Full Text Available Examination of the human transcriptome reveals higher levels of RNA editing than in any other organism tested to date. This is indicative of extensive double-stranded RNA (dsRNA formation within the human transcriptome. Most of the editing sites are located in the primate-specific retrotransposed element called Alu. A large fraction of Alus are found in intronic sequences, implying extensive Alu-Alu dsRNA formation in mRNA precursors. Yet, the effect of these intronic Alus on splicing of the flanking exons is largely unknown. Here, we show that more Alus flank alternatively spliced exons than constitutively spliced ones; this is especially notable for those exons that have changed their mode of splicing from constitutive to alternative during human evolution. This implies that Alu insertions may change the mode of splicing of the flanking exons. Indeed, we demonstrate experimentally that two Alu elements that were inserted into an intron in opposite orientation undergo base-pairing, as evident by RNA editing, and affect the splicing patterns of a downstream exon, shifting it from constitutive to alternative. Our results indicate the importance of intronic Alus in influencing the splicing of flanking exons, further emphasizing the role of Alus in shaping of the human transcriptome.
Jarrell, K A
I describe the self-splicing of an RNA that consists of exon sequences flanked by group II intron sequences. I find that this RNA undergoes accurate splicing in vitro, yielding an excised exon circle. This splicing reaction involves the joining of the 5' splice site at the end of an exon to the 3' splice site at the beginning of the same exon; thus, I term it inverse splicing. Inverse splicing provides a potential mechanism for exon scrambling, for exon deletion in alternative splicing pathwa...
Pyle, Anna Marie
Group II introns are large, autocatalytic ribozymes that catalyze RNA splicing and retrotransposition. Splicing by group II introns plays a major role in the metabolism of plants, fungi, and yeast and contributes to genetic variation in many bacteria. Group II introns have played a major role in genome evolution, as they are likely progenitors of spliceosomal introns, retroelements, and other machinery that controls genetic variation and stability. The structure and catalytic mechanism of group II introns have recently been elucidated through a combination of genetics, chemical biology, solution biochemistry, and crystallography. These studies reveal a dynamic machine that cycles progressively through multiple conformations as it stimulates the various stages of splicing. A central active site, containing a reactive metal ion cluster, catalyzes both steps of self-splicing. These studies provide insights into RNA structure, folding, and catalysis, as they raise new questions about the behavior of RNA machines. PMID:27391926
Deshler, J O; Larson, G P; Rossi, J J
The actin (ACT) gene from the budding yeast Kluyveromyces lactis was cloned, and the nucleotide sequence was determined. The gene had a single intron 778 nucleotides in length which possessed the highly conserved splicing signals found in Saccharomyces cerevisiae introns. We demonstrated splicing of heterologous ACT transcripts in both K. lactis and S. cerevisiae.
Zhou, Kemin; Salamov, Asaf; Kuo, Alan; Aerts, Andrea; Grigoriev, Igor
Using four genomes: Chamydomonas reinhardtii, Agaricus bisporus, Aspergillus carbonarius, and Sporotricum thermophile with EST coverage of 2.9x, 8.9x, 29.5x, and 46.3x respectively, we identified 11 alternative splicing (AS) types that were dominated by intron retention (RI; biased toward short introns) and found 15, 35, 52, and 63percent AS of multiexon genes respectively. Genes with AS were more ancient, and number of AS correlated with number of exons, expression level, and maximum intron length of the gene. Introns with tendency to be retained had either stop codons or length of 3n+1 or 3n+2 presumably triggering nonsense-mediated mRNA decay (NMD), but introns retained in major isoforms (0.2-6percent of all introns) were biased toward 3n length and stop codon free. Stopless introns were biased toward phase 0, but 3n introns favored phase 1 that introduced more flexible and hydrophilic amino acids on both ends of introns which would be less disruptive to protein structure. We proposed a model in which minor RI intron could evolve into major RI that could facilitate intron loss through exonization.
Intron retention in transcripts and the presence of 5 and 3 splice sites within these introns mediate alternate splicing, which is widely observed in animals and plants. Here, functional characterisation of the K+ transporter, HvHKT2;1, with stably retained introns from barley (Hordeum vulgare) in yeast (Saccharomyces cerevisiae), and transcript profiling in yeast and transgenic tobacco (Nicotiana tabacum) is presented. Expression of intron-retaining HvHKT2;1 cDNA (HvHKT2;1-i) in trk1, trk2 yeast strain defective in K+ uptake restored growth in medium containing hygromycin in the presence of different concentrations of K+ and mediated hypersensitivity to Na+. HvHKT2;1-i produces multiple transcripts via alternate splicing of two regular introns and three exons in different compositions. HKT isoforms with retained introns and exon skipping variants were detected in relative expression analysis of (i) HvHKT2;1-i in barley under native conditions, (ii) in transgenic tobacco plants constitutively expressing HvHKT2;1-i, and (iii) in trk1, trk2 yeast expressing HvHKT2;1-i under control of an inducible promoter. Mixed proportions of three HKT transcripts: HvHKT2;1-e (first exon region), HvHKT2;1-i1 (first intron) and HvHKT2;1-i2 (second intron) were observed. The variation in transcript accumulation in response to changing K+ and Na+ concentrations was observed in both heterologous and plant systems. These findings suggest a link between intron-retaining transcripts and different splice variants to ion homeostasis, and their possible role in salt stress.
Zhao, Xiaoqing; Li, Hong; Bao, Tonglaga; Ying, Zhiqiang
Many experiment evidences showed that sequence structures of introns and intron loss/gain can influence gene expression, but current mechanisms did not refer to the functions of post-spliced introns directly. We propose that postspliced introns play their functions in gene expression by interacting with their mRNA sequences and the interaction is characterized by the matched segments between introns and their CDS. In this study, we investigated the interaction characters with length series by improved Smith-Waterman local alignment software for the ribosomal protein genes in C. elegans and D. melanogaster. Our results showed that RF values of five intron groups are significantly high in the central non-conserved region and very low in 5'-end and 3'-end splicing region. It is interesting that the number of the optimal matched regions gradually increases with intron length. Distributions of the optimal matched regions are different for five intron groups. Our study revealed that there are more interaction regions between longer introns and their CDS than shorter, and it provides a positive pattern for regulating the gene expression.
Lunghi, Matteo; Spano, Furio; Magini, Alessandro; Emiliani, Carla; Carruthers, Vern B; Di Cristina, Manlio
Apicomplexan parasites including Toxoplasma gondii and Plasmodium species have complex life cycles that include multiple hosts and differentiation through several morphologically distinct stages requiring marked changes in gene expression. This review highlights emerging evidence implicating regulation of mRNA splicing as a mechanism to prime these parasites for rapid gene expression upon differentiation. We summarize the most important insights in alternative splicing including its role in regulating gene expression by decreasing mRNA abundance via 'Regulated Unproductive Splicing and Translation'. As a related but less well-understood mechanism, we discuss also our recent work suggesting a role for intron retention for precluding translation of stage specific isoforms of T. gondii glycolytic enzymes. We additionally provide new evidence that intron retention might be a widespread mechanism during parasite differentiation. Supporting this notion, recent genome-wide analysis of Toxoplasma and Plasmodium suggests intron retention is more pervasive than heretofore thought. These findings parallel recent emergence of intron retention being more prevalent in mammals than previously believed, thereby adding to the established roles in plants, fungi and unicellular eukaryotes. Deeper mechanistic studies of intron retention will provide important insight into its role in regulating gene expression in apicomplexan parasites and more general in eukaryotic organisms. PMID:26194054
McNeil, Bonnie A; Simon, Dawn M; Zimmerly, Steven
Group II introns are ribozymes and retroelements found in bacteria, and are thought to have been the ancestors of nuclear pre-mRNA introns. Whereas nuclear introns undergo prolific alternative splicing in some species, group II introns are not known to carry out equivalent reactions. Here we report a group II intron in the human pathogen Clostridium tetani, which undergoes four alternative splicing reactions in vivo. Together with unspliced transcript, five mRNAs are produced, each encoding a distinct surface layer protein isoform. Correct fusion of exon reading frames requires a shifted 5' splice site located 8 nt upstream of the canonical boundary motif. The shifted junction is accomplished by an altered IBS1-EBS1 pairing between the intron and 5' exon. Growth of C. tetani under a variety of conditions did not result in large changes in alternative splicing levels, raising the possibility that alternative splicing is constitutive. This work demonstrates a novel type of gene organization and regulation in bacteria, and provides an additional parallel between group II and nuclear pre-mRNA introns. PMID:24214997
de Barros Lopes, M; Soden, A; Henschke, P. A.; Langridge, P
The increased use of pure starter cultures in the wine industry has made it necessary to develop a rapid and simple identification system for yeast strains. A method based upon the PCR using oligonucleotide primers that are complementary to intron splice sites has been developed. Since most introns are not essential for gene function, introns have evolved with minimal constraint. By targeting these highly variable sequences, the PCR has proved to be very effective in uncovering polymorphisms ...
McNeil, Bonnie A; Zimmerly, Steven
Group II introns are ribozymes in bacterial and organellar genomes that function as self-splicing introns and as retroelements. Previously, we reported that the group II intron C.te.I1 of Clostridium tetani alternatively splices in vivo to produce five distinct coding mRNAs. Accurate fusion of upstream and downstream reading frames requires a shifted 5' splice site located 8 nt upstream of the usual 5' GUGYG motif. This site is specified by the ribozyme through an altered intron/exon-binding site 1 (IBS1-EBS1) pairing. Here we use mutagenesis and self-splicing assays to investigate in more detail the significance of the structural features of the C.te.I1 ribozyme. The shifted 5' splice site is shown to be affected by structures in addition to IBS1-EBS1, and unlike other group II introns, C.te.I1 appears to require a spacer between IBS1 and the GUGYG motif. In addition, the mechanism of 3' exon recognition is modified from the ancestral IIB mechanism to a IIA-like mechanism that appears to be longer than the typical single base-pair interaction and may extend up to 4 bp. The novel ribozyme properties that have evolved for C.te.I1 illustrate the plasticity of group II introns in adapting new structural and catalytic properties that can be utilized to affect gene expression. PMID:24751650
Wang, Kai; Wernersson, Rasmus; Brunak, Søren
MOTIVATION: The gene concept has recently changed from the classical one protein notion into a much more diverse picture, where overlapping or fused transcripts, alternative transcription initiation, and genes within genes, add to the complexity generated by alternative splicing. Increased...... understanding of the mechanisms controlling pre-mRNA splicing is thus important for a wide range of aspects relating to gene expression. RESULTS: We have discovered a convex gene delineating pattern in the strength of 5' intron splice sites. When comparing the strengths of >18 000 intron containing Human genes......, we found that when analysing them separately according to the number of introns they contain, initial splice sites were always stronger on average than subsequent ones, and that a similar reversed trend exist towards the terminal gene part. The convex pattern is strongest for genes with up to 10...
The thymidylate synthase gene (td) and ribonucleotide reductase B2 subunit gene (nrdB) EMBO both of bacteriophage T4 in origin, are procaryotic intron-containing protein-encoding genes. To screen for other procaryotic introns, southern hybridization analysis of several procaryotic genomes was carried out, using T4 phage td DNA restriction fragments and synthetic oligodeoxynucleotides defining strategic td exon and intron regions. Furthermore, the labeling pattern of total RNA with [α-32P]GTP, a typical reaction of self-splicing RNAs (class I), was examined. Experimental data implicate multiple self-splicing introns only in the T-even phages: five (1, 0.9, 0.83, 0.75 and 0.6 kb) in T4 and three (1, 0.9 and 0.75 kb) each in T2 and T6 phages. Northern hybridization analysis of total RNA extracted from T-even phage-infected cells confirms that the 1 kb RNA from each phage is in fact the excised intron segment from the precursor RNA transcribed from an intron-containing td gene in each case. This RNA cyclizes to form a contiguous circular molecule. The 0.6 kb RNA is most likely the T4 phage nrdB intron which seems to be absent from the corresponding gene in T2 and T6. The remaining RNA species are candidates for other self-splicing introns in these phages
Full Text Available During evolution segments of homeothermic genomes underwent a GC content increase. Our analyses reveal that two exon-intron architectures have evolved from an ancestral state of low GC content exons flanked by short introns with a lower GC content. One group underwent a GC content elevation that abolished the differential exon-intron GC content, with introns remaining short. The other group retained the overall low GC content as well as the differential exon-intron GC content, and is associated with longer introns. We show that differential exon-intron GC content regulates exon inclusion level in this group, in which disease-associated mutations often lead to exon skipping. This group's exons also display higher nucleosome occupancy compared to flanking introns and exons of the other group, thus “marking” them for spliceosomal recognition. Collectively, our results reveal that differential exon-intron GC content is a previously unidentified determinant of exon selection and argue that the two GC content architectures reflect the two mechanisms by which splicing signals are recognized: exon definition and intron definition.
Gibb, Ewan A; Edgell, David R
The td group I intron interrupting the thymidylate synthase (TS) gene of phage T4 is a mobile intron that encodes the homing endonuclease I-TevI. Efficient RNA splicing of the intron is required to restore function of the TS gene, while expression of I-TevI from within the intron is required to initiate intron mobility. Three distinct layers of regulation temporally limit I-TevI expression to late in the T4 infective cycle, yet the biological rationale for stringent regulation has not been tested. Here, we deleted key control elements to deregulate I-TevI expression at early and middle times post T4 infection. Strikingly, we found that deregulation of I-TevI, or of a catalytically inactive variant, generated a thymidine-dependent phenotype that is caused by a reduction in td intron splicing. Prematurely terminating I-TevI translation restores td splicing, full-length TS synthesis, and rescues the thymidine-dependent phenotype. We suggest that stringent translational control of I-TevI evolved to prevent the ribosome from disrupting key structural elements of the td intron that are required for splicing and TS function at early and middle times post T4 infection. Analogous translational regulatory mechanisms in unrelated intron-open reading frame arrangements may also function to limit deleterious consequences on splicing and host gene function. PMID:20497330
Chalamcharla Venkata R; Edgell David R; Belfort Marlene
Abstract Group I and II introns can be considered as molecular parasites that interrupt protein-coding and structural RNA genes in all domains of life. They function as self-splicing ribozymes and thereby limit the phenotypic costs associated with disruption of a host gene while they act as mobile DNA elements to promote their spread within and between genomes. Once considered purely selfish DNA elements, they now seem, in the light of recent work on the molecular mechanisms regulating bacter...
Chalamcharla Venkata R
Full Text Available Abstract Group I and II introns can be considered as molecular parasites that interrupt protein-coding and structural RNA genes in all domains of life. They function as self-splicing ribozymes and thereby limit the phenotypic costs associated with disruption of a host gene while they act as mobile DNA elements to promote their spread within and between genomes. Once considered purely selfish DNA elements, they now seem, in the light of recent work on the molecular mechanisms regulating bacterial and phage group I and II intron dynamics, to show evidence of co-evolution with their hosts. These previously underappreciated relationships serve the co-evolving entities particularly well in times of environmental stress.
Monachello, Dario; Michel, François; Costa, Maria
When assayed in vitro, group IIC self-splicing introns, which target bacterial Rho-independent transcription terminators, generally fail to yield branched products during splicing despite their possessing a seemingly normal branchpoint. Starting with intron O.i.I1 from Oceanobacillus iheyensis, whose crystallographically determined structure lacks branchpoint-containing domain VI, we attempted to determine what makes this intron unfit for in vitro branch formation. A major factor was found to be the length of the helix at the base of domain VI: 4 base pairs (bp) are required for efficient branching, even though a majority of group IIC introns have a 3-bp helix. Equally important for lariat formation is the removal of interactions between ribozyme domains II and VI, which are specific to the second step of splicing. Conversely, mismatching of domain VI and its proposed first-step receptor in subdomain IC1 was found to be detrimental; these data suggest that the intron-encoded protein may promote branch formation partly by modulating the equilibrium between conformations specific to the first and second steps of splicing. As a practical application, we show that by making just two changes to the O.i.I1 ribozyme, it is possible to generate sufficient amounts of lariat intron for the latter to be purified and used in kinetic assays in which folding and reaction are uncoupled. PMID:26769855
Holländer, V; Kück, U
The mitochondrial group IIB intron rI1, from the green algae Scenedesmus obliquus ' LSUrRNA gene, has been introduced into the lacZ gene encoding beta-galacto-sidase. After DNA-mediated transformation of the recombinant lacZ gene into Escherichia coli, we observed correct splicing of the chimeric precursor RNA in vivo. In contrast to autocatalytic in vitro self-splicing, intron processing in vivo is independent of the growth temperature, suggesting that in E.coli, trans -acting factors are in...
Taube, Jennifer R.; Sperle, Karen; Banser, Linda; Seeman, Pavel; Cavan, Barbra Charina V.; Garbern, James Y.; Hobson, Grace M.
Alternative splicing of the proteolipid protein 1 gene (PLP1) produces two forms, PLP1 and DM20, due to alternative use of 5′ splice sites with the same acceptor site in intron 3. The PLP1 form predominates in central nervous system RNA. Mutations that reduce the ratio of PLP1 to DM20, whether mutant or normal protein is formed, result in the X-linked leukodystrophy Pelizaeus-Merzbacher disease (PMD). We investigated the ability of sequences throughout PLP1 intron 3 to regulate alternative sp...
Modulation of pre-mRNA splicing by antisense molecules is a promising mechanism of action for gene therapeutic drugs. In this study, we have examined the potential of peptide nucleic acid (PNA) 9-aminoacridine conjugates to modulate the pre-mRNA splicing of the mdm2 human cancer gene in JAR cells. We screened 10 different 15 mer PNAs targeting intron2 at both the 5' - and the 3'-splice site for their effects on the splicing of mdm2 using RT-PCR analysis. We also tested a PNA (2512) targeting the 3'-splice site of intron3 with a complementarity of 4 bases to intron3 and 11 bases to exon4 for its splicing modulation effect. This PNA2512 was further tested for the effects on the mdm2 protein level as well as for inhibition of cell growth in combination with the DNA damaging agent camptothecin (CPT). We show that several of these PNAs effectively inhibit the splicing thereby producing a larger mRNA still containing intron2, while skipping of exon3 was not observed by any of these PNAs. The most effective PNA (PNA2406) targeting the 3'-splice site of intron2 had a complementarity of 4 bases to intron2 and 11 bases to exon3. PNA (2512) targeting the 3'-splice site of intron3 induced both splicing inhibition (intron3 skipping) and skipping of exon4. Furthermore, treatment of JAR cells with this PNA resulted in a reduction in the level of MDM2 protein and a concomitant increase in the level of tumor suppressor p53. In addition, a combination of this PNA with CPT inhibited cell growth more than CPT alone. We have identified several PNAs targeting the 5'- or 3'-splice sites in intron2 or the 3'-splice site of intron3 of mdm2 pre-mRNA which can inhibit splicing. Antisense targeting of splice junctions of mdm2 pre-mRNA may be a powerful method to evaluate the cellular function of MDM2 splice variants as well as a promising approach for discovery of mdm2 targeted anticancer drugs
MAO Xing-xue; LIU Yan-zhuo; XIAO Xin; CHEN Jian-wei; LUO Wen-yong; LI Xiao-fang
A new method of one-step PGR was devised for detecting the first nucleotide in the splice donor site of Wx intron 1.compared to the regular PCR-Acc I method, the method can produce the same result for detecting +1 nucleotide of Wx intron 1.The reliability of the new method was confirmed with 30 rice varieties. The new technique is more convenient and cheaper than the regular PCR-Acc I method, and could be widely deploded in rice molecular marker assistant selection.
Pourzadegan, F; Shariati, L; Taghizadeh, R; Khanahmad, H; Mohammadi, Z; Tabatabaiefar, M A
Suicide gene therapy is one of the most innovative approaches in which a potential toxic gene is delivered to the targeted cancer cell by different target delivery methods. We constructed a transfer vector to express green fluorescent protein (GFP) in transduced cells but not in packaging cells. We placed gfp under the control of the cytomegalovirus (CMV) promoter, which is positioned between the two long-terminal repeats in reverse direction. The intron-2 sequence of the human beta globin gene with two poly-A signals and several stop codons on the antisense strand was placed on the leading strand between the CMV promoter and gfp. For lentiviral production, the HEK293T and line were co-transfected with the PMD2G, psPAX2 and pLentiGFP-Ins2 plasmids. The HEK293T and line were transduced with this virus. PCR was performed for evaluation of intron splicing in transduced cells. The GFP expression was seen in 65% of the cells transduced. The PCR amplification of the genomic DNA of transduced cells confirmed the splicing of intron 2. The strategy is significant to accomplish our goal for preserving the packaging cells from the toxic gene expression during viral assembly and the resultant reduction in viral titration. Also it serves to address several other issues in the gene therapy. PMID:26679755
Guan, Wuxiang; Cheng, Fang; Huang, Qinfeng; Kleiboeker, Steve; Qiu, Jianming
Alternative splicing of the precursor mRNA (pre-mRNA) of human parvovirus B19 (B19V) plays a key role in posttranscriptional regulation of B19V gene expression. We report that the central exon of the B19V pre-mRNA is defined by three GAA motif-containing exonic splicing enhancers and a G/GU-rich intronic splicing enhancer that lies adjacent to the second donor site. Moreover, targeting of morpholino antisense oligonucleotides to the two splicing enhancers surrounding the second donor site led...
Nguyen Dinh, Sy; Sai, Than Zaw Tun; Nawaz, Ghazala; Lee, Kwanuk; Kang, Hunseung
Despite the increasing understanding of the regulation of chloroplast gene expression in plants, the importance of intron splicing and processing of chloroplast RNA transcripts under stress conditions is largely unknown. Here, to understand how abiotic stresses affect the intron splicing and expression patterns of chloroplast genes in dicots and monocots, we carried out a comprehensive analysis of the intron splicing and expression patterns of chloroplast genes in the coffee plant (Coffea arabica) as a dicot and rice (Oryza sativa) as a monocot under abiotic stresses, including drought, cold, or combined drought and heat stresses. The photosynthetic activity of both coffee plants and rice seedlings was significantly reduced under all stress conditions tested. Analysis of the transcript levels of chloroplast genes revealed that the splicing of tRNAs and mRNAs in coffee plants and rice seedlings were significantly affected by abiotic stresses. Notably, abiotic stresses affected differently the splicing of chloroplast tRNAs and mRNAs in coffee plants and rice seedlings. The transcript levels of most chloroplast genes were markedly downregulated in both coffee plants and rice seedlings upon stress treatment. Taken together, these results suggest that coffee and rice plants respond to abiotic stresses via regulating the intron splicing and expression of different sets of chloroplast genes. PMID:27448724
Valles, Y.; Halanych, K.; Boore, J.L.
Nephtys longosetosa is a carnivorous polychaete worm that lives in the intertidal and subtidal zones with worldwide distribution (pleijel&rouse2001). Its mitochondrial genome has the characteristics typical of most metazoans: 37 genes; circular molecule; almost no intergenic sequence; and no significant gene rearrangements when compared to other annelid mtDNAs (booremoritz19981995). Ubiquitous features as small intergenic regions and lack of introns suggested that metazoan mtDNAs are under strong selective pressures to reduce their genome size allowing for faster replication requirements (booremoritz19981995Lynch2005). Yet, in 1996 two type I introns were found in the mtDNA of the basal metazoan Metridium senile (FigureX). Breaking a long-standing rule (absence of introns in metazoan mtDNA), this finding was later supported by the further presence of group I introns in other cnidarians. Interestingly, only the class Anthozoa within cnidarians seems to harbor such introns. Although several hundreds of triploblastic metazoan mtDNAs have been sequenced, this study is the first evidence of mitochondrial introns in triploblastic metazoans. The cox1 gene of N. longosetosa has an intron of almost 2 kbs in length. This finding represents as well the first instance of a group II intron (anthozoans harbor group I introns) in all metazoan lineages. Opposite trends are observed within plants, fungi and protist mtDNAs, where introns (both group I and II) and other non-coding sequences are widespread. Plant, fungal and protist mtDNA structure and organization differ enormously from that of metazoan mtDNA. Both, plant and fungal mtDNA are dynamic molecules that undergo high rates of recombination, contain long intergenic spacer regions and harbor both group I and group II introns. However, as metazoans they have a conserved gene content. Protists, on the other hand have a striking variation of gene content and introns that account for the genome size variation. In contrast to
Merico, Daniele; Roifman, Maian; Braunschweig, Ulrich; Yuen, Ryan K. C.; Alexandrova, Roumiana; Bates, Andrea; Reid, Brenda; Nalpathamkalam, Thomas; Wang, Zhuozhi; Thiruvahindrapuram, Bhooma; Gray, Paul; Kakakios, Alyson; Peake, Jane; Hogarth, Stephanie; Manson, David; Buncic, Raymond; Pereira, Sergio L.; Herbrick, Jo-Anne; Blencowe, Benjamin J.; Roifman, Chaim M.; Scherer, Stephen W.
Roifman Syndrome is a rare congenital disorder characterized by growth retardation, cognitive delay, spondyloepiphyseal dysplasia and antibody deficiency. Here we utilize whole-genome sequencing of Roifman Syndrome patients to reveal compound heterozygous rare variants that disrupt highly conserved positions of the RNU4ATAC small nuclear RNA gene, a minor spliceosome component that is essential for minor intron splicing. Targeted sequencing confirms allele segregation in six cases from four unrelated families. RNU4ATAC rare variants have been recently reported to cause microcephalic osteodysplastic primordial dwarfism, type I (MOPD1), whose phenotype is distinct from Roifman Syndrome. Strikingly, all six of the Roifman Syndrome cases have one variant that overlaps MOPD1-implicated structural elements, while the other variant overlaps a highly conserved structural element not previously implicated in disease. RNA-seq analysis confirms extensive and specific defects of minor intron splicing. Available allele frequency data suggest that recessive genetic disorders caused by RNU4ATAC rare variants may be more prevalent than previously reported. PMID:26522830
Zhang, Libin; Bao, Penghui; Leibowitz, Michael J; Zhang, Yi
Pseudoknots play critical roles in packing the active structure of various functional RNAs. The importance of the P3-P7 pseudoknot in refolding of group I intron ribozymes has been recently appreciated, while little is known about the pseudoknot function in co-transcriptional folding. Here we used the Candida group I intron as a model to address the question. We show that co-transcriptional folding of the active self-splicing intron is twice as fast as refolding. The P3-P7 pseudoknot folds slowly during co-transcriptional folding at a rate constant similar to the folding of the active ribozyme, and folding of both P3-P7 and P1-P10 pseudoknots are inhibited by antisense oligonucleotides. We conclude that when RNA folding is coupled with transcription, formation of pseudoknot structures dominates the productive folding pathway and serves as a rate-limiting step in producing the self-splicing competent Candida intron. PMID:19710184
Boudvillain, M; Pyle, A M
Group II introns are self-splicing RNA molecules that are of considerable interest as ribozymes, mobile genetic elements and examples of folded RNA. Although these introns are among the most common ribozymes, little is known about the chemical and structural determinants for their reactivity. By using nucleotide analog interference mapping (NAIM), it has been possible to identify the nucleotide functional groups (Rp phosphoryls, 2'-hydroxyls, guanosine exocyclic amines, adenosine N7 and N6) t...
Landels, E C; Green, P.M.; Ellis, I H; Fensom, A H; Kaback, M M; Lim-Steele, J; Zeiger, K; Levy, N; Bobrow, M
In a previous study we found that a Tay-Sachs disease (TSD) causing mutation in the intron 9 donor splice site of the HEXA gene occurs at high frequency in non-Jewish patients and carriers from the British Isles. It was found more frequently in subjects of Irish, Scottish, and Welsh origin compared with English origin (63% and 31% respectively). We have now tested, in a blind study, 26 American TSD carriers and 28 non-carriers who have British ancestry for the intron 9 splice site mutation. S...
Xiu, Zhihui; Sun, Feng; Shen, Yun; Zhang, Xiaoyan; Jiang, Ruicheng; Bonnard, Géraldine; Zhang, Jianhua; Tan, Bao-Cai
In higher plants, chloroplast and mitochondrial transcripts contain a number of group II introns that need to be precisely spliced before translation into functional proteins. However, the mechanism of splicing and the factors involved in this process are not well understood. By analysing a seed mutant in maize, we report here the identification of Empty pericarp16 (Emp16) that is required for splicing of nad2 intron 4 in mitochondria. Disruption of Emp16 function causes developmental arrest in the embryo and endosperm, giving rise to an empty pericarp phenotype in maize. Differentiation of the basal endosperm transfer layer cells is severely affected. Molecular cloning indicates that Emp16 encodes a P-type pentatricopeptide repeat (PPR) protein with 11 PPR motifs and is localized in the mitochondrion. Transcript analysis revealed that mitochondrial nad2 intron 4 splicing is abolished in the emp16 mutants, leading to severely reduced assembly and activity of complex I. In response, the mutant dramatically increases the accumulation of mitochondrial complex III and the expression of alternative oxidase AOX2. These results imply that EMP16 is specifically required for mitochondrial nad2 intron 4 cis-splicing and is essential for complex I assembly and embryogenesis and development endosperm in maize. PMID:26764126
Shiraishi, Takehiko; Eysturskard, Jonhard; Nielsen, Peter E
ABSTRACT: BACKGROUND: Modulation of pre-mRNA splicing by antisense molecules is a promising mechanism of action for gene therapeutic drugs. In this study, we have examined the potential of peptide nucleic acid (PNA) 9-aminoacridine conjugates to modulate the pre-mRNA splicing of the mdm2 human...... cancer gene in JAR cells. METHODS: We screened 10 different 15 mer PNAs targeting intron2 at both the 5;- and the 3;-splice site for their effects on the splicing of mdm2 using RT-PCR analysis. We also tested a PNA (2512) targeting the 3;-splice site of intron3 with a complementarity of 4 bases to intron......3 and 11 bases to exon4 for its splicing modulation effect. This PNA2512 was further tested for the effects on the mdm2 protein level as well as for inhibition of cell growth in combination with the DNA damaging agent camptothecin (CPT). RESULTS: We show that several of these PNAs effectively...
Zhang, Libin; Bao, Penghui; Leibowitz, Michael J.; Zhang, Yi
Pseudoknots play critical roles in packing the active structure of various functional RNAs. The importance of the P3–P7 pseudoknot in refolding of group I intron ribozymes has been recently appreciated, while little is known about the pseudoknot function in co-transcriptional folding. Here we used the Candida group I intron as a model to address the question. We show that co-transcriptional folding of the active self-splicing intron is twice as fast as refolding. The P3–P7 pseudoknot folds sl...
Full Text Available Aberrant pre-mRNA splice variants of hyaluronan synthase 1 (HAS1 have been identified in malignant cells from cancer patients. Bioinformatic analysis suggests that intronic sequence changes can underlie aberrant splicing. Deletions and mutations were introduced into HAS1 minigene constructs to identify regions that can influence aberrant intronic splicing, comparing the splicing pattern in transfectants with that in multiple myeloma (MM patients. Introduced genetic variations in introns 3 and 4 of HAS1 as shown here can promote aberrant splicing of the type detected in malignant cells from MM patients. HAS1Vd is a novel intronic splice variant first identified here. HAS1Vb, an intronic splice variant previously identified in patients, skips exon 4 and utilizes the same intron 4 alternative 3'splice site as HAS1Vd. For transfected constructs with unaltered introns 3 and 4, HAS1Vd transcripts are readily detectable, frequently to the exclusion of HAS1Vb. In contrast, in MM patients, HAS1Vb is more frequent than HAS1Vd. In the HAS1 minigene, combining deletion in intron 4 with mutations in intron 3 leads to a shift from HAS1Vd expression to HAS1Vb expression. The upregulation of aberrant splicing, exemplified here by the expression of HAS1Vb, is shown here to be influenced by multiple genetic changes in intronic sequences. For HAS1Vb, this includes enhanced exon 4 skipping and increased usage of alternative 3' splice sites. Thus, the combination of introduced mutations in HAS1 intron3 with introduced deletions in HAS1 intron 4 promoted a shift to an aberrant splicing pattern previously shown to be clinically significant. Most MM patients harbor genetic variations in intron 4, and as shown here, nearly half harbor recurrent mutations in HAS1 intron 3. Our work suggests that aberrant intronic HAS1 splicing in MM patients may rely on intronic HAS1 deletions and mutations that are frequent in MM patients but absent from healthy donors.
Eden, E.; Brunak, Søren
Prediction of splice sites in non-coding regions of genes is one of the most challenging aspects of gene structure recognition. We perform a rigorous analysis of such splice sites embedded in human 5' untranslated regions (UTRs), and investigate correlations between this class of splice sites and...... synaptic weights of the neural networks trained to identify UTR donor sites. Conventional splice site prediction methods perform poorly in UTRs because the reading frame pattern is absent. The NetUTR method presented here performs 2-.3-fold better compared with NetGene2 and GenScan in 5' UTRs. We also...... tested the 5' UTR trained method on protein coding regions, and discovered, surprisingly, that it works quite well (although it cannot compete with NetGene2). This indicates that the local splicing pattern in UTRs and coding regions is largely the same. The NetUTR method is made publicly available at www.cbs.dtu.dk/services/NetUTR....
O.L. Holla; S. Nakken; M. Mattingsdal; T. Ranheim; K.E. Berge; J.C. Defesche; T.P. Leren
Screening for mutations in the low density lipoprotein receptor (LDLR) gene has identified more than 1000 mutations as the cause of familial hypercholesterolemia (FH). In addition, numerous intronic mutations with uncertain effects on pre-mRNA splicing have also been identified. In this study, we ha
Garrett, Roger Antony; Aagaard, Claus Sindbjerg; Andersen, Morten; Dalgaard, Jacob; Lykke-Andersen, Jens; Phan, Hoa T.N.; Trevisanato, Siro; Østergaard, Laust; Larsen, Niels; Leffers, Henrik
Over the past decade our laboratory has had a strong interest in defining the phylogenetic status of the archaea. This has involved determining and analysing the sequences of operons of both rRNAs and RNA polymerases and it led to the discovery of the first archaeal rRNA intron. What follows is a...
Full Text Available The polyA tails of mRNAs are monitored by the exosome as a quality control mechanism. We find that fission yeast, Schizosaccharomyces pombe, adopts this RNA quality control mechanism to regulate a group of 30 or more meiotic genes at the level of both splicing and RNA turnover. In vegetative cells the RNA binding protein Mmi1 binds to the primary transcripts of these genes. We find the novel motif U(U/C/GAAAC highly over-represented in targets of Mmi1. Mmi1 can specifically regulate the splicing of particular introns in a transcript: it inhibits the splicing of introns that are in the vicinity of putative Mmi1 binding sites, while allowing the splicing of other introns that are far from such sites. In addition, binding of Mmi1, particularly near the 3' end, alters 3' processing to promote extremely long polyA tails of up to a kilobase. The hyperadenylated transcripts are then targeted for degradation by the nuclear exonuclease Rrp6. The nuclear polyA binding protein Pab2 assists this hyperadenylation-mediated RNA decay. Rrp6 also targets other hyperadenylated transcripts, which become hyperadenylated in an unknown, but Mmi1-independent way. Thus, hyperadenylation may be a general signal for RNA degradation. In addition, binding of Mmi1 can affect the efficiency of 3' cleavage. Inactivation of Mmi1 in meiosis allows meiotic expression, through splicing and RNA stabilization, of at least 29 target genes, which are apparently constitutively transcribed.
Turk, Edward M.; Caprara, Mark G.
Yeast Suv3p is a member of the DEXH/D box family of RNA helicases and is a critical component of the mitochondrial degradosome, which also includes a 3′ → 5′ exonuclease, Dss1p. Defects in the degradosome result in accumulation of aberrant transcripts, unprocessed transcripts, and excised group I introns. In addition, defects in SUV3 result in decreased splicing of the aI5β and bI3 group I introns. Whereas a role for Suv3p in RNA degradation is well established, the function of Suv3p in splic...
Ponthier, Julie L.; Schluepen, Christina; Chen, Weiguo; Lersch,Robert A.; Gee, Sherry L.; Hou, Victor C.; Lo, Annie J.; Short, Sarah A.; Chasis, Joel A.; Winkelmann, John C.; Conboy, John G.
Activation of protein 4.1R exon 16 (E16) inclusion during erythropoiesis represents a physiologically important splicing switch that increases 4.1R affinity for spectrin and actin. Previous studies showed that negative regulation of E16 splicing is mediated by the binding of hnRNP A/B proteins to silencer elements in the exon and that downregulation of hnRNP A/B proteins in erythroblasts leads to activation of E16 inclusion. This paper demonstrates that positive regulation of E16 splicing can be mediated by Fox-2 or Fox-1, two closely related splicing factors that possess identical RNA recognition motifs. SELEX experiments with human Fox-1 revealed highly selective binding to the hexamer UGCAUG. Both Fox-1 and Fox-2 were able to bind the conserved UGCAUG elements in the proximal intron downstream of E16, and both could activate E16 splicing in HeLa cell co-transfection assays in a UGCAUG-dependent manner. Conversely, knockdown of Fox-2 expression, achieved with two different siRNA sequences resulted in decreased E16 splicing. Moreover, immunoblot experiments demonstrate mouse erythroblasts express Fox-2, but not Fox-1. These findings suggest that Fox-2 is a physiological activator of E16 splicing in differentiating erythroid cells in vivo. Recent experiments show that UGCAUG is present in the proximal intron sequence of many tissue-specific alternative exons, and we propose that the Fox family of splicing enhancers plays an important role in alternative splicing switches during differentiation in metazoan organisms.
Setlik, R. F.; Garduno-Juarez, R.; Manchester, J. I.; Shibata, M.; Ornstein, R. L.; Rein, R.
A three-dimensional model of the Tetrahymena thermophila group I intron is used to further explore the catalytic mechanism of the transphosphorylation reaction of the cleavage step. Based on the coordinates of the catalytic core model proposed by Michel and Westhof (Michel, F., Westhof, E. J. Mol. Biol. 216, 585-610 (1990)), we first converted their ligation step model into a model of the cleavage step by the substitution of several bases and the removal of helix P9. Next, an attempt to place a trigonal bipyramidal transition state model in the active site revealed that this modified model for the cleavage step could not accommodate the transition state due to insufficient space. A lowering of P1 helix relative to surrounding helices provided the additional space required. Simultaneously, it provided a better starting geometry to model the molecular contacts proposed by Pyle et al. (Pyle, A. M., Murphy, F. L., Cech, T. R. Nature 358, 123-128. (1992)), based on mutational studies involving the J8/7 segment. Two hydrated Mg2+ complexes were placed in the active site of the ribozyme model, using the crystal structure of the functionally similar Klenow fragment (Beese, L.S., Steitz, T.A. EMBO J. 10, 25-33 (1991)) as a guide. The presence of two metal ions in the active site of the intron differs from previous models, which incorporate one metal ion in the catalytic site to fulfill the postulated roles of Mg2+ in catalysis. The reaction profile is simulated based on a trigonal bipyramidal transition state, and the role of the hydrated Mg2+ complexes in catalysis is further explored using molecular orbital calculations.
Southern hybridization analyses of procaryotic DNA from Escherchia coli, λ bacteriophage, and T1 to T7 phages were carried out. The hybridization probes used consisted of DNA restriction fragments derived from the T4 phage intron-containing thymidylate synthase gene (td) and short synthetic oligodeoxynucleotides defining specific exon and intron regions of the gene. It was shown that intact as well as restricted DNA from the T-even phages hybridized not only to both T4 phage td intron- and exon-specific probes but also to probes defining the td 5' (exon I-intron) and 3' (intron-exon II) presplice junctions. These data strongly suggest that, analogous to the T4 phage, only the T2 and T6 phages among the procaryotes tested contain interrupted td genes. The td intervening sequence in each phage is roughly 1 kilobase pair (kb) in size and interrupts the td gene at a site analogous to that in the T4 phage. This was confirmed by data from Northern (RNA) hybridization analysis of td-specific in vitro transcripts of these phage DNAs. [α-32P]GTP in vitro labeling of total RNA from T4 phage-infected cells produced five species of labeled RNAs that were 1, 0.9, 0.83, 0.75, and 0.6 kb in size. Only the 1-, 0.9-, and 0.75-kb species were labeled in RNA from T2- or T6-infected cells. The commonly present 1-kb RNA is the excised td intron, which exists in both linear and circular forms in the respective T-even-phage-infected cells, while the 0.6-kb RNA unique to T4 may be the excised intron derived from the ribonucleotide reductase small subunit gene (nrdB) of the phage. The remaining labeled RNA species are likely candidates for other self-splicing introns
Full Text Available Dihydropyrimidinase (DHP deficiency is an autosomal recessive disease caused by mutations in the DPYS gene. Patients present with highly elevated levels of dihydrouracil and dihydrothymine in their urine, blood and cerebrospinal fluid. The analysis of the effect of mutations in DPYS on pre-mRNA splicing is hampered by the fact that DHP is primarily expressed in liver and kidney cells. The minigene approach can detect mRNA splicing aberrations using cells that do not express the endogenous mRNA. We have used a minigene-based approach to analyze the effects of a presumptive pre-mRNA splicing mutation in two newly identified Chinese pediatric patients with DHP deficiency. Mutation analysis of DPYS showed that both patients were compound heterozygous for a novel intronic mutation c.1443+5G>A in intron 8 and a previously described missense mutation c.1001A>G (p.Q334R in exon 6. Wild-type and the mutated minigene constructs, containing exons 7, 8 and 9 of DPYS, yielded different splicing products after expression in HEK293 cells. The c.1443+5G>A mutation resulted in altered pre-mRNA splicing of the DPYS minigene construct with full skipping of exon 8. Analysis of the DHP crystal structure showed that the deletion of exon 8 severely affects folding, stability and homooligomerization of the enzyme as well as disruption of the catalytic site. Thus, the analysis suggests that the c.1443+5G>A mutation results in aberrant splicing of the pre-mRNA encoding DHP, underlying the DHP deficiency in two unrelated Chinese patients.
The intron is an important component of eukaryotic gene. Extensive studies have been conducted to get a better understanding of its structure and function. This paper presents a brief review of the structure and function of introns in higher plant genes. It is shown that higher plant introns possess structural properties shared by all eukaryotic introns, however, they also exhibit a striking degree of diversity. The process of intron splicing in higher plant genes involves interaction between multiple cis-acting elements and trans-acting factors, such as 5′ splicing site, 3′ splicing site and many protein factors. The process of intron splicing is an important level at which gene expression is regulated. Especially alternative splicing of intron can regulate time and space of gene expression. In addition, some introns in higher plant genes also regulate gene expression by affecting the pattern of gene expression, enhancing the level of gene expression and driving the gene expression.
Pearson, Matthew D.; McGuire, Abigail M; Neafsey, Daniel Edward; Galagan, James E.
Background: Variations in transcript splicing can reveal how eukaryotes recognize intronic splice sites. Retained introns (RIs) commonly appear when the intron definition (ID) mechanism of splice site recognition inconsistently identifies intron-exon boundaries, and cassette exons (CEs) are often caused by variable recognition of splice junctions by the exon definition (ED) mechanism. We have performed a comprehensive survey of alternative splicing across 42 eukaryotes to gain ins...
Boudvillain, M; Pyle, A M
Group II introns are self-splicing RNA molecules that are of considerable interest as ribozymes, mobile genetic elements and examples of folded RNA. Although these introns are among the most common ribozymes, little is known about the chemical and structural determinants for their reactivity. By using nucleotide analog interference mapping (NAIM), it has been possible to identify the nucleotide functional groups (Rp phosphoryls, 2'-hydroxyls, guanosine exocyclic amines, adenosine N7 and N6) that are most important for composing the catalytic core of the intron. The majority of interference effects occur in clusters located within the two catalytically essential Domains 1 and 5 (D1 and D5). Collectively, the NAIM results indicate that key tetraloop-receptor interactions display a specific chemical signature, that the epsilon-epsilon' interaction includes an elaborate array of additional features and that one of the most important core structures is an uncharacterized three-way junction in D1. By combining NAIM with site-directed mutagenesis, a new tertiary interaction, kappa-kappa', was identified between this region and the most catalytically important section of D5, adjacent to the AGC triad in stem 1. Together with the known zeta-zeta' interaction, kappa-kappa' anchors D5 firmly into the D1 scaffold, thereby presenting chemically essential D5 functionalities for participation in catalysis. PMID:9843513
Full Text Available Exonization of Alu elements is a major mechanism for birth of new exons in primate genomes. Prior analyses of expressed sequence tags show that almost all Alu-derived exons are alternatively spliced, and the vast majority of these exons have low transcript inclusion levels. In this work, we provide genomic and experimental evidence for diverse splicing patterns of exonized Alu elements in human tissues. Using Exon array data of 330 Alu-derived exons in 11 human tissues and detailed RT-PCR analyses of 38 exons, we show that some Alu-derived exons are constitutively spliced in a broad range of human tissues, and some display strong tissue-specific switch in their transcript inclusion levels. Most of such exons are derived from ancient Alu elements in the genome. In SEPN1, mutations of which are linked to a form of congenital muscular dystrophy, the muscle-specific inclusion of an Alu-derived exon may be important for regulating SEPN1 activity in muscle. Realtime qPCR analysis of this SEPN1 exon in macaque and chimpanzee tissues indicates human-specific increase in its transcript inclusion level and muscle specificity after the divergence of humans and chimpanzees. Our results imply that some Alu exonization events may have acquired adaptive benefits during the evolution of primate transcriptomes.
There exists a single nucleotide polymorphism, G or T, at the first base of the donor splice site of waxy gene intron 1 in rice. In order to study the relationship between the first base of the donor splice site of waxy gene intron 1 and amylose content in rice, the one-step PCR method was used to determine whether it is G or T in 220 Yunnan indigenous rice varieties from 14 districts, 55 towns/counties of Yunnan Province, and 101 varieties of which were validated by the PCR-Acc I method. According to the G/T polymorphism, 164 rice varieties showed GG-genotype, while the other 56 fell into TT-genotype, accounting for 74.5% and 25.5% of all the test varieties, respectively. When all the rice varieties were divided into indica and japonica subspecies, it was found that 80.5% of indica rice and 67.0% of japonica rice belonged to GG-genotype. The rice varieties with GG-genotype had significantly higher amylose content (18.95% on average) than those with TT-genotype (all below 16%), but 33 rice varieties with GG-genotype still had low amylose content ranging from 3.91% to 15.93%, and most of them came from the Dai minority area in the Southwest of Yunnan Province. However, there was no significant difference in the mean amylose content of the same GG or TT genotypes between indica and japonica rice,suggesting that different genetic backgrounds, indica or japonica, had no effect on amylose content. The coefficient of correlation between the genotype and amylose content was 0.733 (P＜0.01).
Kenji Fukunaga; Naoko Izuka; Takehiro Hachiken; Satoshi Mizuguchi; Hidemi Ito; Katsuyuki Ichitani
We investigated genetic variation of a rice HEADING DATE 1(HD1) homolog in foxtail millet. First, we searched for a rice HD1 homolog in a foxtail millet genome sequence and designed primers to amplify the entire coding sequence of the gene. We compared full HD1 gene sequences of 11 accessions (including Yugu 1, a Chinese cultivar used for genome sequencing) from various regions in Europe and Asia, found a nucleotide substitution at a putative splice site of intron 1, and designated the accessions with the nucleotide substitution as carrying a splicing variant. We verified by RT-PCR that this single nucleotide substitution causes aberrant splicing of intron 1. We investigated the geographical distribution of the splicing variant in 480 accessions of foxtail millet from various regions of Europe and Asia and part of Africa by dCAPS and found that the splicing variant is broadly distributed in Europe and Asia. Differences of heading times between accessions with wild type allele of the HD1 gene and those with the splicing variant allele were unclear. We also investigated variation in 13 accessions of ssp. viridis, the wild ancestor, and the results suggested that the wild type is predominant in the wild ancestor.
Raghavan, Rahul; Miller, Scott R; Hicks, Linda D; Minnick, Michael F
We describe the presence and characteristics of two self-splicing group I introns in the sole 23S rRNA gene of Coxiella burnetii. The two group I introns, Cbu.L1917 and Cbu.L1951, are inserted at sites 1917 and 1951 (Escherichia coli numbering), respectively, in the 23S rRNA gene of C. burnetii. Both introns were found to be self-splicing in vivo and in vitro even though the terminal nucleotide of Cbu.L1917 is adenine and not the canonical conserved guanine, termed OmegaG, found in Cbu.L1951 and all other group I introns described to date. Predicted secondary structures for both introns were constructed and revealed that Cbu.L1917 and Cbu.L1951 were group IB2 and group IA3 introns, respectively. We analyzed strains belonging to eight genomic groups of C. burnetii to determine sequence variation and the presence or absence of the elements and found both introns to be highly conserved (>/=99%) among them. Although phylogenetic analysis did not identify the specific identities of donors, it indicates that the introns were likely acquired independently; Cbu.L1917 was acquired from other bacteria like Thermotoga subterranea and Cbu.L1951 from lower eukaryotes like Acanthamoeba castellanii. We also confirmed the fragmented nature of mature 23S rRNA in C. burnetii due to the presence of an intervening sequence. The presence of three selfish elements in C. burnetii's 23S rRNA gene is very unusual for an obligate intracellular bacterium and suggests a recent shift to its current lifestyle from a previous niche with greater opportunities for lateral gene transfer. PMID:17644584
Hagiwara, Y; Nishio, H; Kitoh, Y; Takeshima, Y; Narita, N; Wada, H; Yokoyama, M.; Nakamura, H; Matsuo, M.
The mutations in one-third of Duchenne and Becker muscular dystrophy patients remain unknown, as they do not involve gross rearrangements of the dystrophin gene. We now report a defect in the splicing of precursor mRNA (pre-mRNA), resulting from a maternally inherited mutation of the dystrophin gene in a patient with Becker muscular dystrophy. This defect results from a G-to-T transversion at the terminal nucleotide of exon 13, within the 5' splice site of intron 13, and causes complete skipp...
Hagiwara, Yoko; Nishio, Hisahide; Kitoh, Yoshihiko; Takeshima, Yasuhiro; Narita, Naoko; Wada, Hiroko; Yokoyama, Mitsuhiro; Nakamura, Hajime; Matsuo, Masafumi (Kobe Univ. School of Medicine (Japan))
The mutations in one-third of Duchenne and Becker muscular dystrophy patients remain unknown, as they do not involve gross rearrangements of the dystrophin gene. The authors now report a defect in the splicing of precursor mRNA (pre-mRNA), resulting from a maternally inherited mutation of the dystrophin gene in a patient with Becker muscular dystrophy. This defect results from a G-to-T transversion at the terminal nucleotide of exon 13, within the 5[prime] splice site of intron 13, and causes complete skipping of exon 13 during processing of dystrophin pre-mRNA. The predicted polypeptide encoded by the aberrant mRNA is a truncated dystrophin lacking 40 amino acids from the amino-proximal end of the rod domain. This is the first report of an intraexon point mutation that completely inactivates a 5[prime] splice donor site in dystrophin pre-mRNA. Analysis of the genomic context of the G[sup [minus]1]-to-T mutation at the 5[prime] splice site supports the exon-definition model of pre-mRNA splicing and contributes to the understanding of splice-site selection. 48 refs., 5 figs.
Hagiwara, Y; Nishio, H; Kitoh, Y; Takeshima, Y; Narita, N; Wada, H; Yokoyama, M; Nakamura, H; Matsuo, M
The mutations in one-third of Duchenne and Becker muscular dystrophy patients remain unknown, as they do not involve gross rearrangements of the dystrophin gene. We now report a defect in the splicing of precursor mRNA (pre-mRNA), resulting from a maternally inherited mutation of the dystrophin gene in a patient with Becker muscular dystrophy. This defect results from a G-to-T transversion at the terminal nucleotide of exon 13, within the 5' splice site of intron 13, and causes complete skipping of exon 13 during processing of dystrophin pre-mRNA. The predicted polypeptide encoded by the aberrant mRNA is a truncated dystrophin lacking 40 amino acids from the amino-proximal end of the rod domain. This is the first report of an intraexon point mutation that completely inactivates a 5' splice donor site in dystrophin pre-mRNA. Analysis of the genomic context of the G-1-to-T mutation at the 5' splice site supports the exon-definition model of pre-mRNA splicing and contributes to the understanding of splice-site selection. PMID:8279470
Solera, J. (Unidades de Genetica Molecular, Madrid (Spain)); Magallon, M.; Martin-Villar, J. (Hemofilia Hospital, Madrid (Spain)); Coloma, A. (Departamento deBioquimica de la Facultad de Medicina de la Universidad Autonoma, Madrid (Spain))
DNA from a patient with severe hemophilia B was evaluated by RFLP analysis, producing results which suggested the existence of a partial deletion within the factor IX gene. The deletion was further localized and characterized by PCR amplification and sequencing. The altered allele has a 4,442-bp deletion which removes both the donor splice site located at the 5[prime] end of intron d and the two last coding nucleotides located at the 3[prime] end of exon IV in the normal factor IX gene; this fragment has been inserted in inverted orientation. Two homologous sequences have been discovered at the ends of the deleted DNA fragment.
Shultz, Jacqueline C.; Rachel W Goehe; Murudkar, Charuta S.; Wijesinghe, Dayanjan S.; Mayton, Eric K.; Massiello, Autumn; Hawkins, Amy J.; Mukerjee, Prabhat; Pinkerman, Ryan L.; Park, Margaret A; Chalfant, Charles E.
Increasing evidence points to the functional importance of alternative splice variations in cancer pathophysiology with the alternative pre-mRNA processing of caspase 9 as one example. In this study, we delve into the underlying molecular mechanisms that regulate the alternative splicing of caspase 9. Specifically, the pre-mRNA sequence of caspase 9 was analyzed for RNA cis-elements known to interact with SRSF1, a required enhancer for caspase 9 RNA splicing. This analysis revealed thirteen p...
Hellsten, Uffe; Aspden, Julie L.; Rio, Donald C.; Rokhsar, Daniel S.
An intron is an extended genomic feature whose function requires multiple constrained positions - donor and acceptor splice sites, a branch point, a polypyrimidine tract and suitable splicing enhancers - that may be distributed over hundreds or thousands of nucleotides. New introns are therefore unlikely to emerge by incremental accumulation of functional sub-elements. Here we demonstrate that a functional intron can be created de novo in a single step by a segmental genomic duplication. This experiment recapitulates in vivo the birth of an intron that arose in the ancestral jawed vertebrate lineage nearly half a billion years ago.
Nielsen, Henrik; Johansen, Steinar
Group I introns are genetic elements interrupting functional genes. They are removed from precursors at the RNA level and most catalyze their own splicing. The catalytic part of these constitutes one of the major classes of catalytic RNAs, the group I ribozymes. However, group I introns have a lot...... more to offer than their own elimination by splicing. Intron RNA can circularize in at least three different ways and introns are mobile both at the DNA and RNA level. Some group I introns have a very complex organization incorporating functional genes and other sequence elements and have established...... deep relationships with their host genomes. Finally, group I introns can develop into new ribozymes with new biological functions...
HE Miao; LI Jidong; ZHANG Shanghong
A database called eukaryotic intron database (EID) was developed based on the data from GenBank.Studies on the statistical characteristics of EID show that there were 103,848 genes,478,484 introns,and 582,332 exons,with an average of 4.61 introns and 5.61 exons per gene.Introns of 40-120 nt in length were abundant in the database.Results of the statistical analysis on the data from nine model species showed that in eukaryotes,higher species do not necessarily have more introns or exons in a gene than lower species.Furthermore,characteristics of EID,such as intron phase,distribution of different splice sites,and the relationship between genome size and intron proportion or intron density,have been studied.
Madeleine Zerbato; Nathalie Holic; Sophie Moniot-Frin; Dina Ingrao; Anne Galy; Javier Perea
Group II introns are self-splicing mobile elements found in prokaryotes and eukaryotic organelles. These introns propagate by homing into precise genomic locations, following assembly of a ribonucleoprotein complex containing the intron-encoded protein (IEP) and the spliced intron RNA. Engineered group II introns are now commonly used tools for targeted genomic modifications in prokaryotes but not in eukaryotes. We speculate that the catalytic activation of currently known group II introns is...
Trantírková, Silvie; Paris, Zdeněk; Sturm, N. R.; Campbell, D. A.; Lukeš, Julius
Roč. 35, č. 4 (2005), s. 359-366. ISSN 0020-7519 R&D Projects: GA AV ČR IAA5022302 Grant ostatní: NIH(US) AI34536; NIH(US) AI056034 Institutional research plan: CEZ:AV0Z60220518 Keywords : splicing * Trypanosoma * RNA interference Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 3.346, year: 2005
Osman, Erkan Y; Yen, Pei-Fen; Lorson, Christian L.
Spinal muscular atrophy (SMA) is a neurodegenerative disease caused by loss of survival motor neuron-1 (SMN1). A nearly identical copy gene, SMN2, is present in all SMA patients. Although the SMN2 coding sequence has the potential to produce full-length SMN, nearly 90% of SMN2-derived transcripts are alternatively spliced and encode a truncated protein. SMN2, however, is an excellent therapeutic target. Previously, we developed antisense-based oligonucleotides (bifunctional RNAs) that specifi...
Schwartz, Schraga; Silva, João(CFTP, Departamento de Física, Instituto Superior Técnico, Universidade de Lisboa, Avenida Rovisco Pais 1, 1049, Lisboa, Portugal); Burstein, David; Pupko, Tal; Eyras, Eduardo; Ast, Gil
Introns are among the hallmarks of eukaryotic genes. Splicing of introns is directed by three main splicing signals: the 5′ splice site (5′ss), the branch site (BS), and the polypyrimdine tract/3′splice site (PPT-3′ss). To study the evolution of these splicing signals, we have conducted a systematic comparative analysis of these signals in over 1.2 million introns from 22 eukaryotes. Our analyses suggest that all these signals have dramatically evolved: The PPT is weak among most fungi, inter...
Shepard, PJ; Hertel, KJ
Pre-mRNA splicing is carried out by the spliceosome, which identifies exons and removes intervening introns. Alternative splicing in higher eukaryotes results in the generation of multiple protein isoforms from gene transcripts. The extensive alternative splicing observed implies a flexibility of the spliceosome to identify exons within a given pre-mRNA. To reach this flexibility, splice-site selection in higher eukaryotes has evolved to depend on multiple parameters such as splice-site stren...
Koonin Eugene V
Full Text Available Abstract Background Ever since the discovery of 'genes in pieces' and mRNA splicing in eukaryotes, origin and evolution of spliceosomal introns have been considered within the conceptual framework of the 'introns early' versus 'introns late' debate. The 'introns early' hypothesis, which is closely linked to the so-called exon theory of gene evolution, posits that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. Under this scenario, the absence of spliceosomal introns in prokaryotes is considered to be a result of "genome streamlining". The 'introns late' hypothesis counters that spliceosomal introns emerged only in eukaryotes, and moreover, have been inserted into protein-coding genes continuously throughout the evolution of eukaryotes. Beyond the formal dilemma, the more substantial side of this debate has to do with possible roles of introns in the evolution of eukaryotes. Results I argue that several lines of evidence now suggest a coherent solution to the introns-early versus introns-late debate, and the emerging picture of intron evolution integrates aspects of both views although, formally, there seems to be no support for the original version of introns-early. Firstly, there is growing evidence that spliceosomal introns evolved from group II self-splicing introns which are present, usually, in small numbers, in many bacteria, and probably, moved into the evolving eukaryotic genome from the α-proteobacterial progenitor of the mitochondria. Secondly, the concept of a primordial pool of 'virus-like' genetic elements implies that self-splicing introns are among the most ancient genetic entities. Thirdly, reconstructions of the ancestral state of eukaryotic genes suggest that the last common ancestor of extant eukaryotes had an intron
Shao, Wei; Zhao, Qiong-Yi; Wang, Xiu-Ye; Xu, Xin-Yan; Tang, Qing; Li, Muwang; Li, Xuan; Xu, Yong-Zhen
Alternative splicing and trans-splicing events have not been systematically studied in the silkworm Bombyx mori. Here, the silkworm transcriptome was analyzed by RNA-seq. The authors identified 320 novel genes, modified 1140 gene models, and found thousands of alternative splicing and 58 trans-splicing events. Studies of three SR proteins show that both their alternative splicing patterns and mRNA products are conserved from insect to human, and one isoform of Srsf6 with a retained intron is ...
de Roos Albert DG
Full Text Available Abstract Background The timing of the origin of introns is of crucial importance for an understanding of early genome architecture. The Exon theory of genes proposed a role for introns in the formation of multi-exon proteins by exon shuffling and predicts the presence of conserved splice sites in ancient genes. In this study, large-scale analysis of potential conserved splice sites was performed using an intron-exon database (ExInt derived from GenBank. Results A set of conserved intron positions was found by matching identical splice sites sequences from distantly-related eukaryotic kingdoms. Most amino acid sequences with conserved introns were homologous to consensus sequences of functional domains from conserved proteins including kinases, phosphatases, small GTPases, transporters and matrix proteins. These included ancient proteins that originated before the eukaryote-prokaryote split, for instance the catalytic domain of protein phosphatase 2A where a total of eleven conserved introns were found. Using an experimental setup in which the relation between a splice site and the ancientness of its surrounding sequence could be studied, it was found that the presence of an intron was positively correlated to the ancientness of its surrounding sequence. Intron phase conservation was linked to the conservation of the gene sequence and not to the splice site sequence itself. However, no apparent differences in phase distribution were found between introns in conserved versus non-conserved sequences. Conclusion The data confirm an origin of introns deep in the eukaryotic branch and is in concordance with the presence of introns in the first functional protein modules in an 'Exon theory of genes' scenario. A model is proposed in which shuffling of primordial short exonic sequences led to the formation of the first functional protein modules, in line with hypotheses that see the formation of introns integral to the origins of genome evolution
Danielson, K.G.; Fazzio, A.; Cohen, I.; Cannizzaro, L.A.; Eichstetter, I.; Iozzo, R.V. (Thomas Jefferson Univ., Philadelphia, PA (United States))
Decorin is a chondroitin/dermatan sulfate proteoglycan expressed by most vascular and avascular connective tissues and, because of its ability to interact with collagen and growth factors, has been implicated in the control of matrix assembly and cellular growth. To understand the molecular mechanisms involved in regulating its tissue expression, we have isolated a number of genomic clones encoding the complete decorin gene. The human decorin gene spans over 38 kb of continuous DNA sequence and contains eight exons and very large introns, two of which are 5.4 and > 13.2 kb. We have discovered two alternatively spliced leader exons, exons Ia and Ib, in the 5[prime] untranslated region. These exons were identified by cloning and sequencing cDNAs obtained by polymerase chain reaction amplification of a fibroblast cDNA library. Using Northern blotting or reverse transcriptase PCR, we detected the two leader exons in a variety of mRNAs isolated from human cell lines and tissues. Interestingly, sequences highly (74-87%) homologous to exons Ia and lb are found in the 5[prime]untranslated region of avian and bovine decorin, respectively. This high degree of conservation among species suggests regulatory functions for these leader exons. In the 3' untranslated region there are several polyadenylation sites, and at least two of these sites could give rise to the transcripts of [approx]1.6 and [approx]1.9 kb, typically detected in a variety of tissues and cells. Using a genomic clone as the labeled probe and in situ hybridization of human metaphase chromosomes, we have mapped the decorin gene to the discrete region of human chromosome 12q23. This sturdy provides the molecular basis for discerning the transcriptional control of the decorin gene and offers the opportunity to investigate genetic disorders linked to this important human gene. 57 refs., 11 figs., 3 tabs.
Ke, Shengdong; CHASIN, LAWRENCE A.
Splicing is a crucial process in gene expression in higher organisms because: (1) most vertebrate genes contain introns; and (2) alternative splicing is primarily responsible for increasing proteomic complexity and functional diversity. Intron definition, the coordination across an intron, is a mandatory step in the splicing process. However, exon definition, the coordination across an exon, is also thought to be required for the splicing of most vertebrate exons. Recent investigations of exo...
Full Text Available Many spliceosomal introns exist in the eukaryotic nuclear genome. Despite much research, the evolution of spliceosomal introns remains poorly understood. In this paper, we tried to gain insights into intron evolution from a novel perspective by comparing the gene structures of cytoplasmic ribosomal proteins (CRPs and mitochondrial ribosomal proteins (MRPs, which are held to be of archaeal and bacterial origin, respectively. We analyzed 25 homologous pairs of CRP and MRP genes that together had a total of 527 intron positions. We found that all 12 of the intron positions shared by CRP and MRP genes resulted from parallel intron gains and none could be considered to be "conserved," i.e., descendants of the same ancestor. This was supported further by the high frequency of proto-splice sites at these shared positions; proto-splice sites are proposed to be sites for intron insertion. Although we could not definitively disprove that spliceosomal introns were already present in the last universal common ancestor, our results lend more support to the idea that introns were gained late. At least, our results show that MRP genes were intronless at the time of endosymbiosis. The parallel intron gains between CRP and MRP genes accounted for 2.3% of total intron positions, which should provide a reliable estimate for future inferences of intron evolution.
Monat, Caroline; Cousineau, Benoit
Group II introns are large RNA enzymes that can excise as lariats, circles or in a linear form through branching, circularization or hydrolysis, respectively. Branching is by far the main and most studied splicing pathway while circularization was mostly overlooked. We previously showed that removal of the branch point A residue from Ll.LtrB, the group II intron from Lactococcus lactis, exclusively leads to circularization. However, the majority of the released intron circles harbored an additional C residue of unknown origin at the splice junction. Here, we exploited the Ll.LtrB-ΔA mutant to study the circularization pathway of bacterial group II introns in vivo. We demonstrated that the non-encoded C residue, present at the intron circle splice junction, corresponds to the first nt of exon 2. Intron circularization intermediates, harboring the first 2 or 3 nts of exon 2, were found to accumulate showing that branch point removal leads to 3' splice site misrecognition. Traces of properly ligated exons were also detected functionally confirming that a small proportion of Ll.LtrB-ΔA circularizes accurately. Overall, our data provide the first detailed molecular analysis of the group II intron circularization pathway and suggests that circularization is a conserved splicing pathway in bacteria. PMID:26673697
Stoddard, Barry; Belfort, Marlene
Homing endonucleases have long been known as the orchestrators of intron mobility. However, the extent of their influence on the intron and its genetic and cellular environment is still being elucidated. The accompanying paper emphasizes the importance of temporal control of endonuclease expression on splicing, expression of the host gene and cellular metabolism, while it raises questions to guide future inquiry.
Choi, T; Huang, M; Gorman, C; Jaenisch, R
To investigate the role of splicing in the regulation of gene expression, we have generated transgenic mice carrying the human histone H4 promoter linked to the bacterial gene for chloramphenicol acetyltransferase (CAT), with or without a heterologous intron in the transcription unit. We found that CAT activity is 5- to 300-fold higher when the transgene incorporates a hybrid intron than with an analogous transgene precisely deleted for the intervening sequences. This hybrid intron, consistin...
Sibley, Christopher R; Emmett, Warren; Blazquez, Lorea; Faro, Ana; Haberman, Nejc; Briese, Michael; Trabzuni, Daniah; Ryten, Mina; Weale, Michael E; Hardy, John; Modic, Miha; Curk, Tomaž; Wilson, Stephen W; Plagnol, Vincent; Ule, Jernej
It is generally believed that splicing removes introns as single units from precursor messenger RNA transcripts. However, some long Drosophila melanogaster introns contain a cryptic site, known as a recursive splice site (RS-site), that enables a multi-step process of intron removal termed recursive splicing. The extent to which recursive splicing occurs in other species and its mechanistic basis have not been examined. Here we identify highly conserved RS-sites in genes expressed in the mammalian brain that encode proteins functioning in neuronal development. Moreover, the RS-sites are found in some of the longest introns across vertebrates. We find that vertebrate recursive splicing requires initial definition of an 'RS-exon' that follows the RS-site. The RS-exon is then excluded from the dominant mRNA isoform owing to competition with a reconstituted 5' splice site formed at the RS-site after the first splicing step. Conversely, the RS-exon is included when preceded by cryptic promoters or exons that fail to reconstitute an efficient 5' splice site. Most RS-exons contain a premature stop codon such that their inclusion can decrease mRNA stability. Thus, by establishing a binary splicing switch, RS-sites demarcate different mRNA isoforms emerging from long genes by coupling cryptic elements with inclusion of RS-exons. PMID:25970246
Ismail, Said I.; Kingsman, Susan M.; Kingsman, Alan J.; Uden, Mark
The inclusion of retrovirus-derived introns within retrovirus-based expression vectors leads to a fraction of the resulting transcripts being spliced. Such splicing has been shown to markedly improve expression (W. J. Krall et al., Gene Ther. 3:37–48, 1996). One way to improve upon this still further might involve the use of more efficient introns instead of those from the provirus. Currently, however, incorporation of such introns remains self-defeating since they are removed in the nucleus ...
Christie, Michael; Carroll, Bernard J.
Transposons and viruses are generally devoid of introns and are prime targets for small interfering RNAs (siRNAs) and RNA silencing. Conversely, endogenous genes often contain introns and are not usually subjected to post-transcriptional gene silencing by siRNAs. In a recent study, we reported that efficient intron splicing directly suppresses siRNA biogenesis and RNA silencing of a Green Fluorescence Protein (GFP) transgene. Splicing-mediated suppression of GFP silencing was dependent on ABH...
Ma, Ming-Yue; Lan, Xin-Ran; Niu, Deng-Ke
The origin and subsequent accumulation of spliceosomal introns are prominent events in the evolution of eukaryotic gene structure. However, the mechanisms underlying intron gain remain unclear because there are few proven cases of recently gained introns. In an RNA-dependent RNA polymerase (RdRp) gene, we found that a tandem duplication occurred after the divergence of potato and its wild relatives among other Solanum plants. The duplicated sequence crosses the intron-exon boundary of the first intron and the second exon. A new intron was detected at this duplicated region, and it includes a small previously exonic segment of the upstream copy of the duplicated sequence and the intronic segment of the downstream copy of the duplicated sequence. The donor site of this new intron was directly obtained from the small previously exonic segment. Most of the splicing signals were inherited directly from the parental intron/exon structure, including a putative branch site, the polypyrimidine tract, the 3' splicing site, two putative exonic splicing enhancers, and the GC contents differed between the intron and exon. In the widely cited model of intron gain by tandem genomic duplication, the duplication of an AGGT-containing exonic segment provides the GT and AG splicing sites for the new intron. Our results illustrate that the tandem duplication model of intron gain should be diverse in terms of obtaining the proper splicing signals. PMID:27547574
Full Text Available Abstract Background Alternative splicing has been reported in various eukaryotic groups including plants, apicomplexans, diatoms, amoebae, animals and fungi. However, whether widespread alternative splicing has evolved independently in the different eukaryotic groups or was inherited from their last common ancestor, and may therefore predate multicellularity, is still unknown. To better understand the origin and evolution of alternative splicing and its usage in diverse organisms, we studied alternative splicing in 12 eukaryotic species, comparing rates of alternative splicing across genes of different functional classes, cellular locations, intron/exon structures and evolutionary origins. Results For each species, we find that genes from most functional categories are alternatively spliced. Ancient genes (shared between animals, fungi and plants show high levels of alternative splicing. Genes with products expressed in the nucleus or plasma membrane are generally more alternatively spliced while those expressed in extracellular location show less alternative splicing. We find a clear correspondence between incidence of alternative splicing and intron number per gene both within and between genomes. In general, we find several similarities in patterns of alternative splicing across these diverse eukaryotes. Conclusion Along with previous studies indicating intron-rich genes with weak intron boundary consensus and complex spliceosomes in ancestral organisms, our results suggest that at least a simple form of alternative splicing may already have been present in the unicellular ancestor of plants, fungi and animals. A role for alternative splicing in the evolution of multicellularity then would largely have arisen by co-opting the preexisting process.
Sun, Xiaoyong; Zuo, Fenghua; Ru, Yuanbin; Guo, Jiqiang; Yan, Xiaoyan; Sablok, Gaurav
Alternative splicing plays a key role in the regulation of the central dogma. Four major types of alternative splicing have been classified as intron retention, exon skipping, alternative 5 splice sites or alternative donor sites, and alternative 3 splice sites or alternative acceptor sites. A few algorithms have been developed to detect splice junctions from RNA-Seq reads. However, there are few tools targeting at the major alternative splicing types at the exon/intron level. This type of analysis may reveal subtle, yet important events of alternative splicing, and thus help gain deeper understanding of the mechanism of alternative splicing. This paper describes a user-friendly R package, extracting, annotating and analyzing alternative splicing types for sequence alignment files from RNA-Seq. SplicingTypesAnno can: (1) provide annotation for major alternative splicing at exon/intron level. By comparing the annotation from GTF/GFF file, it identifies the novel alternative splicing sites; (2) offer a convenient two-level analysis: genome-scale annotation for users with high performance computing environment, and gene-scale annotation for users with personal computers; (3) generate a user-friendly web report and additional BED files for IGV visualization. SplicingTypesAnno is a user-friendly R package for extracting, annotating and analyzing alternative splicing types at exon/intron level for sequence alignment files from RNA-Seq. It is publically available at https://sourceforge.net/projects/splicingtypes/files/ or http://genome.sdau.edu.cn/research/software/SplicingTypesAnno.html. PMID:25720307
Wang, Kai; Singh, Darshan; Zeng, Zheng; Coleman, Stephen J; Huang, Yan; Savich, Gleb L; He, Xiaping; Mieczkowski, Piotr; Grimm, Sara A; Perou, Charles M; MacLeod, James N; Chiang, Derek Y; Prins, Jan F; Liu, Jinze
The accurate mapping of reads that span splice junctions is a critical component of all analytic techniques that work with RNA-seq data. We introduce a second generation splice detection algorithm, MapSplice, whose focus is high sensitivity and specificity in the detection of splices as well as CPU and memory efficiency. MapSplice can be applied to both short (<75 bp) and long reads (≥ 75 bp). MapSplice is not dependent on splice site features or intron length, consequently it can detect novel canonical as well as non-canonical splices. MapSplice leverages the quality and diversity of read alignments of a given splice to increase accuracy. We demonstrate that MapSplice achieves higher sensitivity and specificity than TopHat and SpliceMap on a set of simulated RNA-seq data. Experimental studies also support the accuracy of the algorithm. Splice junctions derived from eight breast cancer RNA-seq datasets recapitulated the extensiveness of alternative splicing on a global level as well as the differences between molecular subtypes of breast cancer. These combined results indicate that MapSplice is a highly accurate algorithm for the alignment of RNA-seq reads to splice junctions. Software download URL: http://www.netlab.uky.edu/p/bioinfo/MapSplice. PMID:20802226
Cyril Férandon; Serge Moukha; Philippe Callac; Jean-Pierre Benedetto; Michel Castroviejo; Gérard Barroso
In eukaryotes, introns are located in nuclear and organelle genes from several kingdoms. Large introns (up to 5 kbp) are frequent in mitochondrial genomes of plant and fungi but scarce in Metazoa, even if these organisms are grouped with fungi among the Opisthokonts. Mitochondrial introns are classified in two groups (I and II) according to their RNA secondary structure involved in the intron self-splicing mechanism. Most of these mitochondrial group I introns carry a “Homing Endonuclease Gen...
Full Text Available BACKGROUND: Group II introns are RNA enzymes that splice themselves from pre-mRNA transcripts. Most bacterial group II introns harbour an open reading frame (ORF, coding for a protein with reverse transcriptase, maturase and occasionally DNA binding and endonuclease activities. Some ORF-containing group II introns were shown to be mobile retroelements that invade new DNA target sites. From an evolutionary perspective, group II introns are hypothesized to be the ancestors of the spliceosome-dependent nuclear introns and the small nuclear RNAs (snRNAs--U1, U2, U4, U5 and U6 that are important functional elements of the spliceosome machinery. The ability of some group II introns fragmented in two or three pieces to assemble and undergo splicing in trans supports the theory that spliceosomal snRNAs evolved from portions of group II introns. METHODOLOGY/PRINCIPAL FINDINGS: We used a transposon-based genetic screen to explore the ability of the Ll.LtrB group II intron from the Gram-positive bacterium Lactococcus lactis to be fragmented into three pieces in vivo. Trans-splicing tripartite variants of Ll.LtrB were selected using a highly efficient and sensitive trans-splicing/conjugation screen. We report that numerous fragmentation sites located throughout Ll.LtrB support tripartite trans-splicing, showing that this intron is remarkably tolerant to fragmentation. CONCLUSIONS/SIGNIFICANCE: This work unveils the great versatility of group II intron fragments to assemble and accurately trans-splice their flanking exons in vivo. The selected introns represent the first evidence of functional tripartite group II introns in bacteria and provide experimental support for the proposed evolutionary relationship between group II introns and snRNAs.
Abstract Background Previous studies have indicated that the wide variation in intron density (the number of introns per gene) among different eukaryotes largely reflects varying degrees of intron loss during evolution. The most popular model, which suggests that organisms lose introns through a mechanism in which reverse-transcribed cDNA recombines with the genomic DNA, concerns only one mutational force. Hypothesis Using exons as the units of splicing-site recognition, exon definition const...
Hastings, Michelle L.; Eric Allemand; Duelli, Dominik M.; Michael P Myers; Krainer, Adrian R.
Pre-mRNA splicing is a crucial step in gene expression, and accurate recognition of splice sites is an essential part of this process. Splice sites with weak matches to the consensus sequences are common, though it is not clear how such sites are efficiently utilized. Using an in vitro splicing-complementation approach, we identified PUF60 as a factor that promotes splicing of an intron with a weak 3' splice-site. PUF60 has homology to U2AF(65), a general splicing factor that facilitates 3' s...
Manuel Irimia; Jakob Lewin Rukov; Scott William Roy
Alternative splicing is tightly regulated in a spatio-temporal and quantitative manner. This regulation is achieved by a complex interplay between spliceosomal (trans) factors that bind to different sequence (cis) elements. cis-elements reside in both introns and exons and may either enhance or silence splicing. Differential combinations of cis-elements allows for a huge diversity of overall splicing signals, together comprising a complex ‘splicing code’. Many cis-elements have been identifie...
Group I intron ribozymes constitute one of the main classes of ribozymes and have been a particularly important model in the discovery of key concepts in RNA biology as well as in the development of new methods. Compared to other ribozyme classes, group I intron ribozymes display considerable...... intronic products of these pathways have the potential to integrate into targets and to form various types of circular RNA molecules. Thus, group I intron ribozymes and associated elements found within group I introns is a rich source of biological phenomena. This chapter provides a strategy and protocols...... for initial characterization of new group I intron ribozymes....
Rous sarcoma virus (RSV) requires incomplete splicing of its viral transcripts to maintain efficient replication. A splicing inhibitor element, the negative regulator of splicing (NRS), is located near the 5' end of the RNA but the significance of this positioning is not known. In a heterologous intron the NRS functions optimally when positioned close to the authentic 5' splice site. This observation led us to investigate the basis of the position dependence. Four explanations were put forth and stressed the role of three major elements involved in splicing, the 3' splice site, the 5' splice site, and the 5' end cap structure. NRS function was unrelated to its position relative to the 3' splice site or the cap structure and appeared to depend on its position relative to the authentic 5' splice site. We conclude that position dependence may reflect distance constraints necessary for competition of the NRS with the authentic 5' splice site for pairing with the 3' splice sites
Kevelam, Sietske H; Taube, Jennifer R; van Spaendonk, Rosalina M L;
causal mutations. In silico analysis of effects of the mutations on splicing and RNA folding was performed. In vitro gene splicing was examined in RNA from patients' fibroblasts and an immortalized immature oligodendrocyte cell line after transfection with mutant minigene splicing constructs. RESULTS......: All patients had unusual hemizygous mutations of PLP1 located in exon 3B (one deletion, one missense and two silent), which is spliced out in isoform DM20, or in intron 3 (five mutations). The deletion led to truncation of PLP1, but not DM20. Four mutations were predicted to affect PLP1/DM20...... alternative splicing by creating exonic splicing silencer motifs or new splice donor sites or by affecting the local RNA structure of the PLP1 splice donor site. Four deep intronic mutations were predicted to destabilize a long-distance interaction structure in the secondary PLP1 RNA fragment involved in...
Irimia, Manuel; Rukov, Jakob Lewin; Penny, David;
, and may therefore predate multicellularity, is still unknown. To better understand the origin and evolution of alternative splicing and its usage in diverse organisms, we studied alternative splicing in 12 eukaryotic species, comparing rates of alternative splicing across genes of different functional...... classes, cellular locations, intron/exon structures and evolutionary origins. RESULTS: For each species, we find that genes from most functional categories are alternatively spliced. Ancient genes (shared between animals, fungi and plants) show high levels of alternative splicing. Genes with products...... expressed in the nucleus or plasma membrane are generally more alternatively spliced while those expressed in extracellular location show less alternative splicing. We find a clear correspondence between incidence of alternative splicing and intron number per gene both within and between genomes. In general...
Splicing is the process that removes introns and joins exons from pre-mesenger RNA (pre-mRNA). It is an essential step in pre-mRNA processing that form the mature RNA. Microarray data indicates that approximately 75% of human genes produce transcripts that are alternatively spliced. Alternative splicing is one of the major mechanisms that ultimately generate high number of protein isoforms from a limited number of genes. The proper catalysis and regulation of alternative splice site selection...
Dabeva, Mariana D.; Post-Beittenmiller, Martha A.; Warner, Jonathan R.
The gene for a yeast ribosomal protein, RPL32, contains a single intron. The product of this gene appears to participate in feedback control of the splicing of the intron from the transcript. This autogenous regulation of splicing provides a striking analogy to the autogenous regulation of translation of ribosomal proteins in Escherichia coli.
Dabeva, M. D.; Post-Beittenmiller, M A; Warner, J R
The gene for a yeast ribosomal protein, RPL32, contains a single intron. The product of this gene appears to participate in feedback control of the splicing of the intron from the transcript. This autogenous regulation of splicing provides a striking analogy to the autogenous regulation of translation of ribosomal proteins in Escherichia coli.
Full Text Available Abstract Background Alternative splicing increases protein diversity by generating multiple transcript isoforms from a single gene through different combinations of exons or through different selections of splice sites. It has been reported that RNA secondary structures are involved in alternative splicing. Here we perform a genomic study of RNA secondary structures around splice sites in humans (Homo sapiens, mice (Mus musculus, fruit flies (Drosophila melanogaster, and nematodes (Caenorhabditis elegans to further investigate this phenomenon. Results We observe that GC content around splice sites is closely associated with the splice site usage in multiple species. RNA secondary structure is the possible explanation, because the structural stability difference among alternative splice sites, constitutive splice sites, and skipped splice sites can be explained by the GC content difference. Alternative splice sites tend to be GC-enriched and exhibit more stable RNA secondary structures in all of the considered species. In humans and mice, splice sites of first exons and long exons tend to be GC-enriched and hence form more stable structures, indicating the special role of RNA secondary structures in promoter proximal splicing events and the splicing of long exons. In addition, GC-enriched exon-intron junctions tend to be overrepresented in tissue-specific alternative splice sites, indicating the functional consequence of the GC effect. Compared with regions far from splice sites and decoy splice sites, real splice sites are GC-enriched. We also found that the GC-content effect is much stronger than the nucleotide-order effect to form stable secondary structures. Conclusion All of these results indicate that GC content is related to splice site usage and it may mediate the splicing process through RNA secondary structures.
Full Text Available Mobile group II introns insert site-specifically into DNA target sites by a mechanism termed retrohoming in which the excised intron RNA reverse splices into a DNA strand and is reverse transcribed by the intron-encoded protein. Retrohoming is mediated by a ribonucleoprotein particle that contains the intron-encoded protein and excised intron RNA, with target specificity determined largely by base pairing of the intron RNA to the DNA target sequence. This feature enabled the development of mobile group II introns into bacterial gene targeting vectors ("targetrons" with programmable target specificity. Thus far, however, efficient group II intron-based gene targeting reactions have not been demonstrated in eukaryotes.By using a plasmid-based Xenopus laevis oocyte microinjection assay, we show that group II intron RNPs can integrate efficiently into target DNAs in a eukaryotic nucleus, but the reaction is limited by low Mg(2+ concentrations. By supplying additional Mg(2+, site-specific integration occurs in up to 38% of plasmid target sites. The integration products isolated from X. laevis nuclei are sensitive to restriction enzymes specific for double-stranded DNA, indicating second-strand synthesis via host enzymes. We also show that group II intron RNPs containing either lariat or linear intron RNA can introduce a double-strand break into a plasmid target site, thereby stimulating homologous recombination with a co-transformed DNA fragment at frequencies up to 4.8% of target sites. Chromatinization of the target DNA inhibits both types of targeting reactions, presumably by impeding RNP access. However, by using similar RNP microinjection methods, we show efficient Mg(2+-dependent group II intron integration into plasmid target sites in zebrafish (Danio rerio embryos and into plasmid and chromosomal target sites in Drosophila melanogster embryos, indicating that DNA replication can mitigate effects of chromatinization.Our results provide an
McManus, Hilary A; Lewis, Louise A; Fučíková, Karolina; Haugen, Peik
The spread of group I introns depends on their association with intron-encoded homing endonucleases. Introns that encode functional homing endonuclease genes (HEGs) are highly invasive, whereas introns that only encode the group I ribozyme responsible for self-splicing are generally stably inherited (i.e., vertical inheritance). A number of recent case studies have provided new knowledge on the evolution of group I introns, however, there are still large gaps in understanding of their distribution on the tree of life, and how they have spread into new hosts and genic sites. During a larger phylogenetic survey of chlorophyceaen green algae, we found that 23 isolates contain at least one group I intron in the rbcL chloroplast gene. Structural analyses show that the introns belong to one of two intron lineages, group IA2 intron-HEG (GIY-YIG family) elements inserted after position 462 in the rbcL gene, and group IA1 introns inserted after position 699. The latter intron type sometimes encodes HNH homing endonucleases. The distribution of introns was analyzed on an exon phylogeny and patterns were recovered that are consistent with vertical inheritance and possible horizontal transfer. The rbcL 462 introns are thus far reported only within the Volvocales, Hydrodictyaceae and Bracteacoccus, and closely related isolates of algae differ in the presence of rbcL introns. Phylogenetic analysis of the intron conserved regions indicates that the rbcL699 and rbcL462 introns have distinct evolutionary origins. The rbcL699 introns were likely derived from ribosomal RNA L2449 introns, whereas the rbcL462 introns form a close relationship with psbA introns. PMID:22056605
Busch, A.; Hertel, KJ
The splicing of pre-mRNAs is an essential step of gene expression in eukaryotes. Introns are removed from split genes through the activities of the spliceosome, a large ribonuclear machine that is conserved throughout the eukaryotic lineage. While unicellular eukaryotes are characterized by less complex splicing, pre-mRNA splicing of multicellular organisms is often associated with extensive alternative splicing that significantly enriches their proteome. The alternative selection of splice s...
Khodor, Yevgenia L.; Rodriguez, Joseph; Abruzzi, Katharine C.; Tang, Chih-Hang Anthony; Marr, Michael T.; Rosbash, Michael
Cotranscriptional splicing, in which mRNA is spliced as it is being transcribed, is thought to be necessary for proper gene regulation of many genes in eukaryotic cells. While studies have shown that splicing takes place cotranscriptionally in yeast, in higher eukaryotes, where genes contain multiple introns with widespread alternative splicing, the question of whether cotranscriptional splicing is a general phenomenon remains. Khodor et al. investigated what fractions of genes are cotranscri...
Ismail, S I; Kingsman, S M; Kingsman, A J; Uden, M
The inclusion of retrovirus-derived introns within retrovirus-based expression vectors leads to a fraction of the resulting transcripts being spliced. Such splicing has been shown to markedly improve expression (W. J. Krall et al., Gene Ther. 3:37-48, 1996). One way to improve upon this still further might involve the use of more efficient introns instead of those from the provirus. Currently, however, incorporation of such introns remains self-defeating since they are removed in the nucleus of the producer cell. In the past, elaborate ways to overcome this problem have included the use of alphaviruses to make the vector transcripts within the cytoplasm, thus avoiding the nuclear splicing machinery during vector production (K. J. Li and H. Garoff, Proc. Natl. Acad. Sci. USA 95:3650-3654, 1998). We now present a novel design for the inclusion of introns within a retroviral vector. In essence, this is achieved by exploiting the retroviral replication process to copy not only the U3 promoter but also a synthetic splice donor to the 5'-long-terminal-repeat position during reverse transcription. Once copied, synthesized transcripts then contain a splice donor at their 5' end capable of interacting with a consensus splice acceptor engineered downstream of the packaging signal. Upon transduction, we demonstrate these vectors to produce enhanced expression from near fully spliced (and thus packaging signal minus) transcripts. The unique design of these high titer and high-expression retroviral vectors may be of use in a number of gene therapy applications. PMID:10666267
The trasformer gene plays a critical role in the sex determination pathways of many insects. We cloned two transformer gene introns from Anastrepha suspensa, the Caribbean fruit fly. These introns have sequences that putatively have a role in sex-specific splicing patterns that affect sex determinat...
RNA from T4-infected cells yields multiple end-labeled species when incubated with [α-32P]GTP under self-splicing conditions. One of these corresponds to the previously characterized intron from the T4 td gene and, as shown in this work, the others represent additional group I introns in T4. Two loci distinct from the td gene were found to hybridize to the mixed GTP-labeled T4 RNA probe. These were mapped to the unlinked genes nrdB and sunY. Cloned DNA from the nrdB region that contained the intron was shown to generate characteristic group I splice products with RNA synthesized in vivo or in vitro. The splice junction of the nrdB gene was determined and the nature of the RNA reaction products characterized. In vivo expression of the nrdB gene and the open reading frame within the intron was studied using in-frame lacZ fusions and primer extension analyses. The data suggest that expression of the intron open reading frame is highly regulated during T4 infection. Possible regulatory mechanisms are discussed
Nielsen, Henrik; Fiskaa, Tonje; Birgisdottir, Asa Birna; Haugen, Peik; Einvik, Christer; Johansen, Steinar
at the expense of the host. The circularization pathway has distinct structural requirements that differ from those of splicing and appears to be specifically suppressed in vivo. The ability to form full-length circles is found in all types of nuclear group I introns, including those from the...
Markmiller, Sebastian; Cloonan, Nicole; Lardelli, Rea M;
Minor class or U12-type splicing is a highly conserved process required to remove a minute fraction of introns from human pre-mRNAs. Defects in this splicing pathway have recently been linked to human disease, including a severe developmental disorder encompassing brain and skeletal abnormalities...
Chiara Naro; Claudio Sette
Alternative splicing (AS) is one of the key processes involved in the regulation of gene expression in eukaryotic cells. AS catalyzes the removal of intronic sequences and the joining of selected exons, thus ensuring the correct processing of the primary transcript into the mature mRNA. The combinatorial nature of AS allows a great expansion of the genome coding potential, as multiple splice-variants encoding for different proteins may arise from a single gene. Splicing is mediated by a large...
Peter J Shepard; Klemens J Hertel
@@ Pre-mRNA splicing is a fundamental process required for the expression of most metazoan genes. It is carried out by the spliceosome, which catalyzes the removal of non-coding intronic sequences to assemble exons into mature mRNAs prior to export and translation.Defects in splicing lead to many human genetic diseases , and splicing mutations in a number of genes involved in growth control have been implicated in multiple types of cancer.
Klinz, F. J.; Gallwitz, D
The size of the 309 bp actin gene intron of the yeast Saccharomyces cerevisiae was enlarged by inserting DNA fragments of different lengths and sequence. Enlarging the intron above 551 bp, the largest known yeast intron, led to a decrease in splicing efficiency. The effect on transcript splicing was dependent on the length of the inserted fragments rather than their sequence. It was also observed that insertion of the actin gene intron into different regions of the normally unsplit yeast YP2 ...
Viles, Kristi D.; Sullenger, Bruce A
Most human pre-mRNAs are cis-spliced, removing introns and joining flanking exons of the same RNA molecule. However, splicing of exons present on separate pre-mRNA molecules can also occur. This trans-splicing reaction can be exploited by pre-trans-splicing molecules (PTMs), which are incapable of cis-splicing. PTM-mediated trans-splicing has been utilized to repair mutant RNAs as a novel approach to gene therapy. Herein we explore how the site of PTM expression influences trans-splicing acti...
In eukaryotes, most protein-coding genes contain introns which are removed by precursor messenger RNA (pre-mRNA) splicing. Alternative splicing is a process by which multiple messenger RNAs (mRNAs) are generated from a single pre-mRNA, resulting in functionally distinct proteins. Recent genome-wide analyses of alternative splicing indicated that in higher eukaryotes alternative splicing is an important mechanism that generates proteomic complexity and regulates gene expression. Mis-regulation of splicing causes a wide range of human diseases. This review describes the current understanding of pre-mRNA splicing and the mechanisms that regulate mammalian pre-mRNA splicing. It also discusses emerging directions in the field of alternative splicing.
In eukaryotes,most protein-coding genes contain introns which are removed by precursor messenger RNA(pre-mRNA) splicing.Alternative splicing is a process by which multiple messenger RNAs(mRNAs) are generated from a single pre-mRNA,resulting in functionally distinct proteins.Recent genome-wide analyses of alternative splicing indicated that in higher eukaryotes alternative splicing is an important mechanism that generates proteomic complexity and regulates gene expression.Mis-regulation of splicing causes a wide range of human diseases.This review describes the current understanding of pre-mRNA splicing and the mechanisms that regulate mammalian pre-mRNA splicing.It also discusses emerging directions in the field of alternative splicing.
Licht, Konstantin; Kapoor, Utkarsh; Mayrhofer, Elisa; Jantsch, Michael F
Alternative splicing and adenosine to inosine (A to I) RNA-editing are major factors leading to co- and post-transcriptional modification of genetic information. Both, A to I editing and splicing occur in the nucleus. As editing sites are frequently defined by exon-intron basepairing, mRNA splicing efficiency should affect editing levels. Moreover, splicing rates affect nuclear retention and will therefore also influence the exposure of pre-mRNAs to the editing-competent nuclear environment. Here, we systematically test the influence of splice rates on RNA-editing using reporter genes but also endogenous substrates. We demonstrate for the first time that the extent of editing is controlled by splicing kinetics when editing is guided by intronic elements. In contrast, editing sites that are exclusively defined by exonic structures are almost unaffected by the splicing efficiency of nearby introns. In addition, we show that editing levels in pre- and mature mRNAs do not match. This phenomenon can in part be explained by the editing state of an RNA influencing its splicing rate but also by the binding of the editing enzyme ADAR that interferes with splicing. PMID:27112566
Tang, Yunjia; Nielsen, Henrik; Masquida, Benoît;
BACKGROUND: Twin-ribozyme introns represent a complex class of mobile group I introns that harbour a lariat capping (LC) ribozyme and a homing endonuclease gene embedded in a conventional self-splicing group I ribozyme (GIR2). Twin-ribozyme introns have so far been confined to nucleolar DNA in...... Naegleria amoeboflagellates and the myxomycete Didymium iridis. RESULTS: We characterize structural organization, catalytic properties and molecular evolution of a new twin-ribozyme intron in Allovahlkampfia (Heterolobosea). The intron contains two ribozyme domains with different functions in ribosomal RNA...... efficient self-cleaving ribozyme that generates a small 2',5' lariat cap at the 5' end of the homing endonuclease mRNA, and thus contributes to intron mobility. CONCLUSIONS: The discovery of a twin-ribozyme intron in a member of Heterolobosea expands the distribution pattern of LC ribozymes. We identify a...
Full Text Available Abstract Background Two categories of introns are known, a common U2 type and a rare U12 type. These two types of introns are removed by distinct spliceosomes. The phylogenetic distribution of spliceosomal RNAs that are characteristic of the U12 spliceosome, i.e. the U11, U12, U4atac and U6atac RNAs, suggest that U12 spliceosomes were lost in many phylogenetic groups. We have now examined the distribution of U2 and U12 introns in many of these groups. Results U2 and U12 introns were predicted by making use of available EST and genomic sequences. The results show that in species or branches where U12 spliceosomal components are missing, also U12 type of introns are lacking. Examples are the choanoflagellate Monosiga brevicollis, Entamoeba histolytica, green algae, diatoms, and the fungal lineage Basidiomycota. Furthermore, whereas U12 splicing does not occur in Caenorhabditis elegans, U12 introns as well as U12 snRNAs are present in Trichinella spiralis, which is deeply branching in the nematode tree. A comparison of homologous genes in T. spiralis and C. elegans revealed different mechanisms whereby U12 introns were lost. Conclusions The phylogenetic distribution of U12 introns and spliceosomal RNAs give further support to an early origin of U12 dependent splicing. In addition, this distribution identifies a large number of instances during eukaryotic evolution where such splicing was lost.
Full Text Available Abstract Background Genome-wide computational analysis of alternative splicing (AS in several flowering plants has revealed that pre-mRNAs from about 30% of genes undergo AS. Chlamydomonas, a simple unicellular green alga, is part of the lineage that includes land plants. However, it diverged from land plants about one billion years ago. Hence, it serves as a good model system to study alternative splicing in early photosynthetic eukaryotes, to obtain insights into the evolution of this process in plants, and to compare splicing in simple unicellular photosynthetic and non-photosynthetic eukaryotes. We performed a global analysis of alternative splicing in Chlamydomonas reinhardtii using its recently completed genome sequence and all available ESTs and cDNAs. Results Our analysis of AS using BLAT and a modified version of the Sircah tool revealed AS of 498 transcriptional units with 611 events, representing about 3% of the total number of genes. As in land plants, intron retention is the most prevalent form of AS. Retained introns and skipped exons tend to be shorter than their counterparts in constitutively spliced genes. The splice site signals in all types of AS events are weaker than those in constitutively spliced genes. Furthermore, in alternatively spliced genes, the prevalent splice form has a stronger splice site signal than the non-prevalent form. Analysis of constitutively spliced introns revealed an over-abundance of motifs with simple repetitive elements in comparison to introns involved in intron retention. In almost all cases, AS results in a truncated ORF, leading to a coding sequence that is around 50% shorter than the prevalent splice form. Using RT-PCR we verified AS of two genes and show that they produce more isoforms than indicated by EST data. All cDNA/EST alignments and splice graphs are provided in a website at http://combi.cs.colostate.edu/as/chlamy. Conclusions The extent of AS in Chlamydomonas that we observed is much
Podar, Mircea; Mullineaux, Lauren; Huang, Hon-Ren; Perlman, Philip S.; Sogin, Mitchell L.
Group II introns are catalytic RNAs and mobile retrotransposable elements known to be present in the genomes of some nonmarine bacteria and eukaryotic organelles. Here we report the discovery of group II introns in a bacterial mat sample collected from a deep-sea hydrothermal vent near 9°N on the East Pacific Rise. One of the introns was shown to self-splice in vitro. This is the first example of marine bacterial introns from molecular population structure studies of microorganisms that live ...
Robberson, B L; Cote, G J; Berget, S M
Interactions at the 3' end of the intron initiate spliceosome assembly and splice site selection in vertebrate pre-mRNAs. Multiple factors, including U1 small nuclear ribonucleoproteins (snRNPs), are involved in initial recognition at the 3' end of the intron. Experiments were designed to test the possibility that U1 snRNP interaction at the 3' end of the intron during early assembly functions to recognize and define the downstream exon and its resident 5' splice site. Splicing precursor RNAs...
SHANG Yuan; WANG Chen; ZHANG Yi
The Ca. LSU intron flanking a 129 bp exon upstream and a 100 bp exon downstream was inserted into the lacZ gene on pRS426 to transform E. coli. Northern blot analysis and RT-PCR showed that splicing of Ca. LSU in E.coli is efficient upon inducible expression of the precursor RNA. In contrast, co-transcriptional self-splicing of the intron in vitro is much less active. Therefore, this E. coli splicing system can be used as a better model to investigate the effect of the ribozyme inhibitors on Ca. LSU splicing in living cell. We examined the effects of neomycin sulfate and pentamidine on Ca. LSU splicing in E. coli, and found that these drugs does-dependently inhibit the intron splicing.However, heomycin is more potent than pentamidine in this action.
Wessagowit, Vesarat; Nalla, Vijay K.; Rogan, Peter K; McGrath, John A
The process of excising introns from pre-mRNA complexes is directed by specific genomic DNA sequences at intron—exon borders known as splice sites. These regions contain well-conserved motifs which allow the splicing process to proceed in a regulated and structured manner. However, as well as conventional splicing, several genes have the inherent capacity to undergo alternative splicing, thus allowing synthesis of multiple gene transcripts, perhaps with different functional properties. Within...
Full Text Available Abstract Background Intron gains reportedly are very rare during evolution of vertebrates, and the mechanisms underlying their creation are largely unknown. Previous investigations have shown that, during metazoan radiation, the exon-intron patterns of serpin superfamily genes were subject to massive changes, in contrast to many other genes. Results Here we investigated intron dynamics in the serpin superfamily in lineages pre- and postdating the split of vertebrates. Multiple intron gains were detected in a group of ray-finned fishes, once the canonical groups of vertebrate serpins had been established. In two genes, co-occurrence of non-standard introns was observed, implying that intron gains in vertebrates may even happen concomitantly or in a rapidly consecutive manner. DNA breakage/repair processes associated with genome compaction are introduced as a novel factor potentially favoring intron gain, since all non-canonical introns were found in a lineage of ray-finned fishes that experienced genomic downsizing. Conclusion Multiple intron acquisitions were identified in serpin genes of a lineage of ray-finned fishes, but not in any other vertebrates, suggesting that insertion rates for introns may be episodically increased. The co-occurrence of non-standard introns within the same gene discloses the possibility that introns may be gained simultaneously. The sequences flanking the intron insertion points correspond to the proto-splice site consensus sequence MAG↑N, previously proposed to serve as intron insertion site. The association of intron gains in the serpin superfamily with a group of fishes that underwent genome compaction may indicate that DNA breakage/repair processes might foster intron birth.
Nozaki, H; Ohta, N; Yamada, T; Takano, H
Group I introns were reported for the first time in the large subunit of Rubisco (rbcL) genes, using two colonial green algae, Pleodorina californica and Gonium multicoccum (Volvocales). The rbcL gene of P. californica contained an intron (PIC intron) of 1320 bp harboring an open reading frame (ORF). The G. multicoccum rbcL gene had two ORF-lacking introns of 549 (GM1 intron) and 295 (GM2 intron) base pairs. Based on the conserved nucleotide sequences of the secondary structure, the PIC and GM1 introns were assigned to group IA2 whereas the GM2 intron belonged to group IA1. Southern hybridization analyses of nuclear and chloroplast DNAs indicated that such intron-containing rbcL genes are located in the chloroplast genome. Sequencing RNAs from the two algae revealed that these introns are spliced out during mRNA maturation. In addition, the PIC and GM1 introns were inserted in the same position of the rbcL exons, and phylogenetic analysis of group IA introns indicated a close phylogenetic relationship between the PIC and GM1 introns within the lineage of bacteriophage group IA2 introns. However, P. californica and G. multicoccum occupy distinct clades in the phylogenetic trees of the colonial Volvocales, and the majority of other colonial volvocalean species do not have such introns in the rbcL genes. Therefore, these introns might have been recently inserted in the rbcL genes independently by horizontal transmission by viruses or bacteriophage. PMID:9620266
Rearick, David; Prakash, Ashwin; McSweeny, Andrew; Shepard, Samuel S.; Fedorova, Larisa; Fedorov, Alexei
It has been widely acknowledged that non-coding RNAs are master-regulators of genomic functions. However, the significance of the presence of ncRNA within introns has not received proper attention. ncRNA within introns are commonly produced through the post-splicing process and are specific signals of gene transcription events, impacting many other genes and modulating their expression. This study, along with the following discussion, details the association of thousands of ncRNAs—snoRNA, miR...
Haugen, P; Huss, V A; Nielsen, Henrik;
The green alga Scenedesmus pupukensis and the red alga Porphyra spiralis contain large group-IC1 introns in their nuclear small subunit ribosomal RNA genes due to the presence of open reading frames at the 5' end of the introns. The putative 555 amino-acid Scenedesmus-encoded protein harbors a...... sequence motif resembling the bacterial S9 ribosomal proteins. The Porphyra intron self-splices in vitro, and generates both ligated exons and a full-length intron RNA circle. The Porphyra intron has an unusual structural organization by encoding a potential 149 amino-acid homing-endonuclease-like protein...... functions of nuclear group-I intron proteins....
Mahrez, Walid; Shin, Juhyun; Muñoz-Viana, Rafael; Figueiredo, Duarte D; Trejo-Arellano, Minerva S; Exner, Vivien; Siretskiy, Alexey; Gruissem, Wilhelm; Köhler, Claudia; Hennig, Lars
Several pathways control time to flowering in Arabidopsis thaliana through transcriptional and posttranscriptional gene regulation. In recent years, mRNA processing has gained interest as a critical regulator of flowering time control in plants. However, the molecular mechanisms linking RNA splicing to flowering time are not well understood. In a screen for Arabidopsis early flowering mutants we identified an allele of BRR2a. BRR2 proteins are components of the spliceosome and highly conserved in eukaryotes. Arabidopsis BRR2a is ubiquitously expressed in all analyzed tissues and involved in the processing of flowering time gene transcripts, most notably FLC. A missense mutation of threonine 895 in BRR2a caused defects in FLC splicing and greatly reduced FLC transcript levels. Reduced FLC expression increased transcription of FT and SOC1 leading to early flowering in both short and long days. Genome-wide experiments established that only a small set of introns was not correctly spliced in the brr2a mutant. Compared to control introns, retained introns were often shorter and GC-poor, had low H3K4me1 and CG methylation levels, and were often derived from genes with a high-H3K27me3-low-H3K36me3 signature. We propose that BRR2a is specifically needed for efficient splicing of a subset of introns characterized by a combination of factors including intron size, sequence and chromatin, and that FLC is most sensitive to splicing defects. PMID:27100965
Jean-Luc Da Lage
Full Text Available Most eukaryotes have at least some genes interrupted by introns. While it is well accepted that introns were already present at moderate density in the last eukaryote common ancestor, the conspicuous diversity of intron density among genomes suggests a complex evolutionary history, with marked differences between phyla. The question of the rates of intron gains and loss in the course of evolution and factors influencing them remains controversial. We have investigated a single gene family, alpha-amylase, in 55 species covering a variety of animal phyla. Comparison of intron positions across phyla suggests a complex history, with a likely ancestral intronless gene undergoing frequent intron loss and gain, leading to extant intron/exon structures that are highly variable, even among species from the same phylum. Because introns are known to play no regulatory role in this gene and there is no alternative splicing, the structural differences may be interpreted more easily: intron positions, sizes, losses or gains may be more likely related to factors linked to splicing mechanisms and requirements, and to recognition of introns and exons, or to more extrinsic factors, such as life cycle and population size. We have shown that intron losses outnumbered gains in recent periods, but that "resets" of intron positions occurred at the origin of several phyla, including vertebrates. Rates of gain and loss appear to be positively correlated. No phase preference was found. We also found evidence for parallel gains and for intron sliding. Presence of introns at given positions was correlated to a strong protosplice consensus sequence AG/G, which was much weaker in the absence of intron. In contrast, recent intron insertions were not associated with a specific sequence. In animal Amy genes, population size and generation time seem to have played only minor roles in shaping gene structures.
Full Text Available Much remains unknown concerning the mechanism by which the splicing machinery pinpoints short exons within intronic sequences and how splicing factors are directed to their pre-mRNA targets. One probable explanation lies in differences in chromatin organization between exons and introns. Proteomic, co-immunoprecipitation, and sedimentation analyses described here indicate that SF3B1, an essential splicing component of the U2 snRNP complex, is strongly associated with nucleosomes. ChIP-seq and RNA-seq analyses reveal that SF3B1 specifically binds nucleosomes located at exonic positions. SF3B1 binding is enriched at nucleosomes positioned over short exons flanked by long introns that are also characterized by differential GC content between exons and introns. Disruption of SF3B1 binding to such nucleosomes affects splicing of these exons similarly to SF3B1 knockdown. Our findings suggest that the association of SF3B1 with nucleosomes is functionally important for splice-site recognition and that SF3B1 conveys splicing-relevant information embedded in chromatin structure.
Burggraf, S.; Larsen, N.; Woese, C. R.; Stetter, K. O.
The 16S rRNA genes of Pyrobaculum aerophilum and Pyrobaculum islandicum were amplified by the polymerase chain reaction, and the resulting products were sequenced directly. The two organisms are closely related by this measure (over 98% similar). However, they differ in that the (lone) 16S rRNA gene of Pyrobaculum aerophilum contains a 713-bp intron not seen in the corresponding gene of Pyrobaculum islandicum. To our knowledge, this is the only intron so far reported in the small subunit rRNA gene of a prokaryote. Upon excision the intron is circularized. A secondary structure model of the intron-containing rRNA suggests a splicing mechanism of the same type as that invoked for the tRNA introns of the Archaea and Eucarya and 23S rRNAs of the Archaea. The intron contains an open reading frame whose protein translation shows no certain homology with any known protein sequence.
Full Text Available Abstract Background Group I introns are one of the four major classes of introns as defined by their distinct splicing mechanisms. Because they catalyze their own removal from precursor transcripts, group I introns are referred to as autocatalytic introns. Group I introns are common in fungal and protist nuclear ribosomal RNA genes and in organellar genomes. In contrast, they are rare in all other organisms and genomes, including bacteria. Results Here we report five group I introns, each containing a LAGLIDADG homing endonuclease gene (HEG, in large subunit (LSU rRNA genes of cyanobacteria. Three of the introns are located in the LSU gene of Synechococcus sp. C9, and the other two are in the LSU gene of Synechococcus lividus strain C1. Phylogenetic analyses show that these introns and their HEGs are closely related to introns and HEGs located at homologous insertion sites in organellar and bacterial rDNA genes. We also present a compilation of group I introns with homing endonuclease genes in bacteria. Conclusion We have discovered multiple HEG-containing group I introns in a single bacterial gene. To our knowledge, these are the first cases of multiple group I introns in the same bacterial gene (multiple group I introns have been reported in at least one phage gene and one prophage gene. The HEGs each contain one copy of the LAGLIDADG motif and presumably function as homodimers. Phylogenetic analysis, in conjunction with their patchy taxonomic distribution, suggests that these intron-HEG elements have been transferred horizontally among organelles and bacteria. However, the mode of transfer and the nature of the biological connections among the intron-containing organisms are unknown.
Jo, Bong-Seok; Choi, Sun Shim
The intron has been a big biological mystery since it was first discovered in several aspects. First, all of the completely sequenced eukaryotes harbor introns in the genomic structure, whereas no prokaryotes identified so far carry introns. Second, the amount of total introns varies in different species. Third, the length and number of introns vary in different genes, even within the same species genome. Fourth, all introns are copied into RNAs by transcription and DNAs by replication processes, but intron sequences do not participate in protein-coding sequences. The existence of introns in the genome should be a burden to some cells, because cells have to consume a great deal of energy to copy and excise them exactly at the correct positions with the help of complicated spliceosomal machineries. The existence throughout the long evolutionary history is explained, only if selective advantages of carrying introns are assumed to be given to cells to overcome the negative effect of introns. In that regard, we summarize previous research about the functional roles or benefits of introns. Additionally, several other studies strongly suggesting that introns should not be junk will be introduced. PMID:26865841
Full Text Available BAP1 is a tumor suppressor gene that is lost or deleted in diverse cancers, including uveal mela¬noma, malignant pleural mesothelioma (MPM, clear cell renal carcinoma, and cholangiocarcinoma. Recently, BAP1 germline mutations have been reported in families with combinations of these same cancers. A particular challenge for mutation screening is the classification of non-truncating BAP1 sequence variants because it is not known whether these subtle changes can affect the protein function sufficiently to predispose to cancer development. Here we report mRNA splicing analysis on a homozygous substitution mutation, BAP1 c. 2054 A&T (p.Glu685Val, identified in an MPM cell line derived from a mesothelioma patient. The mutation occurred at the 3rd nucleotide from the 3' end of exon 16. RT-PCR, cloning and subsequent sequencing revealed several aberrant splicing products not observed in the controls: 1 a 4 bp deletion at the end of exon 16 in all clones derived from the major splicing product. The BAP1 c. 2054 A&T mutation introduced a new 5' splice site (GU, which resulted in the deletion of 4 base pairs and presumably protein truncation; 2 a variety of alternative splicing products that led to retention of different introns: introns 14-16; introns 15-16; intron 14 and intron 16; 3 partial intron 14 and 15 retentions caused by activation of alternative 3' splice acceptor sites (AG in the introns. Taken together, we were unable to detect any correctly spliced mRNA transcripts in this cell line. These results suggest that aberrant splicing caused by this mutation is quite efficient as it completely abolishes normal splicing through creation of a novel 5' splice site and activation of cryptic splice sites. These data support the conclusion that BAP1 c.2054 A&T (p.E685V variant is a pathogenic mutation and contributes to MPM through disruption of normal splicing.
Kevelam, Sietske H; Taube, Jennifer R; van Spaendonk, Rosalina M L; Bertini, Enrico; Sperle, Karen; Tarnopolsky, Mark; Tonduti, Davide; Valente, Enza Maria; Travaglini, Lorena; Sistermans, Erik A; Bernard, Geneviève; Catsman-Berrevoets, Coriene E; van Karnebeek, Clara D M; Østergaard, John R; Friederich, Richard L; Fawzi Elsaid, Mahmoud; Schieving, Jolanda H; Tarailo-Graovac, Maja; Orcesi, Simona; Steenweg, Marjan E; van Berkel, Carola G M; Waisfisz, Quinten; Abbink, Truus E M; van der Knaap, Marjo S; Hobson, Grace M; Wolf, Nicole I
Objective The objective of this study was to investigate the genetic etiology of the X-linked disorder “Hypomyelination of Early Myelinating Structures” (HEMS). Methods We included 16 patients from 10 families diagnosed with HEMS by brain MRI criteria. Exome sequencing was used to search for causal mutations. In silico analysis of effects of the mutations on splicing and RNA folding was performed. In vitro gene splicing was examined in RNA from patients’ fibroblasts and an immortalized immature oligodendrocyte cell line after transfection with mutant minigene splicing constructs. Results All patients had unusual hemizygous mutations of PLP1 located in exon 3B (one deletion, one missense and two silent), which is spliced out in isoform DM20, or in intron 3 (five mutations). The deletion led to truncation of PLP1, but not DM20. Four mutations were predicted to affect PLP1/DM20 alternative splicing by creating exonic splicing silencer motifs or new splice donor sites or by affecting the local RNA structure of the PLP1 splice donor site. Four deep intronic mutations were predicted to destabilize a long-distance interaction structure in the secondary PLP1 RNA fragment involved in regulating PLP1/DM20 alternative splicing. Splicing studies in fibroblasts and transfected cells confirmed a decreased PLP1/DM20 ratio. Interpretation Brain structures that normally myelinate early are poorly myelinated in HEMS, while they are the best myelinated structures in Pelizaeus–Merzbacher disease, also caused by PLP1 alterations. Our data extend the phenotypic spectrum of PLP1-related disorders indicating that normal PLP1/DM20 alternative splicing is essential for early myelination and support the need to include intron 3 in diagnostic sequencing. PMID:26125040
Starokadomskyy P. L.
Full Text Available Inteins, or protein introns, are parts of protein sequences that are post-translationally excised, their flanking regions (exteins being spliced together. This process was called protein splicing. Originally inteins were found in prokaryotic or unicellular eukaryotic organisms. But the general principles of post-translation protein rearrangement are evolving yielding different post-translation modification of proteins in multicellular organisms. For clarity, these non-intein mediated events call either protein rearrangements or protein editing. The most intriguing example of protein editing is proteasome-mediated splicing of antigens in vertebrates that may play important role in antigen presentation. Other examples of protein rearrangements are maturation of Hg-proteins (critical receptors in embryogenesis as well as maturation of several metabolic enzymes. Despite a lack of experimental data we try to analyze some intriguing examples of protein splicing evolution.
Plotch, Stephen J.; Krug, Robert M.
In influenza virus-infected cells, the splicing of the viral NS1 mRNA catalyzed by host nuclear enzymes is controlled so that the steady-state amount of the spliced NS2 mRNA is only 5-10% of that of the unspliced NS1 mRNA. Here we examine the splicing of NS1 mRNA in vitro, using nuclear extracts from HeLa cells. We show that in addition to its consensus 5' and 3' splice sites, NS1 mRNA has an intron branch-point adenosine residue that was functional in lariat formation. Nonetheless, this RNA was not detectably spliced in vitro under conditions in which a human β -globin precursor was efficiently spliced. Using chimeric RNA precursors containing both NS1 and β -globin sequences, we show that the NS1 5' splice site was effectively utilized by the β -globin branch-point sequence and 3' splice site to form a spliced RNA, whereas the NS1 3' splice site did not function in detectable splicing in vitro, even in the presence of the β -globin branch-point sequence or in the presence of both the branch-point sequence and 5' exon and splice site from β -globin With the chimeric precursors that were not detectably spliced, as with NS1 mRNA itself, a low level of a lariat structure containing only intron and not 3' exon sequences was formed. The inability of the consensus 3' splice site of NS1 mRNA to function effectively in in vitro splicing suggests that this site is structurally inaccessible to components of the splicing machinery. Based on these results, we propose two mechanisms whereby NS1 mRNA splicing in infected cells is controlled via the accessibility of its 3' splice site.
Koonin Eugene V
Full Text Available Abstract The GT dinucleotide in the first two intron positions is the most conserved element of the U2 donor splice signals. However, in a small fraction of donor sites, GT is replaced by GC. A substantial enrichment of GC in donor sites of alternatively spliced genes has been observed previously in human, nematode and Arabidopsis, suggesting that GC signals are important for regulation of alternative splicing. We used parsimony analysis to reconstruct evolution of donor splice sites and inferred 298 GT > GC conversion events compared to 40 GC > GT conversion events in primate and rodent genomes. Thus, there was substantive accumulation of GC donor splice sites during the evolution of mammals. Accumulation of GC sites might have been driven by selection for alternative splicing. Reviewers This article was reviewed by Jerzy Jurka and Anton Nekrutenko. For the full reviews, please go to the Reviewers' Reports section.
Bechhofer, D H; Hue, K K; Shub, D A
The thymidylate synthase gene (thy) (EC 184.108.40.206) of Bacillus subtilis bacteriophage beta 22 has a self-splicing, group I intron inserted into a highly conserved region of the coding sequence. The intron is very similar to one that is inserted 21 bp further downstream in the homologous thymidylate synthase gene (td) of Escherichia coli bacteriophage T4. In contrast, the amino acid sequences of the bacteriophage thymidylate synthases are highly divergent. The beta 22 intron has a fragmentary open reading frame (ORF) that encodes a putative helix-turn-helix DNA-binding motif, similar to one at the carboxyl terminus of the homing endonuclease (I-TevI) encoded by the T4 td intron. The td ORF and the thy ORF fragments are inserted into different regions of their respective intron structures. These results suggest that the thymidylate synthase genes, their introns, and their respective intron-ORFs all have separate evolutionary histories and that the acquisition of the intron could not have occurred by a simple homing event. PMID:7972121
Lisson, Ralph; Hellert, Jan; Ringleb, Malte; Machens, Fabian; Kraus, Josef; Hehl, Reinhard
The maize Activator/Dissociation (Ac/Ds) transposable element system was introduced into sugar beet. The autonomous Ac and non-autonomous Ds element excise from the T-DNA vector and integrate at novel positions in the sugar beet genome. Ac and Ds excisions generate footprints in the donor T-DNA that support the hairpin model for transposon excision. Two complete integration events into genomic sugar beet DNA were obtained by IPCR. Integration of Ac leads to an eight bp duplication, while integration of Ds in a homologue of a sugar beet flowering locus gene did not induce a duplication. The molecular structure of the target site indicates Ds integration into a double strand break. Analyses of transposase transcription using RT-PCR revealed low amounts of alternatively spliced mRNAs. The fourth intron of the transposase was found to be partially misspliced. Four different splice products were identified. In addition, the second and third exon were found to harbour two and three novel introns, respectively. These utilize each the same splice donor but several alternative splice acceptor sites. Using the SplicePredictor online tool, one of the two introns within exon two is predicted to be efficiently spliced in maize. Most interestingly, splicing of this intron together with the four major introns of Ac would generate a transposase that lacks the DNA binding domain and two of its three nuclear localization signals, but still harbours the dimerization domain. PMID:20512402
Nguyen, Anh H.; Lee, Jong Uk; Sim, Sang Jun
RNA splicing plays important roles in transcriptome and proteome diversity. Herein, we describe the use of a nanoplasmonic system that unveils RNA folding and assembly during pre-mRNA splicing wherein the quantification of mRNA splice variants is not taken into account. With a couple of SERS-probes and plasmonic probes binding at the boundary sites of exon-2/intron-2 and intron-2/exon-3 of the pre-mature RNA of the β-globin gene, the splicing process brings the probes into the plasmonic bands. For plasmonic probes, a plasmon shift increase of ~29 nm, corresponding to intron removal and exon-2 and exon-3 connection to form the mRNA molecule, is measured by plasmonic coupling. The increased scattering intensity and surface-enhanced Raman scattering (SERS) fingerprinting reveal the clear dynamics of pre-mRNA splicing. Moreover, a time-resolved experiment of individual RNA molecules exhibited a successful splicing and an inhibited splicing event by 33 μM biflavonoid isoginkgetin, a general inhibitor of RNA splicing. The results suggest that the RNA splicing is successfully monitored with the nanoplasmonic system. Thus, this platform can be useful for studying RNA nanotechnology, biomolecular folding, alternative splicing, and maturation of microRNA.
Savisaar, Rosina; Hurst, Laurence D
Exonic splice enhancers (ESEs) are short nucleotide motifs, enriched near exon ends, that enhance the recognition of the splice site and thus promote splicing. Are intronless genes under selection to avoid these motifs so as not to attract the splicing machinery to an mRNA that should not be spliced, thereby preventing the production of an aberrant transcript? Consistent with this possibility, we find that ESEs in putative recent retrocopies are at a higher density and evolving faster than those in other intronless genes, suggesting that they are being lost. Moreover, intronless genes are less dense in putative ESEs than intron-containing ones. However, this latter difference is likely due to the skewed base composition of intronless sequences, a skew that is in line with the general GC richness of few exon genes. Indeed, after controlling for such biases, we find that both intronless and intron-containing genes are denser in ESEs than expected by chance. Importantly, nucleotide-controlled analysis of evolutionary rates at synonymous sites in ESEs indicates that the ESEs in intronless genes are under purifying selection in both human and mouse. We conclude that on the loss of introns, some but not all, ESE motifs are lost, the remainder having functions beyond a role in splice promotion. These results have implications for the design of intronless transgenes and for understanding the causes of selection on synonymous sites. PMID:26802218
OsEBP-89 is a transcription factor gene of rice.It contains two introns. Using RT-PCR and Southern hybridization to study OsEBP-89 tissue-specific expression, we found that its first intron (115 bp in length) of its was retained in a fraction of its transcripts of this gene in rice developing seeds. Furthermore, two OsEBP-89 cDNA clones (c89L and c89LH) were screened from a rice cDNA library.Sequence analysis revealed that the first intron was retained in c89L clone, whereas, both the first and second intron sequences were spliced in c89LH. In addition to developing seeds, the first intron unspliced transcripts of OsEBP-89 are detected in leaves and roots of rice, too. However, the ratio of the first intron unspliced to spliced OsEBP-89 transcripts varied in different tissues examined. The potential biological significance of intron retention in OsEBP-89 transcript was discussed.
The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. The RNA sequences has some sections called Introns. Introns, derived from the term "intragenic regions", are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by Biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behaviour in the access to the secret key to code the messages. In the RNA-Crypto System algoritnm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.
The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. In particular the RNA sequences have some sections called Introns. Introns, derived from the term "intragenic regions", are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by Biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behaviour in the access to the secret key to code the messages. In the RNA-Crypto System algorithm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.
Osman, Ismail; Tay, Mandy Li-Ian; Pek, Jun Wei
Upon splicing, introns are rapidly degraded. Hence, RNAs derived from introns are commonly deemed as junk sequences. However, the discoveries of intronic-derived small nucleolar RNAs (snoRNAs), small Cajal body associated RNAs (scaRNAs) and microRNAs (miRNAs) suggested otherwise. These non-coding RNAs are shown to play various roles in gene regulation. In this review, we highlight another class of intron-derived RNAs known as stable intronic sequence RNAs (sisRNAs). sisRNAs have been observed since the 1980 s; however, we are only beginning to understand their biological significance. Recent studies have shown or suggested that sisRNAs regulate their own host's gene expression, function as molecular sinks or sponges, and regulate protein translation. We propose that sisRNAs function as an additional layer of gene regulation in the cells. PMID:27147469
Splicing is one of the key processing steps during the maturation of a gene’s primary transcript into the mRNA molecule used as a template for protein production. Splicing involves the removal of segments called introns and re-joining of the remaining segments called exons. It is by now well e
Li, Juan; Sun, Wenchang; Wang, Bing; Xiao, Xiao; Liu, Xiang-Qin
Inteins catalyze protein splicing in a fashion similar to how self-splicing introns catalyze RNA splicing. Split-inteins catalyze precise ligation of two separate polypeptides through trans-splicing in a highly specific manner. Here we report a method of using protein trans-splicing to circumvent the packaging size limit of gene therapy vectors. To demonstrate this method, we chose a large dystrophin gene and an adeno-associated viral (AAV) vector, which has a small packaging size. A highly f...
Full Text Available BACKGROUND: Transcriptional interference has been recently recognized as an unexpectedly complex and mostly negative regulation of genes. Despite a relatively few studies that emerged in recent years, it has been demonstrated that a readthrough transcription derived from one gene can influence the transcription of another overlapping or nested gene. However, the molecular effects resulting from this interaction are largely unknown. METHODOLOGY/PRINCIPAL FINDINGS: Using in silico chromosome walking, we searched for prematurely terminated transcripts bearing signatures of intron retention or exonization of intronic sequence at their 3' ends upstream to human L1 retrotransposons, protein-coding and noncoding nested genes. We demonstrate that transcriptional interference induced by intronic L1s (or other repeated DNAs and nested genes could be characterized by intron retention, forced exonization and cryptic polyadenylation. These molecular effects were revealed from the analysis of endogenous transcripts derived from different cell lines and tissues and confirmed by the expression of three minigenes in cell culture. While intron retention and exonization were comparably observed in introns upstream to L1s, forced exonization was preferentially detected in nested genes. Transcriptional interference induced by L1 or nested genes was dependent on the presence or absence of cryptic splice sites, affected the inclusion or exclusion of the upstream exon and the use of cryptic polyadenylation signals. CONCLUSIONS/SIGNIFICANCE: Our results suggest that transcriptional interference induced by intronic L1s and nested genes could influence the transcription of the large number of genes in normal as well as in tumor tissues. Therefore, this type of interference could have a major impact on the regulation of the host gene expression.
Full Text Available Abstract Background More than two thirds of the highly expressed ribosomal protein (RP genes in Saccharomyces cerevisiae contain introns, which is in sharp contrast to the genome-wide five percent intron-containing genes. It is well established that introns carry regulatory sequences and that the transcription of RP genes is extensively and coordinately regulated. Here we test the hypotheses that introns are innately associated with heavily transcribed genes and that introns of RP genes contribute regulatory TF binding sequences. Moreover, we investigate whether promoter features are significantly different between intron-containing and intronless RP genes. Results We find that directly measured transcription rates tend to be lower for intron-containing compared to intronless RP genes. We do not observe any specifically enriched sequence motifs in the introns of RP genes other than those of the branch point and the two splice sites. Comparing the promoters of intron-containing and intronless RP genes, we detect differences in number and position of Rap1-binding and IFHL motifs. Moreover, the analysis of the length distribution and the folding free energies suggest that, at least in a sub-population of RP genes, the 5' untranslated sequences are optimized for regulatory function. Conclusion Our results argue against the direct involvement of introns in the regulation of transcription of highly expressed genes. Moreover, systematic differences in motif distributions suggest that RP transcription factors may act differently on intron-containing and intronless gene promoters. Thus, our findings contribute to the decoding of the RP promoter architecture and may fuel the discussion on the evolution of introns.
Panahi, Bahman; Abbaszadeh, Bahram; Taghizadeghan, Mehdi; Ebrahimie, Esmaeil
Sorghum bicolor is a member of grass family which is an attractive model plant for genome study due to interesting genome features like low genome size. In this research, we performed comprehensive investigation of Alternative Splicing and ontology aspects of genes those have undergone these events in sorghum bicolor. We used homology based alignments between gene rich transcripts, represented by tentative consensus (TC) transcript sequences, and genomic scaffolds to deduce the structure of genes and identify alternatively spliced transcripts in sorghum. Using homology mapping of assembled expressed sequence tags with genomics data, we identified 2,137 Alternative Splicing events in S. bicolor. Our study showed that complex events and intron retention are the main types of Alternative Splicing events in S. bicolor and highlights the prevalence of splicing site recognition for definition of introns in this plant. Annotations of the alternatively spliced genes revealed that they represent diverse biological process and molecular functions, suggesting a fundamental role for Alternative Splicing in affecting the development and physiology of S. bicolor. PMID:25049459
Poole Anthony M
Full Text Available Abstract Martin & Koonin recently proposed that the eukaryote nucleus evolved as a quality control mechanism to prevent ribosome readthrough into introns. In their scenario, the bacterial ancestor of mitochondria was resident in an archaeal cell, and group II introns (carried by the fledgling mitochondrion inserted into coding regions in the archaeal host genome. They suggest that if transcription and translation were coupled, and because splicing is expected to have been slower than translation, the effect of insertion would have been ribosome readthrough into introns, resulting in production of aberrant proteins. The emergence of the nuclear compartment would thus have served to separate transcription and splicing from translation, thereby alleviating this problem. In this article, I argue that Martin & Koonin's model is not compatible with current knowledge. The model requires that group II introns would spread aggressively through an archaeal genome. It is well known that selfish elements can spread through an outbreeding sexual population despite a substantial fitness cost to the host. The same is not true for asexual lineages however, where both theory and observation argue that such elements will be under pressure to reduce proliferation, and may be lost completely. The recent introduction of group II introns into archaea by horizontal transfer provides a natural test case with which to evaluate Martin & Koonin's model. The distribution and behaviour of these introns fits prior theoretical expectations, not the scenario of aggressive proliferation advocated by Martin & Koonin. I therefore conclude that the mitochondrial seed hypothesis for the origin of eukaryote introns, on which their model is based, better explains the early expansion of introns in eukaryotes. The mitochondrial seed hypothesis has the capacity to separate the origin of eukaryotes from the origin of introns, leaving open the possibility that the cell that engulfed the
Gretarsdottir, Solveig; Helgason, Hannes; Helgadottir, Anna;
.7 × 10⁻⁸ and rs17248748-T, OR = 0.92 and Padj = 0.022). The LDLR splice region variant, rs72658867-A, located at position +5 in intron 14 (NM_000527:c.2140+5G>A), causes retention of intron 14 during transcription and is expected to produce a truncated LDL receptor lacking domains essential for function...
Braddock, M; Muckenthaler, M; White, M R; Thorburn, A M; Sommerville, J; Kingsman, A J; Kingsman, S M
The translation of a capped, polyadenylated RNA after injection into the nucleus of Xenopus oocytes occurs only if the RNA contains an intron. A single point mutation in the splice donor site prevents translation. Intron-less RNA is exported efficiently to the cytoplasm and is held, undegraded, in a translationally inert state for several days. Translation can be activated by treating the oocytes with progesterone or by injecting antibodies that bind the FRGY2 class of messenger RNA binding p...
Sophie Germann; Lise Gratadou; Martin Dutertre; Didier Auboeuf
Numerous studies report splicing alterations in a multitude of cancers by using gene-by-gene analysis. However, understanding of the role of alternative splicing in cancer is now reaching a new level, thanks to the use of novel technologies allowing the analysis of splicing at a large-scale level. Genome-wide analyses of alternative splicing...
In most eukaryotic genes, protein-coding exons are separated by non-coding introns which are removed from the primary transcript by a process called 'splicing'. The positions where introns are cut and exons are spliced together are called 'splice sites'. Thus, computational prediction of splice sites is crucial for gene finding in eukaryotes. Weight array models are a powerful probabilistic approach to splice site detection. Parameters for these models are usually derived from m-tuple frequencies in trusted training data and subsequently smoothed to avoid zero probabilities. In this study we compare three different ways of parameter estimation for m-tuple frequencies, namely (a) non-smoothed probability estimation, (b) standard pseudo counts and (c) a Gaussian smoothing procedure that we recently developed
Taher, Leila; Meinicke, Peter; Morgenstern, Burkhard
In most eukaryotic genes, protein-coding exons are separated by non-coding introns which are removed from the primary transcript by a process called "splicing". The positions where introns are cut and exons are spliced together are called "splice sites". Thus, computational prediction of splice sites is crucial for gene finding in eukaryotes. Weight array models are a powerful probabilistic approach to splice site detection. Parameters for these models are usually derived from m-tuple frequencies in trusted training data and subsequently smoothed to avoid zero probabilities. In this study we compare three different ways of parameter estimation for m-tuple frequencies, namely (a) non-smoothed probability estimation, (b) standard pseudo counts and (c) a Gaussian smoothing procedure that we recently developed.
Full Text Available Alternative splicing is tightly regulated in a spatio-temporal and quantitative manner. This regulation is achieved by a complex interplay between spliceosomal (trans factors that bind to different sequence (cis elements. cis-elements reside in both introns and exons and may either enhance or silence splicing. Differential combinations of cis-elements allows for a huge diversity of overall splicing signals, together comprising a complex 'splicing code'. Many cis-elements have been identified, and their effects on exon inclusion levels demonstrated in reporter systems. However, the impact of interspecific differences in these elements on the evolution of alternative splicing levels has not yet been investigated at genomic level. Here we study the effect of interspecific differences in predicted exonic splicing regulators (ESRs on exon inclusion levels in human and chimpanzee. For this purpose, we compiled and studied comprehensive datasets of predicted ESRs, identified by several computational and experimental approaches, as well as microarray data for changes in alternative splicing levels between human and chimpanzee. Surprisingly, we found no association between changes in predicted ESRs and changes in alternative splicing levels. This observation holds across different ESR exon positions, exon lengths, and 5' splice site strengths. We suggest that this lack of association is mainly due to the great importance of context for ESR functionality: many ESR-like motifs in primates may have little or no effect on splicing, and thus interspecific changes at short-time scales may primarily occur in these effectively neutral ESRs. These results underscore the difficulties of using current computational ESR prediction algorithms to identify truly functionally important motifs, and provide a cautionary tale for studies of the effect of SNPs on splicing in human disease.
Irimia, Manuel; Rukov, Jakob Lewin; Roy, Scott William
Alternative splicing is tightly regulated in a spatio-temporal and quantitative manner. This regulation is achieved by a complex interplay between spliceosomal (trans) factors that bind to different sequence (cis) elements. cis-elements reside in both introns and exons and may either enhance or silence splicing. Differential combinations of cis-elements allows for a huge diversity of overall splicing signals, together comprising a complex 'splicing code'. Many cis-elements have been identified, and their effects on exon inclusion levels demonstrated in reporter systems. However, the impact of interspecific differences in these elements on the evolution of alternative splicing levels has not yet been investigated at genomic level. Here we study the effect of interspecific differences in predicted exonic splicing regulators (ESRs) on exon inclusion levels in human and chimpanzee. For this purpose, we compiled and studied comprehensive datasets of predicted ESRs, identified by several computational and experimental approaches, as well as microarray data for changes in alternative splicing levels between human and chimpanzee. Surprisingly, we found no association between changes in predicted ESRs and changes in alternative splicing levels. This observation holds across different ESR exon positions, exon lengths, and 5' splice site strengths. We suggest that this lack of association is mainly due to the great importance of context for ESR functionality: many ESR-like motifs in primates may have little or no effect on splicing, and thus interspecific changes at short-time scales may primarily occur in these effectively neutral ESRs. These results underscore the difficulties of using current computational ESR prediction algorithms to identify truly functionally important motifs, and provide a cautionary tale for studies of the effect of SNPs on splicing in human disease. PMID:19495418
M. Lutzelberger; L.S. Reinert; A.T. Das; B. Berkhout; J. Kjems
Productive infection and successful replication of human immunodeficiency virus 1 (HIV-1) requires the balanced expression of all viral genes. This is achieved by a combination of alternative splicing events and regulated nuclear export of viral RNA. Because viral splicing is incomplete and intron-c
Deogun Jitender S
Full Text Available Abstract Background Predicting and proper ranking of canonical splice sites (SSs is a challenging problem in bioinformatics and machine learning communities. Any progress in SSs recognition will lead to better understanding of splicing mechanism. We introduce several new approaches of combining a priori knowledge for improved SS detection. First, we design our new Bayesian SS sensor based on oligonucleotide counting. To further enhance prediction quality, we applied our new de novo motif detection tool MHMMotif to intronic ends and exons. We combine elements found with sensor information using Naive Bayesian Network, as implemented in our new tool SpliceScan. Results According to our tests, the Bayesian sensor outperforms the contemporary Maximum Entropy sensor for 5' SS detection. We report a number of putative Exonic (ESE and Intronic (ISE Splicing Enhancers found by MHMMotif tool. T-test statistics on mouse/rat intronic alignments indicates, that detected elements are on average more conserved as compared to other oligos, which supports our assumption of their functional importance. The tool has been shown to outperform the SpliceView, GeneSplicer, NNSplice, Genio and NetUTR tools for the test set of human genes. SpliceScan outperforms all contemporary ab initio gene structural prediction tools on the set of 5' UTR gene fragments. Conclusion Designed methods have many attractive properties, compared to existing approaches. Bayesian sensor, MHMMotif program and SpliceScan tools are freely available on our web site. Reviewers This article was reviewed by Manyuan Long, Arcady Mushegian and Mikhail Gelfand.
Graber, Joel H; Salisbury, Jesse; Hutchins, Lucie N; Blumenthal, Thomas
Many mRNAs in Caenorhabditis elegans are generated through a trans-splicing reaction that adds one of two classes of spliced leader RNA to an independently transcribed pre-mRNA. SL1 leaders are spliced mostly to pre-mRNAs from genes with outrons, intron-like sequences at the 5′-ends of the pre-mRNAs. In contrast, SL2 leaders are nearly exclusively trans-spliced to genes that occur downstream in polycistronic pre-mRNAs produced from operons. Operon pre-mRNA processing requires separation into ...
S. C. Cheng; Abelson, J
We have fractionated a yeast whole cell extract that can accurately splice synthetic actin and CYH2 pre-mRNAs. Three fractions, designated I, II, and III, have been separated by use of ammonium sulfate fractionation and chromatography on heparin agarose. Each fraction alone has no splicing activity. Fractions I and II allow the first step of the splicing reaction to proceed, giving rise to the splicing intermediates, free exon 1, and intron-exon 2. Addition of fraction III completes the react...
Wang, Kai; Ussery, David; Brunak, Søren
, splice site prediction program called NetAspGene, for the genus Aspergillus. Gene sequences from Aspergillus fumigatus, the most common mould pathogen, were used to build and test our model. Compared to many animals and plants, Aspergillus contains smaller introns; thus we have applied a larger window...... better splice site prediction than other available tools. NetAspGene will be very helpful for the study in Aspergillus splice sites and especially in alternative splicing. A webpage for NetAspGene is publicly available at http://www.cbs.dtu.dk/services/NetAspGene....
Liu, Canhui; Oliveira, Ana; Higazi, Tarig B.; Ghedin, Elodie; DePasse, Jay; Thomas R Unnasch
Many genes in parasitic nematodes are both cis- and trans-spliced. Previous studies have demonstrated that a 7nt element encoded in the first intron of the B. malayi 70 kDa heat shock protein (BmHSP70) gene was necessary to permit trans-splicing of transgenic mRNAs in embryos transfected with constructs encoding portions of the BmHSP70 gene. Here we demonstrate that this element (the B. malayi HSP70 trans-splicing motif, or BmHSP70 TSM) is necessary and sufficient to direct trans-splicing of ...
Das, Debopriya; Clark, Tyson A.; Schweitzer, Anthony; Marr,Henry; Yamamoto, Miki L.; Parra, Marilyn K.; Arribere, Josh; Minovitsky,Simon; Dubchak, Inna; Blume, John E.; Conboy, John G.
A novel exon microarray format that probes gene expression with single exon resolution was employed to elucidate critical features of a vertebrate muscle alternative splicing program. A dataset of 56 microarray-defined, muscle-enriched exons and their flanking introns were examined computationally in order to investigate coordination of the muscle splicing program. Candidate intron regulatory motifs were required to meet several stringent criteria: significant over-representation near muscle-enriched exons, correlation with muscle expression, and phylogenetic conservation among genomes of several vertebrate orders. Three classes of regulatory motifs were identified in the proximal downstream intron, within 200nt of the target exons: UGCAUG, a specific binding site for Fox-1 related splicing factors; ACUAAC, a novel branchpoint-like element; and UG-/UGC-rich elements characteristic of binding sites for CELF splicing factors. UGCAUG was remarkably enriched, being present in nearly one-half of all cases. These studies suggest that Fox and CELF splicing factors play a major role in enforcing the muscle-specific alternative splicing program, facilitating expression of a set of unique isoforms of cytoskeletal proteins that are critical to muscle cell differentiation. Supplementary materials: There are four supplementary tables and one supplementary figure. The tables provide additional detailed information concerning the muscle-enriched datasets, and about over-represented oligonucleotide sequences in the flanking introns. The supplementary figure shows RT-PCR data confirming the muscle-enriched expression of exons predicted from the microarray analysis.
Full Text Available In eukaryotes, introns are located in nuclear and organelle genes from several kingdoms. Large introns (up to 5 kbp are frequent in mitochondrial genomes of plant and fungi but scarce in Metazoa, even if these organisms are grouped with fungi among the Opisthokonts. Mitochondrial introns are classified in two groups (I and II according to their RNA secondary structure involved in the intron self-splicing mechanism. Most of these mitochondrial group I introns carry a "Homing Endonuclease Gene" (heg encoding a DNA endonuclease acting in transfer and site-specific integration ("homing" and allowing intron spreading and gain after lateral transfer even between species from different kingdoms. Opposed to this gain mechanism, is another which implies that introns, which would have been abundant in the ancestral genes, would mainly evolve by loss. The importance of both mechanisms (loss and gain is matter of debate. Here we report the sequence of the cox1 gene of the button mushroom Agaricus bisporus, the most widely cultivated mushroom in the world. This gene is both the longest mitochondrial gene (29,902 nt and the largest group I intron reservoir reported to date with 18 group I and 1 group II. An exhaustive analysis of the group I introns available in cox1 genes shows that they are mobile genetic elements whose numerous events of loss and gain by lateral transfer combine to explain their wide and patchy distribution extending over several kingdoms. An overview of intron distribution, together with the high frequency of eroded heg, suggests that they are evolving towards loss. In this landscape of eroded and lost intron sequences, the A. bisporus cox1 gene exhibits a peculiar dynamics of intron keeping and catching, leading to the largest collection of mitochondrial group I introns reported to date in a Eukaryote.
Tovar-Corona, Jaime M; Castillo-Morales, Atahualpa; Chen, Lu; Olds, Brett P; Clark, John M; Reynolds, Stuart E; Pittendrigh, Barry R; Feil, Edward J; Urrutia, Araxi O
Genomic and transcriptomics analyses have revealed human head and body lice to be almost genetically identical; although con-specific, they nevertheless occupy distinct ecological niches and have differing feeding patterns. Most importantly, while head lice are not known to be vector competent, body lice can transmit three serious bacterial diseases; epidemictyphus, trench fever, and relapsing fever. In order to gain insights into the molecular bases for these differences, we analyzed alternative splicing (AS) using next-generation sequencing data for one strain of head lice and one strain of body lice. We identified a total of 3,598 AS events which were head or body lice specific. Exon skipping AS events were overrepresented among both head and body lice, whereas intron retention events were underrepresented in both. However, both the enrichment of exon skipping and the underrepresentation of intron retention are significantly stronger in body lice compared with head lice. Genes containing body louse-specific AS events were found to be significantly enriched for functions associated with development of the nervous system, salivary gland, trachea, and ovarian follicle cells, as well as regulation of transcription. In contrast, no functional categories were overrepresented among genes with head louse-specific AS events. Together, our results constitute the first evidence for transcript pool differences in head and body lice, providing insights into molecular adaptations that enabled human lice to adapt to clothing, and representing a powerful illustration of the pivotal role AS can play in functional adaptation. PMID:26169943
Ruskin, B; Pikielny, C W; Rosbash, M; Green, M R
Pre-mRNA splicing in yeast and higher eukaryotes proceeds by similar pathways, in which a probable splicing intermediate and the excised intron are in a lariat configuration. To compare the pre-mRNA splicing mechanisms in yeast and higher eukaryotes, we have analyzed the RNA products resulting from in vitro processing of a yeast intron-containing pre-mRNA in HeLa cell and yeast extracts. In yeast, the RNA branch (2'-5' phosphodiester bond) of the RNA lariat forms at the third adenosine of the...
Coelho, Ana I; Lourenço, Sílvia; Trabuco, Matilde; Silva, Maria João; Oliveira, Anabela; Gaspar, Ana; Diogo, Luísa; Tavares de Almeida, Isabel; Vicente, João B; Rivera, Isabel
In recent years, antisense therapy has emerged as an increasingly important therapeutic approach to tackle several genetic disorders, including inborn errors of metabolism. Intronic mutations activating cryptic splice sites are particularly amenable to antisense therapy, as the canonical splice sites remain intact, thus retaining the potential for restoring constitutive splicing. Mutational analysis of Portuguese galactosemic patients revealed the intronic variation c.820+13A>G as the second most prevalent mutation, strongly suggesting its pathogenicity. The aim of this study was to functionally characterize this intronic variation, to elucidate its pathogenic molecular mechanism(s) and, ultimately, to correct it by antisense therapy. Minigene splicing assays in two distinct cell lines and patients' transcript analyses showed that the mutation activates a cryptic donor splice site, inducing an aberrant splicing of the GALT pre-mRNA, which in turn leads to a frameshift with inclusion of a premature stop codon (p.D274Gfs*17). Functional-structural studies of the recombinant wild-type and truncated GALT showed that the latter is devoid of enzymatic activity and prone to aggregation. Finally, two locked nucleic acid oligonucleotides, designed to specifically recognize the mutation, successfully restored the constitutive splicing, thus establishing a proof of concept for the application of antisense therapy as an alternative strategy for the clearly insufficient dietary treatment in classic galactosemia. PMID:25052314
Nielsen, Henrik; Engberg, J
We have studied the sequence variation of the rDNA intron among six species of Tetrahymena. From these data, the intron appears to be relatively well conserved in evolution. We have evaluated the sequence variations among the most distant of these species in relation to the secondary structure...... involving compensating base pair changes. Interestingly, one of these is found in a region that is known to be dispensable in the in vitro splicing reaction suggesting differences between the in vivo and in vitro reactions. One of the single nucleotide deletions is found in the so-called "internal guide...
Ismail, Said I.; Rohll, Jonathan B.; Kingsman, Susan M.; Kingsman, Alan J.; Uden, Mark
Normal mRNA polyadenylation signals are composed of an AAUAAA motif and G/U box spaced 20 to 30 bp apart. If this spacing is increased further, then polyadenylation is disrupted. Previously it has been demonstrated that insertion of an intron will similarly disrupt this signal even though such introns are removed during a nuclear splicing reaction (X. Liu and J. Mertz, Nucleic Acids Res. 21:5256–5263, 1993). This observation has led to the suggestion that polyadenylation site selection is und...
It has been recently argued that depletion attraction may play an important role in different aspects of cellular organization, ranging from the organization of transcriptional activity in transcription factories to the formation of nuclear bodies. In this paper, we suggest a new application of these ideas in the context of the splicing process, a crucial step of messenger RNA maturation in eukaryotes. We shall show that entropy effects and the resulting depletion attraction may explain the relevance of the aspecific intron length variable in the choice of splice-site recognition modality. On top of that, some qualitative features of the genome architecture of higher eukaryotes can find evolutionary realistic motivation in the light of our model
Lepennetier, Gildas; Catania, Francesco
mRNA-associated processes and gene structure in eukaryotes are typically treated as separate research subjects. Here, we bridge this separation and leverage the extensive multidisciplinary work on Drosophila melanogaster to examine the roles that capping, splicing, cleavage/polyadenylation, and telescripting (i.e, the protection of nascent transcripts from premature cleavage/polyadenylation by the splicing factor U1) might play in shaping exon-intron architecture in protein-coding genes. Our findings suggest that the distance between subsequent internal 5' splice sites (5'ss) in Drosophila genes is constrained such that telescripting effects are maximized, in theory, and thus nascent transcripts are less vulnerable to premature termination. Exceptionally weak 5'ss and constraints on intron-exon size at the gene 5' end also indicate that capping might enhance the recruitment of U1 and, in turn, promote telescripting at this location. Finally, a positive correlation between last exon length and last 5'ss strength suggests that optimal donor splice sites in the proximity of the pre-mRNA tail may inhibit the processing of downstream polyadenylation signals more than weak donor splice sites do. These findings corroborate and build upon previous experimental and computational studies on Drosophila genes. They support the possibility, hitherto scantly explored, that mRNA-associated processes impose significant constraints on the evolution of eukaryotic gene structure. PMID:27172210
Full Text Available In eukaryotic cells, inefficient splicing is surprisingly common and leads to the degradation of transcripts with retained introns. How pre-mRNAs are committed to nuclear decay is unknown. Here, we uncover a mechanism by which specific intron-containing transcripts are targeted for nuclear degradation in fission yeast. Sequence elements within these “decay-promoting” introns co-transcriptionally recruit the exosome specificity factor Mmi1, which induces degradation of the unspliced precursor and leads to a reduction in the levels of the spliced mRNA. This mechanism negatively regulates levels of the RNA helicase DDX5/Dbp2 to promote cell survival in response to stress. In contrast, fast removal of decay-promoting introns by co-transcriptional splicing precludes Mmi1 recruitment and relieves negative expression regulation. We propose that decay-promoting introns facilitate the regulation of gene expression. Based on the identification of multiple additional Mmi1 targets, including mRNAs, long non-coding RNAs, and sn/snoRNAs, we suggest a general role in RNA regulation for Mmi1 through transcript degradation.
Full Text Available Abstract Background Incorporation of exon 11 of the insulin receptor gene is both developmentally and hormonally-regulated. Previously, we have shown the presence of enhancer and silencer elements that modulate the incorporation of the small 36-nucleotide exon. In this study, we investigated the role of inherent splice site strength in the alternative splicing decision and whether recognition of the splice sites is the major determinant of exon incorporation. Results We found that mutation of the flanking sub-optimal splice sites to consensus sequences caused the exon to be constitutively spliced in-vivo. These findings are consistent with the exon-definition model for splicing. In-vitro splicing of RNA templates containing exon 11 and portions of the upstream intron recapitulated the regulation seen in-vivo. Unexpectedly, we found that the splice sites are occupied and spliceosomal complex A was assembled on all templates in-vitro irrespective of splicing efficiency. Conclusion These findings demonstrate that the exon-definition model explains alternative splicing of exon 11 in the IR gene in-vivo but not in-vitro. The in-vitro results suggest that the regulation occurs at a later step in spliceosome assembly on this exon.
Full Text Available Autoimmune polyendocrine syndrome type 1 (APS-1 is a rare autosomal recessive disease defined by the presence of two of the three conditions: mucocutaneous candidiasis, hypoparathyroidism, and Addison's disease. Loss-of-function mutations of the autoimmune regulator (AIRE gene have been linked to APS-1. Here we report mutational analysis and functional characterization of an AIRE mutation in a consanguineous Chinese family with APS-1. All exons of the AIRE gene and adjacent exon-intron sequences were amplified by PCR and subsequently sequenced. We identified a homozygous missense AIRE mutation c.463G>A (p.Gly155Ser in two siblings with different clinical features of APS-1. In silico splice-site prediction and minigene analysis were carried out to study the potential pathological consequence. Minigene splicing analysis and subsequent cDNA sequencing revealed that the AIRE mutation potentially compromised the recognition of the splice donor of intron 3, causing alternative pre-mRNA splicing by intron 3 retention. Furthermore, the aberrant AIRE transcript was identified in a heterozygous carrier of the c.463G>A mutation. The aberrant intron 3-retaining transcript generated a truncated protein (p.G155fsX203 containing the first 154 AIRE amino acids and followed by 48 aberrant amino acids. Therefore, our study represents the first functional characterization of the alternatively spliced AIRE mutation that may explain the pathogenetic role in APS-1.
Full Text Available Saccharomycotina and Taphrinomycotina lack intron in their histone genes, except for an intron in one of histone H4 genes of Yarrowia lipolytica. On the other hand, Basidiomycota and Perizomycotina have introns in their histone genes. We compared the distributions of 81, 47, 79, and 98 introns in the fungal histone H2A, H2B, H3, and H4 genes, respectively. Based on the multiple alignments of the amino acid sequences of histones, we identified 19, 13, 31, and 22 intron insertion sites in the histone H2A, H2B, H3, and H4 genes, respectively. Surprisingly only one hot spot of introns in the histone H2A gene is shared between Basidiomycota and Perizomycotina, suggesting that most of introns of Basidiomycota and Perizomycotina were acquired independently. Our findings suggest that the common ancestor of Ascomycota and Basidiomycota maybe had a few introns in the histone genes. In the course of fungal evolution, Saccharomycotina and Taphrinomycotina lost the histone introns; Basidiomycota and Perizomycotina acquired other introns independently. In addition, most of the introns have sequence similarity among introns of phylogenetically close species, strongly suggesting that horizontal intron transfer events between phylogenetically distant species have not occurred recently in the fungal histone genes.
Full Text Available Abstract Background Alternative transcript diversity manifests itself as a prime cause of complexity in higher eukaryotes. Recently, transcript diversity studies have suggested that 60–80% of human genes are alternatively spliced. We have used a splicing pattern approach for the bioinformatics analysis of Alternative Splicing (AS in chicken, human and mouse. Exons involved in splicing are subdivided into distinct and variant exons, based on the prevalence of the exons across the transcripts. Four possible permutations of these two different groups of exons were categorised as class I (distinct-variant, class II (distinct-variant, class III (variant-distinct and class IV (variant-variant. This classification quantifies the variation in transcript diversity in the three species. Results In all, 3901 chicken AS genes have been compared with 16,715 human and 16,491 mouse AS genes, with 23% of chicken genes being alternatively spliced, compared to 68% in humans and 57% in mice. To minimize any gene structure bias in the input data, comparative genome analysis has been carried out on the orthologous subset of AS genes for the three species. Gene-level analysis suggested that chicken genes show fewer AS events compared to human and mouse. An event-level analysis showed that the percentage of AS events in chicken is similar to that of human, which implies that a smaller number of chicken genes show greater transcript diversity. Overall, chicken genes were found to have fewer transcripts per gene and shorter introns than human and mouse genes. Conclusion In chicken, the majority of genes generate only two or three isoforms, compared to almost eight in human and six in mouse. We observed that intron definition is expressed strongly when compared to exon definition for chicken genome, based on 3% intron retention in chicken, compared to 2% in human and mouse. Splicing patterns with variant exons account for 33% of AS chicken orthologous genes compared to
Osmark, P; Hansson, O; Jonsson, Anna Elisabet;
Intronic variation in the TCF7L2 gene exhibits the strongest association to type 2 diabetes observed to date, but the mechanism whereby this genetic variation translates into altered biological function is largely unknown. A possible explanation is a genotype-dependent difference in the complex...... splicing pattern; however, this has not previously been characterised in pancreatic or insulin target tissues. Here, the detailed TCF7L2 splicing pattern in five human tissues is described and dependence on risk genotype explored....
Petersen, S. M.; Dandanell, M.; Rasmussen, L. J.;
/MSH2 mutations as pathogenic, whereas four MLH1/MSH2/MSH6 mutations are classified as neutral. This study supports the notion that in silico prediction tools and mini-gene assays are important for the classification of intronic variants, and thereby crucial for the genetic counseling of patients and......Background: Germ-line mutations in the DNA mismatch repair genes MLH1, MSH2, and MSH6 predispose to the development of colorectal cancer (Lynch syndrome or hereditary nonpolyposis colorectal cancer). These mutations include disease-causing frame-shift, nonsense, and splicing mutations as well as...... asses the effect on splicing. Results: We describe in silico and in vitro characterization of nine intronic MLH1, MSH2, or MSH6 mutations identified in Danish colorectal cancer patients, of which four mutations are novel. The analysis revealed aberrant splicing of five mutations (MLH1 c. 588 + 5G > A...
Full Text Available During heat shock and other proteotoxic stresses, cells regulate multiple steps in gene expression in order to globally repress protein synthesis and selectively upregulate stress response proteins. Splicing of several mRNAs is known to be inhibited during heat stress, often meditated by SRp38, but the extent and specificity of this effect have remained unclear. Here, we examined splicing regulation genome-wide during heat shock in mouse fibroblasts. We observed widespread retention of introns in transcripts from ∼1,700 genes, which were enriched for tRNA synthetase, nuclear pore, and spliceosome functions. Transcripts with retained introns were largely nuclear and untranslated. However, a group of 580+ genes biased for oxidation reduction and protein folding functions continued to be efficiently spliced. Interestingly, these unaffected transcripts are mostly cotranscriptionally spliced under both normal and stress conditions, whereas splicing-inhibited transcripts are mostly spliced posttranscriptionally. Altogether, our data demonstrate widespread repression of splicing in the mammalian heat stress response, disproportionately affecting posttranscriptionally spliced genes.
Conboy, John G.; Parra, Marilyn K.; Tan, Jeff S.; Mohandas, Narla; Conboy, John G.
In the protein 4.1R gene, alternative first exons splice differentially to alternative 3' splice sites far downstream in exon 2'/2 (E2'/2). We describe a novel intrasplicing mechanism by which exon 1A (E1A) splices exclusively to the distal E2'/2 acceptor via two nested splicing reactions regulated by novel properties of exon 1B (E1B). E1B behaves as an exon in the first step, using its consensus 5' donor to splice to the proximal E2'/2 acceptor. A long region of downstream intron is excised, juxtaposing E1B with E2'/2 to generate a new composite acceptor containing the E1B branchpoint/pyrimidine tract and E2 distal 3' AG-dinucleotide. Next, the upstream E1A splices over E1B to this distal acceptor, excising the remaining intron plus E1B and E2' to form mature E1A/E2 product. We mapped branch points for both intrasplicing reactions and demonstrated that mutation of the E1B 5' splice site or branchpoint abrogates intrasplicing. In the 4.1R gene, intrasplicing ultimately determines N-terminal protein structure and function. More generally, intrasplicing represents a new mechanism whereby alternative promoters can be coordinated with downstream alternative splicing.
Full Text Available Dentinogenesis imperfecta (DGI type II is an autosomal dominant disease characterized by a serious disorders in teeth. Mutations of dentin sialophosphoprotein (DSPP gene were revealed to be the causation of DGI type II (DGI-II. In this study, we identified a novel mutation (NG_011595.1:g.8662T>C, c.135+2T>C lying in the splice donor site of intron 3 of DSPP gene in a Chinese Han DGI-II pedigree. It was found in all affected subjects but not in unaffected ones or other unrelated healthy controls. The function of the mutant DSPP gene, which was predicted online and subsequently confirmed by in vitro splicing analysis, was the loss of splicing of intron 3, leading to the extended length of DSPP mRNA. For the first time, the functional non-splicing of intron was revealed in a novel DSPP mutation and was considered as the causation of DGI-II. It was also indicated that splicing was of key importance to the function of DSPP and this splice donor site might be a sensitive mutation hot spot. Our findings combined with other reports would facilitate the genetic diagnosis of DGI-II, shed light on its gene therapy and help to finally conquer human diseases.
Blechingberg, Jenny; Lykke-Andersen, Søren; Jensen, Torben Heick;
molecular mechanisms participating in alternative GFAP expression. Usage of a polyadenylation signal within the alternatively spliced exon 7a is essential to generate the GFAP kappa and GFAP kappa transcripts. The GFAP kappa mRNA is distinct from GFAP epsilon mRNA given that it also includes intron 7a....... Polyadenylation at the exon 7a site is stimulated by the upstream splice site. Moreover, exon 7a splice enhancer motifs supported both exon 7a splicing and polyadenylation. SR proteins increased the usage of the exon 7a polyadenylation signal but not the exon 7a splicing, whereas the polypyrimidine tract binding...... (PTB) protein enhanced both exon 7a polyadenylation and exon 7a splicing. Finally, increasing transcription by the VP16 trans-activator did not affect the frequency of use of the exon 7a polyadenylation signal whereas the exon 7a splicing frequency was decreased. Our data suggest a model with the...
Erik van Nimwegen
Full Text Available Recent large-scale cDNA sequencing efforts show that elaborate patterns of splice variation are responsible for much of the proteome diversity in higher eukaryotes. To obtain an accurate account of the repertoire of splice variants, and to gain insight into the mechanisms of alternative splicing, it is essential that cDNAs are very accurately mapped to their respective genomes. Currently available algorithms for cDNA-to-genome alignment do not reach the necessary level of accuracy because they use ad hoc scoring models that cannot correctly trade off the likelihoods of various sequencing errors against the probabilities of different gene structures. Here we develop a Bayesian probabilistic approach to cDNA-to-genome alignment. Gene structures are assigned prior probabilities based on the lengths of their introns and exons, and based on the sequences at their splice boundaries. A likelihood model for sequencing errors takes into account the rates at which misincorporation, as well as insertions and deletions of different lengths, occurs during sequencing. The parameters of both the prior and likelihood model can be automatically estimated from a set of cDNAs, thus enabling our method to adapt itself to different organisms and experimental procedures. We implemented our method in a fast cDNA-to-genome alignment program, SPA, and applied it to the FANTOM3 dataset of over 100,000 full-length mouse cDNAs and a dataset of over 20,000 full-length human cDNAs. Comparison with the results of four other mapping programs shows that SPA produces alignments of significantly higher quality. In particular, the quality of the SPA alignments near splice boundaries and SPA's mapping of the 5' and 3' ends of the cDNAs are highly improved, allowing for more accurate identification of transcript starts and ends, and accurate identification of subtle splice variations. Finally, our splice boundary analysis on the human dataset suggests the existence of a novel non
van Nimwegen, Erik; Paul, Nicodeme; Sheridan, Robert; Zavolan, Mihaela
Recent large-scale cDNA sequencing efforts show that elaborate patterns of splice variation are responsible for much of the proteome diversity in higher eukaryotes. To obtain an accurate account of the repertoire of splice variants, and to gain insight into the mechanisms of alternative splicing, it is essential that cDNAs are very accurately mapped to their respective genomes. Currently available algorithms for cDNA-to-genome alignment do not reach the necessary level of accuracy because they use ad hoc scoring models that cannot correctly trade off the likelihoods of various sequencing errors against the probabilities of different gene structures. Here we develop a Bayesian probabilistic approach to cDNA-to-genome alignment. Gene structures are assigned prior probabilities based on the lengths of their introns and exons, and based on the sequences at their splice boundaries. A likelihood model for sequencing errors takes into account the rates at which misincorporation, as well as insertions and deletions of different lengths, occurs during sequencing. The parameters of both the prior and likelihood model can be automatically estimated from a set of cDNAs, thus enabling our method to adapt itself to different organisms and experimental procedures. We implemented our method in a fast cDNA-to-genome alignment program, SPA, and applied it to the FANTOM3 dataset of over 100,000 full-length mouse cDNAs and a dataset of over 20,000 full-length human cDNAs. Comparison with the results of four other mapping programs shows that SPA produces alignments of significantly higher quality. In particular, the quality of the SPA alignments near splice boundaries and SPA's mapping of the 5′ and 3′ ends of the cDNAs are highly improved, allowing for more accurate identification of transcript starts and ends, and accurate identification of subtle splice variations. Finally, our splice boundary analysis on the human dataset suggests the existence of a novel non-canonical splice
Li, Long; Pintel, David J., E-mail: firstname.lastname@example.org
Translation of goose parvovirus (GPV) 72 kDa Rep 1 is initiated from unspliced P9-generated mRNAs in ORF1 from the first in-frame AUG (537 AUG); however, this AUG is bypassed in spliced P9-generated RNA: translation of the 52 kDa Rep 2 protein from spliced RNA is initiated in ORF2 at the next AUG downstream (650 AUG). Usage of the 537 AUG was restored in spliced RNA when the GPV intron was replaced with a chimeric SV40 intron, or following specific mutations of the GPV intron which did not appear in the final spliced mRNA. Additionally, 650 AUG usage was gained in unspliced RNA when the GPV intron splice sites were debilitated. Splicing-dependent regulation of translation initiation was mediated in cis by GPV RNA surrounding the target AUGs. Thus, nuclear RNA processing of GPV P9-generated pre-mRNAs has a complex, but significant, effect on alternative translation initiation of the GPV Rep proteins.
Abstract Background Genome-wide studies of intron dynamics in mammalian orthologous genes have found convincing evidence for loss of introns but very little for intron turnover. Similarly, large-scale analysis of intron dynamics in a few vertebrate genomes has identified only intron losses and no gains, indicating that intron gain is an extremely rare event in vertebrate evolution. These studies suggest that the intron-rich genomes of vertebrates do not allow intron gain. The aim of this stud...
Sahebi, Mahbod; Hanafi, Mohamed M; van Wijnen, Andre J; Azizi, Parisa; Abiri, Rambod; Ashkani, Sadegh; Taheri, Sima
Alternative pre-mRNA splicing provides a source of vast protein diversity by removing non-coding sequences (introns) and accurately linking different exonic regions in the correct reading frame. The regulation of alternative splicing is essential for various cellular functions in both pathological and physiological conditions. In eukaryotic cells, this process is commonly used to increase proteomic diversity and to control gene expression either co- or post-transcriptionally. Alternative splicing occurs within a megadalton-sized, multi-component machine consisting of RNA and proteins; during the splicing process, this complex undergoes dynamic changes via RNA-RNA, protein-protein and RNA-protein interactions. Co-transcriptional splicing functionally integrates the transcriptional machinery, thereby enabling the two processes to influence one another, whereas post-transcriptional splicing facilitates the coupling of RNA splicing with post-splicing events. This review addresses the structural aspects of spliceosomes and the mechanistic implications of their stepwise assembly on the regulation of pre-mRNA splicing. Moreover, the role of phosphorylation-based, signal-induced changes in the regulation of the splicing process is demonstrated. PMID:27154819
Park, Jung Woo; Graveley, Brenton R.
Alternative splicing is a powerful means of controlling gene expression and increasing protein diversity. Most genes express a limited number of mRNA isoforms, but there are several examples of genes that use alternative splicing to generate hundreds, thousands, and even tens of thousands of isoforms. Collectively such genes are considered to undergo complex alternative splicing. The best example is the Drosophila Down syndrome cell adhesion molecule (Dscam) gene, which can generate 38,016 is...
Hnilicová, Jarmila; Staněk, David
There are numerous data suggesting that two key steps in gene expression—transcription and splicing influence each other closely. For a long time it was known that chromatin modifications regulate transcription, but only recently it was shown that chromatin and histone modifications play a significant role in pre-mRNA splicing. Here we summarize interactions between splicing machinery and chromatin and discuss their potential functional significance. We focus mainly on histone acetylation and...
Axelrod, Felicia B.; Liebes, Leonard; Gold-von Simson, Gabrielle; Mendoza, Sandra; Mull, James; Leyne, Maire; Norcliffe-Kaufmann, Lucy; Kaufmann, Horacio; Slaugenhaupt, Susan A.
Familial dysautonomia (FD) is caused by an intronic splice mutation in the IKBKAP gene that leads to partial skipping of exon 20 and tissue-specific reduction in I-κ-B kinase complex associated protein/ elongation protein 1 (IKAP/ELP-1) expression. Kinetin (6-furfurylaminopurine) has been shown to improve splicing and increase wild-type IKBKAP mRNA and IKAP protein expression in FD cell lines and carriers. To determine if oral kinetin treatment could alter mRNA splicing in FD subjects and was...
Kirwin, Susan M; Manolakos, Athena; Barnett, Sarah Swain; Gonzalez, Iris L
Barth syndrome is caused by mutations in the TAZ (tafazzin) gene on human chromosome Xq28. The human tafazzin gene produces four major mRNA splice variants; two of which have been shown to be functional (TAZ lacking exon 5 and full-length) in complementation studies with yeast and Drosophila. This study characterizes the multiple alternative splice variants of TAZ mRNA and their proportions in blood samples from a cohort of individuals with Barth syndrome (BTHS). Because it has been reported that collection and processing methods can affect the expression of various genes, we tested and chose a stabilizing medium for collecting, shipping and processing of the blood samples of these individuals. In both healthy controls and in BTHS individuals, we found a greater variety of alternatively spliced forms than previously described, with a sizeable proportion of minor splice variants besides the four dominant isoforms. Individuals with certain exonic and intronic splice mutations produce additional mutant mRNAs that could be translated into two or more proteins with different amino acid substitutions in a single individual. A fraction of the minor splice variants is predicted to be non-productive. PMID:24342716
Irimia, Manuel; Rukov, Jakob L; Penny, David;
Alternative splicing (AS) contributes to increased transcriptome and proteome diversity in various eukaryotic lineages. Previous studies showed low levels of conservation of alternatively spliced (cassette) exons within mammals and within dipterans. We report a strikingly different pattern in...... patterns of splicing. The functionality of the vast majority of cassette exons is underscored by various other features. We suggest that differences in conservation between lineages reflect differences in levels of functionality and further suggest that these differences are due to differences in intron...... length and the strength of consensus boundaries across lineages. Finally, we demonstrate an inverse relationship between AS and gene duplication, suggesting that the latter may be primarily responsible for the emergence of new functional transcripts in nematodes. Udgivelsesdato: 2008-Feb...
Känsäkoski, Johanna; Jääskeläinen, Jarmo; Jääskeläinen, Tiina; Tommiska, Johanna; Saarinen, Lilli; Lehtonen, Rainer; Hautaniemi, Sampsa; Frilander, Mikko J.; Palvimo, Jorma J.; Toppari, Jorma; Raivio, Taneli
Mutations in the X-linked androgen receptor (AR) gene underlie complete androgen insensitivity syndrome (CAIS), the most common cause of 46,XY sex reversal. Molecular genetic diagnosis of CAIS, however, remains uncertain in patients who show normal coding region of AR. Here, we describe a novel mechanism of AR disruption leading to CAIS in two 46,XY sisters. We analyzed whole-genome sequencing data of the patients for pathogenic variants outside the AR coding region. Patient fibroblasts from the genital area were used for AR cDNA analysis and protein quantification. Analysis of the cDNA revealed aberrant splicing of the mRNA caused by a deep intronic mutation (c.2450-118A>G) in the intron 6 of AR. The mutation creates a de novo 5′ splice site and a putative exonic splicing enhancer motif, which leads to the preferential formation of two aberrantly spliced mRNAs (predicted to include a premature stop codon). Patient fibroblasts contained no detectable AR protein. Our results show that patients with CAIS and normal AR coding region need to be examined for deep intronic mutations that can lead to pseudoexon activation. PMID:27609317
Li, Cheng-Fang; Costa, Maria; Bassi, Gurminder; Lai, Yiu-Kay; Michel, François
A survey of sequence databases revealed 10 instances of subgroup IIB1 mitochondrial ribosomal introns with 1 to 33 additional nucleotides inserted between the 5' exon and the consensus sequence at the intron 5' end. These 10 introns depart further from the IIB1 consensus in their predicted domain VI structure: In contrast to its basal helix and distal GNRA terminal loop, the middle part of domain VI is highly variable and lacks the bulging A that serves as the branchpoint in lariat formation. In vitro experiments using two closely related IIB1 members inserted at the same ribosomal RNA site in the basidiomycete fungi Grifola frondosa and Pycnoporellus fulgens revealed that both ribozymes are capable of efficient self-splicing. However, whereas the Grifola intron was excised predominantly as a lariat, the Pycnoporellus intron, which possesses six additional nucleotides at the 5' end, yielded only linear products, consistent with its predicted domain VI structure. Strikingly, all of the introns with 5' terminal insertions lack the EBS2 exon-binding site. Moreover, several of them are part of the small subset of group II introns that encode potentially functional homing endonucleases of the LAGLIDADG family rather than reverse transcriptases. Such coincidences suggest causal relationships between the shift to DNA-based mobility, the loss of one of the two ribozyme sites for binding the 5' exon, and the exclusive use of hydrolysis to initiate splicing. PMID:21613530
Vitting-Seerup, Kristoffer; Porse, Bo Torben; Sandelin, Albin;
RNA-seq data is currently underutilized, in part because it is difficult to predict the functional impact of alternate transcription events. Recent software improvements in full-length transcript deconvolution prompted us to develop spliceR, an R package for classification of alternative splicing...
Full Text Available BACKGROUND: Gamma-aminobutyric acid type B (GABAB receptors decrease neural activity through G protein signaling. There are two subunits, GABAB1 and GABAB2. Alternative splicing provides GABAB1 with structural and functional diversity. cDNA microarrays showed strong signals from human brain RNA using GABAB1 intron 4 region probes. Therefore, we predicted the existence of novel splice variants. METHODOLOGY/PRINCIPAL FINDINGS: Based on the probe sequence analysis, we proposed two possible splice variants, GABAB1j and GABAB1k. The existence of human GABAB1j was verified by quantitative real-time PCR, and mouse GABAB1j was found from a microarray probe set based on human GABAB1j sequence. GABAB1j open reading frames (ORF and expression patterns are not conserved across species, and they do not have any important functional domains except sushi domains. Thus, we focused on another possible splice variant, GABAB1k. After obtaining PCR evidence for GABAB1k existence from human, mouse, and rat, it was cloned from human and mouse by PCR along with three additional isoforms, GABAB1l, GABAB1m, and GABAB1n. Their expression levels by quantitative real-time PCR are relatively low in brain although they may be expressed in specific cell types. GABAB1l and GABAB1m inhibit GABAB receptor-induced G protein-activated inwardly rectifying K(+ channel (GIRK currents at Xenopus oocyte two-electrode voltage clamp system. CONCLUSIONS/SIGNIFICANCE: This study supports previous suggestions that intron 4 of GABAB1 gene is a frequent splicing spot across species. Like GABAB1e, GABAB1l and GABAB1m do not have transmembrane domains but have a dimerization motif. So, they also could be secreted and bind GABAB2 dominantly instead of GABAB1a. However, only GABAB1l and GABAB1m are N- and C-terminal truncated splicing variants and impair receptor function. This suggests that the intron 4 containing N-terminal truncation is necessary for the inhibitory action of the new splice
Sogin, M L; Ingold, A; Karlok, M;
Previous work has demonstrated the presence of a self-splicing intron in the large subunit ribosomal RNA coding region in some strains of the ciliate protozoan Tetrahymena. Sequence comparisons of the intron regions from six Tetrahymena species showed these to fall into three homology groups. In an...... attempt to evaluate the evolutionary origins of the intervening sequences, we have now determined complete small subunit ribosomal RNA gene sequences from 13 species of Tetrahymena and the absolute number of nucleotide differences between the sequences was used to construct a phylogenetic tree. This...
Scott P Kallgren
Full Text Available Heterochromatin preferentially assembles at repetitive DNA elements, playing roles in transcriptional silencing, recombination suppression, and chromosome segregation. The RNAi machinery is required for heterochromatin assembly in a diverse range of organisms. In fission yeast, RNA splicing factors are also required for pericentric heterochromatin assembly, and a prevailing model is that splicing factors provide a platform for siRNA generation independently of their splicing activity. Here, by screening the fission yeast deletion library, we discovered four novel splicing factors that are required for pericentric heterochromatin assembly. Sequencing total cellular RNAs from the strongest of these mutants, cwf14Δ, showed intron retention in mRNAs of several RNAi factors. Moreover, introducing cDNA versions of RNAi factors significantly restored pericentric heterochromatin in splicing mutants. We also found that mutations of splicing factors resulted in defective telomeric heterochromatin assembly and mis-splicing the mRNA of shelterin component Tpz1, and that replacement of tpz1+ with its cDNA partially rescued heterochromatin defects at telomeres in splicing mutants. Thus, proper splicing of RNAi and shelterin factors contributes to heterochromatin assembly at pericentric regions and telomeres.
Full Text Available Spliceosome-mediated RNA trans-splicing has become an emergent tool for the repair of mutated pre-mRNAs in the treatment of genetic diseases. RNA trans-splicing molecules (RTMs are designed to induce a specific trans-splicing reaction via a binding domain for a respective target pre-mRNA region. A previously established reporter-based screening system allows us to analyze the impact of various factors on the RTM trans-splicing efficiency in vitro. Using this system, we are further able to investigate the potential of antisense RNAs (AS RNAs, presuming to improve the trans-splicing efficiency of a selected RTM, specific for intron 102 of COL7A1. Mutations in the COL7A1 gene underlie the dystrophic subtype of the skin blistering disease epidermolysis bullosa (DEB. We have shown that co-transfections of the RTM and a selected AS RNA, interfering with competitive splicing elements on a COL7A1-minigene (COL7A1-MG, lead to a significant increase of the RNA trans-splicing efficiency. Thereby, accurate trans-splicing between the RTM and the COL7A1-MG is represented by the restoration of full-length green fluorescent protein GFP on mRNA and protein level. This mechanism can be crucial for the improvement of an RTM-mediated correction, especially in cases where a high trans-splicing efficiency is required.
彭司华; 彭小宁; 庄树林; 杜维; 陈良标
Motivation: It was found that high accuracy splicing-site recognition of rice ( Oryza satlva L. ) DNA sequence is especially difficult. We described a new method for the splicing-site recognition of rice DNA sequences. Method: Based on the intron in eukaryotic organisms conforming to the principle of GT-AG, we used support vector machines (SVM) to predict the splicing sites. By machine learning, we built a model and used it to test the effect of the test data set of true and pseudo splicing sites. Results : The prediction accuracy we obtained was 87.53% at the true 5' end splicing site and 87.37% at the true 3' end splicing sites. The results suggested that the SVM approach could achieve higher accuracy than the previous approaches.
Hebsgaard, Stefan M.; Korning, Peter G.; Tolstrup, Niels;
experimentally observed in A.thaliana transformants. Predictions for alternatively spliced genes are also presented, together with examples of genes from other dicots, monocots and algae. The method has been made available through electronic mail (NetPlantGene@cbs.dtu.dk), or the WWW at http://www.cbs.dtu.dk/NetPlantGene.html......Artificial neural networks have been combined with a rule based system to predict intron splice sites in the dicot plant Arabidopsis thaliana. A two step prediction scheme, where a global prediction of the coding potential regulates a cutoff level for a local predicition of splice sites, is refined...... by rules based on splice site confidence values, prediction scores, coding context and distances between potential splice sites. In this approach, the prediction of splice sites mutually affect each other in a non-local manner. The combined approach drastically reduces the large amount of false...
WANG, Juan; Jie ZHANG; Li, Kaibo; Zhao, Wei; Cui, Qinghua
RNA splicing is an important aspect of gene regulation in many organisms. Splicing of RNA is regulated by complicated mechanisms involving numerous RNA-binding proteins and the intricate network of interactions among them. Mutations in cis-acting splicing elements or its regulatory proteins have been shown to be involved in human diseases. Defects in pre-mRNA splicing process have emerged as a common disease-causing mechanism. Therefore, a database integrating RNA splicing and disease associa...
Collemare, J.; Beenen, H.G.; Crous, P.W.; Wit, de, Joost; Burgt, van der, G.J.H.M.
Spliceosomal introns are key components of the eukaryotic gene structure. Although they contributed to the emergence of eukaryotes, their origin remains elusive. In fungi, they might originate from the multiplication of invasive introns named Introner-Like Elements (ILEs). However, so far ILEs have been observed in six fungal species only, including Fulvia fulva and Dothistroma septosporum (Dothideomycetes), arguing against ILE insertion as a general mechanism for intron gain. Here, we identi...
Antonio Garcia-España; Roso Mares; Tung-Tien Sun; Rob Desalle
BACKGROUND: Although large scale informatics studies on introns can be useful in making broad inferences concerning patterns of intron gain and loss, more specific questions about intron evolution at a finer scale can be addressed using a gene family where structure and function are well known. Genome wide surveys of tetraspanins from a broad array of organisms with fully sequenced genomes are an excellent means to understand specifics of intron evolution. Our approach incorporated several ne...
Yeo, Gene Wei-Ming
Silencing of splicing regulators by RNA interference, combined with splicing-specific microarrays, has revealed a complex network of distinct alternative splicing events in Drosophila, while a high-throughput screen of more than 6,000 compounds has identified drugs that interfere specifically and directly with one class of splicing regulators in human cells.
Swanson, M S; Dreyfuss, G
The binding of hnRNP proteins to pre-mRNAs in nuclear extracts, and as isolated proteins, was studied by using monoclonal antibody immunopurification of hnRNP proteins bound to RNase T1-generated fragments. Several major hnRNP proteins, A1, C and D, bind specifically to the 3' end of introns within a region containing the conserved polypyrimidine stretch between the branch site and the 3' splice site. Mutations which alter the conserved 3' splice site dinucleotide AG strongly impair or abolis...
Arman, Ahmet; Ozon, Alev; Isguven, Pinar S; Coker, Ajda; Peker, Ismail; Yordam, Nursen
Growth hormone (GH) is involved in growth, and fat and carbohydrate metabolism. Interaction of GH with the GH receptor (GHR) is necessary for systemic and local production of insulin-like growth factor-I (IGF-I) which mediates GH actions. Mutations in the GHR cause severe postnatal growth failure; the disorder is an autosomal recessive genetic disease resulting in GH insensitivity, called Laron syndrome. It is characterized by dwarfism with elevated serum GH and low levels of IGF-I. We analyzed the GHR gene for mutations and polymorphisms in eight patients with Laron-type dwarfism from six families. We found three missense mutations (S40L, V125A, I526L), one nonsense mutation (W157X), and one splice site mutation in the extracellular domain of GHR. Furthermore, G168G and exon 3 deletion polymorphisms were detected in patients with Laron syndrome. The splice site mutation, which is a novel mutation, was located at the donor splice site of exon 2/ intron 2 within GHR. Although this mutation changed the highly conserved donor splice site consensus sequence GT to GGT by insertion of a G residue, the intron splicing between exon 2 and exon 3 was detected in the patient. These results imply that the splicing occurs arthe GT site in intron 2, leaving the extra inserted G residue at the end of exon 2, thus changing the open reading frame of GHR resulting in a premature termination codon in exon 3. PMID:18404972
Human genes typically contain multiple introns, and in many cases the exons can be joined more than one way to generate multiple rnRNAs, encoding distinct protein isoforms. This process is called alternative splicing. The article summarized the human cytochrome P450 pre-mRNA alternative splicing and their regulatory mechanism and impacts on biological functions.
Yablon, Andrew D
This book is an up-to-date treatment of optical fiber fusion splicing incorporating all the recent innovations in the field. It provides a toolbox of general strategies and specific techniques that the reader can apply when optimizing fusion splices between novel fibers. It specifically addresses considerations important for fusion splicing of contemporary specialty fibers including dispersion compensating fiber, erbium-doped gain fiber, polarization maintaining fiber, and microstructured fiber. Finally, it discusses the future of optical fiber fusion splicing including silica and non-silica based optical fibers as well as the trend toward increasing automation. Whilst serving as a self-contained reference work, abundant citations from the technical literature will enable readers to readily locate primary sources.
Gastaldo, Elena; Harries, Lorna W.; Rubio-Cabezas, Oscar; Castaño, Luis
Background The aim of this study was to characterize the genetic etiology in a patient who presented with permanent neonatal diabetes at 2 months of age. Methodology/Principal Findings Regulatory elements and coding exons 2 and 3 of the INS gene were amplified and sequenced from genomic and complementary DNA samples. A novel heterozygous INS mutation within the terminal intron of the gene was identified in the proband and her affected father. This mutation introduces an ectopic splice site leading to the insertion of 29 nucleotides from the intronic sequence into the mature mRNA, which results in a longer and abnormal transcript. Conclusions/Significance This study highlights the importance of routinely sequencing the exon-intron boundaries and the need to carry out additional studies to confirm the pathogenicity of any identified intronic genetic variants. PMID:22235272
Varadwaj, Pritish; Purohit, Neetesh; Arora, Bhumika
Automatic identification and annotation of exon and intron region of gene, from DNA sequences has been an important research area in field of computational biology. Several approaches viz. Hidden Markov Model (HMM), Artificial Intelligence (AI) based machine learning and Digital Signal Processing (DSP) techniques have extensively and independently been used by various researchers to cater this challenging task. In this work, we propose a Support Vector Machine based kernel learning approach for detection of splice sites (the exon-intron boundary) in a gene. Electron-Ion Interaction Potential (EIIP) values of nucleotides have been used for mapping character sequences to corresponding numeric sequences. Radial Basis Function (RBF) SVM kernel is trained using EIIP numeric sequences. Furthermore this was tested on test gene dataset for detection of splice site by window (of 12 residues) shifting. Optimum values of window size, various important parameters of SVM kernel have been optimized for a better accuracy. Receiver Operating Characteristic (ROC) curves have been utilized for displaying the sensitivity rate of the classifier and results showed 94.82% accuracy for splice site detection on test dataset.
Tuhin Kumar Guha
Full Text Available In Chaetomium thermophilum (DSM 1495 within the mitochondrial DNA (mtDNA small ribosomal subunit (rns gene a group IIA1 intron interrupts an open reading frame (ORF encoded within a group I intron (mS1247. This arrangement offers the opportunity to examine if the nested group II intron could be utilized as a regulatory element for the expression of the homing endonuclease (HEase. Constructs were generated where the codon-optimized ORF was interrupted with either the native group IIA1 intron or a group IIB type intron. This study showed that the expression of the HEase (in vivo in Escherichia coli can be regulated by manipulating the splicing efficiency of the HEase ORF-embedded group II introns. Exogenous magnesium chloride (MgCl2 stimulated the expression of a functional HEase but the addition of cobalt chloride (CoCl2 to growth media antagonized the expression of HEase activity. Ultimately the ability to attenuate HEase activity might be useful in precision genome engineering, minimizing off target activities, or where pathways have to be altered during a specific growth phase.
James R Carter
Full Text Available In portions of South Asia, vectors and patients co-infected with dengue (DENV and chikungunya (CHIKV are on the rise, with the potential for this occurrence in other regions of the world, for example the United States. Therefore, we engineered an antiviral approach that suppresses the replication of both arboviruses in mosquito cells using a single antiviral group I intron. We devised unique configurations of internal, external, and guide sequences that permit homologous recognition and splicing with conserved target sequences in the genomes of both viruses using a single trans-splicing Group I intron, and examined their effectiveness to suppress infections of DENV and CHIKV in mosquito cells when coupled with a proapoptotic 3' exon, ΔN Bax. RT-PCR demonstrated the utility of these introns in trans-splicing the ΔN Bax sequence downstream of either the DENV or CHIKV target site in transformed Aedes albopictus C6/36 cells, independent of the order in which the virus specific targeting sequences were inserted into the construct. This trans-splicing reaction forms DENV or CHIKV ΔN Bax RNA fusions that led to apoptotic cell death as evidenced by annexin V staining, caspase, and DNA fragmentation assays. TCID50-IFA analyses demonstrate effective suppression of DENV and CHIKV infections by our anti-arbovirus group I intron approach. This represents the first report of a dual-acting Group I intron, and demonstrates that we can target DENV and CHIKV RNAs in a sequence specific manner with a single, uniquely configured CHIKV/DENV dual targeting group I intron, leading to replication suppression of both arboviruses, and thus providing a promising single antiviral for the transgenic suppression of multiple arboviruses.
Gabunilas, Jason; Chanfreau, Guillaume
In Saccharomyces cerevisiae, splicing is critical for expression of ribosomal protein genes (RPGs), which are among the most highly expressed genes and are tightly regulated according to growth and environmental conditions. However, knowledge of the precise mechanisms by which RPG pre-mRNA splicing is regulated on a gene-by-gene basis is lacking. Here we show that Rpl22p has an extraribosomal role in the inhibition of splicing of the RPL22B pre-mRNA transcript. A stem loop secondary structure within the intron is necessary for pre-mRNA binding by Rpl22p in vivo and splicing inhibition in vivo and in vitro and can rescue splicing inhibition in vitro when added in trans to splicing reactions. Splicing inhibition by Rpl22p may be partly attributed to the reduction of co-transcriptional U1 snRNP recruitment to the pre-mRNA at the RPL22B locus. We further demonstrate that the inhibition of RPL22B pre-mRNA splicing contributes to the down-regulation of mature transcript during specific stress conditions, and provide evidence hinting at a regulatory role for this mechanism in conditions of suppressed ribosome biogenesis. These results demonstrate an autoregulatory mechanism that fine-tunes the expression of the Rpl22 protein and by extension Rpl22p paralog composition according to the cellular demands for ribosome biogenesis. PMID:27097027
Sorenson, Matthew R; Jha, Deepak K; Ucles, Stefanie A; Flood, Danielle M; Strahl, Brian D; Stevens, Scott W; Kress, Tracy L
Co-transcriptional splicing takes place in the context of a highly dynamic chromatin architecture, yet the role of chromatin restructuring in coordinating transcription with RNA splicing has not been fully resolved. To further define the contribution of histone modifications to pre-mRNA splicing in Saccharomyces cerevisiae, we probed a library of histone point mutants using a reporter to monitor pre-mRNA splicing. We found that mutation of H3 lysine 36 (H3K36) - a residue methylated by Set2 during transcription elongation - exhibited phenotypes similar to those of pre-mRNA splicing mutants. We identified genetic interactions between genes encoding RNA splicing factors and genes encoding the H3K36 methyltransferase Set2 and the demethylase Jhd1 as well as point mutations of H3K36 that block methylation. Consistent with the genetic interactions, deletion of SET2, mutations modifying the catalytic activity of Set2 or H3K36 point mutations significantly altered expression of our reporter and reduced splicing of endogenous introns. These effects were dependent on the association of Set2 with RNA polymerase II and H3K36 dimethylation. Additionally, we found that deletion of SET2 reduces the association of the U2 and U5 snRNPs with chromatin. Thus, our study provides the first evidence that H3K36 methylation plays a role in co-transcriptional RNA splicing in yeast. PMID:26821844
Full Text Available Splicing is an important process for regulation of gene expression in eukaryotes, and it has important functional links to other steps of gene expression. Two examples of these linkages include Ceg1, a component of the mRNA capping enzyme, and the chromatin elongation factors Spt4-5, both of which have recently been shown to play a role in the normal splicing of several genes in the yeast Saccharomyces cerevisiae. Using a genomic approach to characterize the roles of Spt4-5 in splicing, we used splicing-sensitive DNA microarrays to identify specific sets of genes that are mis-spliced in ceg1, spt4, and spt5 mutants. In the context of a complex, nested, experimental design featuring 22 dye-swap array hybridizations, comprising both biological and technical replicates, we applied five appropriate statistical models for assessing differential expression between wild-type and the mutants. To refine selection of differential expression genes, we then used a robust model-synthesizing approach, Differential Expression via Distance Synthesis, to integrate all five models. The resultant list of differentially expressed genes was then further analyzed with regard to select attributes: we found that highly transcribed genes with long introns were most sensitive to spt mutations. QPCR confirmation of differential expression was established for the limited number of genes evaluated. In this paper, we showcase splicing array technology, as well as powerful, yet general, statistical methodology for assessing differential expression, in the context of a real, complex experimental design. Our results suggest that the Spt4-Spt5 complex may help coordinate splicing with transcription under conditions that present kinetic challenges to spliceosome assembly or function.
Straniero, Letizia; Rimoldi, Valeria; Soldà, Giulia; Mauri, Lucia; Manfredini, Emanuela; Andreucci, Elena; Bargiacchi, Sara; Penco, Silvana; Gesu, Giovanni P; Del Longo, Alessandra; Piozzi, Elena; Asselta, Rosanna; Primignani, Paola
Oculocutaneous albinism (OCA) is characterized by hypopigmentation of the skin, hair and eye, and by ophthalmologic abnormalities caused by a deficiency in melanin biosynthesis. OCA type IV (OCA4) is one of the four commonly recognized forms of albinism, and is determined by mutation in the SLC45A2 gene. Here, we investigated the genetic basis of OCA4 in an Italian child. The mutational screening of the SLC45A2 gene identified two novel potentially pathogenic splicing mutations: a synonymous transition (c.888G>A) involving the last nucleotide of exon 3 and a single-nucleotide insertion (c.1156+2dupT) within the consensus sequence of the donor splice site of intron 5. As computer-assisted analysis for mutant splice-site prediction was not conclusive, we investigated the effects on pre-mRNA splicing of these two variants by using an in vitro minigene approach. Production of mutant transcripts in HeLa cells demonstrated that both mutations cause the almost complete abolishment of the physiologic donor splice site, with the concomitant unmasking of cryptic donor splice sites. To our knowledge, this work represents the first in-depth molecular characterization of splicing defects in a OCA4 patient. PMID:26016411
Ismail, S I; Rohll, J B; Kingsman, S M; Kingsman, A J; Uden, M
Normal mRNA polyadenylation signals are composed of an AAUAAA motif and G/U box spaced 20 to 30 bp apart. If this spacing is increased further, then polyadenylation is disrupted. Previously it has been demonstrated that insertion of an intron will similarly disrupt this signal even though such introns are removed during a nuclear splicing reaction (X. Liu and J. Mertz, Nucleic Acids Res. 21:5256-5263, 1993). This observation has led to the suggestion that polyadenylation site selection is undertaken prior to intron excision. We now present results that both support and extend these observations and in doing so create a novel class of retroviral expression vector with improved qualities. We found that when an intron-disrupted polyadenylation signal is inserted within a retroviral expression vector, such a signal, although reformed in the producer cell, remains benign until transduction, where it is then preferentially used. Thus, we demonstrate that upon transduction these vectors now produce a majority of shortened subgenomic species and as a consequence have a reduced tendency for subsequent mobilization from transduced cells. In addition, we demonstrate that the use of this internal signal leads to enhanced expression from such vectors and that this is achieved without any loss in titer. Therefore, split polyadenylation signals confer enhanced performance and improved safety upon retroviral expression vectors into which they are inserted. Such split signals may prove useful for the future optimization of retroviral vectors in gene therapy. PMID:11119589
van der Lelij, Petra; Stocsits, Roman R; Ladurner, Rene; Petzold, Georg; Kreidl, Emanuel; Koch, Birgit; Schmitz, Julia; Neumann, Beate; Ellenberg, Jan; Peters, Jan-Michael
Although splicing is essential for the expression of most eukaryotic genes, inactivation of splicing factors causes specific defects in mitosis. The molecular cause of this defect is unknown. Here, we show that the spliceosome subunits SNW1 and PRPF8 are essential for sister chromatid cohesion in human cells. A transcriptome-wide analysis revealed that SNW1 or PRPF8 depletion affects the splicing of specific introns in a subset of pre-mRNAs, including pre-mRNAs encoding the cohesion protein s...
Mario Gustavo Mayer
Full Text Available Since the discovery that genes are split into intron and exons, the studies of the mechanisms involved in splicing pointed to presence of consensus signals in an attempt to generalize the process for all living cells. However, as discussed in the present review, splicing is a theme full of variations. The trans-splicing of pre-mRNAs, the joining of exons from distinct transcripts, is one of these variations with broad distribution in the phylogenetic tree. The biological meaning of this phenomenon is discussed encompassing reactions resembling a possible noise to mechanisms of gene expression regulation. All of them however, can contribute to the generation of life diversity.
A.T.M. Golam Bari
Full Text Available Splice site prediction in DNA sequence is a basic search problem for finding exon/intron and intron/exon boundaries. Removing introns and then joining the exons together forms the mRNA sequence. These sequences are the input of the translation process. It is a necessary step in the central dogma of molecular biology. The main task of splice site prediction is to find out the exact GT and AG ended sequences. Then it identifies the true and false GT and AG ended sequences among those candidate sequences. In this paper, we survey research works on splice site prediction based on support vector machine (SVM. The basic difference between these research works is nucleotide encoding technique and SVM kernel selection. Some methods encode the DNA sequence in a sparse way whereas others encode in a probabilistic manner. The encoded sequences serve as input of SVM. The task of SVM is to classify them using its learning model. The accuracy of classification largely depends on the proper kernel selection for sequence data as well as a selection of kernel parameter. We observe each encoding technique and classify them according to their similarity. Then we discuss about kernel and their parameter selection. Our survey paper provides a basic understanding of encoding approaches and proper kernel selection of SVM for splice site prediction.
Chang You LI; Jia You CHU; Jian Kun YU; Xiao Qin HUANG; Xiao Juan LIU; Li SHI; Yan Chun CHE; Jiu Yong XIE
The splicing of many alternative exons in the precursor messenger RNA (pre-mRNA) is regulated by extracellular factors but the underlying molecular bases remain unclear. Here we report the differential regulation of Bcl-x pre-mRNA splicing by extracellular factors and their distinctrequirements for pre-mRNA elements. In K562 leukemia cells, treatment with interleukin-6 (IL-6) or granulocyte-macrophage colony stimulating factor (GM-CSF) reduced the proportion of the Bcl-xL variant mRNA while treatment with 12-O-tetradecanoylphorbol 13-acetate (TPA) had no effect. In U251 glioma cells, however, TPA efficientlyincreased the Bcl-xL level. These regulations were also seen for a transfected splicing reporter mini-gene. Further analyses of deletion mutants indicate that nucleotides 1-176 of the downstream intron are required for the IL-6 effect, whereas additional nucleotides 177-284 are essential for the GM-CSF effect. As for the TPA effect, only nucleotides 1-76 are required in the downstream intron. Thus, IL-6, GM-CSF and TPA differentially regulate Bcl-x splicing and require specific intronic pre-mRNA sequences for their respective effects.
Ferguson, M. L.; Coulon, A.; de Turris, V.; Palangat, M.; Chow, C. C.; Singer, R. H.; Larson, D. R.
The synthesis of pre-mRNA and the splicing of that pre-mRNA to form completed transcripts requires coordination between two large multi-subunit complexes (the transcription elongation complex and the spliceosome). How this coordination occurs in vivo is unknown. Here we report the first experimental observation of transcription and splicing occurring at the same gene in living cells. By utilizing the PP7/MS2 fluorescent RNA reporter system, we can directly observe two distinct regions of the nascent RNA, allowing us to measure the rise and fall time of the intron and exon of a reporter gene stably integrated into a human cell line. The reporter gene consists of a beta globin gene where we have inserted a 24 RNA hairpin cassette into the intron/exon. Upon synthesis, the RNA hairpins are tightly bound by fluorescently-labeled PP7/MS2 bacteriophage coat proteins. After gene induction, a single locus of active transcription in the nucleus shows fluorescence intensity changes characteristic of the synthesis and excision of the intron/exon. Using fluctuation analysis, we determine the elongation rate to be 1.5 kb/min. From the temporal cross correlation function, we determine that splicing of this gene must be co-transcriptional with a splicing time of ~100 seconds before termination and a ~200 second pause at termination. We propose that dual-color RNA imaging may be extended to investigate other mechanisms of transcription, gene regulation, and RNA processing.
Willis, I; Hottinger, H; Pearson, D.; Chisholm, V; Leupold, U; Söll, D
The nucleotide sequences of a Schizosaccharomyces pombe opal suppressor serine tRNA gene (sup9-e) and of 12 in vivo-generated mutant genes, which have lost the ability to suppress UGA mutations, have been determined. Analysis of the expression of these genes in Saccharomyces cerevisiae in vitro and in vivo systems has revealed defects in tRNA gene transcription and precursor tRNA processing. Single base changes in the D-loop, the intron and the extra arm affect the efficiency of splicing of t...
Full Text Available Abstract Background Regulation of the expression of particular genes can rely on mechanisms that are different from classical transcriptional and translational control. The LY6G5B and LY6G6D genes encode LY-6 domain proteins, whose expression seems to be regulated in an original fashion, consisting of an intron retention event which generates, through an early premature stop codon, a non-coding transcript, preventing expression in most cell lines and tissues. Results The MHC LY-6 non-coding transcripts have shown to be stable and very abundant in the cell, and not subject to Nonsense Mediated Decay (NMD. This retention event appears not to be solely dependent on intron features, because in the case of LY6G5B, when the intron is inserted in the artificial context of a luciferase expression plasmid, it is fully spliced but strongly stabilises the resulting luciferase transcript. In addition, by quantitative PCR we found that the retained and spliced forms are differentially expressed in tissues indicating an active regulation of the non-coding transcript. EST database analysis revealed that these genes have an alternative expression pathway with the formation of Transcription Induced Chimeras (TIC. This data was confirmed by RT-PCR, revealing the presence of different transcripts that would encode the chimeric proteins CSNKβ-LY6G5B and G6F-LY6G6D, in which the LY-6 domain would join to a kinase domain and an Ig-like domain, respectively. Conclusion In conclusion, the LY6G5B and LY6G6D intron-retained transcripts are not subjected to NMD and are more abundant than the properly spliced forms. In addition, these genes form chimeric transcripts with their neighbouring same orientation 5' genes. Of interest is the fact that the 5' genes (CSNKβ or G6F undergo differential splicing only in the context of the chimera (CSNKβ-LY6G5B or G6F-LY6G6C and not on their own.
Plocik, Alex M; Guthrie, Christine
Ribosomal proteins are essential to life. While the functions of ribosomal protein-encoding genes (RPGs) are highly conserved, the evolution of their regulatory mechanisms is remarkably dynamic. In Saccharomyces cerevisiae, RPGs are unusual in that they are commonly present as two highly similar gene copies and in that they are over-represented among intron-containing genes. To investigate the role of introns in the regulation of RPG expression, we constructed 16 S. cerevisiae strains with precise deletions of RPG introns. We found that several yeast introns function to repress rather than to increase steady-state mRNA levels. Among these, the RPS9A and RPS9B introns were required for cross-regulation of the two paralogous gene copies, which is consistent with the duplication of an autoregulatory circuit. To test for similar intron function in animals, we performed an experimental test and comparative analyses for autoregulation among distantly related animal RPS9 orthologs. Overexpression of an exogenous RpS9 copy in Drosophila melanogaster S2 cells induced alternative splicing and degradation of the endogenous copy by nonsense-mediated decay (NMD). Also, analysis of expressed sequence tag data from distantly related animals, including Homo sapiens and Ciona intestinalis, revealed diverse alternatively-spliced RPS9 isoforms predicted to elicit NMD. We propose that multiple forms of splicing regulation among RPS9 orthologs from various eukaryotes operate analogously to translational repression of the alpha operon by S4, the distant prokaryotic ortholog. Thus, RPS9 orthologs appear to have independently evolved variations on a fundamental autoregulatory circuit. PMID:22479208
Alex M Plocik
Full Text Available Ribosomal proteins are essential to life. While the functions of ribosomal protein-encoding genes (RPGs are highly conserved, the evolution of their regulatory mechanisms is remarkably dynamic. In Saccharomyces cerevisiae, RPGs are unusual in that they are commonly present as two highly similar gene copies and in that they are over-represented among intron-containing genes. To investigate the role of introns in the regulation of RPG expression, we constructed 16 S. cerevisiae strains with precise deletions of RPG introns. We found that several yeast introns function to repress rather than to increase steady-state mRNA levels. Among these, the RPS9A and RPS9B introns were required for cross-regulation of the two paralogous gene copies, which is consistent with the duplication of an autoregulatory circuit. To test for similar intron function in animals, we performed an experimental test and comparative analyses for autoregulation among distantly related animal RPS9 orthologs. Overexpression of an exogenous RpS9 copy in Drosophila melanogaster S2 cells induced alternative splicing and degradation of the endogenous copy by nonsense-mediated decay (NMD. Also, analysis of expressed sequence tag data from distantly related animals, including Homo sapiens and Ciona intestinalis, revealed diverse alternatively-spliced RPS9 isoforms predicted to elicit NMD. We propose that multiple forms of splicing regulation among RPS9 orthologs from various eukaryotes operate analogously to translational repression of the alpha operon by S4, the distant prokaryotic ortholog. Thus, RPS9 orthologs appear to have independently evolved variations on a fundamental autoregulatory circuit.
Full Text Available BACKGROUND: Although large scale informatics studies on introns can be useful in making broad inferences concerning patterns of intron gain and loss, more specific questions about intron evolution at a finer scale can be addressed using a gene family where structure and function are well known. Genome wide surveys of tetraspanins from a broad array of organisms with fully sequenced genomes are an excellent means to understand specifics of intron evolution. Our approach incorporated several new fully sequenced genomes that cover the major lineages of the animal kingdom as well as plants, protists and fungi. The analysis of exon/intron gene structure in such an evolutionary broad set of genomes allowed us to identify ancestral intron structure in tetraspanins throughout the eukaryotic tree of life. METHODOLOGY/PRINCIPAL FINDINGS: We performed a phylogenomic analysis of the intron/exon structure of the tetraspanin protein family. In addition, to the already characterized tetraspanin introns numbered 1 through 6 found in animals, three additional ancient, phase 0 introns we call 4a, 4b and 4c were found. These three novel introns in combination with the ancestral introns 1 to 6, define three basic tetraspanin gene structures which have been conserved throughout the animal kingdom. Our phylogenomic approach also allows the estimation of the time at which the introns of the 33 human tetraspanin paralogs appeared, which in many cases coincides with the concomitant acquisition of new introns. On the other hand, we observed that new introns (introns other than 1-6, 4a, b and c were not randomly inserted into the tetraspanin gene structure. The region of tetraspanin genes corresponding to the small extracellular loop (SEL accounts for only 10.5% of the total sequence length but had 46% of the new animal intron insertions. CONCLUSIONS/SIGNIFICANCE: Our results indicate that tests of intron evolution are strengthened by the phylogenomic approach with
Horner David S
Full Text Available Abstract Background Recent discoveries have highlighted the fact that alternative splicing and alternative transcripts are the rule, rather than the exception, in metazoan genes. Since multiple transcript and protein variants expressed by the same gene are, by definition, structurally distinct and need not to be functionally equivalent, the concept of gene orthology should be extended to the transcript level in order to describe evolutionary relationships between structurally similar transcript variants. In other words, the identification of true orthology relationships between gene products now should progress beyond primary sequence and "splicing orthology", consisting in ancestrally shared exon-intron structures, is required to define orthologous isoforms at transcript level. Results As a starting step in this direction, in this work we performed a large scale human- mouse gene comparison with a twofold goal: first, to assess if and to which extent traditional gene annotations such as RefSeq capture genuine splicing orthology; second, to provide a more detailed annotation and quantification of true human-mouse orthologous transcripts defined as transcripts of orthologous genes exhibiting the same splicing patterns. Conclusions We observed an identical exon/intron structure for 32% of human and mouse orthologous genes. This figure increases to 87% using less stringent criteria for gene structure similarity, thus implying that for about 13% of the human RefSeq annotated genes (and about 25% of the corresponding transcripts we could not identify any mouse transcript showing sufficient similarity to be confidently assigned as a splicing ortholog. Our data suggest that current gene and transcript data may still be rather incomplete - with several splicing variants still unknown. The observation that alternative splicing produces large numbers of alternative transcripts and proteins, some of them conserved across species and others truly species
Kamperis, Konstantinos; Siggaard, C; Herlin, Troels;
In order to elucidate the molecular basis and the clinical characteristics of X-linked recessive nephrogenic diabetes insipidus (CNDI) in a kindred of Danish descent, we performed direct sequencing of the arginine vasopressin receptor 2 (AVPR2) gene in five members of the family, as well as...... clinical investigations comprising a fluid deprivation test and a 1-deamino-8-D-arginine-vasopressin (dDAVP) infusion test in the study subject and his mother. We found a highly unusual, novel, de novo 1447A-->C point mutation (gDNA), involving the invariable splice acceptor of the second intron of the...
DeHaven, Alexander C; Norden, Ian S; Hoskins, Aaron A
The process of removing intronic sequences from a precursor to messenger RNA (pre-mRNA) to yield a mature mRNA transcript via splicing is an integral step in eukaryotic gene expression. Splicing is carried out by a cellular nanomachine called the spliceosome that is composed of RNA components and dozens of proteins. Despite decades of study, many fundamentals of spliceosome function have remained elusive. Recent developments in single-molecule fluorescence microscopy have afforded new tools to better probe the spliceosome and the complex, dynamic process of splicing by direct observation of single molecules. These cutting-edge technologies enable investigators to monitor the dynamics of specific splicing components, whole spliceosomes, and even cotranscriptional splicing within living cells. WIREs RNA 2016, 7:683-701. doi: 10.1002/wrna.1358 For further resources related to this article, please visit the WIREs website. PMID:27198613
Full Text Available Polychlorinated dibenzo-p-dioxins (PCDDs and related halogenated aromatic hydrocarbons (e.g., PCDFs, often called "dioxins", are ubiquitously present environmental contaminants. Some of them, notably 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD, are among the most toxic synthetic compounds known. The biological effects of dioxins are mediated via the aryl hydrocarbon receptor (AhR. Mutations in the AhR transactivation domain are linked to sensitivity to the acute lethality of TCDD. We present here a study of AhR gene polymorphism in normal and cancer human tissues affecting pre-mRNA splicing in the AhR gene-coding transactivation domain region (exon 10, intron 10, exon 11 region, previously shown to be associated with AhR dysfunction. We tested 126 pairs of normal and cancer tissue samples from liver, lung, stomach, kidney, mucous, breast, and pancreas of 49 males and 77 females (45-70 years of age. We used in vitro splicing assay, RT-PCR and sequencing methods. Our results showed that in an in vitro system it is possible to reconstitute cellular pre-mRNA splicing events. Tested cancer tissues did not contain mutations in the AhR transactivation domain region when the DNA sequences were compared with those from normal tissues. There were also no differences in AhR mRNA splice variants between normal and malignant breast tissues and no polymorphisms in the studied regions or cDNA.
Rocas, M; Jakubauskiene, E; Kanopka, A
Polychlorinated dibenzo-p-dioxins (PCDDs) and related halogenated aromatic hydrocarbons (e.g., PCDFs), often called "dioxins", are ubiquitously present environmental contaminants. Some of them, notably 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD), are among the most toxic synthetic compounds known. The biological effects of dioxins are mediated via the aryl hydrocarbon receptor (AhR). Mutations in the AhR transactivation domain are linked to sensitivity to the acute lethality of TCDD. We present here a study of AhR gene polymorphism in normal and cancer human tissues affecting pre-mRNA splicing in the AhR gene-coding transactivation domain region (exon 10, intron 10, exon 11 region), previously shown to be associated with AhR dysfunction. We tested 126 pairs of normal and cancer tissue samples from liver, lung, stomach, kidney, mucous, breast, and pancreas of 49 males and 77 females (45-70 years of age). We used in vitro splicing assay, RT-PCR and sequencing methods. Our results showed that in an in vitro system it is possible to reconstitute cellular pre-mRNA splicing events. Tested cancer tissues did not contain mutations in the AhR transactivation domain region when the DNA sequences were compared with those from normal tissues. There were also no differences in AhR mRNA splice variants between normal and malignant breast tissues and no polymorphisms in the studied regions or cDNA. PMID:22052373
Pistoni, Mariaelena; Ghigna, Claudia; Gabellini, Davide
Alternative splicing of pre-mRNAs is a major contributor to proteomic diversity and to the control of gene expression in higher eukaryotic cells. For this reasons, alternative splicing is tightly regulated in different tissues and developmental stages and its disruption can lead to a wide range of human disorders. The aim of this review is to focus on the relevance of alternative splicing for muscle function and muscle disease. We begin by giving a brief overview of alternative splicing, musc...
Wu, XianMing; Hurst, Laurence D.
Where in genes do pathogenic mutations tend to occur and does this provide clues as to the possible underlying mechanisms by which single nucleotide polymorphisms (SNPs) cause disease? As splice-disrupting mutations tend to occur predominantly at exon ends, known also to be hot spots of cis-exonic splice control elements, we examine the relationship between the relative density of such exonic cis-motifs and pathogenic SNPs. In particular, we focus on the intragene distribution of exonic splicing enhancers (ESE) and the covariance between them and disease-associated SNPs. In addition to showing that disease-causing genes tend to be genes with a high intron density, consistent with missplicing, five factors established as trends in ESE usage, are considered: relative position in exons, relative position in genes, flanking intron size, splice sites usage, and phase. We find that more than 76% of pathogenic SNPs are within 3–69 bp of exon ends where ESEs generally reside, this being 13% more than expected. Overall from enrichment of pathogenic SNPs at exon ends, we estimate that approximately 20–45% of SNPs affect splicing. Importantly, we find that within genes pathogenic SNPs tend to occur in splicing-relevant regions with low ESE density: they are found to occur preferentially in the terminal half of genes, in exons flanked by short introns and at the ends of phase (0,0) exons with 3′ non-“AGgt” splice site. We suggest the concept of the “fragile” exon, one home to pathogenic SNPs owing to its vulnerability to splice disruption owing to low ESE density. PMID:26545919
Martínez-Montiel, Nancy; Rosas-Murrieta, Nora Hilda; Martínez-Montiel, Mónica; Gaspariano-Cholula, Mayra Patricia; Martínez-Contreras, Rebeca D
In eukaryotes, genes are frequently interrupted with noncoding sequences named introns. Alternative splicing is a nuclear mechanism by which these introns are removed and flanking coding regions named exons are joined together to generate a message that will be translated in the cytoplasm. This mechanism is catalyzed by a complex machinery known as the spliceosome, which is conformed by more than 300 proteins and ribonucleoproteins that activate and regulate the precision of gene expression when assembled. It has been proposed that several genetic diseases are related to defects in the splicing process, including cancer. For this reason, natural products that show the ability to regulate splicing have attracted enormous attention due to its potential use for cancer treatment. Some microbial metabolites have shown the ability to inhibit gene splicing and the molecular mechanism responsible for this inhibition is being studied for future applications. Here, we summarize the main types of natural products that have been characterized as splicing inhibitors, the recent advances regarding molecular and cellular effects related to these molecules, and the applications reported so far in cancer therapeutics. PMID:27610372
Garland, Jeffrey A.; Aalberts, Daniel P.
When eukaryotic genes are edited by the spliceosome, the first step in intron recognition is the binding of a U1 small nuclear RNA with the donor ( 5' ) splice site. We model this interaction thermodynamically to identify splice sites. Applied to a set of 65 annotated genes, our “finding with binding” method achieves a significant separation between real and false sites. Analyzing binding patterns allows us to discard a large number of decoy sites. Our results improve statistics-based methods for donor site recognition, demonstrating the promise of physical modeling to find functional elements in the genome.
Marcel, V; Fernandes, K; Terrier, O; LANE, D. P.; Bourdon, J-C
In addition to the tumor suppressor p53 protein, also termed p53α, the TP53 gene produces p53β and p53γ through alternative splicing of exons 9β and 9γ located within TP53 intron 9. Here we report that both TG003, a specific inhibitor of Cdc2-like kinases (Clk) that regulates the alternative splicing pre-mRNA pathway, and knockdown of SFRS1 increase expression of endogenous p53β and p53γ at mRNA and protein levels. Development of a TP53 intron 9 minigene shows that TG003 treatment and knockdo...
Daguenet, Elisabeth; Dujardin, Gwendal; Valcárcel, Juan
Removal of introns from pre-mRNA precursors (pre-mRNA splicing) is a necessary step for the expression of most genes in multicellular organisms, and alternative patterns of intron removal diversify and regulate the output of genomic information. Mutation or natural variation in pre-mRNA sequences, as well as in spliceosomal components and regulatory factors, has been implicated in the etiology and progression of numerous pathologies. These range from monogenic to multifactorial genetic diseases, including metabolic syndromes, muscular dystrophies, neurodegenerative and cardiovascular diseases, and cancer. Understanding the molecular mechanisms associated with splicing-related pathologies can provide key insights into the normal function and physiological context of the complex splicing machinery and establish sound basis for novel therapeutic approaches. PMID:26566663
Matioli Sergio R
Full Text Available Abstract Background Group I introns are found in the nuclear small subunit ribosomal RNA gene (SSU rDNA of some species of the genus Porphyra (Bangiales, Rhodophyta. Size polymorphisms in group I introns has been interpreted as the result of the degeneration of homing endonuclease genes (HEG inserted in peripheral loops of intron paired elements. In this study, intron size polymorphisms were characterized for different Porphyra spiralis var. amplifolia (PSA populations on the Southern Brazilian coast, and were used to infer genetic relationships and genetic structure of these PSA populations, in addition to cox2-3 and rbcL-S regions. Introns of different sizes were tested qualitatively for in vitro self-splicing. Results Five intron size polymorphisms within 17 haplotypes were obtained from 80 individuals representing eight localities along the distribution of PSA in the Eastern coast of South America. In order to infer genetic structure and genetic relationships of PSA, these polymorphisms and haplotypes were used as markers for pairwise Fst analyses, Mantel's test and median joining network. The five cox2-3 haplotypes and the unique rbcL-S haplotype were used as markers for summary statistics, neutrality tests Tajima's D and Fu's Fs and for median joining network analyses. An event of demographic expansion from a population with low effective number, followed by a pattern of isolation by distance was obtained for PSA populations with the three analyses. In vitro experiments have shown that introns of different lengths were able to self-splice from pre-RNA transcripts. Conclusion The findings indicated that degenerated HEGs are reminiscent of the presence of a full-length and functional HEG, once fixed for PSA populations. The cline of HEG degeneration determined the pattern of isolation by distance. Analyses with the other markers indicated an event of demographic expansion from a population with low effective number. The different degrees of
Hnilicová, Jarmila; Staněk, David
Roč. 2, č. 3 (2011), s. 182-188. ISSN 1949-1034 R&D Projects: GA ČR GAP305/10/0424; GA AV ČR KAN200520801 Institutional research plan: CEZ:AV0Z50520514 Keywords : chromatin * exon * alternative splicing * transcription * snRNP Subject RIV: EB - Genetics ; Molecular Biology
Ferdos Alaa El Din
Full Text Available Hereditary Hemorrhagic Telangiectasia syndrome (HHT or Rendu-Osler-Weber (ROW syndrome is an autosomal dominant vascular disorder. Two most common forms of HHT, HHT1 and HHT2, have been linked to mutations in the endoglin (ENG and activin receptor-like kinase 1 (ACVRL1or ALK1 genes respectively. This work was designed to examine the pathogenicity of 23 nucleotide variations in ACVRL1 gene detected in more than 400 patients. Among them, 14 missense mutations and one intronic variant were novels, and 8 missense mutations were previously identified with questionable implication in HHT2. The functionality of missense mutations was analyzed in response to BMP9 (specific ligand of ALK1, the maturation of the protein products and their localization were analyzed by western blot and fluorescence microscopy. The splicing impairment of the intronic and of two missense mutations was examined by minigene assay. Functional analysis showed that 18 out of 22 missense mutations were defective. Splicing analysis revealed that one missense mutation (c.733A>G, p.Ile245Val affects the splicing of the harboring exon 6. Similarly, the intronic mutation outside the consensus splicing sites (c.1048+5G>A in intron 7 was seen pathogenic by splicing study. Both mutations induce a frame shift creating a premature stop codon likely resulting in mRNA degradation by NMD surveillance mechanism. Our results confirm the haploinsufficiency model proposed for HHT2. The affected allele of ACVRL1 induces mRNA degradation or the synthesis of a protein lacking the receptor activity. Furthermore, our data demonstrate that functional and splicing analyses together, represent two robust diagnostic tools to be used by geneticists confronted with novel or conflicted ACVRL1 mutations.
Shieh, Grace S.
Abstract Background The packaging of DNA into chromatin regulates transcription from initiation through 3\\' end processing. One aspect of transcription in which chromatin plays a poorly understood role is the co-transcriptional splicing of pre-mRNA. Results Here we provide evidence that H2B monoubiquitylation (H2BK123ub1) marks introns in Saccharomyces cerevisiae. A genome-wide map of H2BK123ub1 in this organism reveals that this modification is enriched in coding regions and that its levels peak at the transcribed regions of two characteristic subgroups of genes. First, long genes are more likely to have higher levels of H2BK123ub1, correlating with the postulated role of this modification in preventing cryptic transcription initiation in ORFs. Second, genes that are highly transcribed also have high levels of H2BK123ub1, including the ribosomal protein genes, which comprise the majority of intron-containing genes in yeast. H2BK123ub1 is also a feature of introns in the yeast genome, and the disruption of this modification alters the intragenic distribution of H3 trimethylation on lysine 36 (H3K36me3), which functionally correlates with alternative RNA splicing in humans. In addition, the deletion of genes encoding the U2 snRNP subunits, Lea1 or Msl1, in combination with an htb-K123R mutation, leads to synthetic lethality. Conclusion These data suggest that H2BK123ub1 facilitates cross talk between chromatin and pre-mRNA splicing by modulating the distribution of intronic and exonic histone modifications.
Nozaki, Hisayoshi; Takahara, Manabu; Nakazawa, Atsushi; Kita, Yoko; Yamada, Takashi; Takano, Hiroyoshi; Kawano, Shigeyuki; Kato, Masahiro
Mobile group I introns sometimes contain an open reading frame (ORF) possibly encoding a site-specific DNA endonuclease. However, previous phylogenetic studies have not clearly deduced the evolutionary roles of the group I intron ORFs. In this paper, we examined the phylogeny of group IA2 introns inserted in the position identical to that of the chloroplast-encoded rbcL coding region (rbcL-462 introns) and their ORFs from 13 strains of five genera (Volvox, Pleodorina, Volvulina, Astrephomene, and Gonium) of the colonial Volvocales (Chlorophyceae) and a related unicellular green alga, Vitreochlamys. The rbcL-462 introns contained an intact or degenerate ORF of various sizes except for the Gonium multicoccum rbcL-462 intron. Partial amino acid sequences of some rbcL-462 intron ORFs exhibited possible homology to the endo/excinuclease amino acid terminal domain. The distribution of the rbcL-462 introns is sporadic in the phylogenetic trees of the colonial Volvocales based on the five chloroplast exon sequences (6021 bp). Phylogenetic analyses of the conserved intron sequences resolved that the G. multicoccum rbcL-462 intron had a phylogenetic position separate from those of other colonial volvocalean rbcL-462 introns, indicating the recent horizontal transmission of the intron in the G. multicoccum lineage. However, the combined data set from conserved intron sequences and ORFs from most of the rbcL-462 introns resolved robust phylogenetic relationships of the introns that were consistent with those of the host organisms. Therefore, most of the extant rbcL-462 introns may have been vertically inherited from the common ancestor of their host organisms, whereas such introns may have been lost in other lineages during evolution of the colonial Volvocales. In addition, apparently higher synonymous substitutions than nonsynonymous substitutions in the rbcL-462 intron ORFs indicated that the ORFs might evolve under functional constraint, which could result in homing of the
Bao-Long NIU; Zhi-Qi MENG; Yue-Zhi TAO; Shun-Lin LU; Hong-Biao WENG; Li-Hua HE; Wei-Feng SHEN
We have identified Bombyx mori transformer-2 gene (Bmtra-2) cDNA by blasting the EST database of B. mori. It was expressed in the whole life of the male and female silkworm and was observed as a band of 1.3 kb by Northern blot analysis. By comparing corresponding ESTs to the Bmtra-2 DNA sequence,it was revealed that there were eight exons and seven introns, and all splice sites of exons/introns conformed to the GT/AG rule. Bmtra-2 pre-mRNA can produce multiple mRNAs encoding six distinct isoforms of BmTRA-2 protein using an alternative splicing pathway during processing. Six types of Bmtra-2 cDNA clones were identified by reverse transcription-polymerase chain reaction. All isoforms of BmTRA-2 protein contain two arginine/serine-rich domains and one RNA recognition motif, showing striking organizational similarity to Drosophila TRA-2 proteins.
Pettitt, Jonathan; Müller, Berndt; Stansfield, Ian; Connolly, Bernadette
The trans-splicing of short spliced leader (SL) RNAs onto the 5′ ends of mRNAs occurs in a diverse range of taxa. In nematodes, all species so far characterized utilize a characteristic, conserved spliced leader, SL1, as well as variants that are employed in the resolution of operons. Here we report the identification of spliced leader trans-splicing in the basal nematode Trichinella spiralis, and show that this nematode does not possess a canonical SL1, but rather has at least 15 distinct sp...
Ye, Zhenqing; Chen, Zhong; Lan, Xun; Hara, Stephen; Sunkel, Benjamin; Huang, Tim H-M; Elnitski, Laura; Wang, Qianben; Jin, Victor X
Alternative splicing (AS), in higher eukaryotes, is one of the mechanisms of post-transcriptional regulation that generate multiple transcripts from the same gene. One particular mode of AS is the skipping event where an exon may be alternatively excluded or constitutively included in the resulting mature mRNA. Both transcript isoforms from this skipping event site, i.e. in which the exon is either included (inclusion isoform) or excluded (skipping isoform), are typically present in one cell, and maintain a subtle balance that is vital to cellular function and dynamics. However, how the prevailing conditions dictate which isoform is expressed and what biological factors might influence the regulation of this process remain areas requiring further exploration. In this study, we have developed a novel computational method, graph-based exon-skipping scanner (GESS), for de novo detection of skipping event sites from raw RNA-seq reads without prior knowledge of gene annotations, as well as for determining the dominant isoform generated from such sites. We have applied our method to publicly available RNA-seq data in GM12878 and K562 cells from the ENCODE consortium and experimentally validated several skipping site predictions by RT-PCR. Furthermore, we integrated other sequencing-based genomic data to investigate the impact of splicing activities, transcription factors (TFs) and epigenetic histone modifications on splicing outcomes. Our computational analysis found that splice sites within the skipping-isoform-dominated group (SIDG) tended to exhibit weaker MaxEntScan-calculated splice site strength around middle, 'skipping', exons compared to those in the inclusion-isoform-dominated group (IIDG). We further showed the positional preference pattern of splicing factors, characterized by enrichment in the intronic splice sites immediately bordering middle exons. Finally, our analysis suggested that different epigenetic factors may introduce a variable obstacle in the
Fukumura, Kazuhiro; Wakabayashi, Shunichi; Kataoka, Naoyuki; Sakamoto, Hiroshi; Suzuki, Yutaka; Nakai, Kenta; Mayeda, Akila; Inoue, Kunio
The exon junction complex (EJC) that is deposited onto spliced mRNAs upstream of exon-exon junctions plays important roles in multiple post-splicing gene expression events, such as mRNA export, surveillance, localization, and translation. However, a direct role for the human EJC in pre-mRNA splicing has not been fully understood. Using HeLa cells, we depleted one of the EJC core components, Y14, and the resulting transcriptome was analyzed by deep sequencing (RNA-Seq) and confirmed by RT-PCR. We found that Y14 is required for efficient and faithful splicing of a group of transcripts that is enriched in short intron-containing genes involved in mitotic cell-cycle progression. Tethering of EJC core components (Y14, eIF4AIII or MAGOH) to a model reporter pre-mRNA harboring a short intron showed that these core components are prerequisites for the splicing activation. Taken together, we conclude that the EJC core assembled on pre-mRNA is critical for efficient and faithful splicing of a specific subset of short introns in mitotic cell cycle-related genes. PMID:27490541
Carolina Concha; Fang Li; Maxwell J. Scott
Genetic control of sex determination in insects has been best characterized in Drosophila melanogaster, where the master gene Sxl codes for RNA that is sex specifically spliced to produce a functional protein only in females. SXL regulates the sex-specific splicing of transformer (tra) RNA which, in turn, regulates the splicing of dsx RNA to produce functional male and female proteins. In the Australian sheep blowfly (Lucilia cuprina), the tra gene (Lctra) is required for female development and Lctra transcripts are sex-specifically spliced such that only female Lctra mRNA codes for functional protein. In males, a factor encoded by the Y-linked male determining gene is thought to prevent the female-mode of splicing of Lctra RNA. To further our understanding of the sex determination regulatory hierarchy in L. cuprina, we have isolated the dsx gene (Lcdsx) from this species. We found that the Lcdsx transcripts are sex-specifically spliced in a similar manner as their counterparts in D. melanogaster, housefly and tephritids. The LcDSX proteins are well conserved and the male form of DSX contains a motif encoded by a male-specific exon that is within the female-specific intron. This intron/exon arrangement had previously been found only in the housefly dsx gene, suggesting this may be a unique feature of dsx genes of Calyptratae species.
Shchelkunova, Aleksandra; Ermolinsky, Boris; Boyle, Meghan; Mendez, Ivan; Lehker, Michael; Martirosyan, Karen S; Kazansky, Alexander V
STAT5B, a specific member of the STAT family, is intimately associated with prostate tumor progression. While the full form of STAT5B is thought to promote tumor progression, a naturally occurring truncated isoform acts as a tumor suppressor. We previously demonstrated that truncated STAT5 is generated by insertion of an alternatively spliced exon and results in the introduction of an early termination codon. Present approaches targeting STAT proteins based on inhibition of functional domains of STAT's, such as DNA-binding, cooperative binding (protein-protein interaction), dimerization and phosphorylation will halt the action of the entire gene, both the proto-oncogenic and tumor suppressor functions of Stat5B. In this report we develop a new approach aimed at inhibiting the expression of full-length STAT5B (a proto-oncogene) while simultaneously enhancing the expression of STAT5∆B (a tumor suppressor). We have demonstrated the feasibility of using steric-blocking splice-switching oligonucleotides (SSOs) with a complimentary sequence to the targeted exon-intron boundary to enhance alternative intron/exon retention (up to 10%). The functional effect of the intron/exon proportional tuning was validated by cell proliferation and clonogenic assays. The new scheme applies specific steric-blocking splice-switching oligonucleotides and opens an opportunity for anti-tumor treatment as well as for the alteration of functional abilities of other STAT proteins. PMID:23289016
Aleksandra Shchelkunova, Boris Ermolinsky, Meghan Boyle, Ivan Mendez, Michael Lehker, Karen S. Martirosyan, Alexander V. Kazansky
Full Text Available STAT5B, a specific member of the STAT family, is intimately associated with prostate tumor progression. While the full form of STAT5B is thought to promote tumor progression, a naturally occurring truncated isoform acts as a tumor suppressor. We previously demonstrated that truncated STAT5 is generated by insertion of an alternatively spliced exon and results in the introduction of an early termination codon. Present approaches targeting STAT proteins based on inhibition of functional domains of STAT's, such as DNA-binding, cooperative binding (protein-protein interaction, dimerization and phosphorylation will halt the action of the entire gene, both the proto-oncogenic and tumor suppressor functions of Stat5B.In this report we develop a new approach aimed at inhibiting the expression of full-length STAT5B (a proto-oncogene while simultaneously enhancing the expression of STAT5∆B (a tumor suppressor. We have demonstrated the feasibility of using steric-blocking splice-switching oligonucleotides (SSOs with a complimentary sequence to the targeted exon-intron boundary to enhance alternative intron/exon retention (up to 10%. The functional effect of the intron/exon proportional tuning was validated by cell proliferation and clonogenic assays. The new scheme applies specific steric-blocking splice-switching oligonucleotides and opens an opportunity for anti-tumor treatment as well as for the alteration of functional abilities of other STAT proteins.
Shchelkunova, Aleksandra; Ermolinsky, Boris; Boyle, Meghan; Mendez, Ivan; Lehker, Michael; Martirosyan, Karen S.; Kazansky, Alexander V.
STAT5B, a specific member of the STAT family, is intimately associated with prostate tumor progression. While the full form of STAT5B is thought to promote tumor progression, a naturally occurring truncated isoform acts as a tumor suppressor. We previously demonstrated that truncated STAT5 is generated by insertion of an alternatively spliced exon and results in the introduction of an early termination codon. Present approaches targeting STAT proteins based on inhibition of functional domains of STAT's, such as DNA-binding, cooperative binding (protein-protein interaction), dimerization and phosphorylation will halt the action of the entire gene, both the proto-oncogenic and tumor suppressor functions of Stat5B. In this report we develop a new approach aimed at inhibiting the expression of full-length STAT5B (a proto-oncogene) while simultaneously enhancing the expression of STAT5∆B (a tumor suppressor). We have demonstrated the feasibility of using steric-blocking splice-switching oligonucleotides (SSOs) with a complimentary sequence to the targeted exon-intron boundary to enhance alternative intron/exon retention (up to 10%). The functional effect of the intron/exon proportional tuning was validated by cell proliferation and clonogenic assays. The new scheme applies specific steric-blocking splice-switching oligonucleotides and opens an opportunity for anti-tumor treatment as well as for the alteration of functional abilities of other STAT proteins. PMID:23289016
Full Text Available Towards the global understanding of plant meiosis, it seems to be essential to decipher why all as yet sequenced plants need or at least encode for two different meiotic SPO11 genes. This is in contrast to mammals and fungi, where only one SPO11 is present. Both SPO11 in plants are essential for the initiation of double strand breaks (DSBs during the meiotic prophase. In nearly all eukaryotic organisms DSB induction by SPO11 leads to meiotic DSB repair, thereby ensuring the formation of a necessary number of crossovers (CO as physical connections between the allelic chromosomes. We aim to investigate the specific functions and evolution of both SPO11 genes in land plants. Therefore, we identified and cloned the respective orthologous genes from Brassica rapa, Carica papaya, Oryza sativa and Physcomitrella patens. In parallel we determined the full length cDNA sequences of SPO11-1 and -2 from all of these plants by RT-PCR. During these experiments we observed that the analyzed plants exhibit a pattern of aberrant splicing products of both SPO11 mRNAs. Such an aberrant splicing has previously been described for Arabidopsis and therefore seems to be conserved throughout evolution. Most of the splicing forms of SPO11-1 and -2 seem to be non functional as they either showed intron retention or shortened exons accompanied by a frameshift leading to premature termination codons (PTCs in most cases. Nevertheless, we could detect one putative functional alternatively spliced mRNA for SPO11-1 and -2 each, indicating that splicing of SPO11 does not depend only on the gene sequence but also on the plant species and that it might play a regulatory role.
Full Text Available BACKGROUND: As research into alternative splicing reveals the fundamental importance of this phenomenon in the genome expression of higher organisms, there is an increasing need for a standardized, consistent and unique identifier for alternatively spliced isoforms. Such an identifier would be useful to eliminate ambiguities in references to gene isoforms, and would allow for the reliable comparison of isoforms from different sources (e.g., known genes vs. computational predictions. Commonly used identifiers for gene transcripts prove to be unsuitable for this purpose. METHODOLOGY: We propose an algorithm to compute an isoform signature based on the arrangement of exons and introns in a primary transcript. The isoform signature uniquely identifies a transcript structure, and can therefore be used as a key in databases of alternatively spliced isoforms, or to compare alternative splicing predictions produced by different methods. In this paper we present the algorithm to generate isoform signatures, we provide some examples of its application, and we describe a web-based resource to generate isoform signatures and use them in database searches. CONCLUSIONS: Isoform signatures are simple, so that they can be easily generated and included in publications and databases, but flexible enough to unambiguously represent all possible isoform structures, including information about coding sequence position and variable transcription start and end sites. We believe that the adoption of isoform signatures can help establish a consistent, unambiguous nomenclature for alternative splicing isoforms. The system described in this paper is freely available at http://genome.ufl.edu/genesig/, and supplementary materials can be found at http://genome.ufl.edu/genesig-files/.
Chen, Weiguo; Liu, Yang; Li, Hongxing; Chang, Shuang; Shu, Dingming; Zhang, Huanmin; Chen, Feng; Xie, Qingmei
The group of avian sarcoma and leukosis virus (ASLV) in chickens contains six highly related subgroups, A to E and J. Four genetic loci, tva, tvb, tvc and tvj, encode for corresponding receptors that determine the susceptibility to the ASLV subgroups. The prevalence of ASLV in hosts may have imposed strong selection pressure toward resistance to ASLV infection, and the resistant alleles in all four receptor genes have been identified. In this study, two new alleles of the tva receptor gene, tva(r5) and tva(r6), with similar intronic deletions were identified in Chinese commercial broilers. These natural mutations delete the deduced branch point signal within the first intron, disrupting mRNA splicing of the tva receptor gene and leading to the retention of intron 1 and introduction of premature TGA stop codons in both the longer and shorter tva isoforms. As a result, decreased susceptibility to subgroup A ASLV in vitro and in vivo was observed in the subsequent analysis. In addition, we identified two groups of heterozygous allele pairs which exhibited quantitative differences in host susceptibility to ASLV-A. This study demonstrated that defective splicing of the tva receptor gene can confer genetic resistance to ASLV subgroup A in the host. PMID:25873518