Sample records for gene rearrangements duplication

  1. Balanced gene losses, duplications and intensive rearrangements led to an unusual regularly sized genome in Arbutus unedo chloroplasts.

    Directory of Open Access Journals (Sweden)

    Fernando Martínez-Alberola

    Full Text Available Completely sequenced plastomes provide a valuable source of information about the duplication, loss, and transfer events of chloroplast genes and phylogenetic data for resolving relationships among major groups of plants. Moreover, they can also be useful for exploiting chloroplast genetic engineering technology. Ericales account for approximately six per cent of eudicot diversity with 11,545 species from which only three complete plastome sequences are currently available. With the aim of increasing the number of ericalean complete plastome sequences, and to open new perspectives in understanding Mediterranean plant adaptations, a genomic study on the basis of the complete chloroplast genome sequencing of Arbutus unedo and an updated phylogenomic analysis of Asteridae was implemented. The chloroplast genome of A. unedo shows extensive rearrangements but a medium size (150,897 nt in comparison to most of angiosperms. A number of remarkable distinct features characterize the plastome of A. unedo: five-fold dismissing of the SSC region in relation to most angiosperms; complete loss or pseudogenization of a number of essential genes; duplication of the ndhH-D operon and its location within the two IRs; presence of large tandem repeats located near highly re-arranged regions and pseudogenes. All these features outline the primary evolutionary split between Ericaceae and other ericalean families. The newly sequenced plastome of A. unedo with the available asterid sequences allowed the resolution of some uncertainties in previous phylogenies of Asteridae.

  2. Genome-wide signatures of 'rearrangement hotspots' within segmental duplications in humans.

    Directory of Open Access Journals (Sweden)

    Mohammed Uddin

    Full Text Available The primary objective of this study was to create a genome-wide high resolution map (i.e., >100 bp of 'rearrangement hotspots' which can facilitate the identification of regions capable of mediating de novo deletions or duplications in humans. A hierarchical method was employed to fragment segmental duplications (SDs into multiple smaller SD units. Combining an end space free pairwise alignment algorithm with a 'seed and extend' approach, we have exhaustively searched 409 million alignments to detect complex structural rearrangements within the reference-guided assembly of the NA18507 human genome (18× coverage, including the previously identified novel 4.8 Mb sequence from de novo assembly within this genome. We have identified 1,963 rearrangement hotspots within SDs which encompass 166 genes and display an enrichment of duplicated gene nucleotide variants (DNVs. These regions are correlated with increased non-allelic homologous recombination (NAHR event frequency which presumably represents the origin of copy number variations (CNVs and pathogenic duplications/deletions. Analysis revealed that 20% of the detected hotspots are clustered within the proximal and distal SD breakpoints flanked by the pathogenic deletions/duplications that have been mapped for 24 NAHR-mediated genomic disorders. FISH Validation of selected complex regions revealed 94% concordance with in silico localization of the highly homologous derivatives. Other results from this study indicate that intra-chromosomal recombination is enhanced in genic compared with agenic duplicated regions, and that gene desert regions comprising SDs may represent reservoirs for creation of novel genes. The generation of genome-wide signatures of 'rearrangement hotspots', which likely serve as templates for NAHR, may provide a powerful approach towards understanding the underlying mutational mechanism(s for development of constitutional and acquired diseases.

  3. Lateral gene transfer, rearrangement, reconciliation

    NARCIS (Netherlands)

    Patterson, M.D.; Szollosi, G.; Daubin, V.; Tannier, E.


    Background. Models of ancestral gene order reconstruction have progressively integrated different evolutionary patterns and processes such as unequal gene content, gene duplications, and implicitly sequence evolution via reconciled gene trees. These models have so far ignored lateral gene transfer,

  4. Complete mtDNA sequences of two millipedes suggest a new model for mitochondrial gene rearrangements: Duplication and non-random loss

    Energy Technology Data Exchange (ETDEWEB)

    Lavrov, Dennis V.; Boore, Jeffrey L.; Brown, Wesley M.


    We determined the complete mtDNA sequences of the millipedes Narceus annularus and Thyropygus sp. (Arthropoda: Diplopoda) and identified in both genomes all 37 genes typical for metazoan mtDNA. The arrangement of these genes is identical in the two millipedes, but differs from that inferred to be ancestral for arthropods by the location of four genes/gene clusters. This novel gene arrangement is unusual for animal mtDNA, in that genes with opposite transcriptional polarities are clustered in the genome and the two clusters are separated by two non-coding regions. The only exception to this pattern is the gene for cysteine tRNA, which is located in the part of the genome that otherwise contains all genes with the opposite transcriptional polarity. We suggest that a mechanism involving complete mtDNA duplication followed by the loss of genes, predetermined by their transcriptional polarity and location in the genome, could generate this gene arrangement from the one ancestral for arthropods. The proposed mechanism has important implications for phylogenetic inferences that are drawn on the basis of gene arrangement comparisons.

  5. Insights into the evolutionary history of tubercle bacilli as disclosed by genetic rearrangements within a PE_PGRS duplicated gene pair

    Directory of Open Access Journals (Sweden)

    Kurepina Natalia


    Full Text Available Abstract Background The highly homologous PE_PGRS (Proline-glutamic acid_polymorphic GC-rich repetitive sequence genes are members of the PE multigene family which is found only in mycobacteria. PE genes are particularly abundant within the genomes of pathogenic mycobacteria where they seem to have expanded as a result of gene duplication events. PE_PGRS genes are characterized by their high GC content and extensive repetitive sequences, making them prone to recombination events and genetic variability. Results Comparative sequence analysis of Mycobacterium tuberculosis genes PE_PGRS17 (Rv0978c and PE_PGRS18 (Rv0980c revealed a striking genetic variation associated with this typical tandem duplicate. In comparison to the M. tuberculosis reference strain H37Rv, the variation (named the 12/40 polymorphism consists of an in-frame 12-bp insertion invariably accompanied by a set of 40 single nucleotide polymorphisms (SNPs that occurs either in PE_PGRS17 or in both genes. Sequence analysis of the paralogous genes in a representative set of worldwide distributed tubercle bacilli isolates revealed data which supported previously proposed evolutionary scenarios for the M. tuberculosis complex (MTBC and confirmed the very ancient origin of "M. canettii" and other smooth tubercle bacilli. Strikingly, the identified polymorphism appears to be coincident with the emergence of the post-bottleneck successful clone from which the MTBC expanded. Furthermore, the findings provide direct and clear evidence for the natural occurrence of gene conversion in mycobacteria, which appears to be restricted to modern M. tuberculosis strains. Conclusion This study provides a new perspective to explore the molecular events that accompanied the evolution, clonal expansion, and recent diversification of tubercle bacilli.

  6. Intraspecific rearrangement of duplicated mitochondrial control regions in the Luzon Tarictic Hornbill Penelopides manillae (Aves: Bucerotidae). (United States)

    Sammler, Svenja; Ketmaier, Valerio; Havenstein, Katja; Tiedemann, Ralph


    Philippine hornbills of the genera Aceros and Penelopides (Bucerotidae) are known to possess a large tandemly duplicated fragment in their mitochondrial genome, whose paralogous parts largely evolve in concert. In the present study, we surveyed the two distinguishable duplicated control regions in several individuals of the Luzon Tarictic Hornbill Penelopides manillae, compare their characteristics within and across individuals, and report on an intraspecific mitochondrial gene rearrangement found in one single specimen, i.e., an interchange between the two control regions. To our knowledge, this is the first observation of two distinct mitochondrial genome rearrangements within a bird species. We briefly discuss a possible evolutionary mechanism responsible for this pattern, and highlight potential implications for the application of control region sequences as a marker in population genetics and phylogeography.

  7. MSOAR 2.0: Incorporating tandem duplications into ortholog assignment based on genome rearrangement

    Directory of Open Access Journals (Sweden)

    Zhang Liqing


    Full Text Available Abstract Background Ortholog assignment is a critical and fundamental problem in comparative genomics, since orthologs are considered to be functional counterparts in different species and can be used to infer molecular functions of one species from those of other species. MSOAR is a recently developed high-throughput system for assigning one-to-one orthologs between closely related species on a genome scale. It attempts to reconstruct the evolutionary history of input genomes in terms of genome rearrangement and gene duplication events. It assumes that a gene duplication event inserts a duplicated gene into the genome of interest at a random location (i.e., the random duplication model. However, in practice, biologists believe that genes are often duplicated by tandem duplications, where a duplicated gene is located next to the original copy (i.e., the tandem duplication model. Results In this paper, we develop MSOAR 2.0, an improved system for one-to-one ortholog assignment. For a pair of input genomes, the system first focuses on the tandemly duplicated genes of each genome and tries to identify among them those that were duplicated after the speciation (i.e., the so-called inparalogs, using a simple phylogenetic tree reconciliation method. For each such set of tandemly duplicated inparalogs, all but one gene will be deleted from the concerned genome (because they cannot possibly appear in any one-to-one ortholog pairs, and MSOAR is invoked. Using both simulated and real data experiments, we show that MSOAR 2.0 is able to achieve a better sensitivity and specificity than MSOAR. In comparison with the well-known genome-scale ortholog assignment tool InParanoid, Ensembl ortholog database, and the orthology information extracted from the well-known whole-genome multiple alignment program MultiZ, MSOAR 2.0 shows the highest sensitivity. Although the specificity of MSOAR 2.0 is slightly worse than that of InParanoid in the real data experiments

  8. Complex rearrangements in patients with duplications of MECP2 can occur by fork stalling and template switching (United States)

    Carvalho, Claudia M.B.; Zhang, Feng; Liu, Pengfei; Patel, Ankita; Sahoo, Trilochan; Bacino, Carlos A.; Shaw, Chad; Peacock, Sandra; Pursley, Amber; Tavyev, Y. Jane; Ramocki, Melissa B.; Nawara, Magdalena; Obersztyn, Ewa; Vianna-Morgante, Angela M.; Stankiewicz, Pawel; Zoghbi, Huda Y.; Cheung, Sau Wai; Lupski, James R.


    Duplication at the Xq28 band including the MECP2 gene is one of the most common genomic rearrangements identified in neurodevelopmentally delayed males. Such duplications are non-recurrent and can be generated by a non-homologous end joining (NHEJ) mechanism. We investigated the potential mechanisms for MECP2 duplication and examined whether genomic architectural features may play a role in their origin using a custom designed 4-Mb tiling-path oligonucleotide array CGH assay. Each of the 30 patients analyzed showed a unique duplication varying in size from ∼250 kb to ∼2.6 Mb. Interestingly, in 77% of these non-recurrent duplications, the distal breakpoints grouped within a 215 kb genomic interval, located 47 kb telomeric to the MECP2 gene. The genomic architecture of this region contains both direct and inverted low-copy repeat (LCR) sequences; this same region undergoes polymorphic structural variation in the general population. Array CGH revealed complex rearrangements in eight patients; in six patients the duplication contained an embedded triplicated segment, and in the other two, stretches of non-duplicated sequences occurred within the duplicated region. Breakpoint junction sequencing was achieved in four duplications and identified an inversion in one patient, demonstrating further complexity. We propose that the presence of LCRs in the vicinity of the MECP2 gene may generate an unstable DNA structure that can induce DNA strand lesions, such as a collapsed fork, and facilitate a Fork Stalling and Template Switching event producing the complex rearrangements involving MECP2. PMID:19324899

  9. Analysis of Duplicate Genes in Soybean

    Institute of Scientific and Technical Information of China (English)

    C.M. Cai; K.J. Van; M.Y. Kim; S.H. Lee


    @@ Gene duplication is a major determinant of the size and gene complement of eukaryotic genomes (Lockton and Gaut, 2005). There are a number of different ways in which duplicate genes can arise (Sankoff, 2001), but the most spectacular method of gene duplication may be whole genome duplication via polyploidization.

  10. Mitochondrial genome sequences of Nematocera (lower Diptera): evidence of rearrangement following a complete genome duplication in a winter crane fly. (United States)

    Beckenbach, Andrew T


    The complete mitochondrial DNA sequences of eight representatives of lower Diptera, suborder Nematocera, along with nearly complete sequences from two other species, are presented. These taxa represent eight families not previously represented by complete mitochondrial DNA sequences. Most of the sequences retain the ancestral dipteran mitochondrial gene arrangement, while one sequence, that of the midge Arachnocampa flava (family Keroplatidae), has an inversion of the trnE gene. The most unusual result is the extensive rearrangement of the mitochondrial genome of a winter crane fly, Paracladura trichoptera (family Trichocera). The pattern of rearrangement indicates that the mechanism of rearrangement involved a tandem duplication of the entire mitochondrial genome, followed by random and nonrandom loss of one copy of each gene. Another winter crane fly retains the ancestral diperan gene arrangement. A preliminary mitochondrial phylogeny of the Diptera is also presented.

  11. Copy number gain at Xp22.31 includes complex duplication rearrangements and recurrent triplications. (United States)

    Liu, Pengfei; Erez, Ayelet; Nagamani, Sandesh C Sreenath; Bi, Weimin; Carvalho, Claudia M B; Simmons, Alexandra D; Wiszniewska, Joanna; Fang, Ping; Eng, Patricia A; Cooper, M Lance; Sutton, V Reid; Roeder, Elizabeth R; Bodensteiner, John B; Delgado, Mauricio R; Prakash, Siddharth K; Belmont, John W; Stankiewicz, Pawel; Berg, Jonathan S; Shinawi, Marwan; Patel, Ankita; Cheung, Sau Wai; Lupski, James R


    Genomic instability is a feature of the human Xp22.31 region wherein deletions are associated with X-linked ichthyosis, mental retardation and attention deficit hyperactivity disorder. A putative homologous recombination hotspot motif is enriched in low copy repeats that mediate recurrent deletion at this locus. To date, few efforts have focused on copy number gain at Xp22.31. However, clinical testing revealed a high incidence of duplication of Xp22.31 in subjects ascertained and referred with neurobehavioral phenotypes. We systematically studied 61 unrelated subjects with rearrangements revealing gain in copy number, using multiple molecular assays. We detected not only the anticipated recurrent and simple nonrecurrent duplications, but also unexpectedly identified recurrent triplications and other complex rearrangements. Breakpoint analyses enabled us to surmise the mechanisms for many of these rearrangements. The clinical significance of the recurrent duplications and triplications were assessed using different approaches. We cannot find any evidence to support pathogenicity of the Xp22.31 duplication. However, our data suggest that the Xp22.31 duplication may serve as a risk factor for abnormal phenotypes. Our findings highlight the need for more robust Xp22.31 triplication detection in that such further gain may be more penetrant than the duplications. Our findings reveal the distribution of different mechanisms for genomic duplication rearrangements at a given locus, and provide insights into aspects of strand exchange events between paralogous sequences in the human genome.

  12. Genomic evidence for adaptation by gene duplication. (United States)

    Qian, Wenfeng; Zhang, Jianzhi


    Gene duplication is widely believed to facilitate adaptation, but unambiguous evidence for this hypothesis has been found in only a small number of cases. Although gene duplication may increase the fitness of the involved organisms by doubling gene dosage or neofunctionalization, it may also result in a simple division of ancestral functions into daughter genes, which need not promote adaptation. Hence, the general validity of the adaptation by gene duplication hypothesis remains uncertain. Indeed, a genome-scale experiment found similar fitness effects of deleting pairs of duplicate genes and deleting individual singleton genes from the yeast genome, leading to the conclusion that duplication rarely results in adaptation. Here we contend that the above comparison is unfair because of a known duplication bias among genes with different fitness contributions. To rectify this problem, we compare homologous genes from the budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe. We discover that simultaneously deleting a duplicate gene pair in S. cerevisiae reduces fitness significantly more than deleting their singleton counterpart in S. pombe, revealing post-duplication adaptation. The duplicates-singleton difference in fitness effect is not attributable to a potential increase in gene dose after duplication, suggesting that the adaptation is owing to neofunctionalization, which we find to be explicable by acquisitions of binary protein-protein interactions rather than gene expression changes. These results provide genomic evidence for the role of gene duplication in organismal adaptation and are important for understanding the genetic mechanisms of evolutionary innovation.

  13. Regional genomic instability predisposes to complex dystrophin gene rearrangements. (United States)

    Oshima, Junko; Magner, Daniel B; Lee, Jennifer A; Breman, Amy M; Schmitt, Eric S; White, Lisa D; Crowe, Carol A; Merrill, Michelle; Jayakar, Parul; Rajadhyaksha, Aparna; Eng, Christine M; del Gaudio, Daniela


    Mutations in the dystrophin gene (DMD) cause Duchenne and Becker muscular dystrophies and the majority of cases are due to DMD gene rearrangements. Despite the high incidence of these aberrations, little is known about their causative molecular mechanism(s). We examined 792 DMD/BMD clinical samples by oligonucleotide array-CGH and report on the junction sequence analysis of 15 unique deletion cases and three complex intragenic rearrangements to elucidate potential underlying mechanism(s). Furthermore, we present three cases with intergenic rearrangements involving DMD and neighboring loci. The cases with intragenic rearrangements include an inversion with flanking deleted sequences; a duplicated segment inserted in direct orientation into a deleted region; and a splicing mutation adjacent to a deletion. Bioinformatic analysis demonstrated that 7 of 12 breakpoints combined among 3 complex cases aligned with repetitive sequences, as compared to 4 of 30 breakpoints for the 15 deletion cases. Moreover, the inversion/deletion case may involve a stem-loop structure that has contributed to the initiation of this rearrangement. For the duplication/deletion and splicing mutation/deletion cases, the presence of the first mutation, either a duplication or point mutation, may have elicited the deletion events in an attempt to correct preexisting mutations. While NHEJ is one potential mechanism for these complex rearrangements, the highly complex junction sequence of the inversion/deletion case suggests the involvement of a replication-based mechanism. Our results support the notion that regional genomic instability, aided by the presence of repetitive elements, a stem-loop structure, and possibly preexisting mutations, may elicit complex rearrangements of the DMD gene.

  14. Duplicability of self-interacting human genes.

    LENUS (Irish Health Repository)

    Pérez-Bercoff, Asa


    BACKGROUND: There is increasing interest in the evolution of protein-protein interactions because this should ultimately be informative of the patterns of evolution of new protein functions within the cell. One model proposes that the evolution of new protein-protein interactions and protein complexes proceeds through the duplication of self-interacting genes. This model is supported by data from yeast. We examined the relationship between gene duplication and self-interaction in the human genome. RESULTS: We investigated the patterns of self-interaction and duplication among 34808 interactions encoded by 8881 human genes, and show that self-interacting proteins are encoded by genes with higher duplicability than genes whose proteins lack this type of interaction. We show that this result is robust against the system used to define duplicate genes. Finally we compared the presence of self-interactions amongst proteins whose genes have duplicated either through whole-genome duplication (WGD) or small-scale duplication (SSD), and show that the former tend to have more interactions in general. After controlling for age differences between the two sets of duplicates this result can be explained by the time since the gene duplication. CONCLUSIONS: Genes encoding self-interacting proteins tend to have higher duplicability than proteins lacking self-interactions. Moreover these duplicate genes have more often arisen through whole-genome rather than small-scale duplication. Finally, self-interacting WGD genes tend to have more interaction partners in general in the PIN, which can be explained by their overall greater age. This work adds to our growing knowledge of the importance of contextual factors in gene duplicability.

  15. Divergence of gene regulation through chromosomal rearrangements

    Directory of Open Access Journals (Sweden)

    Messing Joachim


    Full Text Available Abstract Background The molecular mechanisms that modify genome structures to give birth and death to alleles are still not well understood. To investigate the causative chromosomal rearrangements, we took advantage of the allelic diversity of the duplicated p1 and p2 genes in maize. Both genes encode a transcription factor involved in maysin synthesis, which confers resistance to corn earworm. However, p1 also controls accumulation of reddish pigments in floral tissues and has therefore acquired a new function after gene duplication. p1 alleles vary in their tissue-specific expression, which is indicated in their allele designation: the first suffix refers to red or white pericarp pigmentation and the second to red or white glume pigmentation. Results Comparing chromosomal regions comprising p1-ww[4Co63], P1-rw1077 and P1-rr4B2 alleles with that of the reference genome, P1-wr[B73], enabled us to reconstruct additive events of transposition, chromosome breaks and repairs, and recombination that resulted in phenotypic variation and chimeric regulatory signals. The p1-ww[4Co63] null allele is probably derived from P1-wr[B73] by unequal crossover between large flanking sequences. A transposon insertion in a P1-wr-like allele and NHEJ (non-homologous end-joining could have resulted in the formation of the P1-rw1077 allele. A second NHEJ event, followed by unequal crossover, probably led to the duplication of an enhancer region, creating the P1-rr4B2 allele. Moreover, a rather dynamic picture emerged in the use of polyadenylation signals by different p1 alleles. Interestingly, p1 alleles can be placed on both sides of a large retrotransposon cluster through recombination, while functional p2 alleles have only been found proximal to the cluster. Conclusions Allelic diversity of the p locus exemplifies how gene duplications promote phenotypic variability through composite regulatory signals. Transposition events increase the level of genomic complexity

  16. Gene conversion in human rearranged immunoglobulin genes. (United States)

    Darlow, John M; Stott, David I


    Over the past 20 years, many DNA sequences have been published suggesting that all or part of the V(H) segment of a rearranged immunoglobulin gene may be replaced in vivo. Two different mechanisms appear to be operating. One of these is very similar to primary V(D)J recombination, involving the RAG proteins acting upon recombination signal sequences, and this has recently been proven to occur. Other sequences, many of which show partial V(H) replacements with no addition of untemplated nucleotides at the V(H)-V(H) joint, have been proposed to occur by an unusual RAG-mediated recombination with the formation of hybrid (coding-to-signal) joints. These appear to occur in cells already undergoing somatic hypermutation in which, some authors are convinced, RAG genes are silenced. We recently proposed that the latter type of V(H) replacement might occur by homologous recombination initiated by the activity of AID (activation-induced cytidine deaminase), which is essential for somatic hypermutation and gene conversion. The latter has been observed in other species, but not in human Ig genes, so far. In this paper, we present a new analysis of sequences published as examples of the second type of rearrangement. This not only shows that AID recognition motifs occur in recombination regions but also that some sequences show replacement of central sections by a sequence from another gene, similar to gene conversion in the immunoglobulin genes of other species. These observations support the proposal that this type of rearrangement is likely to be AID-mediated rather than RAG-mediated and is consistent with gene conversion.

  17. Regulation of immunoglobulin gene rearrangement and expression. (United States)

    Taussig, M J; Sims, M J; Krawinkel, U


    The molecular genetic events leading to Ig expression and their control formed the topic of a recent EMBO workshop. This report by Michael Taussig, Martin Sims and Ulrich Krawinkel discusses contributions dealing with genes expressed in early pre-B cells, the mechanism of rearrangement, aberrant rearrangements seen in B cells of SCID mice, the feedback control of rearrangement as studied in transgenic mice, the control of Ig expression at the transcriptional and post-transcriptional levels, and class switching.

  18. Special Issue: Gene Conversion in Duplicated Genes

    Directory of Open Access Journals (Sweden)

    Hideki Innan


    Full Text Available Gene conversion is an outcome of recombination, causing non-reciprocal transfer of a DNA fragment. Several decades later than the discovery of crossing over, gene conversion was first recognized in fungi when non-Mendelian allelic distortion was observed. Gene conversion occurs when a double-strand break is repaired by using homologous sequences in the genome. In meiosis, there is a strong preference to use the orthologous region (allelic gene conversion, which causes non-Mendelian allelic distortion, but paralogous or duplicated regions can also be used for the repair (inter-locus gene conversion, also referred to as non-allelic and ectopic gene conversion. The focus of this special issue is the latter, interlocus gene conversion; the rate is lower than allelic gene conversion but it has more impact on phenotype because more drastic changes in DNA sequence are involved.

  19. Transfection of exogenous rotavirus rearranged RNA segments in cells infected with a WT rotavirus results in subsequent gene rearrangements. (United States)

    Duponchel, Sarah; Troupin, Cécile; Vu, Lan Trang; Schnuriger, Aurélie; Trugnan, Germain; Garbarg-Chenon, Antoine


    Group A rotaviruses, members of the family Reoviridae, are a major cause of infantile acute gastroenteritis. The rotavirus genome consists of 11 dsRNA segments. In some cases, an RNA segment is replaced by a rearranged RNA segment, which is derived from its standard counterpart by partial sequence duplication. It has been shown that some rearranged segments are preferentially encapsidated into viral progenies after serial passages in cell culture. Based on this characteristic, a reverse genetics system was used previously to introduce exogenous segment 7 rearrangements into an infectious rotavirus. This study extends this reverse genetics system to RNA segments 5 and 11. Transfection of exogenous rotavirus rearranged RNA segment 5 or 11 into cells infected with a WT helper rotavirus (bovine strain RF) resulted in subsequent gene rearrangements in the viral progeny. Whilst recombinant viruses were rescued with an exogenous rearranged segment 11, the exogenous segment was modified by a secondary rearrangement. The occurrence of spontaneous rearrangements of WT or exogenous segments is a major hindrance to the use of this reverse genetics approach. © 2014 The Authors.

  20. Osteosarcoma with apparent Ewing sarcoma gene rearrangement


    Mathias, Melissa; Chou, Alexander J; Meyers, Paul; Shukla, Neerav; Hameed, Meera; Agaram, Narasimhan; Wang, Lulu; Berger, Michael F.; Walsh, Michael; Kentsis, Alex


    Poorly differentiated round cell sarcomas present diagnostic challenges due to their variable morphology and lack of specific immunophenotypic markers. We present a case of a 15-year-old female with a tibial tumor that exhibited features of Ewing-like sarcoma, including apparent rearrangement of the EWSR1 gene. Hybridization capture-based next-generation DNA sequencing showed evidence of complex genomic rearrangements, absence of known pathogenic Ewing-like chromosome translocations, and dele...

  1. Yeast genome duplication was followed by asynchronous differentiation of duplicated genes

    DEFF Research Database (Denmark)

    Langkjær, Rikke Breinhold; Cliften, P.F.; Johnston, M.


    Gene redundancy has been observed in yeast, plant and human genomes, and is thought to be a consequence of whole-genome duplications(1-3). Baker's yeast, Saccharomyces cerevisiae, contains several hundred duplicated genes(1). Duplication(s) could have occurred before or after a given speciation. ...

  2. Frequent mitochondrial gene rearrangements at the hymenopteran nad3-nad5 junction. (United States)

    Dowton, Mark; Castro, Lyda R; Campbell, Sarah L; Bargon, Sharmilla D; Austin, Andrew D


    We characterized the organization of mitochondrial genes from a diverse range of hymenopterans. Of the 21 taxa characterized, 12 had distinct, derived organizations. Some rearrangements were consistent with the duplication-random loss mechanism, while others were not. Local inversions were relatively common, i.e., rearrangements characterized by the movement of genes from one mitochondrial strand to the other, opposite or close to their ancestral position. This type of rearrangement is inconsistent with the duplication/random loss model of mitochondrial gene rearrangement. Instead, they are best explained by the operation of recombination. Taxa with derived organizations were restricted to a single, monophyletic group of wasps, the Apocrita, which comprise about 90% of all hymenopterans.

  3. Genomic characterization of large rearrangements of the LDLR gene in Czech patients with familial hypercholesterolemia

    Directory of Open Access Journals (Sweden)

    Fajkus Jiří


    Full Text Available Abstract Background Mutations in the LDLR gene are the most frequent cause of Familial hypercholesterolemia, an autosomal dominant disease characterised by elevated concentrations of LDL in blood plasma. In many populations, large genomic rearrangements account for approximately 10% of mutations in the LDLR gene. Methods DNA diagnostics of large genomic rearrangements was based on Multiple Ligation dependent Probe Amplification (MLPA. Subsequent analyses of deletion and duplication breakpoints were performed using long-range PCR, PCR, and DNA sequencing. Results In set of 1441 unrelated FH patients, large genomic rearrangements were found in 37 probands. Eight different types of rearrangements were detected, from them 6 types were novel, not described so far. In all rearrangements, we characterized their exact extent and breakpoint sequences. Conclusions Sequence analysis of deletion and duplication breakpoints indicates that intrachromatid non-allelic homologous recombination (NAHR between Alu elements is involved in 6 events, while a non-homologous end joining (NHEJ is implicated in 2 rearrangements. Our study thus describes for the first time NHEJ as a mechanism involved in genomic rearrangements in the LDLR gene.

  4. Site-specific deletion and rearrangement of integron insert genes catalyzed by the integron DNA integrase. (United States)

    Collis, C M; Hall, R M


    Deletion of individual antibiotic resistance genes found within the variable region of integrons is demonstrated. Evidence for gene duplications and rearrangements resulting from the insertion of gene units at new locations is also presented. Deletion, duplication, and rearrangement occur only in the presence of the integron-encoded DNA integrase. These events are precise and involve loss or gain of one or more complete insert units or gene cassettes. This confirms the recent definition of gene cassettes as consisting of the gene coding sequences, all except the last 7 bases of the 59-base element found at the 3' end of the gene, and the core site located 5' to the gene (Hall et al., Mol. Microbiol. 5:1941-1959, 1991) and demonstrates that individual gene cassettes are functional units which can be independently mobilized. Both deletions and duplications can be generated by integrase-mediated cointegrate formation followed by integrase-mediated resolution involving a different pair of sites. However, deletion occurs 10 times more frequently than duplication, and we propose that the majority of deletion events are likely to involve integrase-dependent excision of the gene unit to generate a circular gene cassette. The implications of these findings in understanding the evolution of integrons and the spread of antibiotic resistance genes in bacterial populations is discussed. Images PMID:1311297

  5. Major Chromosomal Rearrangements Distinguish Willow and Poplar After the Ancestral "Salicoid" Genome Duplication. (United States)

    Hou, Jing; Ye, Ning; Dong, Zhongyuan; Lu, Mengzhu; Li, Laigeng; Yin, Tongming


    Populus (poplar) and Salix (willow) are sister genera in the Salicaceae family. In both lineages extant species are predominantly diploid. Genome analysis previously revealed that the two lineages originated from a common tetraploid ancestor. In this study, we conducted a syntenic comparison of the corresponding 19 chromosome members of the poplar and willow genomes. Our observations revealed that almost every chromosomal segment had a parallel paralogous segment elsewhere in the genomes, and the two lineages shared a similar syntenic pinwheel pattern for most of the chromosomes, which indicated that the two lineages diverged after the genome reorganization in the common progenitor. The pinwheel patterns showed distinct differences for two chromosome pairs in each lineage. Further analysis detected two major interchromosomal rearrangements that distinguished the karyotypes of willow and poplar. Chromosome I of willow was a conjunction of poplar chromosome XVI and the lower portion of poplar chromosome I, whereas willow chromosome XVI corresponded to the upper portion of poplar chromosome I. Scientists have suggested that Populus is evolutionarily more primitive than Salix. Therefore, we propose that, after the "salicoid" duplication event, fission and fusion of the ancestral chromosomes first give rise to the diploid progenitor of extant Populus species. During the evolutionary process, fission and fusion of poplar chromosomes I and XVI subsequently give rise to the progenitor of extant Salix species. This study contributes to an improved understanding of genome divergence after ancient genome duplication in closely related lineages of higher plants.

  6. Effect of Duplicate Genes on Mouse Genetic Robustness: An Update

    Directory of Open Access Journals (Sweden)

    Zhixi Su


    Full Text Available In contrast to S. cerevisiae and C. elegans, analyses based on the current knockout (KO mouse phenotypes led to the conclusion that duplicate genes had almost no role in mouse genetic robustness. It has been suggested that the bias of mouse KO database toward ancient duplicates may possibly cause this knockout duplicate puzzle, that is, a very similar proportion of essential genes (PE between duplicate genes and singletons. In this paper, we conducted an extensive and careful analysis for the mouse KO phenotype data and corroborated a strong effect of duplicate genes on mouse genetics robustness. Moreover, the effect of duplicate genes on mouse genetic robustness is duplication-age dependent, which holds after ruling out the potential confounding effect from coding-sequence conservation, protein-protein connectivity, functional bias, or the bias of duplicates generated by whole genome duplication (WGD. Our findings suggest that two factors, the sampling bias toward ancient duplicates and very ancient duplicates with a proportion of essential genes higher than that of singletons, have caused the mouse knockout duplicate puzzle; meanwhile, the effect of genetic buffering may be correlated with sequence conservation as well as protein-protein interactivity.

  7. Gene duplication as a major force in evolution

    Indian Academy of Sciences (India)

    Santoshkumar Magadum; Urbi Banerjee; Priyadharshini Murugan; Doddabhimappa Gangapur; Rajasekar Ravikesavan


    Gene duplication is an important mechanism for acquiring new genes and creating genetic novelty in organisms. Many new gene functions have evolved through gene duplication and it has contributed tremendously to the evolution of developmental programmes in various organisms. Gene duplication can result from unequal crossing over, retroposition or chromosomal (or genome) duplication. Understanding the mechanisms that generate duplicate gene copies and the subsequent dynamics among gene duplicates is vital because these investigations shed light on localized and genomewide aspects of evolutionary forces shaping intra-specific and inter-specific genome contents, evolutionary relationships, and interactions. Based on whole-genome analysis of Arabidopsis thaliana, there is compelling evidence that angiosperms underwent two whole-genome duplication events early during their evolutionary history. Recent studies have shown that these events were crucial for creation of many important developmental and regulatory genes found in extant angiosperm genomes. Recent studies also provide strong indications that even yeast (Saccharomyces cerevisiae), with its compact genome, is in fact an ancient tetraploid. Gene duplication can provide new genetic material for mutation, drift and selection to act upon, the result of which is specialized or new gene functions. Without gene duplication the plasticity of a genome or species in adapting to changing environments would be severely limited. Whether a duplicate is retained depends upon its function, its mode of duplication, (i.e. whether it was duplicated during a whole-genome duplication event), the species in which it occurs, and its expression rate. The exaptation of preexisting secondary functions is an important feature in gene evolution, just as it is in morphological evolution.

  8. Molecular trajectories leading to the alternative fates of duplicate genes.

    Directory of Open Access Journals (Sweden)

    Michael Marotta

    Full Text Available Gene duplication generates extra gene copies in which mutations can accumulate without risking the function of pre-existing genes. Such mutations modify duplicates and contribute to evolutionary novelties. However, the vast majority of duplicates appear to be short-lived and experience duplicate silencing within a few million years. Little is known about the molecular mechanisms leading to these alternative fates. Here we delineate differing molecular trajectories of a relatively recent duplication event between humans and chimpanzees by investigating molecular properties of a single duplicate: DNA sequences, gene expression and promoter activities. The inverted duplication of the Glutathione S-transferase Theta 2 (GSTT2 gene had occurred at least 7 million years ago in the common ancestor of African great apes and is preserved in chimpanzees (Pan troglodytes, whereas a deletion polymorphism is prevalent in humans. The alternative fates are associated with expression divergence between these species, and reduced expression in humans is regulated by silencing mutations that have been propagated between duplicates by gene conversion. In contrast, selective constraint preserved duplicate divergence in chimpanzees. The difference in evolutionary processes left a unique DNA footprint in which dying duplicates are significantly more similar to each other (99.4% than preserved ones. Such molecular trajectories could provide insights for the mechanisms underlying duplicate life and death in extant genomes.

  9. Many levels of control of V gene rearrangement frequency. (United States)

    Feeney, Ann J; Goebel, Peter; Espinoza, Celia R


    V, D, and J gene segments rearrange at very different frequencies. As with most biological systems, there are multiple levels of control of V gene recombination frequency, and here we review some of the work from our laboratory that addresses these various control mechanisms. One of the important factors that affect non-random V gene rearrangement frequency is the natural heterogeneity in recombination signal sequences (RSSs). Not only does variation in the heptamer and nonamer affect rearrangement, but variation in the spacer can also dramatically affect recombination. However, there are clearly other factors which control V gene rearrangement, as revealed by the fact that genes with identical RSSs can rearrange at different frequencies in vivo. Some of these other influences most likely affect the earliest stages of control--the change from an inaccessible state to an accessible state. Transcription factors can play a role in inducing these changes. Rearrangement of many VkappaI genes can be induced in a non-lymphoid cell line after ectopic expression of E2A, while neighboring VkappaII and VkappaIII genes do not rearrange, demonstrating that at least one level of control of induction of accessibility occurs at the level of the individual gene. Also, changes in chromatin structure can affect accessibility and might influence individual V gene rearrangement frequency.

  10. Benchmarking Transcriptome Quantification Methods for Duplicated Genes in Xenopus laevis. (United States)

    Kwon, Taejoon


    Xenopus is an important model organism for the study of genome duplication in vertebrates. With the full genome sequence of diploid Xenopus tropicalis available, and that of allotetraploid X. laevis close to being finished, we will be able to expand our understanding of how duplicated genes have evolved. One of the key features in the study of the functional consequence of gene duplication is how their expression patterns vary across different conditions, and RNA-seq seems to have enough resolution to discriminate the expression of highly similar duplicated genes. However, most of the current RNA-seq analysis methods were not designed to study samples with duplicate genes such as in X. laevis. Here, various computational methods to quantify gene expression in RNA-seq data were evaluated, using 2 independent X. laevis egg RNA-seq datasets and 2 reference databases for duplicated genes. The fact that RNA-seq can measure expression levels of similar duplicated genes was confirmed, but long paired-end reads are more informative than short single-end reads to discriminate duplicated genes. Also, it was found that bowtie, one of the most popular mappers in RNA-seq analysis, reports significantly smaller numbers of unique hits according to a mapping quality score compared to other mappers tested (BWA, GSNAP, STAR). Calculated from unique hits based on a mapping quality score, both expression levels and the expression ratio of duplicated genes can be estimated consistently among biological replicates, demonstrating that this method can successfully discriminate the expression of each copy of a duplicated gene pair. This comprehensive evaluation will be a useful guideline for studying gene expression of organisms with genome duplication using RNA-seq in the future.

  11. Inferring angiosperm phylogeny from EST data with widespread gene duplication


    Sanderson, Michael J.; McMahon, Michelle M.


    Background Most studies inferring species phylogenies use sequences from single copy genes or sets of orthologs culled from gene families. For taxa such as plants, with very high levels of gene duplication in their nuclear genomes, this has limited the exploitation of nuclear sequences for phylogenetic studies, such as those available in large EST libraries. One rarely used method of inference, gene tree parsimony, can infer species trees from gene families undergoing duplication and loss, bu...

  12. Gene duplication, modularity and adaptation in the evolution of the aflatoxin gene cluster

    Directory of Open Access Journals (Sweden)

    Jakobek Judy L


    duplicated copy may simply augment/supplement a specific pathway function (aflR/aflS and aflX/aflY or the duplicated copy may evolve a completely new function (aflT/aflQ and aflC/aflW. Gene modules that are contiguous in one species and noncontiguous in others point to possible rearrangements of cluster genes in the evolution of these species. Significantly higher mean Ka/Ks values in section Flavi compared to non-section Flavi species indicate increased positive selection acting in the evolution of genes in OMST and AF gene clusters.

  13. Gene duplication models for directed networks with limits on growth (United States)

    Enemark, Jakob; Sneppen, Kim


    Background: Duplication of genes is important for evolution of molecular networks. Many authors have therefore considered gene duplication as a driving force in shaping the topology of molecular networks. In particular it has been noted that growth via duplication would act as an implicit means of preferential attachment, and thereby provide the observed broad degree distributions of molecular networks. Results: We extend current models of gene duplication and rewiring by including directions and the fact that molecular networks are not a result of unidirectional growth. We introduce upstream sites and downstream shapes to quantify potential links during duplication and rewiring. We find that this in itself generates the observed scaling of transcription factors for genome sites in prokaryotes. The dynamical model can generate a scale-free degree distribution, p(k)\\propto 1/k^{\\gamma } , with exponent γ = 1 in the non-growing case, and with γ>1 when the network is growing. Conclusions: We find that duplication of genes followed by substantial recombination of upstream regions could generate features of genetic regulatory networks. Our steady state degree distribution is however too broad to be consistent with data, thereby suggesting that selective pruning acts as a main additional constraint on duplicated genes. Our analysis shows that gene duplication can only be a main cause for the observed broad degree distributions if there are also substantial recombinations between upstream regions of genes.

  14. Histone modification pattern evolution after yeast gene duplication

    Directory of Open Access Journals (Sweden)

    Zou Yangyun


    Full Text Available Abstract Background Gene duplication and subsequent functional divergence especially expression divergence have been widely considered as main sources for evolutionary innovations. Many studies evidenced that genetic regulatory network evolved rapidly shortly after gene duplication, thus leading to accelerated expression divergence and diversification. However, little is known whether epigenetic factors have mediated the evolution of expression regulation since gene duplication. In this study, we conducted detailed analyses on yeast histone modification (HM, the major epigenetics type in this organism, as well as other available functional genomics data to address this issue. Results Duplicate genes, on average, share more common HM-code patterns than random singleton pairs in their promoters and open reading frames (ORF. Though HM-code divergence between duplicates in both promoter and ORF regions increase with their sequence divergence, the HM-code in ORF region evolves slower than that in promoter region, probably owing to the functional constraints imposed on protein sequences. After excluding the confounding effect of sequence divergence (or evolutionary time, we found the evidence supporting the notion that in yeast, the HM-code may co-evolve with cis- and trans-regulatory factors. Moreover, we observed that deletion of some yeast HM-related enzymes increases the expression divergence between duplicate genes, yet the effect is lower than the case of transcription factor (TF deletion or environmental stresses. Conclusions Our analyses demonstrate that after gene duplication, yeast histone modification profile between duplicates diverged with evolutionary time, similar to genetic regulatory elements. Moreover, we found the evidence of the co-evolution between genetic and epigenetic elements since gene duplication, together contributing to the expression divergence between duplicate genes.

  15. Hominoid chromosomal rearrangements on 17q map to complex regions of segmental duplication


    Cardone, Maria Francesca; Jiang, Zhaoshi; D'Addabbo, Pietro; Archidiacono, Nicoletta; Rocchi, Mariano; Eichler, Evan E.; Ventura, Mario


    Background Chromosomal rearrangements, such as translocations and inversions, are recurrent phenomena during evolution, and both of them are involved in reproductive isolation and speciation. To better understand the molecular basis of chromosome rearrangements and their part in karyotype evolution, we have investigated the history of human chromosome 17 by comparative fluorescence in situ hybridization (FISH) and sequence analysis. Results Human bacterial artificial chromosome/p1 artificial ...

  16. Gene and genome duplication in Acanthamoeba polyphaga Mimivirus. (United States)

    Suhre, Karsten


    Gene duplication is key to molecular evolution in all three domains of life and may be the first step in the emergence of new gene function. It is a well-recognized feature in large DNA viruses but has not been studied extensively in the largest known virus to date, the recently discovered Acanthamoeba polyphaga Mimivirus. Here, I present a systematic analysis of gene and genome duplication events in the mimivirus genome. I found that one-third of the mimivirus genes are related to at least one other gene in the mimivirus genome, either through a large segmental genome duplication event that occurred in the more remote past or through more recent gene duplication events, which often occur in tandem. This shows that gene and genome duplication played a major role in shaping the mimivirus genome. Using multiple alignments, together with remote-homology detection methods based on Hidden Markov Model comparison, I assign putative functions to some of the paralogous gene families. I suggest that a large part of the duplicated mimivirus gene families are likely to interfere with important host cell processes, such as transcription control, protein degradation, and cell regulatory processes. My findings support the view that large DNA viruses are complex evolving organisms, possibly deeply rooted within the tree of life, and oppose the paradigm that viral evolution is dominated by lateral gene acquisition, at least in regard to large DNA viruses.

  17. Phylogenomics of the benzoxazinoid biosynthetic pathway of Poaceae: gene duplications and origin of the Bx cluster

    Directory of Open Access Journals (Sweden)

    Dutartre Leslie


    Full Text Available Abstract Background The benzoxazinoids 2,4-dihydroxy-1,4-benzoxazin-3-one (DIBOA and 2,4-dihydroxy-7- methoxy-1,4-benzoxazin-3-one (DIMBOA, are key defense compounds present in major agricultural crops such as maize and wheat. Their biosynthesis involves nine enzymes thought to form a linear pathway leading to the storage of DI(MBOA as glucoside conjugates. Seven of the genes (Bx1-Bx6 and Bx8 form a cluster at the tip of the short arm of maize chromosome 4 that includes four P450 genes (Bx2-5 belonging to the same CYP71C subfamily. The origin of this cluster is unknown. Results We show that the pathway appeared following several duplications of the TSA gene (α-subunit of tryptophan synthase and of a Bx2-like ancestral CYP71C gene and the recruitment of Bx8 before the radiation of Poaceae. The origins of Bx6 and Bx7 remain unclear. We demonstrate that the Bx2-like CYP71C ancestor was not committed to the benzoxazinoid pathway and that after duplications the Bx2-Bx5 genes were under positive selection on a few sites and underwent functional divergence, leading to the current specific biochemical properties of the enzymes. The absence of synteny between available Poaceae genomes involving the Bx gene regions is in contrast with the conserved synteny in the TSA gene region. Conclusions These results demonstrate that rearrangements following duplications of an IGL/TSA gene and of a CYP71C gene probably resulted in the clustering of the new copies (Bx1 and Bx2 at the tip of a chromosome in an ancestor of grasses. Clustering favored cosegregation and tip chromosomal location favored gene rearrangements that allowed the further recruitment of genes to the pathway. These events, a founding event and elongation events, may have been the key to the subsequent evolution of the benzoxazinoid biosynthetic cluster.

  18. Phylogenomics of the benzoxazinoid biosynthetic pathway of Poaceae: gene duplications and origin of the Bx cluster. (United States)

    Dutartre, Leslie; Hilliou, Frédérique; Feyereisen, René


    The benzoxazinoids 2,4-dihydroxy-1,4-benzoxazin-3-one (DIBOA) and 2,4-dihydroxy-7- methoxy-1,4-benzoxazin-3-one (DIMBOA), are key defense compounds present in major agricultural crops such as maize and wheat. Their biosynthesis involves nine enzymes thought to form a linear pathway leading to the storage of DI(M)BOA as glucoside conjugates. Seven of the genes (Bx1-Bx6 and Bx8) form a cluster at the tip of the short arm of maize chromosome 4 that includes four P450 genes (Bx2-5) belonging to the same CYP71C subfamily. The origin of this cluster is unknown. We show that the pathway appeared following several duplications of the TSA gene (α-subunit of tryptophan synthase) and of a Bx2-like ancestral CYP71C gene and the recruitment of Bx8 before the radiation of Poaceae. The origins of Bx6 and Bx7 remain unclear. We demonstrate that the Bx2-like CYP71C ancestor was not committed to the benzoxazinoid pathway and that after duplications the Bx2-Bx5 genes were under positive selection on a few sites and underwent functional divergence, leading to the current specific biochemical properties of the enzymes. The absence of synteny between available Poaceae genomes involving the Bx gene regions is in contrast with the conserved synteny in the TSA gene region. These results demonstrate that rearrangements following duplications of an IGL/TSA gene and of a CYP71C gene probably resulted in the clustering of the new copies (Bx1 and Bx2) at the tip of a chromosome in an ancestor of grasses. Clustering favored cosegregation and tip chromosomal location favored gene rearrangements that allowed the further recruitment of genes to the pathway. These events, a founding event and elongation events, may have been the key to the subsequent evolution of the benzoxazinoid biosynthetic cluster.

  19. Duplication and maintenance of the Myb genes of vertebrate animals

    Directory of Open Access Journals (Sweden)

    Colin J. Davidson


    Gene duplication is an important means of generating new genes. The major mechanisms by which duplicated genes are preserved in the face of purifying selection are thought to be neofunctionalization, subfunctionalization, and increased gene dosage. However, very few duplicated gene families in vertebrate species have been analyzed by functional tests in vivo. We have therefore examined the three vertebrate Myb genes (c-Myb, A-Myb, and B-Myb by cytogenetic map analysis, by sequence analysis, and by ectopic expression in Drosophila. We provide evidence that the vertebrate Myb genes arose by two rounds of regional genomic duplication. We found that ubiquitous expression of c-Myb and A-Myb, but not of B-Myb or Drosophila Myb, was lethal in Drosophila. Expression of any of these genes during early larval eye development was well tolerated. However, expression of c-Myb and A-Myb, but not of B-Myb or Drosophila Myb, during late larval eye development caused drastic alterations in adult eye morphology. Mosaic analysis implied that this eye phenotype was cell-autonomous. Interestingly, some of the eye phenotypes caused by the retroviral v-Myb oncogene and the normal c-Myb proto-oncogene from which v-Myb arose were quite distinct. Finally, we found that post-translational modifications of c-Myb by the GSK-3 protein kinase and by the Ubc9 SUMO-conjugating enzyme that normally occur in vertebrate cells can modify the eye phenotype caused by c-Myb in Drosophila. These results support a model in which the three Myb genes of vertebrates arose by two sequential duplications. The first duplication was followed by a subfunctionalization of gene expression, then neofunctionalization of protein function to yield a c/A-Myb progenitor. The duplication of this progenitor was followed by subfunctionalization of gene expression to give rise to tissue-specific c-Myb and A-Myb genes.

  20. Sterile DJH rearrangements reveal that distance between gene segments on the human Ig H chain locus influences their ability to rearrange

    DEFF Research Database (Denmark)

    Hansen, Tina Østergaard; Lange, Anders Blaabjerg; Barington, Torben


    Rearrangement of the Ig locus occurs in two steps. First, a JH gene is rearranged to a D gene followed by a VH gene rearranging to the DJH rearrangement. By next generation sequencing, we analyzed 9969 unique DJH rearrangements and 5919 unique VHDJH rearrangements obtained from peripheral blood B...... frequently than JH locus distal D genes, whereas VH locus proximal D genes were observed more frequently in nonproductive VHDJH rearrangements. We further demonstrate that the distance between VH, D, and JH gene segments influence their ability to rearrange within the human Ig locus....

  1. Gene duplication in the genome of parasitic Giardia lamblia

    Directory of Open Access Journals (Sweden)

    Flores Roberto


    Full Text Available Abstract Background Giardia are a group of widespread intestinal protozoan parasites in a number of vertebrates. Much evidence from G. lamblia indicated they might be the most primitive extant eukaryotes. When and how such a group of the earliest branching unicellular eukaryotes developed the ability to successfully parasitize the latest branching higher eukaryotes (vertebrates is an intriguing question. Gene duplication has long been thought to be the most common mechanism in the production of primary resources for the origin of evolutionary novelties. In order to parse the evolutionary trajectory of Giardia parasitic lifestyle, here we carried out a genome-wide analysis about gene duplication patterns in G. lamblia. Results Although genomic comparison showed that in G. lamblia the contents of many fundamental biologic pathways are simplified and the whole genome is very compact, in our study 40% of its genes were identified as duplicated genes. Evolutionary distance analyses of these duplicated genes indicated two rounds of large scale duplication events had occurred in G. lamblia genome. Functional annotation of them further showed that the majority of recent duplicated genes are VSPs (Variant-specific Surface Proteins, which are essential for the successful parasitic life of Giardia in hosts. Based on evolutionary comparison with their hosts, it was found that the rapid expansion of VSPs in G. lamblia is consistent with the evolutionary radiation of placental mammals. Conclusions Based on the genome-wide analysis of duplicated genes in G. lamblia, we found that gene duplication was essential for the origin and evolution of Giardia parasitic lifestyle. The recent expansion of VSPs uniquely occurring in G. lamblia is consistent with the increment of its hosts. Therefore we proposed a hypothesis that the increment of Giradia hosts might be the driving force for the rapid expansion of VSPs.

  2. Complexity of Gene Expression Evolution after Duplication: Protein Dosage Rebalancing

    Directory of Open Access Journals (Sweden)

    Igor B. Rogozin


    Full Text Available Ongoing debates about functional importance of gene duplications have been recently intensified by a heated discussion of the “ortholog conjecture” (OC. Under the OC, which is central to functional annotation of genomes, orthologous genes are functionally more similar than paralogous genes at the same level of sequence divergence. However, a recent study challenged the OC by reporting a greater functional similarity, in terms of gene ontology (GO annotations and expression profiles, among within-species paralogs compared to orthologs. These findings were taken to indicate that functional similarity of homologous genes is primarily determined by the cellular context of the genes, rather than evolutionary history. Subsequent studies suggested that the OC appears to be generally valid when applied to mammalian evolution but the complete picture of evolution of gene expression also has to incorporate lineage-specific aspects of paralogy. The observed complexity of gene expression evolution after duplication can be explained through selection for gene dosage effect combined with the duplication-degeneration-complementation model. This paper discusses expression divergence of recent duplications occurring before functional divergence of proteins encoded by duplicate genes.

  3. The evolution of vertebrate somatostatin receptors and their gene regions involves extensive chromosomal rearrangements

    Directory of Open Access Journals (Sweden)

    Ocampo Daza Daniel


    Full Text Available Abstract Background Somatostatin and its related neuroendocrine peptides have a wide variety of physiological functions that are mediated by five somatostatin receptors with gene names SSTR1-5 in mammals. To resolve their evolution in vertebrates we have investigated the SSTR genes and a large number of adjacent gene families by phylogeny and conserved synteny analyses in a broad range of vertebrate species. Results We find that the SSTRs form two families that belong to distinct paralogons. We observe not only chromosomal similarities reflecting the paralogy relationships between the SSTR-bearing chromosome regions, but also extensive rearrangements between these regions in teleost fish genomes, including fusions and translocations followed by reshuffling through intrachromosomal rearrangements. These events obscure the paralogy relationships but are still tractable thanks to the many genomes now available. We have identified a previously unrecognized SSTR subtype, SSTR6, previously misidentified as either SSTR1 or SSTR4. Conclusions Two ancestral SSTR-bearing chromosome regions were duplicated in the two basal vertebrate tetraploidizations (2R. One of these ancestral SSTR genes generated SSTR2, -3 and -5, the other gave rise to SSTR1, -4 and -6. Subsequently SSTR6 was lost in tetrapods and SSTR4 in teleosts. Our study shows that extensive chromosomal rearrangements have taken place between related chromosome regions in teleosts, but that these events can be resolved by investigating several distantly related species.

  4. Androgen receptor gene mutation, rearrangement, polymorphism. (United States)

    Eisermann, Kurtis; Wang, Dan; Jing, Yifeng; Pascal, Laura E; Wang, Zhou


    Genetic aberrations of the androgen receptor (AR) caused by mutations, rearrangements, and polymorphisms result in a mutant receptor that has varied functions compared to wild type AR. To date, over 1,000 mutations have been reported in the AR with most of these being associated with androgen insensitivity syndrome (AIS). While mutations of AR associated with prostate cancer occur less often in early stage localized disease, mutations in castration-resistant prostate cancer (CRPC) patients treated with anti-androgens occur more frequently with 10-30% of these patients having some form of mutation in the AR. Resistance to anti-androgen therapy usually results from gain-of-function mutations in the LBD such as is seen with bicalutamide and more recently with enzalutamide (MDV3100). Thus, it is crucial to investigate these new AR mutations arising from drug resistance to anti-androgens and other small molecule pharmacological agents.

  5. Simultaneous identification of duplications and lateral gene transfers. (United States)

    Tofigh, Ali; Hallett, Michael; Lagergren, Jens


    The incongruency between a gene tree and a corresponding species tree can be attributed to evolutionary events such as gene duplication and gene loss. This paper describes a combinatorial model where so-called DTL-scenarios are used to explain the differences between a gene tree and a corresponding species tree taking into account gene duplications, gene losses, and lateral gene transfers (also known as horizontal gene transfers). The reasonable biological constraint that a lateral gene transfer may only occur between contemporary species leads to the notion of acyclic DTL-scenarios. Parsimony methods are introduced by defining appropriate optimization problems. We show that finding most parsimonious acyclic DTL-scenarios is NP-hard. However, by dropping the condition of acyclicity, the problem becomes tractable, and we provide a dynamic programming algorithm as well as a fixed-parameter tractable algorithm for finding most parsimonious DTL-scenarios.

  6. Characterization of 67 mitochondrial tRNA gene rearrangements in the Hymenoptera suggests that mitochondrial tRNA gene position is selectively neutral. (United States)

    Dowton, Mark; Cameron, Stephen L; Dowavic, Jessica I; Austin, Andy D; Whiting, Michael F


    We present entire sequences of two hymenopteran mitochondrial genomes and the major portion of three others. We combined these data with nine previously sequenced hymenopteran mitochondrial genomes. This allowed us to infer and analyze the evolution of the 67 mitochondrial gene rearrangements so far found in this order. All of these involve tRNA genes, whereas four also involve larger (protein-coding or ribosomal RNA) genes. We find that the vast majority of mitochondrial gene rearrangements are independently derived. A maximum of four of these rearrangements represent shared, derived organizations, whereas three are convergently derived. The remaining mitochondrial gene rearrangements represent new mitochondrial genome organizations. These data are consistent with the proposal that there are an enormous number of alternative mitochondrial genome organizations possible and that mitochondrial genome organization is, for the most part, selectively neutral. Nevertheless, some mitochondrial genes appear less mobile than others. Genes close to the noncoding region are generally more mobile but only marginally so. Some mitochondrial genes rearrange in a pattern consistent with the duplication/random loss model, but more mitochondrial genes move in a pattern inconsistent with this model. An increased rate of mitochondrial gene rearrangement is not tightly associated with the evolution of parasitism. Although parasitic lineages tend to have more mitochondrial gene rearrangements than nonparasitic lineages, there are exceptions (e.g., Orussus and Schlettererius). It is likely that only a small proportion of the total number of mitochondrial gene rearrangements that have occurred during the evolution of the Hymenoptera have been sampled in the present study.

  7. Exon duplications in the ATP7A gene

    DEFF Research Database (Denmark)

    Mogensen, Mie; Skjørringe, Tina; Kodama, Hiroko


    BACKGROUND: Menkes disease (MD) is an X-linked, fatal neurodegenerative disorder of copper metabolism, caused by mutations in the ATP7A gene. Thirty-three Menkes patients in whom no mutation had been detected with standard diagnostic tools were screened for exon duplications in the ATP7A gene...

  8. Gene Rearrangement Analysis of Orbital Lymphoid Infiltrating Disorders

    Institute of Scientific and Technical Information of China (English)

    Jianhua Yan; Zhongyao Wu; Shuqi Huang; Yongping Li


    Purpose: To determine whether the use of polymerase chain reaction for B-cell gene rearrangement in patients with orbital lymphoid infiltrate disorders could be useful in the diagnosis of lymphoma, especially, in differentiating benign lesion from malignant one. Methods: In addition to clinical, pathological, and immunohistochemical evaluations,48 cases of orbital lymphoid infiltrate disorders were examined for immunoglobulin heavy (IgH) gene rearrangement by means of PCR to amplify the FR3 region with formalin-fixed and paraffin-embedded tissues. Results: Gene rearrangement in the third frame-work of the IgH region was detected in specimens obtained from 15 cases of malignant lymphoma, 4 of reactive lymphoid hyperplasia and 3 of orbital pseudotumor. All of these patients showed a discrete band (100bp) which reflected monoclonal proliferation of B lymphocytes. 5 cases of malignant lymphoma, 6 of reactive lymphoid hyperplasia and 15 of orbital pseudotumor did not show a discrete band on PCR. Conclusions: The FR3 region gene rearrangement of Ig heavy in patients with orbital lymphoid infiltrate disorders may be an additional diagnostic tool in differentiating benign from malignant lymphoid diseases and in offering a useful adjunct for diagnosis in difficult or unclear cases. It is a reliable and practical method of gene diagnosis in orbital lymphoid infiltrate disorders and helps to identify the molecular mechanism of malignant lymphoma. Eye Science 2000; 16:15 ~ 21.

  9. Gene Rearrangement Analysis of Orbital Lymphoid Infilktrating Disorders

    Institute of Scientific and Technical Information of China (English)

    JianghuaYan; ZhongyaoWu; 等


    Purpose:To determine whether the use of polymerae chain reaction for B-cell gene rearrangement in patients with orbital lymphoid infiltrate disorders could be useful in the diagnosis of lymphoma,especially,in differentiating benign lesion from malignant one.Methoids:In addition to clinical,pathological,and immunohistochemical evaluatons,48 cases of orbital lymphoid infiltrate disorders were examined for immunoglobulin heavy (IgH) gene rearrangement by means of PCR to amplify the FR3 region with formalin-fixed and paraffin-embedded tissues.Results:Gene rearrangement in the third frame-work of the IgH region was detected in specimens obtained from 15 cases of malignant lymphoma,4 of reactive lymphoid hyperplasia and 3 of orbital pseudotumor.All of these patients showed a discrete band (100bp) which reflected monoclonal proliferation of B lymphocytes.5 cases of malignant lymphoma,6 of reactive lymphoid hyperplasia and 15 of orbital pseudotumor did not show a discrete band on PCR.Conclusions:The FR3 region gene rearrangement of Ig heavy in patients with orbital lymphoid infilktrate disorders may be an additional diagnostic tool in differentiating benign from malignant lymphoid diseases and in offering a useful adjunct for diagnosis in difficult or unclear case.It is a reliable and practical method of gene diagnosis in orbital lymphoid infiltrate disorders and helps to identife the molecular mechanism of malignant lymphoma.Eye Science 2000;16:15-21.


    Directory of Open Access Journals (Sweden)

    Khlestkina E.


    Full Text Available Gene duplication followed by subfunctionalization and neofunctionalization is of a great evolutionary importance. In plant genomes, duplicated genes may result from either polyploidization (homoeologous genes or segmental chromosome duplications (paralogous genes. In allohexaploid wheat Triticum aestivum L. (2n=6x=42, genome BBAADD, both homoeologous and paralogous copies were found for the regulatory gene Myc encoding MYC-like transcriptional factor in the biosynthesis of flavonoid pigments, anthocyanins, and for the structural gene F3h encoding one of the key enzymes of flavonoid biosynthesis, flavanone 3-hydroxylase. From the 5 copies (3 homoeologous and 2 paralogous of the Myc gene found in T. aestivum, only one plays a regulatory role in anthocyanin biosynthesis, interacting complementary with another transcriptional factor (MYB-like to confer purple pigmentation of grain pericarp in wheat. The role and functionality of the other 4 copies of the Myc gene remain unknown. From the 4 functional copies of the F3h gene in T. aestivum, three homoeologues have similar function. They are expressed in wheat organs colored with anthocyanins or in the endosperm, participating there in biosynthesis of uncolored flavonoid substances. The fourth copy (the B-genomic paralogue is transcribed neither in wheat organs colored with anthocyanins nor in seeds, however, it’s expression has been noticed in roots of aluminium-stressed plants, where the three homoeologous copies are not active. Functional diversification of the duplicated flavonoid biosynthesis genes in wheat may be a reason for maintenance of the duplicated copies and preventing them from pseudogenization.The study was supported by RFBR (11-04-92707. We also thank Ms. Galina Generalova for technical assistance.

  11. Prevalent role of gene features in determining evolutionary fates of whole-genome duplication duplicated genes in flowering plants. (United States)

    Jiang, Wen-kai; Liu, Yun-long; Xia, En-hua; Gao, Li-zhi


    The evolution of genes and genomes after polyploidization has been the subject of extensive studies in evolutionary biology and plant sciences. While a significant number of duplicated genes are rapidly removed during a process called fractionation, which operates after the whole-genome duplication (WGD), another considerable number of genes are retained preferentially, leading to the phenomenon of biased gene retention. However, the evolutionary mechanisms underlying gene retention after WGD remain largely unknown. Through genome-wide analyses of sequence and functional data, we comprehensively investigated the relationships between gene features and the retention probability of duplicated genes after WGDs in six plant genomes, Arabidopsis (Arabidopsis thaliana), poplar (Populus trichocarpa), soybean (Glycine max), rice (Oryza sativa), sorghum (Sorghum bicolor), and maize (Zea mays). The results showed that multiple gene features were correlated with the probability of gene retention. Using a logistic regression model based on principal component analysis, we resolved evolutionary rate, structural complexity, and GC3 content as the three major contributors to gene retention. Cluster analysis of these features further classified retained genes into three distinct groups in terms of gene features and evolutionary behaviors. Type I genes are more prone to be selected by dosage balance; type II genes are possibly subject to subfunctionalization; and type III genes may serve as potential targets for neofunctionalization. This study highlights that gene features are able to act jointly as primary forces when determining the retention and evolution of WGD-derived duplicated genes in flowering plants. These findings thus may help to provide a resolution to the debate on different evolutionary models of gene fates after WGDs.

  12. Inferring angiosperm phylogeny from EST data with widespread gene duplication. (United States)

    Sanderson, Michael J; McMahon, Michelle M


    Most studies inferring species phylogenies use sequences from single copy genes or sets of orthologs culled from gene families. For taxa such as plants, with very high levels of gene duplication in their nuclear genomes, this has limited the exploitation of nuclear sequences for phylogenetic studies, such as those available in large EST libraries. One rarely used method of inference, gene tree parsimony, can infer species trees from gene families undergoing duplication and loss, but its performance has not been evaluated at a phylogenomic scale for EST data in plants. A gene tree parsimony analysis based on EST data was undertaken for six angiosperm model species and Pinus, an outgroup. Although a large fraction of the tentative consensus sequences obtained from the TIGR database of ESTs was assembled into homologous clusters too small to be phylogenetically informative, some 557 clusters contained promising levels of information. Based on maximum likelihood estimates of the gene trees obtained from these clusters, gene tree parsimony correctly inferred the accepted species tree with strong statistical support. A slight variant of this species tree was obtained when maximum parsimony was used to infer the individual gene trees instead. Despite the complexity of the EST data and the relatively small fraction eventually used in inferring a species tree, the gene tree parsimony method performed well in the face of very high apparent rates of duplication.

  13. Concerted evolution of duplicated protein-coding genes in Drosophila.


    Hickey, D. A.; Bally-Cuif, L.; Abukashawa, S; Payant, V; Benkel, B F


    Very rapid rates of gene conversion were observed between duplicated alpha-amylase-coding sequences in Drosophila melanogaster. This gene conversion process was also seen in the related species Drosophila erecta. Specifically, there is virtual sequence identity between the coding regions of the two genes within each species, while the sequence divergence between species is close to that expected based on their phylogenetic relationship. The flanking, noncoding regions are much more highly div...

  14. Characterization of genomic rearrangements of the alpha1-acid glycoprotein/orosomucoid gene in Ghanaians. (United States)

    Yuasa, I; Nakamura, H; Henke, L; Henke, J; Nakagawa, M; Irizawa, Y; Umetsu, K


    In this study, the structure of the alpha1-acid glycoprotein (AGP), or orosomucoid (ORM), gene was investigated in a Ghanaian mother and her child, who shared an unusual variant, ORM1 S2(C), found by isoelectric focusing. Three remarkable changes of nucleotide sequence were observed: (1) The two ORM1 alleles, ORMI*S and ORMI*S2(C), had the AGP2 gene-specific sequence at one and three regions, respectively, in exon 5 to intron 5. The variant allele originating from ORMi*S was characterized by a G-to-A transition, resulting in an amino acid change from valine to methionine, which is also detected in ORM1 F2, a form that is common in Europeans. (2) The AGP2 gene of the child, inherited from the father, was duplicated, as revealed by long-range polymerase chain reaction. (3) Three new mutations were observed in two exons of the AGP2 genes of the mother and child. All of these novel genomic rearrangements, which were not observed in Japanese subjects, may have arisen through point mutation, gene conversion, and unequal crossover events. It is likely that the rearrangement of the AGP gene has often occurred in Africans.

  15. Concerted evolution of duplicated protein-coding genes in Drosophila. (United States)

    Hickey, D A; Bally-Cuif, L; Abukashawa, S; Payant, V; Benkel, B F


    Very rapid rates of gene conversion were observed between duplicated alpha-amylase-coding sequences in Drosophila melanogaster. This gene conversion process was also seen in the related species Drosophila erecta. Specifically, there is virtual sequence identity between the coding regions of the two genes within each species, while the sequence divergence between species is close to that expected based on their phylogenetic relationship. The flanking, noncoding regions are much more highly diverged and do not appear to be subject to gene conversion. Comparison of amylase sequences between the two species provides a clear demonstration that recurrent gene conversion does indeed lead to the concerted evolution of the gene pair.

  16. Familial Lymphoproliferative Malignancies and Tandem Duplication of NF1 Gene

    Directory of Open Access Journals (Sweden)

    Gustavo Fernandes


    Full Text Available Background. Neurofibromatosis type 1 is a genetic disorder caused by loss-of-function mutations in a tumor suppressor gene (NF1 which codifies the protein neurofibromin. The frequent genetic alterations that modify neurofibromin function are deletions and insertions. Duplications are rare and phenotype in patients bearing duplication of NF1 gene is thought to be restricted to developmental abnormalities, with no reference to cancer susceptibility in these patients. We evaluated a patient who presented with few clinical signs of neurofibromatosis type 1 and a conspicuous personal and familiar history of different types of cancer, especially lymphoproliferative malignancies. The coding region of the NF-1 gene was analyzed by real-time polymerase chain reaction and direct sequencing. Multiplex ligation-dependent probe amplification was performed to detect the number of mutant copies. The NF1 gene analysis showed the following alterations: mosaic duplication of NF1, TRAF4, and MYO1D. Fluorescence in situ hybridization using probes (RP5-1002G3 and RP5-92689 flanking NF1 gene in 17q11.2 and CEP17 for 17q11.11.1 was performed. There were three signals (RP5-1002G3conRP5-92689 in the interphases analyzed and two signals (RP5-1002G3conRP5-92689 in 93% of cells. These findings show a tandem duplication of 17q11.2. Conclusion. The case suggests the possibility that NF1 gene duplication may be associated with a phenotype characterized by lymphoproliferative disorders.

  17. A role for gene duplication and natural variation of gene expression in the evolution of metabolism.

    Directory of Open Access Journals (Sweden)

    Daniel J Kliebenstein

    Full Text Available BACKGROUND: Most eukaryotic genomes have undergone whole genome duplications during their evolutionary history. Recent studies have shown that the function of these duplicated genes can diverge from the ancestral gene via neo- or sub-functionalization within single genotypes. An additional possibility is that gene duplicates may also undergo partitioning of function among different genotypes of a species leading to genetic differentiation. Finally, the ability of gene duplicates to diverge may be limited by their biological function. METHODOLOGY/PRINCIPAL FINDINGS: To test these hypotheses, I estimated the impact of gene duplication and metabolic function upon intraspecific gene expression variation of segmental and tandem duplicated genes within Arabidopsis thaliana. In all instances, the younger tandem duplicated genes showed higher intraspecific gene expression variation than the average Arabidopsis gene. Surprisingly, the older segmental duplicates also showed evidence of elevated intraspecific gene expression variation albeit typically lower than for the tandem duplicates. The specific biological function of the gene as defined by metabolic pathway also modulated the level of intraspecific gene expression variation. The major energy metabolism and biosynthetic pathways showed decreased variation, suggesting that they are constrained in their ability to accumulate gene expression variation. In contrast, a major herbivory defense pathway showed significantly elevated intraspecific variation suggesting that it may be under pressure to maintain and/or generate diversity in response to fluctuating insect herbivory pressures. CONCLUSION: These data show that intraspecific variation in gene expression is facilitated by an interaction of gene duplication and biological activity. Further, this plays a role in controlling diversity of plant metabolism.

  18. Recombination facilitates neofunctionalization of duplicate genes via originalization

    Directory of Open Access Journals (Sweden)

    Huang Ren


    Full Text Available Abstract Background Recently originalization was proposed to be an effective way of duplicate-gene preservation, in which recombination provokes the high frequency of original (or wild-type allele on both duplicated loci. Because the high frequency of wild-type allele might drive the arising and accumulating of advantageous mutation, it is hypothesized that recombination might enlarge the probability of neofunctionalization (Pneo of duplicate genes. In this article this hypothesis has been tested theoretically. Results Results show that through originalization recombination might not only shorten mean time to neofunctionalizaiton, but also enlarge Pneo. Conclusions Therefore, recombination might facilitate neofunctionalization via originalization. Several extensive applications of these results on genomic evolution have been discussed: 1. Time to nonfunctionalization can be much longer than a few million generations expected before; 2. Homogenization on duplicated loci results from not only gene conversion, but also originalization; 3. Although the rate of advantageous mutation is much small compared with that of degenerative mutation, Pneo cannot be expected to be small.

  19. The Phenotypic Plasticity of Duplicated Genes in Saccharomyces cerevisiae and the Origin of Adaptations

    Directory of Open Access Journals (Sweden)

    Florian Mattenberger


    Full Text Available Gene and genome duplication are the major sources of biological innovations in plants and animals. Functional and transcriptional divergence between the copies after gene duplication has been considered the main driver of innovations . However, here we show that increased phenotypic plasticity after duplication plays a more major role than thought before in the origin of adaptations. We perform an exhaustive analysis of the transcriptional alterations of duplicated genes in the unicellular eukaryote Saccharomyces cerevisiae when challenged with five different environmental stresses. Analysis of the transcriptomes of yeast shows that gene duplication increases the transcriptional response to environmental changes, with duplicated genes exhibiting signatures of adaptive transcriptional patterns in response to stress. The mechanism of duplication matters, with whole-genome duplicates being more transcriptionally altered than small-scale duplicates. The predominant transcriptional pattern follows the classic theory of evolution by gene duplication; with one gene copy remaining unaltered under stress, while its sister copy presents large transcriptional plasticity and a prominent role in adaptation. Moreover, we find additional transcriptional profiles that are suggestive of neo- and subfunctionalization of duplicate gene copies. These patterns are strongly correlated with the functional dependencies and sequence divergence profiles of gene copies. We show that, unlike singletons, duplicates respond more specifically to stress, supporting the role of natural selection in the transcriptional plasticity of duplicates. Our results reveal the underlying transcriptional complexity of duplicated genes and its role in the origin of adaptations.

  20. DNA rearrangement causes multiple changes in gene expression at the amylase locus in Drosophila melanogaster. (United States)

    Hickey, D A; Benkel, B F; Abukashawa, S; Haus, S


    A spontaneous null mutation at the alpha-amylase locus in Drosophila melanogaster was recovered from a laboratory population. The mutant strain was found to lack amylase enzyme production and to produce low, but detectable, levels of amylase mRNA. Moreover, the null strain is also lacking the glucose repression of amylase mRNA production which is seen in wild-type strains. The mutant phenotype correlates with a rearrangement in genomic DNA which, in turn, corresponds to a simple inversion in the arrangement observed most frequently in North American populations of D. melanogaster, including the common laboratory strain, Oregon-R. These results have implications for our understanding of both the evolution of the duplicated amylase gene structure and the regulation of amylase gene expression.

  1. Signals of historical interlocus gene conversion in human segmental duplications.

    Directory of Open Access Journals (Sweden)

    Beth L Dumont

    Full Text Available Standard methods of DNA sequence analysis assume that sequences evolve independently, yet this assumption may not be appropriate for segmental duplications that exchange variants via interlocus gene conversion (IGC. Here, we use high quality multiple sequence alignments from well-annotated segmental duplications to systematically identify IGC signals in the human reference genome. Our analysis combines two complementary methods: (i a paralog quartet method that uses DNA sequence simulations to identify a statistical excess of sites consistent with inter-paralog exchange, and (ii the alignment-based method implemented in the GENECONV program. One-quarter (25.4% of the paralog families in our analysis harbor clear IGC signals by the quartet approach. Using GENECONV, we identify 1477 gene conversion tracks that cumulatively span 1.54 Mb of the genome. Our analyses confirm the previously reported high rates of IGC in subtelomeric regions and Y-chromosome palindromes, and identify multiple novel IGC hotspots, including the pregnancy specific glycoproteins and the neuroblastoma breakpoint gene families. Although the duplication history of a paralog family is described by a single tree, we show that IGC has introduced incredible site-to-site variation in the evolutionary relationships among paralogs in the human genome. Our findings indicate that IGC has left significant footprints in patterns of sequence diversity across segmental duplications in the human genome, out-pacing the contributions of single base mutation by orders of magnitude. Collectively, the IGC signals we report comprise a catalog that will provide a critical reference for interpreting observed patterns of DNA sequence variation across duplicated genomic regions, including targets of recent adaptive evolution in humans.

  2. Rearrangements of immunoglobulin genes during differentiation and evolution. (United States)

    Honjo, T; Nakai, S; Nishida, Y; Kataoka, T; Yamawaki-Kataoka, Y; Takahashi, N; Obata, M; Shimizu, A; Yaoita, Y; Nikaido, T; Ishida, N


    Immunoglobulin genes are shown to undergo dynamic rearrangements during differentiation as well as evolution. We have demonstrated that a complete immunoglobulin heavy chain gene is formed by at least two types of DNA rearrangement during B cell differentiation. The first type of rearrangement is V-D-J recombination to complete a variable region sequence and the second type is S-S recombination to switch a constant region sequence. Both types of recombination are accompanied by deletion of the intervening DNA segment. Structure and organization of CH genes are elucidated by molecular cloning and nucleotide sequence determination. Organization of H chain genes is summarized as VH-(unknown distance)-JH-(6.5 kb)-C mu-(4.5 kb)-C delta-(unknown distance)-C gamma 3-(34 kb)-C gamma 1-(21 kb)-C gamma 2b-(15 kb)-C gamma 2a-(14.5 kb)-C epsilon-(12.5 kb)-C alpha. The S-S recombination takes place at the S region which is located at the 5' side of each CH gene. Nucleotide sequence of the S region comprises tandem repetition of closely related sequences. The S-S recombination seems to be mediated by short common sequences shared among S regions. A sister chromatid exchange model was proposed as a mechanism for S-S recombination. Comparison of nucleotide sequences of CH genes indicates that immunoglobulin genes have scrambled by intervening sequence-mediated domain transfer during their evolution.


    Institute of Scientific and Technical Information of China (English)

    Guo Sutang; Liu Yongchang; Sun Junning


    Objective:To investigate the pattern of clonal rearrangement of immunoglobulin heavy chain gene (IGH) and T-cell receptor γ gene (TCRγ) of NonHodgkin's lymphoma (NHL). Methods: Bone marrow smears of 211 patients of NHL were detected by PCR, the rearranged IGH and TCRγ gene was amplified using oligonucleotide primers. Results: The clonal rearrangement of IGH gene was detectable in 51.2%(108/211); the clonal rearrangement of TCRγ gene was detectable in 21.3% (45/211); both IGH and TCRγwas detectable in 5.7% (12/211);no clonal rearrangement in 21.8% (46/211). And compared clonal gene rearrangement with pathological type and primary site of tumor. Ten patients of NHL were investigated serially. 5/10 patients still had clonal gene rearrangement at clinical complete remission. Conclusion: It demonstrated that this assay may be useful in monitoring the minimal residual disease (MRD) and in evaluating effectiveness of therapy.

  4. Complete nucleotide sequence and gene rearrangement of the mitochondrial genome of Occidozyga martensii

    Indian Academy of Sciences (India)

    En Li; Xiaoqiang Li; Xiaobing Wu; Ge Feng; Man Zhang; Haitao Shi; Lijun Wang; Jianping Jiang


    In this study, the complete nucleotide sequence (18,321 bp) of the mitochondrial (mt) genome of the round-tongued floating frog, Occidozyga martensii was determined. Although, the base composition and codon usage of O. martensii conformed to the typical vertebrate patterns, this mt genome contained 23 tRNAs (a tandem duplication of tRNA-Met gene). The LTPF tRNA-gene cluster, and the derived position of the ND5 gene downstream of the control region, were present in this mitogenome. Moreover, we found that in the WANCY tRNA-gene cluster, the tRNA-Asn gene was located between the tRNA-Tyr and COI genes instead of between the tRNA-Ala and tRNA-Cys genes, which is a novel mtDNA gene rearrangement in vertebrates. Based on the concatenated nucleotide sequences of the 13 protein-coding genes, phylogenetic analysis (BI, ML, MP) was performed to further clarify the phylogenetic relations of this species within anurans.

  5. Evolution of the duplicated intracellular lipid-binding protein genes of teleost fishes. (United States)

    Venkatachalam, Ananda B; Parmar, Manoj B; Wright, Jonathan M


    Increasing organismal complexity during the evolution of life has been attributed to the duplication of genes and entire genomes. More recently, theoretical models have been proposed that postulate the fate of duplicated genes, among them the duplication-degeneration-complementation (DDC) model. In the DDC model, the common fate of a duplicated gene is lost from the genome owing to nonfunctionalization. Duplicated genes are retained in the genome either by subfunctionalization, where the functions of the ancestral gene are sub-divided between the sister duplicate genes, or by neofunctionalization, where one of the duplicate genes acquires a new function. Both processes occur either by loss or gain of regulatory elements in the promoters of duplicated genes. Here, we review the genomic organization, evolution, and transcriptional regulation of the multigene family of intracellular lipid-binding protein (iLBP) genes from teleost fishes. Teleost fishes possess many copies of iLBP genes owing to a whole genome duplication (WGD) early in the teleost fish radiation. Moreover, the retention of duplicated iLBP genes is substantially higher than the retention of all other genes duplicated in the teleost genome. The fatty acid-binding protein genes, a subfamily of the iLBP multigene family in zebrafish, are differentially regulated by peroxisome proliferator-activated receptor (PPAR) isoforms, which may account for the retention of iLBP genes in the zebrafish genome by the process of subfunctionalization of cis-acting regulatory elements in iLBP gene promoters.

  6. Hox gene duplications correlate with posterior heteronomy in scorpions. (United States)

    Sharma, Prashant P; Schwager, Evelyn E; Extavour, Cassandra G; Wheeler, Ward C


    The evolutionary success of the largest animal phylum, Arthropoda, has been attributed to tagmatization, the coordinated evolution of adjacent metameres to form morphologically and functionally distinct segmental regions called tagmata. Specification of regional identity is regulated by the Hox genes, of which 10 are inferred to be present in the ancestor of arthropods. With six different posterior segmental identities divided into two tagmata, the bauplan of scorpions is the most heteronomous within Chelicerata. Expression domains of the anterior eight Hox genes are conserved in previously surveyed chelicerates, but it is unknown how Hox genes regionalize the three tagmata of scorpions. Here, we show that the scorpion Centruroides sculpturatus has two paralogues of all Hox genes except Hox3, suggesting cluster and/or whole genome duplication in this arachnid order. Embryonic anterior expression domain boundaries of each of the last four pairs of Hox genes (two paralogues each of Antp, Ubx, abd-A and Abd-B) are unique and distinguish segmental groups, such as pectines, book lungs and the characteristic tail, while maintaining spatial collinearity. These distinct expression domains suggest neofunctionalization of Hox gene paralogues subsequent to duplication. Our data reconcile previous understanding of Hox gene function across arthropods with the extreme heteronomy of scorpions.

  7. Profiling of gene duplication patterns of sequenced teleost genomes: evidence for rapid lineage-specific genome expansion mediated by recent tandem duplications

    Directory of Open Access Journals (Sweden)

    Lu Jianguo


    Full Text Available Abstract Background Gene duplication has had a major impact on genome evolution. Localized (or tandem duplication resulting from unequal crossing over and whole genome duplication are believed to be the two dominant mechanisms contributing to vertebrate genome evolution. While much scrutiny has been directed toward discerning patterns indicative of whole-genome duplication events in teleost species, less attention has been paid to the continuous nature of gene duplications and their impact on the size, gene content, functional diversity, and overall architecture of teleost genomes. Results Here, using a Markov clustering algorithm directed approach we catalogue and analyze patterns of gene duplication in the four model teleost species with chromosomal coordinates: zebrafish, medaka, stickleback, and Tetraodon. Our analyses based on set size, duplication type, synonymous substitution rate (Ks, and gene ontology emphasize shared and lineage-specific patterns of genome evolution via gene duplication. Most strikingly, our analyses highlight the extraordinary duplication and retention rate of recent duplicates in zebrafish and their likely role in the structural and functional expansion of the zebrafish genome. We find that the zebrafish genome is remarkable in its large number of duplicated genes, small duplicate set size, biased Ks distribution toward minimal mutational divergence, and proportion of tandem and intra-chromosomal duplicates when compared with the other teleost model genomes. The observed gene duplication patterns have played significant roles in shaping the architecture of teleost genomes and appear to have contributed to the recent functional diversification and divergence of important physiological processes in zebrafish. Conclusions We have analyzed gene duplication patterns and duplication types among the available teleost genomes and found that a large number of genes were tandemly and intrachromosomally duplicated, suggesting

  8. Evolution of the mitochondrial genome in snakes: Gene rearrangements and phylogenetic relationships

    Directory of Open Access Journals (Sweden)

    Zhou Kaiya


    Full Text Available Abstract Background Snakes as a major reptile group display a variety of morphological characteristics pertaining to their diverse behaviours. Despite abundant analyses of morphological characters, molecular studies using mitochondrial and nuclear genes are limited. As a result, the phylogeny of snakes remains controversial. Previous studies on mitochondrial genomes of snakes have demonstrated duplication of the control region and translocation of trnL to be two notable features of the alethinophidian (all serpents except blindsnakes and threadsnakes mtDNAs. Our purpose is to further investigate the gene organizations, evolution of the snake mitochondrial genome, and phylogenetic relationships among several major snake families. Results The mitochondrial genomes were sequenced for four taxa representing four different families, and each had a different gene arrangement. Comparative analyses with other snake mitochondrial genomes allowed us to summarize six types of mitochondrial gene arrangement in snakes. Phylogenetic reconstruction with commonly used methods of phylogenetic inference (BI, ML, MP, NJ arrived at a similar topology, which was used to reconstruct the evolution of mitochondrial gene arrangements in snakes. Conclusion The phylogenetic relationships among the major families of snakes are in accordance with the mitochondrial genomes in terms of gene arrangements. The gene arrangement in Ramphotyphlops braminus mtDNA is inferred to be ancestral for snakes. After the divergence of the early Ramphotyphlops lineage, three types of rearrangements occurred. These changes involve translocations within the IQM tRNA gene cluster and the duplication of the CR. All phylogenetic methods support the placement of Enhydris plumbea outside of the (Colubridae + Elapidae cluster, providing mitochondrial genomic evidence for the familial rank of Homalopsidae.

  9. A salmonid EST genomic study: genes, duplications, phylogeny and microarrays

    Directory of Open Access Journals (Sweden)

    Brahmbhatt Sonal


    Full Text Available Abstract Background Salmonids are of interest because of their relatively recent genome duplication, and their extensive use in wild fisheries and aquaculture. A comprehensive gene list and a comparison of genes in some of the different species provide valuable genomic information for one of the most widely studied groups of fish. Results 298,304 expressed sequence tags (ESTs from Atlantic salmon (69% of the total, 11,664 chinook, 10,813 sockeye, 10,051 brook trout, 10,975 grayling, 8,630 lake whitefish, and 3,624 northern pike ESTs were obtained in this study and have been deposited into the public databases. Contigs were built and putative full-length Atlantic salmon clones have been identified. A database containing ESTs, assemblies, consensus sequences, open reading frames, gene predictions and putative annotation is available. The overall similarity between Atlantic salmon ESTs and those of rainbow trout, chinook, sockeye, brook trout, grayling, lake whitefish, northern pike and rainbow smelt is 93.4, 94.2, 94.6, 94.4, 92.5, 91.7, 89.6, and 86.2% respectively. An analysis of 78 transcript sets show Salmo as a sister group to Oncorhynchus and Salvelinus within Salmoninae, and Thymallinae as a sister group to Salmoninae and Coregoninae within Salmonidae. Extensive gene duplication is consistent with a genome duplication in the common ancestor of salmonids. Using all of the available EST data, a new expanded salmonid cDNA microarray of 32,000 features was created. Cross-species hybridizations to this cDNA microarray indicate that this resource will be useful for studies of all 68 salmonid species. Conclusion An extensive collection and analysis of salmonid RNA putative transcripts indicate that Pacific salmon, Atlantic salmon and charr are 94–96% similar while the more distant whitefish, grayling, pike and smelt are 93, 92, 89 and 86% similar to salmon. The salmonid transcriptome reveals a complex history of gene duplication that is

  10. Diagnostic significance of TCR gene clonal rearrangement analysis in early mycosis fungoides

    Institute of Scientific and Technical Information of China (English)

    Chen Xu; Chuan Wan; Lin Wang; Han-Jun Yang; Yuan Tang; Wei-Ping Liu


    Mycosis fungoides (MF), the most common type of cutaneous T-cell lymphoma, has various unspecific clinical and histological characteristics. Its eariy diagnosis is challenging. The application of T-cell receptor (TCR) gene clonal rearrangement to the diagnosis of MF has been widely studied. In this study, we used polymerase chain reaction (PCR) to investigate the diagnostic significance of detecting TCR-γ and -β gene clonal rearrangement in the eady diagnosis of mycosis fungoides. PCR for TCR-γ and TCR-β gene rearrangement was performed on 19 patients with suspected early MF, 6 with typical MF, and 6 with chronic dermatitis. Of the 19 patients with suspected eady MF, 13 had TCR-~ gene clonal rearrangement, whereas none had TCR-β gene clonal rearrangement. All patients with typical MF had TCR gene clonal rearrangement, in which 4 showed TCR-γ clonal rearrangement, 1 showed TCR-β gene clonal rearrangements, and 1 showed both. No patients with chronic dermatitis had TCR gene clonal rearrangement. These results indicate that TCR gene clonal rearrangement analysis is a useful tool in diagnosing early MF. TCR-γ gene is recommended to the routine analysis, whereas TCR-β gene has potential in combination toward intractable cases.

  11. Molecular screening of pituitary adenomas for gene mutations and rearrangements

    Energy Technology Data Exchange (ETDEWEB)

    Herman, V.; Drazin, N.Z.; Gonskey, R.; Melmed, S. (Cedars-Sinai Medical Center, Los Angeles, CA (United States))


    Although pituitary tumors arise as benign monoclonal neoplasms, genetic alterations have not readily been identified in these adenomas. The authors studied restriction fragment abnormalities involving the GH gene locus, and mutations in the p53 and H-, K-, and N-ras genes in 22 human GH cell adenomas. Twenty two nonsecretory adenomas were also examined for p53 and ras gene mutations. Seven prolactinoma DNA samples were tested for deletions in the multiple endocrine neoplasia-1 (MEN-1) locus, as well as for rearrangements in the hst gene, a member of the fibroblast growth factor family. In DNA from GH-cell adenomas, identical GH restriction patterns were detected in both pituitary and lymphocyte DNA in all patients and in one patient with a mixed GH-TSH cell adenoma. Using polymerase chain reaction (PCR)-single stranded conformation polymorphism analysis, no mutations were detected in exons 5, 6, 7 and 8 of the p53 gene in GH cell adenomas nor in 22 nonsecretory adenomas. Codons 12/13 and 61 of H-ras, K-ras, and N-ras genes were also intact on GH cell adenomas and in nonsecretory adenomas. Site-specific probes for chromosome 11q13 including, PYGM, D11S146, and INT2 were used in 7 sporadic PRL-secreting adenomas to detect deletions of the MEN-1 locus on chromosome 11. One patient was identified with a loss of 11p, and the remaining 6 patients did not demonstrate loss of heterozygosity in the pituitary 11q13 locus, compared to lymphocyte DNA. None of these patients demonstrated hst gene rearrangements which also maps to this locus. These results show that p53 and ras gene mutations are not common events in the pathogenesis of acromegaly and nonsecretory tumors. Although hst gene rearrangements and deletions of 11q13 are not associated with sporadic PRl-cell adenoma formation, a single patient was detected with a partial loss of chromosome 11, including the putative MEN-1 site. 31 refs., 5 figs., 2 tabs.

  12. Matrix Gla protein and osteocalcin: from gene duplication to neofunctionalization. (United States)

    Cancela, M Leonor; Laizé, Vincent; Conceição, Natércia


    Osteocalcin (OC or bone Gla protein, BGP) and matrix Gla protein (MGP) are two members of the growing family of vitamin K-dependent (VKD) proteins. They were the first VKD proteins found not to be involved in coagulation and synthesized outside the liver. Both proteins were isolated from bone although it is now known that only OC is synthesized by bone cells under normal physiological conditions, but since both proteins can bind calcium and hydroxyapatite, they can also accumulate in bone. Both OC and MGP share similar structural features, both in terms of protein domains and gene organization. OC gene is likely to have appeared from MGP through a tandem gene duplication that occurred concomitantly with the appearance of the bony vertebrates. Despite their relatively close relationship and the fact that both can bind calcium and affect mineralization, their functions are not redundant and they also have other unrelated functions. Interestingly, these two proteins appear to have followed quite different evolutionary strategies in order to acquire novel functionalities, with OC following a gene duplication strategy while MGP variability was obtained mostly by the use of multiple promoters and alternative splicing, leading to proteins with additional functional characteristics and alternative gene regulatory pathways. Copyright © 2014 Elsevier Inc. All rights reserved.

  13. Gene duplications in prokaryotes can be associated with environmental adaptation

    Directory of Open Access Journals (Sweden)

    Lempicki Richard A


    Full Text Available Abstract Background Gene duplication is a normal evolutionary process. If there is no selective advantage in keeping the duplicated gene, it is usually reduced to a pseudogene and disappears from the genome. However, some paralogs are retained. These gene products are likely to be beneficial to the organism, e.g. in adaptation to new environmental conditions. The aim of our analysis is to investigate the properties of paralog-forming genes in prokaryotes, and to analyse the role of these retained paralogs by relating gene properties to life style of the corresponding prokaryotes. Results Paralogs were identified in a number of prokaryotes, and these paralogs were compared to singletons of persistent orthologs based on functional classification. This showed that the paralogs were associated with for example energy production, cell motility, ion transport, and defence mechanisms. A statistical overrepresentation analysis of gene and protein annotations was based on paralogs of the 200 prokaryotes with the highest fraction of paralog-forming genes. Biclustering of overrepresented gene ontology terms versus species was used to identify clusters of properties associated with clusters of species. The clusters were classified using similarity scores on properties and species to identify interesting clusters, and a subset of clusters were analysed by comparison to literature data. This analysis showed that paralogs often are associated with properties that are important for survival and proliferation of the specific organisms. This includes processes like ion transport, locomotion, chemotaxis and photosynthesis. However, the analysis also showed that the gene ontology terms sometimes were too general, imprecise or even misleading for automatic analysis. Conclusions Properties described by gene ontology terms identified in the overrepresentation analysis are often consistent with individual prokaryote lifestyles and are likely to give a competitive

  14. [Duplication of DNA--a mechanism for the development of new functionality of genes]. (United States)

    Maślanka, Roman; Zadrąg-Tęcza, Renata


    The amplification of DNA is considered as a mechanism for rapid evolution of organisms. Duplication can be especially advantageous in the case of changing environmental conditions. Whole genome duplication maintains the proper balance between gene expression. This seems to be the main reason why WGD is more favorable than duplication of the fragments of DNA. The polyploidy status disappear as a result of the loss of the majority of duplicated genes. The preservation of duplicated genes is associated with the development of their new functions. Polyploidization is often noted for plants. However due to sequencing technique, the duplications episodes are more frequently reports also for the other systematic taxa, including animals. The occurrence of ancient genome duplication is also considered for yeast Saccharomyces cerevisiae. The existence of two active copies of ribosomal protein genes can be a confirmation of this process. Development of the fermentation process might be one of the probable causes of the yeast genome duplication.

  15. Evolutionary Fates and Dynamic Functionalization of Young Duplicate Genes in Arabidopsis Genomes1[OPEN (United States)

    Wang, Jun; Tao, Feng; Marowsky, Nicholas C.; Fan, Chuanzhu


    Gene duplication is a primary means to generate genomic novelties, playing an essential role in speciation and adaptation. Particularly in plants, a high abundance of duplicate genes has been maintained for significantly long periods of evolutionary time. To address the manner in which young duplicate genes were derived primarily from small-scale gene duplication and preserved in plant genomes and to determine the underlying driving mechanisms, we generated transcriptomes to produce the expression profiles of five tissues in Arabidopsis thaliana and the closely related species Arabidopsis lyrata and Capsella rubella. Based on the quantitative analysis metrics, we investigated the evolutionary processes of young duplicate genes in Arabidopsis. We determined that conservation, neofunctionalization, and specialization are three main evolutionary processes for Arabidopsis young duplicate genes. We explicitly demonstrated the dynamic functionalization of duplicate genes along the evolutionary time scale. Upon origination, duplicates tend to maintain their ancestral functions; but as they survive longer, they might be likely to develop distinct and novel functions. The temporal evolutionary processes and functionalization of plant duplicate genes are associated with their ancestral functions, dynamic DNA methylation levels, and histone modification abundances. Furthermore, duplicate genes tend to be initially expressed in pollen and then to gain more interaction partners over time. Altogether, our study provides novel insights into the dynamic retention processes of young duplicate genes in plant genomes. PMID:27485883

  16. Expression Divergence of Duplicate Genes in the Protein Kinase Superfamily in Pacific Oyster. (United States)

    Gao, Dahai; Ko, Dennis C; Tian, Xinmin; Yang, Guang; Wang, Liuyang


    Gene duplication has been proposed to serve as the engine of evolutionary innovation. It is well recognized that eukaryotic genomes contain a large number of duplicated genes that evolve new functions or expression patterns. However, in mollusks, the evolutionary mechanisms underlying the divergence and the functional maintenance of duplicate genes remain little understood. In the present study, we performed a comprehensive analysis of duplicate genes in the protein kinase superfamily using whole genome and transcriptome data for the Pacific oyster. A total of 64 duplicated gene pairs were identified based on a phylogenetic approach and the reciprocal best BLAST method. By analyzing gene expression from RNA-seq data from 69 different developmental and stimuli-induced conditions (nine tissues, 38 developmental stages, eight dry treatments, seven heat treatments, and seven salty treatments), we found that expression patterns were significantly correlated for a number of duplicate gene pairs, suggesting the conservation of regulatory mechanisms following divergence. Our analysis also identified a subset of duplicate gene pairs with very high expression divergence, indicating that these gene pairs may have been subjected to transcriptional subfunctionalization or neofunctionalization after the initial duplication events. Further analysis revealed a significant correlation between expression and sequence divergence (as revealed by synonymous or nonsynonymous substitution rates) under certain conditions. Taken together, these results provide evidence for duplicate gene sequence and expression divergence in the Pacific oyster, accompanying its adaptation to harsh environments. Our results provide new insights into the evolution of duplicate genes and their expression levels in the Pacific oyster.

  17. Neutral and Non-Neutral Evolution of Duplicated Genes with Gene Conversion

    Directory of Open Access Journals (Sweden)

    Jeffrey A. Fawcett


    Full Text Available Gene conversion is one of the major mutational mechanisms involved in the DNA sequence evolution of duplicated genes. It contributes to create unique patters of DNA polymorphism within species and divergence between species. A typical pattern is so-called concerted evolution, in which the divergence between duplicates is maintained low for a long time because of frequent exchanges of DNA fragments. In addition, gene conversion affects the DNA evolution of duplicates in various ways especially when selection operates. Here, we review theoretical models to understand the evolution of duplicates in both neutral and non-neutral cases. We also explain how these theories contribute to interpreting real polymorphism and divergence data by using some intriguing examples.

  18. Duplication and Diversification of the Hypoxia-Inducible IGFBP-1 Gene in Zebrafish

    DEFF Research Database (Denmark)

    Kamei, Hiroyasu; Lu, Ling; Jiao, Shuang


    Background: Gene duplication is the primary force of new gene evolution. Deciphering whether a pair of duplicated genes has evolved divergent functions is often challenging. The zebrafish is uniquely positioned to provide insight into the process of functional gene evolution due to its amenabilit...

  19. Divergence of Recently Duplicated Mg-Type MADS-Box Genes in Petunia

    NARCIS (Netherlands)

    Bemer, M.; Gordon, J.; Weterings, K.; Angenent, G.C.


    The MADS-box transcription factor family has expanded considerably in plants via gene and genome duplications and can be subdivided into type I and MIKC-type genes. The two gene classes show a different evolutionary history. Whereas the MIKC-type genes originated during ancient genome duplications,

  20. Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene

    Directory of Open Access Journals (Sweden)

    Shomron Noam


    Full Text Available Abstract Background Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Results Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. Conclusion The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains.

  1. Accelerated evolution after gene duplication: a time-dependent process affecting just one copy. (United States)

    Pegueroles, Cinta; Laurie, Steve; Albà, M Mar


    Gene duplication is widely regarded as a major mechanism modeling genome evolution and function. However, the mechanisms that drive the evolution of the two, initially redundant, gene copies are still ill defined. Many gene duplicates experience evolutionary rate acceleration, but the relative contribution of positive selection and random drift to the retention and subsequent evolution of gene duplicates, and for how long the molecular clock may be distorted by these processes, remains unclear. Focusing on rodent genes that duplicated before and after the mouse and rat split, we find significantly increased sequence divergence after duplication in only one of the copies, which in nearly all cases corresponds to the novel daughter copy, independent of the mechanism of duplication. We observe that the evolutionary rate of the accelerated copy, measured as the ratio of nonsynonymous to synonymous substitutions, is on average 5-fold higher in the period spanning 4-12 My after the duplication than it was before the duplication. This increase can be explained, at least in part, by the action of positive selection according to the results of the maximum likelihood-based branch-site test. Subsequently, the rate decelerates until purifying selection completely returns to preduplication levels. Reversion to the original rates has already been accomplished 40.5 My after the duplication event, corresponding to a genetic distance of about 0.28 synonymous substitutions per site. Differences in tissue gene expression patterns parallel those of substitution rates, reinforcing the role of neofunctionalization in explaining the evolution of young gene duplicates.

  2. Restriction and Recruitment—Gene Duplication and the Origin and Evolution of Snake Venom Toxins


    Hargreaves, Adam D; Swain, Martin T.; Matthew J. Hegarty; Logan, Darren W; Mulley, John F


    Snake venom has been hypothesized to have originated and diversified through a process that involves duplication of genes encoding body proteins with subsequent recruitment of the copy to the venom gland, where natural selection acts to develop or increase toxicity. However, gene duplication is known to be a rare event in vertebrate genomes, and the recruitment of duplicated genes to a novel expression domain (neofunctionalization) is an even rarer process that requires the evolution of novel...

  3. Genome-wide analysis of homeobox gene family in legumes: identification, gene duplication and expression profiling. (United States)

    Bhattacharjee, Annapurna; Ghangal, Rajesh; Garg, Rohini; Jain, Mukesh


    Homeobox genes encode transcription factors that are known to play a major role in different aspects of plant growth and development. In the present study, we identified homeobox genes belonging to 14 different classes in five legume species, including chickpea, soybean, Medicago, Lotus and pigeonpea. The characteristic differences within homeodomain sequences among various classes of homeobox gene family were quite evident. Genome-wide expression analysis using publicly available datasets (RNA-seq and microarray) indicated that homeobox genes are differentially expressed in various tissues/developmental stages and under stress conditions in different legumes. We validated the differential expression of selected chickpea homeobox genes via quantitative reverse transcription polymerase chain reaction. Genome duplication analysis in soybean indicated that segmental duplication has significantly contributed in the expansion of homeobox gene family. The Ka/Ks ratio of duplicated homeobox genes in soybean showed that several members of this family have undergone purifying selection. Moreover, expression profiling indicated that duplicated genes might have been retained due to sub-functionalization. The genome-wide identification and comprehensive gene expression profiling of homeobox gene family members in legumes will provide opportunities for functional analysis to unravel their exact role in plant growth and development.

  4. Targeting FLT3 in primary MLL-gene-rearranged infant acute lymphoblastic leukemia. (United States)

    Stam, Ronald W; den Boer, Monique L; Schneider, Pauline; Nollau, Peter; Horstmann, Martin; Beverloo, H Berna; van der Voort, Ella; Valsecchi, Maria G; de Lorenzo, Paola; Sallan, Stephen E; Armstrong, Scott A; Pieters, Rob


    Acute lymphoblastic leukemia (ALL) in infants is characterized by rearrangements of the mixed lineage leukemia (MLL) gene, drug resistance, and a poor treatment outcome. Therefore, novel therapeutic strategies are needed to improve prognosis. Recently, we showed that FLT3 is highly expressed in MLL rearranged ALL (MLL). Here we demonstrate FLT3 expression in infants with MLL (n = 41) to be significantly higher compared to both infant (n = 8; P < .001) and noninfant patients with ALL (n = 23; P = .001) carrying germline MLL genes. Furthermore, leukemic cells from infants with MLL were significantly more sensitive to the Fms-like tyrosine kinase 3 (FLT3) inhibitor PKC412 (N-benzoyl staurosporine) than noninfant ALL cells, and at least as sensitive as internal tandem duplication-positive (ITD+) AML cells. Surprisingly, activation loop mutations only occurred in about 3% (1 of 36) of the cases and no FLT3/ITDs were observed. However, measuring FLT3 phosphorylation in infants with MLL expressing varying levels of wild-type FLT3 revealed that high-level FLT3 expression is associated with ligand-independent FLT3 activation. This suggests that infant MLL cells displaying activated FLT3 as a result of overexpression can be targeted by FLT3 inhibitors such as PKC412. However, at concentrations of PKC412 minimally required to fully inhibit FLT3 phosphorylation, the cytotoxic effects were only fractional. Thus, PKC412-induced apoptosis in infant MLL cells is unlikely to be a consequence of FLT3 inhibition alone but may involve inhibition of multiple other kinases by this drug.

  5. Are duplicated genes responsible for anthracnose resistance in common bean? (United States)

    Costa, Larissa Carvalho; Nalin, Rafael Storto; Ramalho, Magno Antonio Patto; de Souza, Elaine Aparecida


    The race 65 of Colletotrichum lindemuthianum, etiologic agent of anthracnose in common bean, is distributed worldwide, having great importance in breeding programs for anthracnose resistance. Several resistance alleles have been identified promoting resistance to this race. However, the variability that has been detected within race has made it difficult to obtain cultivars with durable resistance, because cultivars may have different reactions to each strain of race 65. Thus, this work aimed at studying the resistance inheritance of common bean lines to different strains of C. lindemuthianum, race 65. We used six C. lindemuthianum strains previously characterized as belonging to the race 65 through the international set of differential cultivars of anthracnose and nine commercial cultivars, adapted to the Brazilian growing conditions and with potential ability to discriminate the variability within this race. To obtain information on the resistance inheritance related to nine commercial cultivars to six strains of race 65, these cultivars were crossed two by two in all possible combinations, resulting in 36 hybrids. Segregation in the F2 generations revealed that the resistance to each strain is conditioned by two independent genes with the same function, suggesting that they are duplicated genes, where the dominant allele promotes resistance. These results indicate that the specificity between host resistance genes and pathogen avirulence genes is not limited to races, it also occurs within strains of the same race. Further research may be carried out in order to establish if the alleles identified in these cultivars are different from those described in the literature.

  6. Duplication of pilus gene complexes of Haemophilus influenzae biogroup aegyptius. (United States)

    Read, T D; Dowdell, M; Satola, S W; Farley, M M


    Brazilian purpuric fever (BPF) is a recently described pediatric septicemia caused by a strain of Haemophilus influenzae biogroup aegyptius. The pilus specified by this bacterium may be important in BPF pathogenesis, enhancing attachment to host tissue. Here, we report the cloning of two haf (for H. influenzae biogroup aegyptius fimbriae) gene clusters from a cosmid library of strain F3031. We sequenced a 6.8-kb segment of the haf1 cluster and identified five genes (hafA to hafE). The predicted protein products, HafA to HafD, are 72, 95, 98, and 90% similar, respectively, to HifA to HifD of the closely related H. influenzae type b pilus. Strikingly, the putative pilus adhesion, HifE, shares only 44% identity with HafE, suggesting that the proteins may differ in receptor specificity. Insertion of a mini-gammadelta transposon in the hafE gene eliminated hemadsorption. The nucleotide sequences of the haf1 and haf2 clusters are more than 99% identical. Using the recently published sequence of the H. influenzae Rd genome, we determined that the haf1 complex lies at a unique position in the chromosome between the pmbA gene and a hypothetical open reading frame, HI1153. The location of the haf2 cluster, inserted between the purE and pepN genes, is analogous to the hif genes on H. influenzae type b. BPF fimbrial phase switching appears to involve slip-strand mispairing of repeated dinucleotides in the pilus promoter. The BPF-associated H. influenzae biogroup aegyptius pilus system generally resembles other H. influenzae, but the possession of a second fimbrial gene cluster, which appears to have arisen by a recent duplication event, and the novel sequence of the HafE adhesin may be significant in the unusual pathogenesis of BPF.

  7. Comparative Inference of Duplicated Genes Produced by Polyploidization in Soybean Genome

    Directory of Open Access Journals (Sweden)

    Yanmei Yang


    Full Text Available Soybean (Glycine max is one of the most important crop plants for providing protein and oil. It is important to investigate soybean genome for its economic and scientific value. Polyploidy is a widespread and recursive phenomenon during plant evolution, and it could generate massive duplicated genes which is an important resource for genetic innovation. Improved sequence alignment criteria and statistical analysis are used to identify and characterize duplicated genes produced by polyploidization in soybean. Based on the collinearity method, duplicated genes by whole genome duplication account for 70.3% in soybean. From the statistical analysis of the molecular distances between duplicated genes, our study indicates that the whole genome duplication event occurred more than once in the genome evolution of soybean, which is often distributed near the ends of chromosomes.

  8. Adaptive evolution of genes duplicated from the Drosophila pseudoobscura neo-X chromosome. (United States)

    Meisel, Richard P; Hilldorfer, Benedict B; Koch, Jessica L; Lockton, Steven; Schaeffer, Stephen W


    Drosophila X chromosomes are disproportionate sources of duplicated genes, and these duplications are usually the result of retrotransposition of X-linked genes to the autosomes. The excess duplication is thought to be driven by natural selection for two reasons: X chromosomes are inactivated during spermatogenesis, and the derived copies of retroposed duplications tend to be testis expressed. Therefore, autosomal derived copies of retroposed genes provide a mechanism for their X-linked paralogs to "escape" X inactivation. Once these duplications have fixed, they may then be selected for male-specific functions. Throughout the evolution of the Drosophila genus, autosomes have fused with X chromosomes along multiple lineages giving rise to neo-X chromosomes. There has also been excess duplication from the two independent neo-X chromosomes that have been examined--one that occurred prior to the common ancestor of the willistoni species group and another that occurred along the lineage leading to Drosophila pseudoobscura. To determine what role natural selection plays in the evolution of genes duplicated from the D. pseudoobscura neo-X chromosome, we analyzed DNA sequence divergence between paralogs, polymorphism within each copy, and the expression profiles of these duplicated genes. We found that the derived copies of all duplicated genes have elevated nonsynonymous polymorphism, suggesting that they are under relaxed selective constraints. The derived copies also tend to have testis- or male-biased expression profiles regardless of their chromosome of origin. Genes duplicated from the neo-X chromosome appear to be under less constraints than those duplicated from other chromosome arms. We also find more evidence for historical adaptive evolution in genes duplicated from the neo-X chromosome, suggesting that they are under a unique selection regime in which elevated nonsynonymous polymorphism provides a large reservoir of functional variants, some of which are fixed

  9. The Evolutionary Relationship between Alternative Splicing and Gene Duplication (United States)

    Iñiguez, Luis P.; Hernández, Georgina


    The protein diversity that exists today has resulted from various evolutionary processes. It is well known that gene duplication (GD) along with the accumulation of mutations are responsible, among other factors, for an increase in the number of different proteins. The gene structure in eukaryotes requires the removal of non-coding sequences, introns, to produce mature mRNAs. This process, known as cis-splicing, referred to here as splicing, is regulated by several factors which can lead to numerous splicing arrangements, commonly designated as alternative splicing (AS). AS, producing several transcripts isoforms form a single gene, also increases the protein diversity. However, the evolution and manner for increasing protein variation differs between AS and GD. An important question is how are patterns of AS affected after a GD event. Here, we review the current knowledge of AS and GD, focusing on their evolutionary relationship. These two processes are now considered the main contributors to the increasing protein diversity and therefore their relationship is a relevant, yet understudied, area of evolutionary study. PMID:28261262

  10. Evolution of vertebrate central nervous system is accompanied by novel expression changes of duplicate genes. (United States)

    Chen, Yuan; Ding, Yun; Zhang, Zuming; Wang, Wen; Chen, Jun-Yuan; Ueno, Naoto; Mao, Bingyu


    The evolution of the central nervous system (CNS) is one of the most striking changes during the transition from invertebrates to vertebrates. As a major source of genetic novelties, gene duplication might play an important role in the functional innovation of vertebrate CNS. In this study, we focused on a group of CNS-biased genes that duplicated during early vertebrate evolution. We investigated the tempo-spatial expression patterns of 33 duplicate gene families and their orthologs during the embryonic development of the vertebrate Xenopus laevis and the cephalochordate Brachiostoma belcheri. Almost all the identified duplicate genes are differentially expressed in the CNS in Xenopus embryos, and more than 50% and 30% duplicate genes are expressed in the telencephalon and mid-hindbrain boundary, respectively, which are mostly considered as two innovations in the vertebrate CNS. Interestingly, more than 50% of the amphioxus orthologs do not show apparent expression in the CNS in amphioxus embryos as detected by in situ hybridization, indicating that some of the vertebrate CNS-biased duplicate genes might arise from non-CNS genes in invertebrates. Our data accentuate the functional contribution of gene duplication in the CNS evolution of vertebrate and uncover an invertebrate non-CNS history for some vertebrate CNS-biased duplicate genes. Copyright © 2011. Published by Elsevier Ltd.

  11. Diverged Copies of the Seed Regulatory Opaque-2 Gene by a Segmental Duplication in the Progenitor Genome of Rice,Sorghum,and Maize

    Institute of Scientific and Technical Information of China (English)

    Jian-Hong Xu; Joachim Messing


    Comparative analyses of the sequence of entire genomes have shown that gene duplications,chromosomal segmental duplications.or even whole genome duplications(WGD)have played prominent roles in the evolution of many eukaryotic species.Here,we used the ancient duplication of a well known transcription factor in maize,encoded by the Opaque-2(02)IOCUS,to examine the generaI features of divergences of chromosomaI segmentaI duplications in a lineagespecific manner.We took advantage of contiguous chromosomal sequence information in rice(Oryza sativa,Nipponbare).sorghum(Sorghum bicoloc Btx623),and maize(Zea mays,B73)that were aligned by conserved gene order(synteny).This analysis showed that the maize O2 locus is contained within a 1.25 million base-pair(Mb)segment on chromosome 7.which was duplicated≈56 million years ago(mya)before the split of rice and maize 50 mya.The duplicated region on chromosome 1 is only half the size and contains the maize OHP gene.which does not restore the o2 mutation although it encodes a protein with the same DNA and protein binding properties in endosperm.The segmental duplication iS not only found in rice,but also in sorghum,which split from maize 11.9 mya.A detailed analysis of the duplicated regions provided examples for complex rearrangements including deletions.duplications,conversions,inversions,and translocations.Furthermore,the rice and sorghum genomes appeared to be more stable than the maize genome,probably because maize underwent allotetraploidization and then diploidization.

  12. Duplication of OsHAP family genes and their association with heading date in rice. (United States)

    Li, Qiuping; Yan, Wenhao; Chen, Huaxia; Tan, Cong; Han, Zhongmin; Yao, Wen; Li, Guangwei; Yuan, Mengqi; Xing, Yongzhong


    Heterotrimeric Heme Activator Protein (HAP) family genes are involved in the regulation of flowering in plants. It is not clear how many HAP genes regulate heading date in rice. In this study, we identified 35 HAP genes, including seven newly identified genes, and performed gene duplication and candidate gene-based association analyses. Analyses showed that segmental duplication and tandem duplication are the main mechanisms of HAP gene duplication. Expression profiling and functional identification indicated that duplication probably diversifies the functions of HAP genes. A nucleotide diversity analysis revealed that 13 HAP genes underwent selection. A candidate gene-based association analysis detected four HAP genes related to heading date. An investigation of transgenic plants or mutants of 23 HAP genes confirmed that overexpression of at least four genes delayed heading date under long-day conditions, including the previously cloned Ghd8/OsHAP3H. Our results indicate that the large number of HAP genes in rice was mainly produced by gene duplication, and a few HAP genes function to regulate heading date. Selection of HAP genes is probably caused by their diverse functions rather than regulation of heading.

  13. The major resistance gene cluster in lettuce is highly duplicated and spans several megabases. (United States)

    Meyers, B C; Chin, D B; Shen, K A; Sivaramakrishnan, S; Lavelle, D O; Zhang, Z; Michelmore, R W


    At least 10 Dm genes conferring resistance to the oomycete downy mildew fungus Bremia lactucae map to the major resistance cluster in lettuce. We investigated the structure of this cluster in the lettuce cultivar Diana, which contains Dm3. A deletion breakpoint map of the chromosomal region flanking Dm3 was saturated with a variety of molecular markers. Several of these markers are components of a family of resistance gene candidates (RGC2) that encode a nucleotide binding site and a leucine-rich repeat region. These motifs are characteristic of plant disease resistance genes. Bacterial artificial chromosome clones were identified by using duplicated restriction fragment length polymorphism markers from the region, including the nucleotide binding site-encoding region of RGC2. Twenty-two distinct members of the RGC2 family were characterized from the bacterial artificial chromosomes; at least two additional family members exist. The RGC2 family is highly divergent; the nucleotide identity was as low as 53% between the most distantly related copies. These RGC2 genes span at least 3.5 Mb. Eighteen members were mapped on the deletion breakpoint map. A comparison between the phylogenetic and physical relationships of these sequences demonstrated that closely related copies are physically separated from one another and indicated that complex rearrangements have shaped this region. Analysis of low-copy genomic sequences detected no genes, including RGC2, in the Dm3 region, other than sequences related to retrotransposons and transposable elements. The related but divergent family of RGC2 genes may act as a resource for the generation of new resistance phenotypes through infrequent recombination or unequal crossing over.

  14. Divergence of gene body DNA methylation and evolution of plant duplicate genes.

    Directory of Open Access Journals (Sweden)

    Jun Wang

    Full Text Available It has been shown that gene body DNA methylation is associated with gene expression. However, whether and how deviation of gene body DNA methylation between duplicate genes can influence their divergence remains largely unexplored. Here, we aim to elucidate the potential role of gene body DNA methylation in the fate of duplicate genes. We identified paralogous gene pairs from Arabidopsis and rice (Oryza sativa ssp. japonica genomes and reprocessed their single-base resolution methylome data. We show that methylation in paralogous genes nonlinearly correlates with several gene properties including exon number/gene length, expression level and mutation rate. Further, we demonstrated that divergence of methylation level and pattern in paralogs indeed positively correlate with their sequence and expression divergences. This result held even after controlling for other confounding factors known to influence the divergence of paralogs. We observed that methylation level divergence might be more relevant to the expression divergence of paralogs than methylation pattern divergence. Finally, we explored the mechanisms that might give rise to the divergence of gene body methylation in paralogs. We found that exonic methylation divergence more closely correlates with expression divergence than intronic methylation divergence. We show that genomic environments (e.g., flanked by transposable elements and repetitive sequences of paralogs generated by various duplication mechanisms are associated with the methylation divergence of paralogs. Overall, our results suggest that the changes in gene body DNA methylation could provide another avenue for duplicate genes to develop differential expression patterns and undergo different evolutionary fates in plant genomes.

  15. Phylogenetics of Lophotrochozoan bHLH Genes and the Evolution of Lineage-Specific Gene Duplicates (United States)

    Bao, Yongbo


    The gain and loss of genes encoding transcription factors is of importance to understanding the evolution of gene regulatory complexity. The basic helix–loop–helix (bHLH) genes encode a large superfamily of transcription factors. We systematically classify the bHLH genes from five mollusc, two annelid and one brachiopod genomes, tracing the pattern of bHLH gene evolution across these poorly studied Phyla. In total, 56–88 bHLH genes were identified in each genome, with most identifiable as members of previously described bilaterian families, or of new families we define. Of such families only one, Mesp, appears lost by all these species. Additional duplications have also played a role in the evolution of the bHLH gene repertoire, with many new lophotrochozoan-, mollusc-, bivalve-, or gastropod-specific genes defined. Using a combination of transcriptome mining, RT-PCR, and in situ hybridization we compared the expression of several of these novel genes in tissues and embryos of the molluscs Crassostrea gigas and Patella vulgata, finding both conserved expression and evidence for neofunctionalization. We also map the positions of the genes across these genomes, identifying numerous gene linkages. Some reflect recent paralog divergence by tandem duplication, others are remnants of ancient tandem duplications dating to the lophotrochozoan or bilaterian common ancestors. These data are built into a model of the evolution of bHLH genes in molluscs, showing formidable evolutionary stasis at the family level but considerable within-family diversification by tandem gene duplication. PMID:28338988

  16. Global analysis of human duplicated genes reveals the relative importance of whole-genome duplicates originated in the early vertebrate evolution. (United States)

    Acharya, Debarun; Ghosh, Tapash C


    Gene duplication is a genetic mutation that creates functionally redundant gene copies that are initially relieved from selective pressures and may adapt themselves to new functions with time. The levels of gene duplication may vary from small-scale duplication (SSD) to whole genome duplication (WGD). Studies with yeast revealed ample differences between these duplicates: Yeast WGD pairs were functionally more similar, less divergent in subcellular localization and contained a lesser proportion of essential genes. In this study, we explored the differences in evolutionary genomic properties of human SSD and WGD genes, with the identifiable human duplicates coming from the two rounds of whole genome duplication occurred early in vertebrate evolution. We observed that these two groups of duplicates were also dissimilar in terms of their evolutionary and genomic properties. But interestingly, this is not like the same observed in yeast. The human WGDs were found to be functionally less similar, diverge more in subcellular level and contain a higher proportion of essential genes than the SSDs, all of which are opposite from yeast. Additionally, we explored that human WGDs were more divergent in their gene expression profile, have higher multifunctionality and are more often associated with disease, and are evolutionarily more conserved than human SSDs. Our study suggests that human WGD duplicates are more divergent and entails the adaptation of WGDs to novel and important functions that consequently lead to their evolutionary conservation in the course of evolution.

  17. Consensus properties and their large-scale applications for the gene duplication problem. (United States)

    Moon, Jucheol; Lin, Harris T; Eulenstein, Oliver


    Solving the gene duplication problem is a classical approach for species tree inference from gene trees that are confounded by gene duplications. This problem takes a collection of gene trees and seeks a species tree that implies the minimum number of gene duplications. Wilkinson et al. posed the conjecture that the gene duplication problem satisfies the desirable Pareto property for clusters. That is, for every instance of the problem, all clusters that are commonly present in the input gene trees of this instance, called strict consensus, will also be found in every solution to this instance. We prove that this conjecture does not generally hold. Despite this negative result we show that the gene duplication problem satisfies a weaker version of the Pareto property where the strict consensus is found in at least one solution (rather than all solutions). This weaker property contributes to our design of an efficient scalable algorithm for the gene duplication problem. We demonstrate the performance of our algorithm in analyzing large-scale empirical datasets. Finally, we utilize the algorithm to evaluate the accuracy of standard heuristics for the gene duplication problem using simulated datasets.

  18. Buffering by gene duplicates: an analysis of molecular correlates and evolutionary conservation

    Directory of Open Access Journals (Sweden)

    Vogel Christine


    Full Text Available Abstract Background One mechanism to account for robustness against gene knockouts or knockdowns is through buffering by gene duplicates, but the extent and general correlates of this process in organisms is still a matter of debate. To reveal general trends of this process, we provide a comprehensive comparison of gene essentiality, duplication and buffering by duplicates across seven bacteria (Mycoplasma genitalium, Bacillus subtilis, Helicobacter pylori, Haemophilus influenzae, Mycobacterium tuberculosis, Pseudomonas aeruginosa, Escherichia coli, and four eukaryotes (Saccharomyces cerevisiae (yeast, Caenorhabditis elegans (worm, Drosophila melanogaster (fly, Mus musculus (mouse. Results In nine of the eleven organisms, duplicates significantly increase chances of survival upon gene deletion (P-value ≤ 0.05, but only by up to 13%. Given that duplicates make up to 80% of eukaryotic genomes, the small contribution is surprising and points to dominant roles of other buffering processes, such as alternative metabolic pathways. The buffering capacity of duplicates appears to be independent of the degree of gene essentiality and tends to be higher for genes with high expression levels. For example, buffering capacity increases to 23% amongst highly expressed genes in E. coli. Sequence similarity and the number of duplicates per gene are weak predictors of the duplicate's buffering capacity. In a case study we show that buffering gene duplicates in yeast and worm are somewhat more similar in their functions than non-buffering duplicates and have increased transcriptional and translational activity. Conclusion In sum, the extent of gene essentiality and buffering by duplicates is not conserved across organisms and does not correlate with the organisms' apparent complexity. This heterogeneity goes beyond what would be expected from differences in experimental approaches alone. Buffering by duplicates contributes to robustness in several organisms

  19. Subfunctionalization of duplicated zebrafish pax6 genes by cis-regulatory divergence

    National Research Council Canada - National Science Library

    Kleinjan, Dirk A; Bancewicz, Ruth M; Gautier, Philippe; Dahm, Ralf; Schonthaler, Helia B; Damante, Giuseppe; Seawright, Anne; Hever, Ann M; Yeyati, Patricia L; van Heyningen, Veronica; Coutinho, Pedro


    Gene duplication is a major driver of evolutionary divergence. In most vertebrates a single PAX6 gene encodes a transcription factor required for eye, brain, olfactory system, and pancreas development...

  20. Prevalence of chromosomal rearrangements involving non-ETS genes in prostate cancer. (United States)

    Kluth, Martina; Galal, Rami; Krohn, Antje; Weischenfeldt, Joachim; Tsourlakis, Christina; Paustian, Lisa; Ahrary, Ramin; Ahmed, Malik; Scherzai, Sekander; Meyer, Anne; Sirma, Hüseyin; Korbel, Jan; Sauter, Guido; Schlomm, Thorsten; Simon, Ronald; Minner, Sarah


    Prostate cancer is characterized by structural rearrangements, most frequently including translocations between androgen-dependent genes and members of the ETS family of transcription factor like TMPRSS2:ERG. In a recent whole genome sequencing study we identified 140 gene fusions that were unrelated to ETS genes in 11 prostate cancers. The aim of the present study was to estimate the prevalence of non-ETS gene fusions. We randomly selected 27 of these rearrangements and analyzed them by fluorescence in situ hybridization (FISH) in a tissue microarray format containing 500 prostate cancers. Using break-apart FISH probes for one fusion partner each, we found rearrangements of 13 (48%) of the 27 analyzed genes in 300-400 analyzable cancers per gene. Recurrent breakage, often accompanied by partial deletion of the genes, was found for NCKAP5, SH3BGR and TTC3 in 3 (0.8%) tumors each, as well as for ARNTL2 and ENOX1 in 2 (0.5%) cancers each. One rearranged tumor sample was observed for each of VCL, ZNF578, IMMP2L, SLC16A12, PANK1, GPHN, LRP1 and ZHX2. Balanced rearrangements, indicating possible gene fusion, were found for ZNF578, SH3BGR, LPR12 and ZHX2 in individual cancers only. The results of the present study confirm that rearrangements involving non-ETS genes occur in prostate cancer, but demonstrate that they are highly individual and typically non-recurrent.

  1. Large scale calcium channel gene rearrangements in episodic ataxia and hemiplegic migraine: implications for diagnostic testing. (United States)

    Labrum, R W; Rajakulendran, S; Graves, T D; Eunson, L H; Bevan, R; Sweeney, M G; Hammans, S R; Tubridy, N; Britton, T; Carr, L J; Ostergaard, J R; Kennedy, C R; Al-Memar, A; Kullmann, D M; Schorge, S; Temple, K; Davis, M B; Hanna, M G


    Episodic ataxia type 2 (EA2) and familial hemiplegic migraine type 1 (FHM1) are autosomal dominant disorders characterised by paroxysmal ataxia and migraine, respectively. Point mutations in CACNA1A, which encodes the neuronal P/Q-type calcium channel, have been detected in many cases of EA2 and FHM1. The genetic basis of typical cases without CACNA1A point mutations is not fully known. Standard DNA sequencing methods may miss large scale genetic rearrangements such as deletions and duplications. The authors investigated whether large scale genetic rearrangements in CACNA1A can cause EA2 and FHM1. The authors used multiplex ligation dependent probe amplification (MLPA) to screen for intragenic CACNA1A rearrangements. The authors identified five previously unreported large scale deletions in CACNA1A in seven families with episodic ataxia and in one case with hemiplegic migraine. One of the deletions (exon 6 of CACNA1A) segregated with episodic ataxia in a four generation family with eight affected individuals previously mapped to 19p13. In addition, the authors identified the first pathogenic duplication in CACNA1A in an index case with isolated episodic diplopia without ataxia and in a first degree relative with episodic ataxia. Large scale deletions and duplications can cause CACNA1A associated channelopathies. Direct DNA sequencing alone is not sufficient as a diagnostic screening test.

  2. Characterization of a Complex Chromosomal Rearrangement Involving a de novo Duplication of 9p and 9q and a Deletion of 9q. (United States)

    Martín-De Saro, Mónica D; Valdés-Miranda, Juan M; Plaza-Benhumea, Lautaro; Pérez-Cabrera, Adrián; Gonzalez-Huerta, Luz M; Guevara-Yañez, Roberto; Cuevas-Covarrubias, Sergio A


    Rearrangements of the distal region of 9p are important chromosome imbalances in human beings. Trisomy 9p is the fourth most frequent chromosome anomaly and is a clinically recognizable syndrome. Kleefstra syndrome, previously named 9q subtelomeric deletion syndrome, is either caused by a submicroscopic deletion in 9q34.3 or an intragenic mutation of EHMT1. We report a Mexican male patient with abnormal development, dysmorphism, systemic anomalies and a complex chromosomal rearrangement (CCR). GTG-banding revealed a 46,XY,add(9)(q34.3) karyotype, whereas array analysis resulted in arr[hg19] 9p24.3p23(203,861-11,842,172)×3, 9q34.3(138,959,881-139,753,294)×3, 9q34.3(139,784,913-141,020,389)×1. Array and karyotype analyses were normal in both parents. Partial duplication of 9p is one of the most commonly detected autosomal structural abnormalities in liveborn infants. A microdeletion in 9q34.3 corresponds to Kleefstra syndrome, whereas a microduplication in 9q34.3 shows a great clinical variability. Here, we present a CCR in a patient with multiple congenital anomalies who represents the first case with partial 9p trisomy, partial 9q trisomy and partial 9q monosomy. © 2016 S. Karger AG, Basel.

  3. The evolutionary fate of alternatively spliced homologous exons after gene duplication. (United States)

    Abascal, Federico; Tress, Michael L; Valencia, Alfonso


    Alternative splicing and gene duplication are the two main processes responsible for expanding protein functional diversity. Although gene duplication can generate new genes and alternative splicing can introduce variation through alternative gene products, the interplay between the two processes is complex and poorly understood. Here, we have carried out a study of the evolution of alternatively spliced exons after gene duplication to better understand the interaction between the two processes. We created a manually curated set of 97 human genes with mutually exclusively spliced homologous exons and analyzed the evolution of these exons across five distantly related vertebrates (lamprey, spotted gar, zebrafish, fugu, and coelacanth). Most of these exons had an ancient origin (more than 400 Ma). We found examples supporting two extreme evolutionary models for the behaviour of homologous axons after gene duplication. We observed 11 events in which gene duplication was accompanied by splice isoform separation, that is, each paralog specifically conserved just one distinct ancestral homologous exon. At other extreme, we identified genes in which the homologous exons were always conserved within paralogs, suggesting that the alternative splicing event cannot easily be separated from the function in these genes. That many homologous exons fall in between these two extremes highlights the diversity of biological systems and suggests that the subtle balance between alternative splicing and gene duplication is adjusted to the specific cellular context of each gene. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  4. Gene duplication and divergence affecting drug content in Cannabis sativa. (United States)

    Weiblen, George D; Wenger, Jonathan P; Craft, Kathleen J; ElSohly, Mahmoud A; Mehmedic, Zlatko; Treiber, Erin L; Marks, M David


    Cannabis sativa is an economically important source of durable fibers, nutritious seeds, and psychoactive drugs but few economic plants are so poorly understood genetically. Marijuana and hemp were crossed to evaluate competing models of cannabinoid inheritance and to explain the predominance of tetrahydrocannabinolic acid (THCA) in marijuana compared with cannabidiolic acid (CBDA) in hemp. Individuals in the resulting F2 population were assessed for differential expression of cannabinoid synthase genes and were used in linkage mapping. Genetic markers associated with divergent cannabinoid phenotypes were identified. Although phenotypic segregation and a major quantitative trait locus (QTL) for the THCA/CBDA ratio were consistent with a simple model of codominant alleles at a single locus, the diversity of THCA and CBDA synthase sequences observed in the mapping population, the position of enzyme coding loci on the map, and patterns of expression suggest multiple linked loci. Phylogenetic analysis further suggests a history of duplication and divergence affecting drug content. Marijuana is distinguished from hemp by a nonfunctional CBDA synthase that appears to have been positively selected to enhance psychoactivity. An unlinked QTL for cannabinoid quantity may also have played a role in the recent escalation of drug potency.

  5. Gene duplications and losses among vertebrate deoxyribonucleoside kinases of the non-TK1 Family

    DEFF Research Database (Denmark)

    Mutahir, Zeeshan; Christiansen, Louise Slot; Clausen, Anders R.;


    , among vertebrates only four mammalian dNKs have been studied for their substrate specificity and kinetic properties. However, some vertebrates, such as fish, frogs, and birds, apparently possess a duplicated homolog of deoxycytidine kinase (dCK). In this study, we characterized a family of d......CK/deoxyguanosine kinase (dGK)-like enzymes from a frog Xenopus laevis and a bird Gallus gallus. We showed that X. laevis has a duplicated dCK gene and a dGK gene, whereas G. gallus has a duplicated dCK gene but has lost the dGK gene. We cloned, expressed, purified, and subsequently determined the kinetic parameters...

  6. The roles of whole-genome and small-scale duplications in the functional specialization of Saccharomyces cerevisiae genes.

    Directory of Open Access Journals (Sweden)

    Mario A Fares

    Full Text Available Researchers have long been enthralled with the idea that gene duplication can generate novel functions, crediting this process with great evolutionary importance. Empirical data shows that whole-genome duplications (WGDs are more likely to be retained than small-scale duplications (SSDs, though their relative contribution to the functional fate of duplicates remains unexplored. Using the map of genetic interactions and the re-sequencing of 27 Saccharomyces cerevisiae genomes evolving for 2,200 generations we show that SSD-duplicates lead to neo-functionalization while WGD-duplicates partition ancestral functions. This conclusion is supported by: (a SSD-duplicates establish more genetic interactions than singletons and WGD-duplicates; (b SSD-duplicates copies share more interaction-partners than WGD-duplicates copies; (c WGD-duplicates interaction partners are more functionally related than SSD-duplicates partners; (d SSD-duplicates gene copies are more functionally divergent from one another, while keeping more overlapping functions, and diverge in their sub-cellular locations more than WGD-duplicates copies; and (e SSD-duplicates complement their functions to a greater extent than WGD-duplicates. We propose a novel model that uncovers the complexity of evolution after gene duplication.

  7. The Roles of Whole-Genome and Small-Scale Duplications in the Functional Specialization of Saccharomyces cerevisiae Genes (United States)

    Fares, Mario A.; Keane, Orla M.; Toft, Christina; Carretero-Paulet, Lorenzo; Jones, Gary W.


    Researchers have long been enthralled with the idea that gene duplication can generate novel functions, crediting this process with great evolutionary importance. Empirical data shows that whole-genome duplications (WGDs) are more likely to be retained than small-scale duplications (SSDs), though their relative contribution to the functional fate of duplicates remains unexplored. Using the map of genetic interactions and the re-sequencing of 27 Saccharomyces cerevisiae genomes evolving for 2,200 generations we show that SSD-duplicates lead to neo-functionalization while WGD-duplicates partition ancestral functions. This conclusion is supported by: (a) SSD-duplicates establish more genetic interactions than singletons and WGD-duplicates; (b) SSD-duplicates copies share more interaction-partners than WGD-duplicates copies; (c) WGD-duplicates interaction partners are more functionally related than SSD-duplicates partners; (d) SSD-duplicates gene copies are more functionally divergent from one another, while keeping more overlapping functions, and diverge in their sub-cellular locations more than WGD-duplicates copies; and (e) SSD-duplicates complement their functions to a greater extent than WGD–duplicates. We propose a novel model that uncovers the complexity of evolution after gene duplication. PMID:23300483

  8. Functionally recurrent rearrangements of the MAST kinase and Notch gene families in breast cancer. (United States)

    Robinson, Dan R; Kalyana-Sundaram, Shanker; Wu, Yi-Mi; Shankar, Sunita; Cao, Xuhong; Ateeq, Bushra; Asangani, Irfan A; Iyer, Matthew; Maher, Christopher A; Grasso, Catherine S; Lonigro, Robert J; Quist, Michael; Siddiqui, Javed; Mehra, Rohit; Jing, Xiaojun; Giordano, Thomas J; Sabel, Michael S; Kleer, Celina G; Palanisamy, Nallasivam; Natrajan, Rachael; Lambros, Maryou B; Reis-Filho, Jorge S; Kumar-Sinha, Chandan; Chinnaiyan, Arul M


    Breast cancer is a heterogeneous disease that has a wide range of molecular aberrations and clinical outcomes. Here we used paired-end transcriptome sequencing to explore the landscape of gene fusions in a panel of breast cancer cell lines and tissues. We observed that individual breast cancers have a variety of expressed gene fusions. We identified two classes of recurrent gene rearrangements involving genes encoding microtubule-associated serine-threonine kinase (MAST) and members of the Notch family. Both MAST and Notch-family gene fusions have substantial phenotypic effects in breast epithelial cells. Breast cancer cell lines harboring Notch gene rearrangements are uniquely sensitive to inhibition of Notch signaling, and overexpression of MAST1 or MAST2 gene fusions has a proliferative effect both in vitro and in vivo. These findings show that recurrent gene rearrangements have key roles in subsets of carcinomas and suggest that transcriptome sequencing could identify individuals with rare, targetable gene fusions.

  9. Tandem gene arrays in Trypanosoma brucei: Comparative phylogenomic analysis of duplicate sequence variation

    Directory of Open Access Journals (Sweden)

    Jackson Andrew P


    Full Text Available Abstract Background The genome sequence of the protistan parasite Trypanosoma brucei contains many tandem gene arrays. Gene duplicates are created through tandem duplication and are expressed through polycistronic transcription, suggesting that the primary purpose of long, tandem arrays is to increase gene dosage in an environment where individual gene promoters are absent. This report presents the first account of the tandem gene arrays in the T. brucei genome, employing several related genome sequences to establish how variation is created and removed. Results A systematic survey of tandem gene arrays showed that substantial sequence variation existed across the genome; variation from different regions of an array often produced inconsistent phylogenetic affinities. Phylogenetic relationships of gene duplicates were consistent with concerted evolution being a widespread homogenising force. However, tandem duplicates were not usually identical; therefore, any homogenising effect was coincident with divergence among duplicates. Allelic gene conversion was detected using various criteria and was apparently able to both remove and introduce sequence variation. Tandem arrays containing structural heterogeneity demonstrated how sequence homogenisation and differentiation can occur within a single locus. Conclusion The use of multiple genome sequences in a comparative analysis of tandem gene arrays identified substantial sequence variation among gene duplicates. The distribution of sequence variation is determined by a dynamic balance of conservative and innovative evolutionary forces. Gene trees from various species showed that intraspecific duplicates evolve in concert, perhaps through frequent gene conversion, although this does not prevent sequence divergence, especially where structural heterogeneity physically separates a duplicate from its neighbours. In describing dynamics of sequence variation that have consequences beyond gene dosage, this

  10. Characterization of a complex rearrangement involving duplication and deletion of 9p in an infant with craniofacial dysmorphism and cardiac anomalies

    Directory of Open Access Journals (Sweden)

    Di Bartolo Daniel L


    Full Text Available Abstract Partial duplication and partial deletion of the short arm of chromosome 9 have each been reported in the literature as clinically recognizable syndromes. We present clinical, cytogenetic, and molecular findings on a five-week-old female infant with concomitant duplication and terminal deletion of the short arm of chromosome 9. To our knowledge ten such cases have previously been reported. Conventional cytogenetic analysis identified additional material on chromosome 9 at band p23. FISH analysis aided in determining the additional material consisted of an inverted duplication with a terminal deletion of the short arm. Microarray analysis confirmed this interpretation and further characterized the abnormality as a duplication of about 32.7 Mb, from 9p23 to 9p11.2, and a terminal deletion of about 11.5 Mb, from 9p24.3 to 9p23. The infant displayed characteristic features of Duplication 9p Syndrome (hypotonia, bulbous nose, single transverse palmar crease, cranial anomalies, as well as features associated with Deletion 9p Syndrome (flat nasal bridge, long philtrum, cardiac anomalies despite the deletion being distal to the reported critical region for this syndrome. This case suggests that there are genes or regulatory elements that lie outside of the reported critical region responsible for certain phenotypic features associated with Deletion 9p Syndrome. It also underscores the importance of utilizing array technology to precisely define abnormalities involving the short arm of 9p in order to further refine genotype/phenotype associations and to identify additional cases of duplication/deletion.

  11. Effect of Incomplete Lineage Sorting On Tree-Reconciliation-Based Inference of Gene Duplication. (United States)

    Zheng, Yu; Zhang, Louxin


    In the tree reconciliation approach to infer the duplication history of a gene family, the gene (family) tree is compared to the corresponding species tree. Incomplete lineage sorting (ILS) gives rise to stochastic variation in the topology of a gene tree and hence likely introduces false duplication events when a tree reconciliation method is used. We quantify the effect of ILS on gene duplication inference in a species tree in terms of the expected number of false duplication events inferred from reconciling a random gene tree, which occurs with a probability predicted in coalescent theory, and the species tree. We computationally examine the relationship between the effect of ILS on duplication inference in a species tree and its topological parameters. Our findings suggest that ILS may cause non-negligible bias on duplication inference, particularly on an asymmetric species tree. Hence, when gene duplication is inferred via tree reconciliation or any other approach that takes gene tree topology into account, the ILS-induced bias should be examined cautiously.

  12. Pinda: a web service for detection and analysis of intraspecies gene duplication events. (United States)

    Kontopoulos, Dimitrios-Georgios; Glykos, Nicholas M


    We present Pinda, a Web service for the detection and analysis of possible duplications of a given protein or DNA sequence within a source species. Pinda fully automates the whole gene duplication detection procedure, from performing the initial similarity searches, to generating the multiple sequence alignments and the corresponding phylogenetic trees, to bootstrapping the trees and producing a Z-score-based list of duplication candidates for the input sequence. Pinda has been cross-validated using an extensive set of known and bibliographically characterized duplication events. The service facilitates the automatic and dependable identification of gene duplication events, using some of the most successful bioinformatics software to perform an extensive analysis protocol. Pinda will prove of use for the analysis of newly discovered genes and proteins, thus also assisting the study of recently sequenced genomes. The service's location is The source code is freely available via

  13. Maternal acute lymphoctic leukemia with rearrangement of the mixed lineage leukemia gene occurring during pregnancy. (United States)

    Aljurf, Mahmoud; Nassar, Amr; Saleh, Abu J; Almhareb, Fahed; Alzahrani, Hazzaa; Walter, Claudia; Bakr, Mohammad; Ahmed, Syed Osman; Chaudhri, Naeem


    Acute lymphoblastic leukemia (ALL) is a relatively rare disease during pregnancy, accounting for about 15% of all cases of pregnancy-associated leukemia. Although mixed lineage leukemia gene (MLL) rearrangement is the dominant genetic aberration in infantile acute leukemia, the occurrence of MLL gene rearrangement in maternal ALL occurring during pregnancy has not been reported. Out of 31 cases of maternal leukemia diagnosed during pregnancy at our institution, 5 were ALL cases. Three of the 5 patients had MLL gene rearrangement. The data for these 5 patients are presented in this report. We believe that the association of MLL gene rearrangement with maternal leukemia is biologically plausible and this observation needs to be validated in a larger cohort of pregnancy-associated maternal leukemia cases.

  14. The rearrangement of the human alpha(1)-acid glycoprotein/orosomucoid gene: evidence for tandemly triplicated genes consisting of two AGP1 and one AGP2. (United States)

    Nakamura, H; Yuasa, I; Umetsu, K; Nakagawa, M; Nanba, E; Kimura, K


    The human alpha(1)-acid glycoprotein (AGP) or orosomucoid (ORM) is controlled by the two tandemly arranged genes, AGP1 and AGP2. The further duplication of the AGP1 gene has been suggested by a few duplicated ORM1 locus haplotypes including ORM1*F1. S and ORM1*B9. S, detected by isoelectric focusing. To clarify the triplication of the AGP gene, 39 DNA samples from Japanese subjects were studied by the long-range PCR of intergenic regions. The analysis of PCR products showed that the tandemly triplicated genes, AGP1A-AGP1B-AGP2, occurred on about 20% of chromosomes. These composites were divided into ORM1A*F1-ORM1B*S-ORM2*M and ORM1A*B9-ORM1B*S-ORM2*M by allelic variations. Furthermore, the former was classified into a few haplotypes by three synonymous sequence variations, which might have arisen through gene conversion-like events. The recombination breakpoints existed between the 5' flanking region and intron 2 of the AGP1B gene. Thus, it is likely that the rearrangement of the AGP gene has often occurred.

  15. Comparative study of human mitochondrial proteome reveals extensive protein subcellular relocalization after gene duplications

    Directory of Open Access Journals (Sweden)

    Huang Yong


    Full Text Available Abstract Background Gene and genome duplication is the principle creative force in evolution. Recently, protein subcellular relocalization, or neolocalization was proposed as one of the mechanisms responsible for the retention of duplicated genes. This hypothesis received support from the analysis of yeast genomes, but has not been tested thoroughly on animal genomes. In order to evaluate the importance of subcellular relocalizations for retention of duplicated genes in animal genomes, we systematically analyzed nuclear encoded mitochondrial proteins in the human genome by reconstructing phylogenies of mitochondrial multigene families. Results The 456 human mitochondrial proteins selected for this study were clustered into 305 gene families including 92 multigene families. Among the multigene families, 59 (64% consisted of both mitochondrial and cytosolic (non-mitochondrial proteins (mt-cy families while the remaining 33 (36% were composed of mitochondrial proteins (mt-mt families. Phylogenetic analyses of mt-cy families revealed three different scenarios of their neolocalization following gene duplication: 1 relocalization from mitochondria to cytosol, 2 from cytosol to mitochondria and 3 multiple subcellular relocalizations. The neolocalizations were most commonly enabled by the gain or loss of N-terminal mitochondrial targeting signals. The majority of detected subcellular relocalization events occurred early in animal evolution, preceding the evolution of tetrapods. Mt-mt protein families showed a somewhat different pattern, where gene duplication occurred more evenly in time. However, for both types of protein families, most duplication events appear to roughly coincide with two rounds of genome duplications early in vertebrate evolution. Finally, we evaluated the effects of inaccurate and incomplete annotation of mitochondrial proteins and found that our conclusion of the importance of subcellular relocalization after gene duplication on

  16. Spider Transcriptomes Identify Ancient Large-Scale Gene Duplication Event Potentially Important in Silk Gland Evolution. (United States)

    Clarke, Thomas H; Garb, Jessica E; Hayashi, Cheryl Y; Arensburger, Peter; Ayoub, Nadia A


    The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae). © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  17. Extensive local gene duplication and functional divergence among paralogs in Atlantic salmon. (United States)

    Warren, Ian A; Ciborowski, Kate L; Casadei, Elisa; Hazlerigg, David G; Martin, Sam; Jordan, William C; Sumner, Seirian


    Many organisms can generate alternative phenotypes from the same genome, enabling individuals to exploit diverse and variable environments. A prevailing hypothesis is that such adaptation has been favored by gene duplication events, which generate redundant genomic material that may evolve divergent functions. Vertebrate examples of recent whole-genome duplications are sparse although one example is the salmonids, which have undergone a whole-genome duplication event within the last 100 Myr. The life-cycle of the Atlantic salmon, Salmo salar, depends on the ability to produce alternating phenotypes from the same genome, to facilitate migration and maintain its anadromous life history. Here, we investigate the hypothesis that genome-wide and local gene duplication events have contributed to the salmonid adaptation. We used high-throughput sequencing to characterize the transcriptomes of three key organs involved in regulating migration in S. salar: Brain, pituitary, and olfactory epithelium. We identified over 10,000 undescribed S. salar sequences and designed an analytic workflow to distinguish between paralogs originating from local gene duplication events or from whole-genome duplication events. These data reveal that substantial local gene duplications took place shortly after the whole-genome duplication event. Many of the identified paralog pairs have either diverged in function or become noncoding. Future functional genomics studies will reveal to what extent this rich source of divergence in genetic sequence is likely to have facilitated the evolution of extreme phenotypic plasticity required for an anadromous life-cycle.

  18. Original tandem duplication in FXIIIA gene with splicing site modification and four amino acids insertion causes factor XIII deficiency. (United States)

    Louhichi, Nacim; Haj Salem, Ikhlass; Medhaffar, Moez; Miled, Nabil; Hadji, Ahmad F; Keskes, Leila; Fakhfakh, Faiza


    : Recessive mutations of F13A gene are reported to be responsible of FXIIIA subunit deficiency (FXIIIA). In all, some intronic nucleotide changes identified in this gene were investigated by in-silico analysis and occasionally supported by experimental data or reported in some cases as a polymorphism. To determine the molecular defects responsible of congenital factor XIII deficiency in Libyan patient, molecular analysis was performed by direct DNA sequencing of the coding regions and splice junctions of the FXIIIA subunit gene (F13A). A splicing minigene assay was used to study the effect of this mutation. Bioinformatics exploration was fulfilled to conceive consequences on protein. A 12-bp duplication straddling the border of intron 9 and exon 10 leads to two 3' acceptor splice sites, resulting in silencing of the downstream wild 3' splice site. It caused an in-frame insertion of 12 nucleotides into mRNA and four amino acids into protein. Bioinformatic analysis predicts that the insertion of four amino acids affects the site 3 of calcium binding site, which disturbs the smooth function of the FXIIIA peptide causing the factor XIII deficiency. This study showed that a small duplication seems to weaken the original 3' splice site and enhance the activation of a new splice site responsible for an alternative splicing. It would be interesting to examine the underlying molecular mechanism involved in this rearrangement.

  19. Immunoglobulin gene rearrangements and the pathogenesis of multiple myeloma

    NARCIS (Netherlands)

    Gonzalez, David; van der Burg, Mirjam; Garcia-Sanz, Ramon; Fenton, James A.; Langerak, Anton W.; Gonzalez, Marcos; van Dongen, Jacques J. M.; Miguel, Jesus F. San; Morgan, Gareth J.


    The ability to rearrange the germ-line DNA to generate antibody diversity is an essential prerequisite for the production of a functional repertoire. While this is essential to prevent infections, it also represents the "Achilles heal" of the B-cell lineage, occasionally leading to malignant transfo

  20. Distinct Defects in Spine Formation or Pruning in Two Gene Duplication Mouse Models of Autism. (United States)

    Wang, Miao; Li, Huiping; Takumi, Toru; Qiu, Zilong; Xu, Xiu; Yu, Xiang; Bian, Wen-Jie


    Autism spectrum disorder (ASD) encompasses a complex set of developmental neurological disorders, characterized by deficits in social communication and excessive repetitive behaviors. In recent years, ASD is increasingly being considered as a disease of the synapse. One main type of genetic aberration leading to ASD is gene duplication, and several mouse models have been generated mimicking these mutations. Here, we studied the effects of MECP2 duplication and human chromosome 15q11-13 duplication on synaptic development and neural circuit wiring in the mouse sensory cortices. We showed that mice carrying MECP2 duplication had specific defects in spine pruning, while the 15q11-13 duplication mouse model had impaired spine formation. Our results demonstrate that spine pathology varies significantly between autism models and that distinct aspects of neural circuit development may be targeted in different ASD mutations. Our results further underscore the importance of gene dosage in normal development and function of the brain.

  1. Combined mutation and rearrangement screening by quantitative PCR high-resolution melting: is it relevant for hereditary recurrent Fever genes?

    Directory of Open Access Journals (Sweden)

    Nathalie Pallares-Ruiz

    Full Text Available The recent identification of genes implicated in hereditary recurrent fevers has allowed their specific diagnosis. So far however, only punctual mutations have been identified and a significant number of patients remain with no genetic confirmation of their disease after routine molecular approaches such as sequencing. The possible involvement of sequence rearrangements in these patients has only been examined in familial Mediterranean fever and was found to be unlikely. To assess the existence of larger genetic alterations in 3 other concerned genes, MVK (Mevalonate kinase, NLRP3 (Nod like receptor family, pyrin domain containing 3 and TNFRSF1A (TNF receptor superfamily 1A, we adapted the qPCR-HRM method to study possible intragenic deletions and duplications. This single-tube approach, combining both qualitative (mutations and quantitative (rearrangement screening, has proven effective in Lynch syndrome diagnosis. Using this approach, we studied 113 unselected (prospective group and 88 selected (retrospective group patients and identified no intragenic rearrangements in the 3 genes. Only qualitative alterations were found with a sensitivity similar to that obtained using classical molecular techniques for screening punctual mutations. Our results support that deleterious copy number alterations in MVK, NLRP3 and TNFRSF1A are rare or absent from the mutational spectrum of hereditary recurrent fevers, and demonstrate that a routine combined method such as qPCR-HRM provides no further help in genetic diagnosis. However, quantitative approaches such as qPCR or SQF-PCR did prove to be quick and effective and could still be useful after non contributory punctual mutation screening in the presence of clinically evocative signs.

  2. Gene duplication and divergence of long wavelength-sensitive opsin genes in the guppy, Poecilia reticulata. (United States)

    Watson, Corey T; Gray, Suzanne M; Hoffmann, Margarete; Lubieniecki, Krzysztof P; Joy, Jeffrey B; Sandkam, Ben A; Weigel, Detlef; Loew, Ellis; Dreyer, Christine; Davidson, William S; Breden, Felix


    Female preference for male orange coloration in the genus Poecilia suggests a role for duplicated long wavelength-sensitive (LWS) opsin genes in facilitating behaviors related to mate choice in these species. Previous work has shown that LWS gene duplication in this genus has resulted in expansion of long wavelength visual capacity as determined by microspectrophotometry (MSP). However, the relationship between LWS genomic repertoires and expression of LWS retinal cone classes within a given species is unclear. Our previous study in the related species, Xiphophorus helleri, was the first characterization of the complete LWS opsin genomic repertoire in conjunction with MSP expression data in the family Poeciliidae, and revealed the presence of four LWS loci and two distinct LWS cone classes. In this study we characterized the genomic organization of LWS opsin genes by BAC clone sequencing, and described the full range of cone cell types in the retina of the colorful Cumaná guppy, Poecilia reticulata. In contrast to X. helleri, MSP data from the Cumaná guppy revealed three LWS cone classes. Comparisons of LWS genomic organization described here for Cumaná to that of X. helleri indicate that gene divergence and not duplication was responsible for the evolution of a novel LWS haplotype in the Cumaná guppy. This lineage-specific divergence is likely responsible for a third additional retinal cone class not present in X. helleri, and may have facilitated the strong sexual selection driven by female preference for orange color patterns associated with the genus Poecilia.

  3. Marfan syndrome with a complex chromosomal rearrangement including deletion of the FBN1 gene

    Directory of Open Access Journals (Sweden)

    Colovati Mileny ES


    Full Text Available Abstract Background The majority of Marfan syndrome (MFS cases is caused by mutations in the fibrillin-1 gene (FBN1, mapped to chromosome 15q21.1. Only few reports on deletions including the whole FBN1 gene, detected by molecular cytogenetic techniques, were found in literature. Results We report here on a female patient with clinical symptoms of the MFS spectrum plus craniostenosis, hypothyroidism and intellectual deficiency who presents a 1.9 Mb deletion, including the FBN1 gene and a complex rearrangement with eight breakpoints involving chromosomes 6, 12 and 15. Discussion This is the first report of MFS with a complex chromosome rearrangement involving a deletion of FBN1 and contiguous genes. In addition to the typical clinical findings of the Marfan syndrome due to FBN1 gene haploinsufficiency, the patient presents features which may be due to the other gene deletions and possibly to the complex chromosome rearrangement.

  4. Rates of gene rearrangement and nucleotide substitution are correlated in the mitochondrial genomes of insects. (United States)

    Shao, Renfu; Dowton, Mark; Murrell, Anna; Barker, Stephen C


    A number of studies indicated that lineages of animals with high rates of mitochondrial (mt) gene rearrangement might have high rates of mt nucleotide substitution. We chose the hemipteroid assemblage and the Insecta to test the idea that rates of mt gene rearrangement and mt nucleotide substitution are correlated. For this purpose, we sequenced the mt genome of a lepidopsocid from the Psocoptera, the only order of hemipteroid insects for which an entire mtDNA sequence is not available. The mt genome of this lepidopsocid is circular, 16,924 bp long, and contains 37 genes and a putative control region; seven tRNA genes and a protein-coding gene in this genome have changed positions relative to the ancestral arrangement of mt genes of insects. We then compared the relative rates of nucleotide substitution among species from each of the four orders of hemipteroid insects and among the 20 insects whose mt genomes have been sequenced entirely. All comparisons among the hemipteroid insects showed that species with higher rates of gene rearrangement also had significantly higher rates of nucleotide substitution statistically than did species with lower rates of gene rearrangement. In comparisons among the 20 insects, where the mt genomes of the two species differed by more than five breakpoints, the more rearranged species always had a significantly higher rate of nucleotide substitution than the less rearranged species. However, in comparisons where the mt genomes of two species differed by five or less breakpoints, the more rearranged species did not always have a significantly higher rate of nucleotide substitution than the less rearranged species. We tested the statistical significance of the correlation between the rates of mt gene rearrangement and mt nucleotide substitution with nine pairs of insects that were phylogenetically independent from one another. We found that the correlation was positive and statistically significant (R2 = 0.73, P = 0.01; Rs = 0.67, P

  5. Methods for identifying and mapping recent segmental and gene duplications in eukaryotic genomes. (United States)

    Khaja, Razi; MacDonald, Jeffrey R; Zhang, Junjun; Scherer, Stephen W


    The aim of this chapter is to provide instruction for analyzing and mapping recent segmental and gene duplications in eukaryotic genomes. We describe a bioinformatics-based approach utilizing computational tools to manage eukaryotic genome sequences to characterize and understand the evolutionary fates and trajectories of duplicated genes. An introduction to bioinformatics tools and programs such as BLAST, Perl, BioPerl, and the GFF specification provides the necessary background to complete this analysis for any eukaryotic genome of interest.

  6. X-linked myotubular myopathy due to a complex rearrangement involving a duplication of MTM1 exon 10

    NARCIS (Netherlands)

    Trump, N.; Cullup, T.; Verheij, J. B. G. M.; Manzur, A.; Muntoni, F.; Abbs, S.; Jungbluth, H.


    X-linked myotubular myopathy is a predominantly severe congenital myopathy with central nuclei on muscle biopsy due to mutations in the MTM1 gene encoding myotubularin. We report a boy with typical features of X-linked myotubular myopathy. Sequencing of the MTM1 gene did not reveal any causative mut

  7. Dynamics of gene duplication in the genomes of chlorophyll d-producing cyanobacteria: implications for the ecological niche. (United States)

    Miller, Scott R; Wood, A Michelle; Blankenship, Robert E; Kim, Maria; Ferriera, Steven


    Gene duplication may be an important mechanism for the evolution of new functions and for the adaptive modulation of gene expression via dosage effects. Here, we analyzed the fate of gene duplicates for two strains of a novel group of cyanobacteria (genus Acaryochloris) that produces the far-red light absorbing chlorophyll d as its main photosynthetic pigment. The genomes of both strains contain an unusually high number of gene duplicates for bacteria. As has been observed for eukaryotic genomes, we find that the demography of gene duplicates can be well modeled by a birth-death process. Most duplicated Acaryochloris genes are of comparatively recent origin, are strain-specific, and tend to be located on different genetic elements. Analyses of selection on duplicates of different divergence classes suggest that a minority of paralogs exhibit near neutral evolutionary dynamics immediately following duplication but that most duplicate pairs (including those which have been retained for long periods) are under strong purifying selection against amino acid change. The likelihood of duplicate retention varied among gene functional classes, and the pronounced differences between strains in the pool of retained recent duplicates likely reflects differences in the nutrient status and other characteristics of their respective environments. We conclude that most duplicates are quickly purged from Acaryochloris genomes and that those which are retained likely make important contributions to organism ecology by conferring fitness benefits via gene dosage effects. The mechanism of enhanced duplication may involve homologous recombination between genetic elements mediated by paralogous copies of recA.

  8. Investigation of T-cell receptor-γ gene rearrangement in gastrointestinal lymphomas by PCR-SSCP analysis

    Institute of Scientific and Technical Information of China (English)

    Xi-Qun Han; Li He; Lan-Ying Shong; Hui-Yong Jiang; Mei-Gang Zhu; Tong Zhao


    AIM: To analyze the characterization of T-cell receptor-γ (TCR-γ) gene rearrangement in the gastrointestinal lymphomas and evaluate the value of PCR-SSCP analysis in gastrointestinal lymphomas investigation.METHODS: TCR-γgene rearrangement segments of gastrointestinal lymphomas were cloned and sequenced.Single clone plasmid and mixed clone plsamids were subsequently submitted to PCR-SSCP analysis to investigate the relationship between the number of amplified clones and band patterns of the amplified products. The PCR products of TCR-γgene rearrangement of 40 gastrointestinal lymphomas were electrophoresed on agarose gels and the positive cases on agarose gels were studied by SSCP analysis.RESULTS: The sequencing showed that TCR-γ gene rearrangement of the gastrointestinal lymphomas included functional gene and pseudogene with extensive variety in the junctional regions. In SSCP analysis, the number of the single-stranded bands was about two times of the number of amplified clones, and double-stranded band became broad with the increased number of the amplified clones. Thirteen of the 25 B-cell gastrointestinal lymphomas and 14 of the 15 gastrointestinal T-cell lymphomas were positive detected on agarose gel electrophoresis. Of the positive cases detected by SSCP analysis, 3 B-cell lymphomas and 13 T-cell lymphomas showed positive bands. The other cases showed only smears. The rearranged pattern included 13 monoallelic gene rearrangements and 3 biallelic or oligoclonal gene rearrangements.CONCLUSION: The pattern of TCR-γ, gene rearrangement in gastrointestinal lymphomas are similar to that of the nodular lymphomas. PCR-SSCP analysis for TCR-γ gene rearrangement can be applied both for adjuvant diagnosis of gastrointestinal lymphomas and analysis of the gene rearrangement pattern. The ratio of TCR-γ gene rearrangements occurred in T-cell gastrointestinal lymphomas is significantly higher than that in B-cell gastrointestinal lymphomas. The gene rearrangement

  9. Multiplex ligation-dependent probe amplification for rapid detection of deletions and duplications in the dystrophin gene

    Institute of Scientific and Technical Information of China (English)


    Objective:Duchenne muscular dystrophy (DMD) and Becker muscular dystrophy (BMD) are X-linked disorders caused by mutations in the dystrophin gene. The majority of recognized mutations are copy number changes of individual exons. The objective of the present study was to assess the multiplex ligation-dependent probe amplification (MLPA) effects of detection of gene mutations. Methods: Samples of 20 control males and 80 males and their mothers referred to our diagnostic facility on the clinical suspicion of DMD or BMD were tested by MLPA and multiplex PCR. Results: The mean DQs for all peak of 20 control male samples was 1.02 (range from 0.83 to 1.21) by MLPA. Deletions or duplications were identified in 6 out of 31 families that had been previously tested as negative by multiplex PCR. One case of complex rearrangement involving a duplication of two regions: dupEX3-9 and dupEX 17-41 were found by MLPA. Conclusions: MLPA is a highly sensitive method and rapid alternative to multiplex PCR for detection of DMD and BMD.

  10. Deletion of the immunoglobulin kappa chain intron enhancer abolishes kappa chain gene rearrangement in cis but not lambda chain gene rearrangement in trans.


    Takeda, S; Zou, Y R; Bluethmann, H; Kitamura, D; Muller, U.; Rajewsky, K


    Immunoglobulins (Ig) secreted from a plasma cell contain either kappa or lambda light chains, but not both. This phenomenon is termed isotypic kappa-lambda exclusion. While kappa-producing cells have their lambda chain genes in germline configuration, in most lambda-producing cells the kappa chain genes are either non-productively rearranged or deleted. To investigate the molecular mechanism for isotypic kappa-lambda exclusion, in particular the role of the Ig kappa intron enhancer, we replac...

  11. Divergence of recently duplicated M{gamma}-type MADS-box genes in Petunia. (United States)

    Bemer, Marian; Gordon, Jonathan; Weterings, Koen; Angenent, Gerco C


    The MADS-box transcription factor family has expanded considerably in plants via gene and genome duplications and can be subdivided into type I and MIKC-type genes. The two gene classes show a different evolutionary history. Whereas the MIKC-type genes originated during ancient genome duplications, as well as during more recent events, the type I loci appear to experience high turnover with many recent duplications. This different mode of origin also suggests a different fate for the type I duplicates, which are thought to have a higher chance to become silenced or lost from the genome. To get more insight into the evolution of the type I MADS-box genes, we isolated nine type I genes from Petunia, which belong to the Mgamma subclass, and investigated the divergence of their coding and regulatory regions. The isolated genes could be subdivided into two categories: two genes were highly similar to Arabidopsis Mgamma-type genes, whereas the other seven genes showed less similarity to Arabidopsis genes and originated more recently. Two of the recently duplicated genes were found to contain deleterious mutations in their coding regions, and expression analysis revealed that a third paralog was silenced by mutations in its regulatory region. However, in addition to the three genes that were subjected to nonfunctionalization, we also found evidence for neofunctionalization of one of the Petunia Mgamma-type genes. Our study shows a rapid divergence of recently duplicated Mgamma-type MADS-box genes and suggests that redundancy among type I paralogs may be less common than expected.

  12. Restriction and recruitment-gene duplication and the origin and evolution of snake venom toxins. (United States)

    Hargreaves, Adam D; Swain, Martin T; Hegarty, Matthew J; Logan, Darren W; Mulley, John F


    Snake venom has been hypothesized to have originated and diversified through a process that involves duplication of genes encoding body proteins with subsequent recruitment of the copy to the venom gland, where natural selection acts to develop or increase toxicity. However, gene duplication is known to be a rare event in vertebrate genomes, and the recruitment of duplicated genes to a novel expression domain (neofunctionalization) is an even rarer process that requires the evolution of novel combinations of transcription factor binding sites in upstream regulatory regions. Therefore, although this hypothesis concerning the evolution of snake venom is very unlikely and should be regarded with caution, it is nonetheless often assumed to be established fact, hindering research into the true origins of snake venom toxins. To critically evaluate this hypothesis, we have generated transcriptomic data for body tissues and salivary and venom glands from five species of venomous and nonvenomous reptiles. Our comparative transcriptomic analysis of these data reveals that snake venom does not evolve through the hypothesized process of duplication and recruitment of genes encoding body proteins. Indeed, our results show that many proposed venom toxins are in fact expressed in a wide variety of body tissues, including the salivary gland of nonvenomous reptiles and that these genes have therefore been restricted to the venom gland following duplication, not recruited. Thus, snake venom evolves through the duplication and subfunctionalization of genes encoding existing salivary proteins. These results highlight the danger of the elegant and intuitive "just-so story" in evolutionary biology.

  13. Exonic rearrangements in the known Parkinson's disease-causing genes are a rare cause of the disease in South African patients. (United States)

    van der Merwe, Celia; Carr, Jonathan; Glanzmann, Brigitte; Bardien, Soraya


    Parkinson's disease (PD) is a neurodegenerative movement disorder characterized by the loss of dopaminergic neurons in the substantia nigra of the midbrain. To date, a number of PD-causing genes have been found, including SNCA, LRRK2, VPS35, PARK2, PINK1, DJ-1, ATP13A2, and most recently CHCHD2. Mutations in these genes range from point mutations to larger exonic rearrangements including deletions and duplications. This study aimed to detect possible copy number variation (CNV) in the known PD-causing genes in a cohort of South African patients with PD. Multiplex Ligation-dependent Probe Amplification (MLPA) analysis was performed on a total of 210 South African PD patients, and possible CNVs were verified using quantitative real time PCR. No homozygous or compound heterozygous exon rearrangements in the genes analysed were found in the patient group. A heterozygous PARK2 exon 4 deletion was found in a sporadic patient with an age at onset of 51 years. Sanger sequencing did not reveal any additional mutations in PARK2 in this patient. Combining our results with that of previous studies in a South African cohort, the frequency of exonic rearrangements in the known PD-causing genes is only 1.8% (8/439 patients). In conclusion, CNV in the known PD-causing genes are a rare cause of PD in a South African cohort, and there may be as yet unknown genetic causes of PD that are specific to patients of African ethnicity.

  14. Subfunctionalization reduces the fitness cost of gene duplication in humans by buffering dosage imbalances

    Directory of Open Access Journals (Sweden)

    Fernández Ariel


    Full Text Available Abstract Background Driven essentially by random genetic drift, subfunctionalization has been identified as a possible non-adaptive mechanism for the retention of duplicate genes in small-population species, where widespread deleterious mutations are likely to cause complementary loss of subfunctions across gene copies. Through subfunctionalization, duplicates become indispensable to maintain the functional requirements of the ancestral locus. Yet, gene duplication produces a dosage imbalance in the encoded proteins and thus, as investigated in this paper, subfunctionalization must be subject to the selective forces arising from the fitness bottleneck introduced by the duplication event. Results We show that, while arising from random drift, subfunctionalization must be inescapably subject to selective forces, since the diversification of expression patterns across paralogs mitigates duplication-related dosage imbalances in the concentrations of encoded proteins. Dosage imbalance effects become paramount when proteins rely on obligatory associations to maintain their structural integrity, and are expected to be weaker when protein complexation is ephemeral or adventitious. To establish the buffering effect of subfunctionalization on selection pressure, we determine the packing quality of encoded proteins, an established indicator of dosage sensitivity, and correlate this parameter with the extent of paralog segregation in humans, using species with larger population -and more efficient selection- as controls. Conclusions Recognizing the role of subfunctionalization as a dosage-imbalance buffer in gene duplication events enabled us to reconcile its mechanistic nonadaptive origin with its adaptive role as an enabler of the evolution of genetic redundancy. This constructive role was established in this paper by proving the following assertion: If subfunctionalization is indeed adaptive, its effect on paralog segregation should scale with the dosage

  15. Detecting functional divergence after gene duplication through evolutionary changes in posttranslational regulatory sequences. (United States)

    Nguyen Ba, Alex N; Strome, Bob; Hua, Jun Jie; Desmond, Jonathan; Gagnon-Arsenault, Isabelle; Weiss, Eric L; Landry, Christian R; Moses, Alan M


    Gene duplication is an important evolutionary mechanism that can result in functional divergence in paralogs due to neo-functionalization or sub-functionalization. Consistent with functional divergence after gene duplication, recent studies have shown accelerated evolution in retained paralogs. However, little is known in general about the impact of this accelerated evolution on the molecular functions of retained paralogs. For example, do new functions typically involve changes in enzymatic activities, or changes in protein regulation? Here we study the evolution of posttranslational regulation by examining the evolution of important regulatory sequences (short linear motifs) in retained duplicates created by the whole-genome duplication in budding yeast. To do so, we identified short linear motifs whose evolutionary constraint has relaxed after gene duplication with a likelihood-ratio test that can account for heterogeneity in the evolutionary process by using a non-central chi-squared null distribution. We find that short linear motifs are more likely to show changes in evolutionary constraints in retained duplicates compared to single-copy genes. We examine changes in constraints on known regulatory sequences and show that for the Rck1/Rck2, Fkh1/Fkh2, Ace2/Swi5 paralogs, they are associated with previously characterized differences in posttranslational regulation. Finally, we experimentally confirm our prediction that for the Ace2/Swi5 paralogs, Cbk1 regulated localization was lost along the lineage leading to SWI5 after gene duplication. Our analysis suggests that changes in posttranslational regulation mediated by short regulatory motifs systematically contribute to functional divergence after gene duplication.

  16. Temporal pattern of loss/persistence of duplicate genes involved in signal transduction and metabolic pathways after teleost-specific genome duplication

    Directory of Open Access Journals (Sweden)

    Sato Yukuto


    Full Text Available Abstract Background Recent genomic studies have revealed a teleost-specific third-round whole genome duplication (3R-WGD event occurred in a common ancestor of teleost fishes. However, it is unclear how the genes duplicated in this event were lost or persisted during the diversification of teleosts, and therefore, how many of the duplicated genes contribute to the genetic differences among teleosts. This subject is also important for understanding the process of vertebrate evolution through WGD events. We applied a comparative evolutionary approach to this question by focusing on the genes involved in long-term potentiation, taste and olfactory transduction, and the tricarboxylic acid cycle, based on the whole genome sequences of four teleosts; zebrafish, medaka, stickleback, and green spotted puffer fish. Results We applied a state-of-the-art method of maximum-likelihood phylogenetic inference and conserved synteny analyses to each of 130 genes involved in the above biological systems of human. These analyses identified 116 orthologous gene groups between teleosts and tetrapods, and 45 pairs of 3R-WGD-derived duplicate genes among them. This suggests that more than half [(45×2/(116+45] = 56.5% of the loci, probably more than ten thousand genes, present in a common ancestor of the four teleosts were still duplicated after the 3R-WGD. The estimated temporal pattern of gene loss suggested that, after the 3R-WGD, many (71/116 of the duplicated genes were rapidly lost during the initial 75 million years (MY, whereas on average more than half (27.3/45 of the duplicated genes remaining in the ancestor of the four teleosts (45/116 have persisted for about 275 MY. The 3R-WGD-derived duplicates that have persisted for a long evolutionary periods of time had significantly larger number of interacting partners and longer length of protein coding sequence, implying that they tend to be more multifunctional than the singletons after the 3R-WGD. Conclusion

  17. Whole-gene positive selection, elevated synonymous substitution rates, duplication, and indel evolution of the chloroplast clpP1 gene.

    Directory of Open Access Journals (Sweden)

    Per Erixon

    Full Text Available BACKGROUND: Synonymous DNA substitution rates in the plant chloroplast genome are generally relatively slow and lineage dependent. Non-synonymous rates are usually even slower due to purifying selection acting on the genes. Positive selection is expected to speed up non-synonymous substitution rates, whereas synonymous rates are expected to be unaffected. Until recently, positive selection has seldom been observed in chloroplast genes, and large-scale structural rearrangements leading to gene duplications are hitherto supposed to be rare. METHODOLOGY/PRINCIPLE FINDINGS: We found high substitution rates in the exons of the plastid clpP1 gene in Oenothera (the Evening Primrose family and three separate lineages in the tribe Sileneae (Caryophyllaceae, the Carnation family. Introns have been lost in some of the lineages, but where present, the intron sequences have substitution rates similar to those found in other introns of their genomes. The elevated substitution rates of clpP1 are associated with statistically significant whole-gene positive selection in three branches of the phylogeny. In two of the lineages we found multiple copies of the gene. Neighboring genes present in the duplicated fragments do not show signs of elevated substitution rates or positive selection. Although non-synonymous substitutions account for most of the increase in substitution rates, synonymous rates are also markedly elevated in some lineages. Whereas plant clpP1 genes experiencing negative (purifying selection are characterized by having very conserved lengths, genes under positive selection often have large insertions of more or less repetitive amino acid sequence motifs. CONCLUSIONS/SIGNIFICANCE: We found positive selection of the clpP1 gene in various plant lineages to correlated with repeated duplication of the clpP1 gene and surrounding regions, repetitive amino acid sequences, and increase in synonymous substitution rates. The present study sheds light on the

  18. A young Drosophila duplicate gene plays essential roles in spermatogenesis by regulating several Y-linked male fertility genes.

    Directory of Open Access Journals (Sweden)

    Yun Ding

    Full Text Available Gene duplication is supposed to be the major source for genetic innovations. However, how a new duplicate gene acquires functions by integrating into a pathway and results in adaptively important phenotypes has remained largely unknown. Here, we investigated the biological roles and the underlying molecular mechanism of the young kep1 gene family in the Drosophila melanogaster species subgroup to understand the origin and evolution of new genes with new functions. Sequence and expression analysis demonstrates that one of the new duplicates, nsr (novel spermatogenesis regulator, exhibits positive selection signals and novel subcellular localization pattern. Targeted mutagenesis and whole-transcriptome sequencing analysis provide evidence that nsr is required for male reproduction associated with sperm individualization, coiling, and structural integrity of the sperm axoneme via regulation of several Y chromosome fertility genes post-transcriptionally. The absence of nsr-like expression pattern and the presence of the corresponding cis-regulatory elements of the parental gene kep1 in the pre-duplication species Drosophila yakuba indicate that kep1 might not be ancestrally required for male functions and that nsr possibly has experienced the neofunctionalization process, facilitated by changes of trans-regulatory repertories. These findings not only present a comprehensive picture about the evolution of a new duplicate gene but also show that recently originated duplicate genes can acquire multiple biological roles and establish novel functional pathways by regulating essential genes.

  19. Complete mitochondrial genomes of two gelechioids, Mesophleps albilinella and Dichomeris ustalella (Lepidoptera: Gelechiidae), with a description of gene rearrangement in Lepidoptera. (United States)

    Park, Jeong Sun; Kim, Min Jee; Jeong, Su Yeon; Kim, Sung Soo; Kim, Iksoo


    We sequenced the entire mitochondrial genome (mitogenome) of two gelechioids, Mesophleps albilinella and Dichomeris ustalella, and compared their genome organization and sequence composition to those of available gelechioid mitogenomes for an enhanced understanding of Gelechioidea genomic characteristics. We compared all available lepidopteran mitogenome arrangements, including that of M. albilinella, which is unique in Gelechioidea, to comprehend the extensiveness and mechanisms of gene rearrangement in Lepidoptera. The genomes of M. albilinella and D. ustalella are 15,274 and 15,410 bp in size, respectively, with the typical sets of mitochondrial (mt) genes. The COI gene begins with CGA (arginine) in all sequenced gelechioids, including M. albilinella and D. ustalella, reinforcing the feature as a synapomorphic trait, at least in the Gelechioidea. Each 353- and 321-bp long A + T-rich region of M. albilinella and D. ustalella contains one (D. ustalella) or two (M. albilinella) tRNA-like structures. The M. albilinella mitogenome has a unique gene arrangement among the Gelechioidea: ARNESF (the underline signifies an inverted gene) at the ND3 and ND5 junction, as opposed to the ARNSEF that is found in ancestral insects. An extensive search of available lepidopteran mitogenomes, including that of M. albilinella, turned up six rearrangements that differ from those of ancestral insects. Most of the rearrangements can be explained by the tandem duplication-random loss model, but inversion, which requires recombination, is also found in two cases, including M. albilinella. Excluding the MIQ rearrangement at the A + T-rich region and ND2 junction, which is found in nearly all Ditrysia, most of the remaining rearrangements found in Lepidoptera appear to be independently derived in that they are automorphic at several taxonomic scales, although current mitogenomic data are limited, particularly for congeneric data.

  20. Trichomonas transmembrane cyclases result from massive gene duplication and concomitant development of pseudogenes.

    Directory of Open Access Journals (Sweden)

    Jike Cui


    Full Text Available Trichomonas vaginalis has an unusually large genome (approximately 160 Mb encoding approximately 60,000 proteins. With the goal of beginning to understand why some Trichomonas genes are present in so many copies, we characterized here a family of approximately 123 Trichomonas genes that encode transmembrane adenylyl cyclases (TMACs.The large family of TMACs genes is the result of recent duplications of a small set of ancestral genes that appear to be unique to trichomonads. Duplicated TMAC genes are not closely associated with repetitive elements, and duplications of flanking sequences are rare. However, there is evidence for TMAC gene replacements by homologous recombination. A high percentage of TMAC genes (approximately 46% are pseudogenes, as they contain stop codons and/or frame shifts, or the genes are truncated. Numerous stop codons present in the genome project G3 strain are not present in orthologous genes of two other Trichomonas strains (S1 and B7RC2. Each TMAC is composed of a series of N-terminal transmembrane helices and a single C-terminal cyclase domain that has adenylyl cyclase activity. Multiple TMAC genes are transcribed by Trichomonas cloned by limiting dilution.We conclude that one reason for the unusually large genome of Trichomonas is the presence of unstable families of genes such as those encoding TMACs that are undergoing massive gene duplication and concomitant development of pseudogenes.

  1. Local synteny and codon usage contribute to asymmetric sequence divergence of Saccharomyces cerevisiae gene duplicates

    Directory of Open Access Journals (Sweden)

    Bergthorsson Ulfar


    Full Text Available Abstract Background Duplicated genes frequently experience asymmetric rates of sequence evolution. Relaxed selective constraints and positive selection have both been invoked to explain the observation that one paralog within a gene-duplicate pair exhibits an accelerated rate of sequence evolution. In the majority of studies where asymmetric divergence has been established, there is no indication as to which gene copy, ancestral or derived, is evolving more rapidly. In this study we investigated the effect of local synteny (gene-neighborhood conservation and codon usage on the sequence evolution of gene duplicates in the S. cerevisiae genome. We further distinguish the gene duplicates into those that originated from a whole-genome duplication (WGD event (ohnologs versus small-scale duplications (SSD to determine if there exist any differences in their patterns of sequence evolution. Results For SSD pairs, the derived copy evolves faster than the ancestral copy. However, there is no relationship between rate asymmetry and synteny conservation (ancestral-like versus derived-like in ohnologs. mRNA abundance and optimal codon usage as measured by the CAI is lower in the derived SSD copies relative to ancestral paralogs. Moreover, in the case of ohnologs, the faster-evolving copy has lower CAI and lowered expression. Conclusions Together, these results suggest that relaxation of selection for codon usage and gene expression contribute to rate asymmetry in the evolution of duplicated genes and that in SSD pairs, the relaxation of selection stems from the loss of ancestral regulatory information in the derived copy.

  2. Partial duplication of the APBA2 gene in chromosome 15q13 corresponds to duplicon structures

    Directory of Open Access Journals (Sweden)

    Kesterson Robert A


    Full Text Available Abstract Background Chromosomal abnormalities affecting human chromosome 15q11-q13 underlie multiple genomic disorders caused by deletion, duplication and triplication of intervals in this region. These events are mediated by highly homologous segments of DNA, or duplicons, that facilitate mispairing and unequal cross-over in meiosis. The gene encoding an amyloid precursor protein-binding protein (APBA2 was previously mapped to the distal portion of the interval commonly deleted in Prader-Willi and Angelman syndromes and duplicated in cases of autism. Results We show that this gene actually maps to a more telomeric location and is partially duplicated within the broader region. Two highly homologous copies of an interval containing a large 5' exon and downstream sequence are located ~5 Mb distal to the intact locus. The duplicated copies, containing the first coding exon of APBA2, can be distinguished by single nucleotide sequence differences and are transcriptionally inactive. Adjacent to APBA2 maps a gene termed KIAA0574. The protein encoded by this gene is weakly homologous to a protein termed X123 that in turn maps adjacent to APBA1 on 9q21.12; APBA1 is highly homologous to APBA2 in the C-terminal region and is distinguished from APBA2 by the N-terminal region encoded by this duplicated exon. Conclusion The duplication of APBA2 sequences in this region adds to a complex picture of different low copy repeats present across this region and elsewhere on the chromosome.

  3. Gene Duplication, Population Genomics, and Species-Level Differentiation within a Tropical Mountain Shrub (United States)

    Mastretta-Yanes, Alicia; Zamudio, Sergio; Jorgensen, Tove H.; Arrigo, Nils; Alvarez, Nadir; Piñero, Daniel; Emerson, Brent C.


    Gene duplication leads to paralogy, which complicates the de novo assembly of genotyping-by-sequencing (GBS) data. The issue of paralogous genes is exacerbated in plants, because they are particularly prone to gene duplication events. Paralogs are normally filtered from GBS data before undertaking population genomics or phylogenetic analyses. However, gene duplication plays an important role in the functional diversification of genes and it can also lead to the formation of postzygotic barriers. Using populations and closely related species of a tropical mountain shrub, we examine 1) the genomic differentiation produced by putative orthologs, and 2) the distribution of recent gene duplication among lineages and geography. We find high differentiation among populations from isolated mountain peaks and species-level differentiation within what is morphologically described as a single species. The inferred distribution of paralogs among populations is congruent with taxonomy and shows that GBS could be used to examine recent gene duplication as a source of genomic differentiation of nonmodel species. PMID:25223767

  4. Evolution dynamics of a model for gene duplication under adaptive conflict (United States)

    Ancliff, Mark; Park, Jeong-Man


    We present and solve the dynamics of a model for gene duplication showing escape from adaptive conflict. We use a Crow-Kimura quasispecies model of evolution where the fitness landscape is a function of Hamming distances from two reference sequences, which are assumed to optimize two different gene functions, to describe the dynamics of a mixed population of individuals with single and double copies of a pleiotropic gene. The evolution equations are solved through a spin coherent state path integral, and we find two phases: one is an escape from an adaptive conflict phase, where each copy of a duplicated gene evolves toward subfunctionalization, and the other is a duplication loss of function phase, where one copy maintains its pleiotropic form and the other copy undergoes neutral mutation. The phase is determined by a competition between the fitness benefits of subfunctionalization and the greater mutational load associated with maintaining two gene copies. In the escape phase, we find a dynamics of an initial population of single gene sequences only which escape adaptive conflict through gene duplication and find that there are two time regimes: until a time t* single gene sequences dominate, and after t* double gene sequences outgrow single gene sequences. The time t* is identified as the time necessary for subfunctionalization to evolve and spread throughout the double gene sequences, and we show that there is an optimum mutation rate which minimizes this time scale.

  5. Prevalence of chromosomal rearrangements involving non-ETS genes in prostate cancer

    DEFF Research Database (Denmark)

    Kluth, Martina; Galal, Rami; Krohn, Antje


    Prostate cancer is characterized by structural rearrangements, most frequently including translocations between androgen-dependent genes and members of the ETS family of transcription factor like TMPRSS2:ERG. In a recent whole genome sequencing study we identified 140 gene fusions that were unrel...

  6. A critical assessment of cross-species detection of gene duplicates using comparative genomic hybridization

    Directory of Open Access Journals (Sweden)

    Renn Suzy CP


    Full Text Available Abstract Background Comparison of genomic DNA among closely related strains or species is a powerful approach for identifying variation in evolutionary processes. One potent source of genomic variation is gene duplication, which is prevalent among individuals and species. Array comparative genomic hybridization (aCGH has been successfully utilized to detect this variation among lineages. Here, beyond the demonstration that gene duplicates among species can be quantified with aCGH, we consider the effect of sequence divergence on the ability to detect gene duplicates. Results Using the X chromosome genomic content difference between male D. melanogaster and female D. yakuba and D. simulans, we describe a decrease in the ability to accurately measure genomic content (copy number for orthologs that are only 90% identical. We demonstrate that genome characteristics (e.g. chromatin environment and non-orthologous sequence similarity can also affect the ability to accurately measure genomic content. We describe a normalization strategy and statistical criteria to be used for the identification of gene duplicates among any species group for which an array platform is available from a closely related species. Conclusions Array CGH can be used to effectively identify gene duplication and genome content; however, certain biases are present due to sequence divergence and other genome characteristics resulting from the divergence between lineages. Highly conserved gene duplicates will be more readily recovered by aCGH. Duplicates that have been retained for a selective advantage due to directional selection acting on many loci in one or both gene copies are likely to be under-represented. The results of this study should inform the interpretation of both previously published and future work that employs this powerful technique.

  7. Segmental duplication as one of the driving forces underlying the diversity of the human immunoglobulin heavy chain variable gene region

    Directory of Open Access Journals (Sweden)

    Gao Richeng


    , the divergence level within the gene segments is significantly different from that in the intergenic regions indicating that these regions may have been subject to different selection pressures and that the IGHV gene segments in this region are functionally important. Conclusions Non-reciprocal genetic rearrangements associated with large duplicate sequence blocks could substantially contribute to the IGHV region diversity. Since the resulting polymorphisms may affect the number, composition and organization of the gene segments in this region, it may have significant impact on the function of the IGHV gene segment repertoire, antibody diversity, and therefore, the immune system. Because one of the gene segments, 3-30 (1.9III, is associated with autoimmune diseases, it could be of diagnostic significance to learn about the variants in the haplotypes by using the multiplex haplotype analysis system used in the present study with DNA sequence tags specific for the variants of all gene segments in this region.

  8. Comparative Evolution of Duplicated Ddx3 Genes in Teleosts: Insights from Japanese Flounder, Paralichthys olivaceus. (United States)

    Wang, Zhongkai; Liu, Wei; Song, Huayu; Wang, Huizhen; Liu, Jinxiang; Zhao, Haitao; Du, Xinxin; Zhang, Quanqi


    Following the two rounds of whole-genome duplication that occurred during deuterostome evolution, a third genome duplication event occurred in the stem lineage of ray-finned fishes. This teleost-specific genome duplication is thought to be responsible for the biological diversification of ray-finned fishes. DEAD-box polypeptide 3 (DDX3) belongs to the DEAD-box RNA helicase family. Although their functions in humans have been well studied, limited information is available regarding their function in teleosts. In this study, two teleost Ddx3 genes were first identified in the transcriptome of Japanese flounder (Paralichthys olivaceus). We confirmed that the two genes originated from teleost-specific genome duplication through synteny and phylogenetic analysis. Additionally, comparative analysis of genome structure, molecular evolution rate, and expression pattern of the two genes in Japanese flounder revealed evidence of subfunctionalization of the duplicated Ddx3 genes in teleosts. Thus, the results of this study reveal novel insights into the evolution of the teleost Ddx3 genes and constitute important groundwork for further research on this gene family.

  9. Differential transcriptional modulation of duplicated fatty acid-binding protein genes by dietary fatty acids in zebrafish (Danio rerio: evidence for subfunctionalization or neofunctionalization of duplicated genes

    Directory of Open Access Journals (Sweden)

    Denovan-Wright Eileen M


    Full Text Available Abstract Background In the Duplication-Degeneration-Complementation (DDC model, subfunctionalization and neofunctionalization have been proposed as important processes driving the retention of duplicated genes in the genome. These processes are thought to occur by gain or loss of regulatory elements in the promoters of duplicated genes. We tested the DDC model by determining the transcriptional induction of fatty acid-binding proteins (Fabps genes by dietary fatty acids (FAs in zebrafish. We chose zebrafish for this study for two reasons: extensive bioinformatics resources are available for zebrafish at and zebrafish contains many duplicated genes owing to a whole genome duplication event that occurred early in the ray-finned fish lineage approximately 230-400 million years ago. Adult zebrafish were fed diets containing either fish oil (12% lipid, rich in highly unsaturated fatty acid, sunflower oil (12% lipid, rich in linoleic acid, linseed oil (12% lipid, rich in linolenic acid, or low fat (4% lipid, low fat diet for 10 weeks. FA profiles and the steady-state levels of fabp mRNA and heterogeneous nuclear RNA in intestine, liver, muscle and brain of zebrafish were determined. Result FA profiles assayed by gas chromatography differed in the intestine, brain, muscle and liver depending on diet. The steady-state level of mRNA for three sets of duplicated genes, fabp1a/fabp1b.1/fabp1b.2, fabp7a/fabp7b, and fabp11a/fabp11b, was determined by reverse transcription, quantitative polymerase chain reaction (RT-qPCR. In brain, the steady-state level of fabp7b mRNAs was induced in fish fed the linoleic acid-rich diet; in intestine, the transcript level of fabp1b.1 and fabp7b were elevated in fish fed the linolenic acid-rich diet; in liver, the level of fabp7a mRNAs was elevated in fish fed the low fat diet; and in muscle, the level of fabp7a and fabp11a mRNAs were elevated in fish fed the linolenic acid-rich or the low fat diets. In all cases

  10. Molecular evolution of the duplicated TFIIAγ genes in Oryzeae and its relatives

    Directory of Open Access Journals (Sweden)

    Sun Hong-Zheng


    Full Text Available Abstract Background Gene duplication provides raw genetic materials for evolutionary novelty and adaptation. The evolutionary fate of duplicated transcription factor genes is less studied although transcription factor gene plays important roles in many biological processes. TFIIAγ is a small subunit of TFIIA that is one of general transcription factors required by RNA polymerase II. Previous studies identified two TFIIAγ-like genes in rice genome and found that these genes either conferred resistance to rice bacterial blight or could be induced by pathogen invasion, raising the question as to their functional divergence and evolutionary fates after gene duplication. Results We reconstructed the evolutionary history of the TFIIAγ genes from main lineages of angiosperms and demonstrated that two TFIIAγ genes (TFIIAγ1 and TFIIAγ5 arose from a whole genome duplication that happened in the common ancestor of grasses. Likelihood-based analyses with branch, codon, and branch-site models showed no evidence of positive selection but a signature of relaxed selective constraint after the TFIIAγ duplication. In particular, we found that the nonsynonymous/synonymous rate ratio (ω = dN/dS of the TFIIAγ1 sequences was two times higher than that of TFIIAγ5 sequences, indicating highly asymmetric rates of protein evolution in rice tribe and its relatives, with an accelerated rate of TFIIAγ1 gene. Our expression data and EST database search further indicated that after whole genome duplication, the expression of TFIIAγ1 gene was significantly reduced while TFIIAγ5 remained constitutively expressed and maintained the ancestral role as a subunit of the TFIIA complex. Conclusion The evolutionary fate of TFIIAγ duplicates is not consistent with the neofunctionalization model that predicts that one of the duplicated genes acquires a new function because of positive Darwinian selection. Instead, we suggest that subfunctionalization might be involved in

  11. Dating and functional characterization of duplicated genes in the apple (Malus domestica Borkh. by analyzing EST data

    Directory of Open Access Journals (Sweden)

    Sanzol Javier


    Full Text Available Abstract Background Gene duplication is central to genome evolution. In plants, genes can be duplicated through small-scale events and large-scale duplications often involving polyploidy. The apple belongs to the subtribe Pyrinae (Rosaceae, a diverse lineage that originated via allopolyploidization. Both small-scale duplications and polyploidy may have been important mechanisms shaping the genome of this species. Results This study evaluates the gene duplication and polyploidy history of the apple by characterizing duplicated genes in this species using EST data. Overall, 68% of the apple genes were clustered into families with a mean copy-number of 4.6. Analysis of the age distribution of gene duplications supported a continuous mode of small-scale duplications, plus two episodes of large-scale duplicates of vastly different ages. The youngest was consistent with the polyploid origin of the Pyrinae 37-48 MYBP, whereas the older may be related to γ-triplication; an ancient hexapolyploidization previously characterized in the four sequenced eurosid genomes and basal to the eurosid-asterid divergence. Duplicated genes were studied for functional diversification with an emphasis on young paralogs; those originated during or after the formation of the Pyrinae lineage. Unequal assignment of single-copy genes and gene families to Gene Ontology categories suggested functional bias in the pattern of gene retention of paralogs. Young paralogs related to signal transduction, metabolism, and energy pathways have been preferentially retained. Non-random retention of duplicated genes seems to have mediated the expansion of gene families, some of which may have substantially increased their members after the origin of the Pyrinae. The joint analysis of over-duplicated functional categories and phylogenies, allowed evaluation of the role of both polyploidy and small-scale duplications during this process. Finally, gene expression analysis indicated that 82

  12. Rapid, Nonradioactive Detection of Clonal T-Cell Receptor Gene Rearrangements in Lymphoid Neoplasms (United States)

    Bourguin, Anne; Tung, Rosann; Galili, Naomi; Sklar, Jeffrey


    Southern blot hybridization analysis of clonal antigen receptor gene rearrangements has proved to be a valuable adjunct to conventional methods for diagnosing lymphoid neoplasia. However, Southern blot analysis suffers from a number of technical disadvantages, including the time necessary to obtain results, the use of radioactivity, and the susceptibility of the method to various artifacts. We have investigated an alternative approach for assessing the clonality of antigen receptor gene rearrangements in lymphoid tissue biopsy specimens. This approach involves the amplification of rearranged γ T-cell receptor genes by the polymerase chain reaction and analysis of the polymerase chain reaction products by denaturing gradient gel electrophoresis. By use of this approach, clonal rearrangements from neoplastic lymphocytes constituting as little as 0.1-1% of the total cells in the tissue are detected as discrete bands in the denaturing gel after the gel is stained with ethidium bromide and viewed under ultraviolet light. In contrast, polyclonal rearrangements from reactive lymphocytes appear as a diffuse smear along the length of the gel. Our findings suggest that polymerase chain reaction combined with denaturing gradient gel electrophoresis may offer a rapid, nonradioactive, and sensitive alternative to Southern blot analysis for the diagnostic evaluation of lymphoid tissue biopsy specimens.

  13. Molecular evolution accompanying functional divergence of duplicated genes along the plant starch biosynthesis pathway. (United States)

    Nougué, Odrade; Corbi, Jonathan; Ball, Steven G; Manicacci, Domenica; Tenaillon, Maud I


    Starch is the main source of carbon storage in the Archaeplastida. The starch biosynthesis pathway (sbp) emerged from cytosolic glycogen metabolism shortly after plastid endosymbiosis and was redirected to the plastid stroma during the green lineage divergence. The SBP is a complex network of genes, most of which are members of large multigene families. While some gene duplications occurred in the Archaeplastida ancestor, most were generated during the sbp redirection process, and the remaining few paralogs were generated through compartmentalization or tissue specialization during the evolution of the land plants. In the present study, we tested models of duplicated gene evolution in order to understand the evolutionary forces that have led to the development of SBP in angiosperms. We combined phylogenetic analyses and tests on the rates of evolution along branches emerging from major duplication events in six gene families encoding sbp enzymes. We found evidence of positive selection along branches following cytosolic or plastidial specialization in two starch phosphorylases and identified numerous residues that exhibited changes in volume, polarity or charge. Starch synthases, branching and debranching enzymes functional specializations were also accompanied by accelerated evolution. However, none of the sites targeted by selection corresponded to known functional domains, catalytic or regulatory. Interestingly, among the 13 duplications tested, 7 exhibited evidence of positive selection in both branches emerging from the duplication, 2 in only one branch, and 4 in none of the branches. The majority of duplications were followed by accelerated evolution targeting specific residues along both branches. This pattern was consistent with the optimization of the two sub-functions originally fulfilled by the ancestral gene before duplication. Our results thereby provide strong support to the so-called "Escape from Adaptive Conflict" (EAC) model. Because none of the

  14. Functional divergence of gene duplicates – a domain-centric view

    Directory of Open Access Journals (Sweden)

    Khaladkar Mugdha


    Full Text Available Abstract Background Gene duplicates have been shown to evolve at different rates. Here we further investigate the mechanism and functional underpinning of this phenomenon by assessing asymmetric evolution specifically within functional domains of gene duplicates. Results Based on duplicate genes in five teleost fishes resulting from a whole genome duplication event, we first show that a Fisher Exact test based approach to detect asymmetry is more sensitive than the previously used Likelihood Ratio test. Using our Fisher Exact test, we found that the evolutionary rate asymmetry in the overall protein is largely explained by the asymmetric evolution within specific protein domains. Moreover, among cases of asymmetrically evolving domains, for the gene copy containing a fast evolving domain, the non-synonymous substitutions often cluster within the fast evolving domain. We found that rare substitutions were preferred within asymmetrically evolving domains suggestive of functional divergence. While overall ~32 % of the domains tested were found to be evolving asymmetrically, certain protein domains such as the Tyrosine and Ser/Thr Kinase domains had a much greater prevalence of asymmetric evolution. Finally, based on the spatial expression of Zebra fish duplicate proteins during development, we found that protein pairs containing asymmetrically evolving domains had a greater divergence in gene expression as compared to the duplicate proteins that did not exhibit asymmetric evolution. Conclusions Taken together, our results suggest that the previously observed asymmetry in the overall duplicate protein evolution is largely due to divergence of specific domains of the protein, and coincides with divergence in spatial expression domains.

  15. Partial duplications of the ATRX gene cause the ATR-X syndrome. (United States)

    Thienpont, Bernard; de Ravel, Thomy; Van Esch, Hilde; Van Schoubroeck, Dominique; Moerman, Philippe; Vermeesch, Joris Robert; Fryns, Jean-Pierre; Froyen, Guy; Lacoste, Caroline; Badens, Catherine; Devriendt, Koen


    ATR-X syndrome is a rare syndromic X-linked mental retardation disorder. We report that some of the patients suspected of ATR-X carry large intragenic duplications in the ATRX gene, leading to an absence of ATRX mRNA and of the protein. These findings underscore the need for including quantitative analyses to mutation analysis of the ATRX gene.

  16. Compensatory Drift and the Evolutionary Dynamics of Dosage-Sensitive Duplicate Genes. (United States)

    Thompson, Ammon; Zakon, Harold H; Kirkpatrick, Mark


    Dosage-balance selection preserves functionally redundant duplicates (paralogs) at the optimum for their combined expression. Here we present a model of the dynamics of duplicate genes coevolving under dosage-balance selection. We call this the compensatory drift model. Results show that even when strong dosage-balance selection constrains total expression to the optimum, expression of each duplicate can diverge by drift from its original level. The rate of divergence slows as the strength of stabilizing selection, the size of the mutation effect, and/or the size of the population increases. We show that dosage-balance selection impedes neofunctionalization early after duplication but can later facilitate it. We fit this model to data from sodium channel duplicates in 10 families of teleost fish; these include two convergent lineages of electric fish in which one of the duplicates neofunctionalized. Using the model, we estimated the strength of dosage-balance selection for these genes. The results indicate that functionally redundant paralogs still may undergo radical functional changes after a prolonged period of compensatory drift.

  17. Genomic analysis reveals extensive gene duplication within the bovine TRB locus

    Directory of Open Access Journals (Sweden)

    Law Andy


    Full Text Available Abstract Background Diverse TR and IG repertoires are generated by V(DJ somatic recombination. Genomic studies have been pivotal in cataloguing the V, D, J and C genes present in the various TR/IG loci and describing how duplication events have expanded the number of these genes. Such studies have also provided insights into the evolution of these loci and the complex mechanisms that regulate TR/IG expression. In this study we analyze the sequence of the third bovine genome assembly to characterize the germline repertoire of bovine TRB genes and compare the organization, evolution and regulatory structure of the bovine TRB locus with that of humans and mice. Results The TRB locus in the third bovine genome assembly is distributed over 5 scaffolds, extending to ~730 Kb. The available sequence contains 134 TRBV genes, assigned to 24 subgroups, and 3 clusters of DJC genes, each comprising a single TRBD gene, 5–7 TRBJ genes and a single TRBC gene. Seventy-nine of the TRBV genes are predicted to be functional. Comparison with the human and murine TRB loci shows that the gene order, as well as the sequences of non-coding elements that regulate TRB expression, are highly conserved in the bovine. Dot-plot analyses demonstrate that expansion of the genomic TRBV repertoire has occurred via a complex and extensive series of duplications, predominantly involving DNA blocks containing multiple genes. These duplication events have resulted in massive expansion of several TRBV subgroups, most notably TRBV6, 9 and 21 which contain 40, 35 and 16 members respectively. Similarly, duplication has lead to the generation of a third DJC cluster. Analyses of cDNA data confirms the diversity of the TRBV genes and, in addition, identifies a substantial number of TRBV genes, predominantly from the larger subgroups, which are still absent from the genome assembly. The observed gene duplication within the bovine TRB locus has created a repertoire of phylogenetically

  18. Identification of Ohnolog Genes Originating from Whole Genome Duplication in Early Vertebrates, Based on Synteny Comparison across Multiple Genomes. (United States)

    Singh, Param Priya; Arora, Jatin; Isambert, Hervé


    Whole genome duplications (WGD) have now been firmly established in all major eukaryotic kingdoms. In particular, all vertebrates descend from two rounds of WGDs, that occurred in their jawless ancestor some 500 MY ago. Paralogs retained from WGD, also coined 'ohnologs' after Susumu Ohno, have been shown to be typically associated with development, signaling and gene regulation. Ohnologs, which amount to about 20 to 35% of genes in the human genome, have also been shown to be prone to dominant deleterious mutations and frequently implicated in cancer and genetic diseases. Hence, identifying ohnologs is central to better understand the evolution of vertebrates and their susceptibility to genetic diseases. Early computational analyses to identify vertebrate ohnologs relied on content-based synteny comparisons between the human genome and a single invertebrate outgroup genome or within the human genome itself. These approaches are thus limited by lineage specific rearrangements in individual genomes. We report, in this study, the identification of vertebrate ohnologs based on the quantitative assessment and integration of synteny conservation between six amniote vertebrates and six invertebrate outgroups. Such a synteny comparison across multiple genomes is shown to enhance the statistical power of ohnolog identification in vertebrates compared to earlier approaches, by overcoming lineage specific genome rearrangements. Ohnolog gene families can be browsed and downloaded for three statistical confidence levels or recompiled for specific, user-defined, significance criteria at In the light of the importance of WGD on the genetic makeup of vertebrates, our analysis provides a useful resource for researchers interested in gaining further insights on vertebrate evolution and genetic diseases.

  19. Identification of Ohnolog Genes Originating from Whole Genome Duplication in Early Vertebrates, Based on Synteny Comparison across Multiple Genomes.

    Directory of Open Access Journals (Sweden)

    Param Priya Singh


    Full Text Available Whole genome duplications (WGD have now been firmly established in all major eukaryotic kingdoms. In particular, all vertebrates descend from two rounds of WGDs, that occurred in their jawless ancestor some 500 MY ago. Paralogs retained from WGD, also coined 'ohnologs' after Susumu Ohno, have been shown to be typically associated with development, signaling and gene regulation. Ohnologs, which amount to about 20 to 35% of genes in the human genome, have also been shown to be prone to dominant deleterious mutations and frequently implicated in cancer and genetic diseases. Hence, identifying ohnologs is central to better understand the evolution of vertebrates and their susceptibility to genetic diseases. Early computational analyses to identify vertebrate ohnologs relied on content-based synteny comparisons between the human genome and a single invertebrate outgroup genome or within the human genome itself. These approaches are thus limited by lineage specific rearrangements in individual genomes. We report, in this study, the identification of vertebrate ohnologs based on the quantitative assessment and integration of synteny conservation between six amniote vertebrates and six invertebrate outgroups. Such a synteny comparison across multiple genomes is shown to enhance the statistical power of ohnolog identification in vertebrates compared to earlier approaches, by overcoming lineage specific genome rearrangements. Ohnolog gene families can be browsed and downloaded for three statistical confidence levels or recompiled for specific, user-defined, significance criteria at In the light of the importance of WGD on the genetic makeup of vertebrates, our analysis provides a useful resource for researchers interested in gaining further insights on vertebrate evolution and genetic diseases.

  20. On the Complexity of Duplication-Transfer-Loss Reconciliation with Non-Binary Gene Trees. (United States)

    Kordi, Misagh; Bansal, Mukul


    Duplication-Transfer-Loss (DTL) reconciliation has emerged as a powerful technique for studying gene family evolution in the presence of horizontal gene transfer. DTL reconciliation takes as input a gene family phylogeny and the corresponding species phylogeny, and reconciles the two by postulating speciation, gene duplication, horizontal gene transfer, and gene loss events. Efficient algorithms exist for finding optimal DTL reconciliations when the gene tree is binary. However, gene trees are frequently non-binary. With such non-binary gene trees, the reconciliation problem seeks to find a binary resolution of the gene tree that minimizes the reconciliation cost. Given the prevalence of non-binary gene trees, many efficient algorithms have been developed for this problem in the context of the simpler Duplication-Loss (DL) reconciliation model. Yet, no efficient algorithms exist for DTL reconciliation with non-binary gene trees and the complexity of the problem remains unknown. In this work, we resolve this open question by showing that the problem is, in fact, NP-hard. Our reduction applies to both the dated and undated formulations of DTL reconciliation. By resolving this long-standing open problem, this work will spur the development of both exact and heuristic algorithms for this important problem.

  1. Limited pattern of TCR delta chain gene rearrangement on the RNA level in multiple sclerosis. (United States)

    Nowak, J; Januszkiewicz, D; Pernak, M; Hertmanowska, H; Nowicka-Kujawska, K; Rembowska, J; Lewandowski, K; Nowak, T; Wender, M


    Susceptibility to multiple sclerosis (MS) is most likely affected by a number of genes, including HLA and T-cell receptor (TCR) genes. T cells expressing gamma/delta receptors seem to contribute to autoagression in MS, as evidenced by their localization in the MS plaques in the brain. The aim of this study was to analyse the TCRdelta chain gene rearrangement at the RNA (cDNA) level and compare to the DNA pattern rearrangement. TCRdelta gene rearrangement was analysed in MS patients and healthy individuals with the use of primers specific for Vdelta1-6 and Jdelta1 genes (at the DNA level) and specific for Vdelta1-6 and Cdelta1 genes (at the cDNA level). The size of PCR products was analysed on agarose gel and by ALF-Express (Pharmacia). Additionally, the lymphocyte surface immunophenotype was studied with specific monoclonal antibodies. At the DNA level a restricted pattern of Vdelta3-Jdelta1 and Vdelta5-Jdelta1 was found only in MS patients. Contrary to DNA, mono-, oligoclonal RNA (cDNA) rearrangements were limited to Vdelta1-Cdelta1, Vdelta2-Cdelta1 and Vdelta3-Cdelta1 only in MS patients as well. Surface immunophenotype analysis revealed in MS a much higher frequency of activated gamma/delta T lymphocytes, i.e. expressing HLA-DR and CD25. An elevated level of CD56 positive cells in MS was recorded. Mono-oligoclonal pattern of TCRdelta gene rearrangement at the RNA level, along with increase in activated gamma/delta T cells, strongly argue for a significant role of gamma/delta T lymphocytes in the pathogenesis of MS.

  2. Evidence of duplicated Hox genes in the most recent common ancestor of extant scorpions. (United States)

    Sharma, Prashant P; Santiago, Marc A; González-Santillán, Edmundo; Monod, Lionel; Wheeler, Ward C


    Scorpions (order Scorpiones) are unusual among arthropods, both for the extreme heteronomy of their bauplan and for the high gene family turnover exhibited in their genomes. These phenomena appear to be correlated, as two scorpion species have been shown to possess nearly twice the number of Hox genes present in most arthropods. Segmentally offset anterior expression boundaries of a subset of Hox paralogs have been shown to correspond to transitions in segmental identities in the scorpion posterior tagmata, suggesting that posterior heteronomy in scorpions may have been achieved by neofunctionalization of Hox paralogs. However, both the first scorpion genome sequenced and the developmental genetic data are based on exemplars of Buthidae, one of 19 families of scorpions. It is therefore not known whether Hox paralogy is limited to Buthidae or widespread among scorpions. We surveyed 24 high throughput transcriptomes and the single whole genome available for scorpions, in order to test the prediction that Hox gene duplications are common to the order. We used gene tree parsimony to infer whether the paralogy was consistent with a duplication event in the scorpion common ancestor. Here we show that duplicated Hox genes in non-buthid scorpions occur in six of the ten Hox classes. Gene tree topologies and parsimony-based reconciliation of the gene trees are consistent with a duplication event in the most recent common ancestor of scorpions. These results suggest that a Hox paralogy, and by extension the model of posterior patterning established in a buthid, can be extended to non-Buthidae scorpions.

  3. Duplication and relocation of the functional DPY19L2 gene within low copy repeats

    Directory of Open Access Journals (Sweden)

    Cheung Joseph


    Full Text Available Abstract Background Low copy repeats (LCRs are thought to play an important role in recent gene evolution, especially when they facilitate gene duplications. Duplicate genes are fundamental to adaptive evolution, providing substrates for the development of new or shared gene functions. Moreover, silencing of duplicate genes can have an indirect effect on adaptive evolution by causing genomic relocation of functional genes. These changes are theorized to have been a major factor in speciation. Results Here we present a novel example showing functional gene relocation within a LCR. We characterize the genomic structure and gene content of eight related LCRs on human Chromosomes 7 and 12. Two members of a novel transmembrane gene family, DPY19L, were identified in these regions, along with six transcribed pseudogenes. One of these genes, DPY19L2, is found on Chromosome 12 and is not syntenic with its mouse orthologue. Instead, the human locus syntenic to mouse Dpy19l2 contains a pseudogene, DPY19L2P1. This indicates that the ancestral copy of this gene has been silenced, while the descendant copy has remained active. Thus, the functional copy of this gene has been relocated to a new genomic locus. We then describe the expansion and evolution of the DPY19L gene family from a single gene found in invertebrate animals. Ancient duplications have led to multiple homologues in different lineages, with three in fish, frogs and birds and four in mammals. Conclusion Our results show that the DPY19L family has expanded throughout the vertebrate lineage and has undergone recent primate-specific evolution within LCRs.

  4. Duplication and diversification of the hypoxia-inducible IGFBP-1 gene in zebrafish.

    Directory of Open Access Journals (Sweden)

    Hiroyasu Kamei

    Full Text Available BACKGROUND: Gene duplication is the primary force of new gene evolution. Deciphering whether a pair of duplicated genes has evolved divergent functions is often challenging. The zebrafish is uniquely positioned to provide insight into the process of functional gene evolution due to its amenability to genetic and experimental manipulation and because it possess a large number of duplicated genes. METHODOLOGY/PRINCIPAL FINDINGS: We report the identification and characterization of two hypoxia-inducible genes in zebrafish that are co-ortholgs of human IGF binding protein-1 (IGFBP-1. IGFBP-1 is a secreted protein that binds to IGF and modulates IGF actions in somatic growth, development, and aging. Like their human and mouse counterparts, in adult zebrafish igfbp-1a and igfbp-1b are exclusively expressed in the liver. During embryogenesis, the two genes are expressed in overlapping spatial domains but with distinct temporal patterns. While zebrafish IGFBP-1a mRNA was easily detected throughout embryogenesis, IGFBP-1b mRNA was detectable only in advanced stages. Hypoxia induces igfbp-1a expression in early embryogenesis, but induces the igfbp-1b expression later in embryogenesis. Both IGFBP-1a and -b are capable of IGF binding, but IGFBP-1b has much lower affinities for IGF-I and -II because of greater dissociation rates. Overexpression of IGFBP-1a and -1b in zebrafish embryos caused significant decreases in growth and developmental rates. When tested in cultured zebrafish embryonic cells, IGFBP-1a and -1b both inhibited IGF-1-induced cell proliferation but the activity of IGFBP-1b was significantly weaker. CONCLUSIONS/SIGNIFICANCE: These results indicate subfunction partitioning of the duplicated IGFBP-1 genes at the levels of gene expression, physiological regulation, protein structure, and biological actions. The duplicated IGFBP-1 may provide additional flexibility in fine-tuning IGF signaling activities under hypoxia and other catabolic

  5. Opposing phenotypes in mice with Smith-Magenis deletion and Potocki-Lupski duplication syndromes suggest gene dosage effects on fluid consumption behavior. (United States)

    Heck, Detlef H; Gu, Wenli; Cao, Ying; Qi, Shuhua; Lacaria, Melanie; Lupski, James R


    A quantitative long-term fluid consumption and fluid-licking assay was performed in two mouse models with either an ∼2 Mb genomic deletion, Df(11)17, or the reciprocal duplication copy number variation (CNV), Dp(11)17, analogous to the human genomic rearrangements causing either Smith-Magenis syndrome [SMS; OMIM #182290] or Potocki-Lupski syndrome [PTLS; OMIM #610883], respectively. Both mouse strains display distinct quantitative alterations in fluid consumption compared to their wild-type littermates; several of these changes are diametrically opposing between the two chromosome engineered mouse models. Mice with duplication versus deletion showed longer versus shorter intervals between visits to the waterspout, generated more versus less licks per visit and had higher versus lower variability in the number of licks per lick-burst as compared to their respective wild-type littermates. These findings suggest that copy number variation can affect long-term fluid consumption behavior in mice. Other behavioral differences were unique for either the duplication or deletion mutants; the deletion CNV resulted in increased variability of the licking rhythm, and the duplication CNV resulted in a significant slowing of the licking rhythm. Our findings document a readily quantitated complex behavioral response that can be directly and reciprocally influenced by a gene dosage effect.

  6. Gene duplication as a mechanism of genomic adaptation to a changing environment (United States)

    Kondrashov, Fyodor A.


    A subject of extensive study in evolutionary theory has been the issue of how neutral, redundant copies can be maintained in the genome for long periods of time. Concurrently, examples of adaptive gene duplications to various environmental conditions in different species have been described. At this point, it is too early to tell whether or not a substantial fraction of gene copies have initially achieved fixation by positive selection for increased dosage. Nevertheless, enough examples have accumulated in the literature that such a possibility should be considered. Here, I review the recent examples of adaptive gene duplications and make an attempt to draw generalizations on what types of genes may be particularly prone to be selected for under certain environmental conditions. The identification of copy-number variation in ecological field studies of species adapting to stressful or novel environmental conditions may improve our understanding of gene duplications as a mechanism of adaptation and its relevance to the long-term persistence of gene duplications. PMID:22977152

  7. [Analyses of the rearrangement of T-cell receptor- and immunoglobulin genes in the diagnosis of lymphoproliferative disorders]. (United States)

    Griesser, D H


    Rearrangements are developmentally regulated genetic recombinations in T and B cells which generate functional T cell receptor (TcR) and immunoglobulin genes, respectively. Different variable, sometimes diversity, and joining gene segments which are discontinuously spread out within their chromosomal location in germline configuration, are randomly assembled in individual lymphocytes. These rearrangements can be detected by Southern Blot analysis if more than 5% of a total lymphocyte population in a biopsy specimen carries the same clonal rearrangement. We analyzed DNA from 324 snap-frozen biopsy specimens from lympho-proliferative disorders. None of the 20 reactive lesions and four malignant myelomonocytic tumors had a clonal antigen receptor gene rearrangement. All 117 malignant B cell lymphomas of different subtypes and 95 of 97 malignant T cell lymphomas showed a clonal gene rearrangement. Only two angioimmunoblastic lymphadenopathy(AILD)-type T cell lymphomas did not have immune receptor gene rearrangements. They were morphologically indistinguishable from the other 47 T/AILD lymphomas with clonal rearrangement patterns. In most cases TcR beta and immunoglobulin heavy chain (IgH) gene probes were sufficient for lineage assignment of the clonal T or B lymphocyte population. In 18% of B lymphomas, however, a cross-lineage rearrangement of TcR beta genes, and in 20% of the T cell lymphomas a clonal IgH gene rearrangement was detected. After exclusion of centrocytic, large cell anaplastic lymphomas (LCAL) of B-type, and T/AILD lymphomas which are overrepresented in our study, only 10% of the remaining 147 T and B cell lymphomas had aberrant rearrangements. TcR rearrangements other than those of the beta chain genes were extremely rare in B cell lymphomas, as were Ig kappa rearrangements in T lymphomas. Only two T/AILD lymphomas had IgH and Ig kappa rearrangement in addition to their clonal T cell receptor gene rearrangements. Both samples likely contain a clonal B

  8. Functional characterization of duplicated Suppressor of Overexpression of Constans 1-like genes in petunia.

    Directory of Open Access Journals (Sweden)

    Jill C Preston

    Full Text Available Flowering time is strictly controlled by a combination of internal and external signals that match seed set with favorable environmental conditions. In the model plant species Arabidopsis thaliana (Brassicaceae, many of the genes underlying development and evolution of flowering have been discovered. However, much remains unknown about how conserved the flowering gene networks are in plants with different growth habits, gene duplication histories, and distributions. Here we functionally characterize three homologs of the flowering gene Suppressor Of Overexpression of Constans 1 (SOC1 in the short-lived perennial Petunia hybrida (petunia, Solanaceae. Similar to A. thaliana soc1 mutants, co-silencing of duplicated petunia SOC1-like genes results in late flowering. This phenotype is most severe when all three SOC1-like genes are silenced. Furthermore, expression levels of the SOC1-like genes Unshaven (UNS and Floral Binding Protein 21 (FBP21, but not FBP28, are positively correlated with developmental age. In contrast to A. thaliana, petunia SOC1-like gene expression did not increase with longer photoperiods, and FBP28 transcripts were actually more abundant under short days. Despite evidence of functional redundancy, differential spatio-temporal expression data suggest that SOC1-like genes might fine-tune petunia flowering in response to photoperiod and developmental stage. This likely resulted from modification of SOC1-like gene regulatory elements following recent duplication, and is a possible mechanism to ensure flowering under both inductive and non-inductive photoperiods.

  9. Early vertebrate chromosome duplications and the evolution of the neuropeptide Y receptor gene regions

    Directory of Open Access Journals (Sweden)

    Brenner Sydney


    Full Text Available Abstract Background One of the many gene families that expanded in early vertebrate evolution is the neuropeptide (NPY receptor family of G-protein coupled receptors. Earlier work by our lab suggested that several of the NPY receptor genes found in extant vertebrates resulted from two genome duplications before the origin of jawed vertebrates (gnathostomes and one additional genome duplication in the actinopterygian lineage, based on their location on chromosomes sharing several gene families. In this study we have investigated, in five vertebrate genomes, 45 gene families with members close to the NPY receptor genes in the compact genomes of the teleost fishes Tetraodon nigroviridis and Takifugu rubripes. These correspond to Homo sapiens chromosomes 4, 5, 8 and 10. Results Chromosome regions with conserved synteny were identified and confirmed by phylogenetic analyses in H. sapiens, M. musculus, D. rerio, T. rubripes and T. nigroviridis. 26 gene families, including the NPY receptor genes, (plus 3 described recently by other labs showed a tree topology consistent with duplications in early vertebrate evolution and in the actinopterygian lineage, thereby supporting expansion through block duplications. Eight gene families had complications that precluded analysis (such as short sequence length or variable number of repeated domains and another eight families did not support block duplications (because the paralogs in these families seem to have originated in another time window than the proposed genome duplication events. RT-PCR carried out with several tissues in T. rubripes revealed that all five NPY receptors were expressed in the brain and subtypes Y2, Y4 and Y8 were also expressed in peripheral organs. Conclusion We conclude that the phylogenetic analyses and chromosomal locations of these gene families support duplications of large blocks of genes or even entire chromosomes. Thus, these results are consistent with two early vertebrate

  10. Gene Duplication and Gene Expression Changes Play a Role in the Evolution of Candidate Pollen Feeding Genes in Heliconius Butterflies. (United States)

    Smith, Gilbert; Macias-Muñoz, Aide; Briscoe, Adriana D


    Heliconius possess a unique ability among butterflies to feed on pollen. Pollen feeding significantly extends their lifespan, and is thought to have been important to the diversification of the genus. We used RNA sequencing to examine feeding-related gene expression in the mouthparts of four species of Heliconius and one nonpollen feeding species, Eueides isabella We hypothesized that genes involved in morphology and protein metabolism might be upregulated in Heliconius because they have longer proboscides than Eueides, and because pollen contains more protein than nectar. Using de novo transcriptome assemblies, we tested these hypotheses by comparing gene expression in mouthparts against antennae and legs. We first looked for genes upregulated in mouthparts across all five species and discovered several hundred genes, many of which had functional annotations involving metabolism of proteins (cocoonase), lipids, and carbohydrates. We then looked specifically within Heliconius where we found eleven common upregulated genes with roles in morphology (CPR cuticle proteins), behavior (takeout-like), and metabolism (luciferase-like). Closer examination of these candidates revealed that cocoonase underwent several duplications along the lineage leading to heliconiine butterflies, including two Heliconius-specific duplications. Luciferase-like genes also underwent duplication within lepidopterans, and upregulation in Heliconius mouthparts. Reverse-transcription PCR confirmed that three cocoonases, a peptidase, and one luciferase-like gene are expressed in the proboscis with little to no expression in labial palps and salivary glands. Our results suggest pollen feeding, like other dietary specializations, was likely facilitated by adaptive expansions of preexisting genes-and that the butterfly proboscis is involved in digestive enzyme production.

  11. Immunoglobulin gene expression and regulation of rearrangement in kappa transgenic mice

    Energy Technology Data Exchange (ETDEWEB)

    Ritchie, K.A.


    Transgenic mice were produced by microinjection of the functionally rearranged immunoglobulin kappa gene from the myeloma MOPC-21 into the male pronucleus of fertilized mouse eggs, and implantation of the microinjected embryos into foster mothers. Mice that integrated the injected gene were detected by hybridizing tail DNA dots with radioactively labelled pBR322 plasmid DNA, which detects pBR322 sequences left as a tag on the microinjected DNA. Mice that integrated the injected gene (six males) were mated and the DNA, RNA and serum kappa chains of their offspring were analyzed. A rabbit anti-mouse kappa chain antiserum was also produced for use in detection of mouse kappa chains on protein blots. Hybridomas were produced from the spleen cells of these kappa transgenic mice to immortalize representative B cells and to investigate expression of the transgenic kappa gene, its effect on allelic exclusion, and its effect on the control of light chain gene rearrangement and expression. The results show that the microinjected DNA is integrated as concatamers in unique single or, rarely, two separate sites in the genome. The concatamers are composed of several copies (16 to 64) of injected DNA arranged in a head to tail fashion. The transgene is expressed into protein normally and in a tissue specific fashion. For the first time in these transgenic mice, all tissues contain a functionally rearranged and potentially expressible immunoglobulin gene. The transgene is expressed only in B cells and not in hepatocytes, for example. This indicates that rearrangement of immunoglobulin genes is necessary but not sufficient for the tissue specific expression of these genes by B cells.

  12. Zebrafish IGF genes: gene duplication, conservation and divergence, and novel roles in midline and notochord development.

    Directory of Open Access Journals (Sweden)

    Shuming Zou

    Full Text Available Insulin-like growth factors (IGFs are key regulators of development, growth, and longevity. In most vertebrate species including humans, there is one IGF-1 gene and one IGF-2 gene. Here we report the identification and functional characterization of 4 distinct IGF genes (termed as igf-1a, -1b, -2a, and -2b in zebrafish. These genes encode 4 structurally distinct and functional IGF peptides. IGF-1a and IGF-2a mRNAs were detected in multiple tissues in adult fish. IGF-1b mRNA was detected only in the gonad and IGF-2b mRNA only in the liver. Functional analysis showed that all 4 IGFs caused similar developmental defects but with different potencies. Many of these embryos had fully or partially duplicated notochords, suggesting that an excess of IGF signaling causes defects in the midline formation and an expansion of the notochord. IGF-2a, the most potent IGF, was analyzed in depth. IGF-2a expression caused defects in the midline formation and expansion of the notochord but it did not alter the anterior neural patterning. These results not only provide new insights into the functional conservation and divergence of the multiple igf genes but also reveal a novel role of IGF signaling in midline formation and notochord development in a vertebrate model.

  13. A duplicated PLP gene causing Pelizaeus-Merzbacher disease detected by comparative multiplex PCR

    Energy Technology Data Exchange (ETDEWEB)

    Inoue, K.; Sugiyama, N.; Kawanishi, C. [Yokohama City Univ., Yokohama (Japan)] [and others


    Pelizaeus-Merzbacher disease (PMD) is an X-linked dysmyelinating disorder caused by abnormalities in the proteolipid protein (PLP) gene, which is essential for oligodendrocyte differentiation and CNS myelin formation. Although linkage analysis has shown the homogeneity at the PLP locus in patients with PMD, exonic mutations in the PLP gene have been identified in only 10% - 25% of all cases, which suggests the presence of other genetic aberrations, including gene duplication. In this study, we examined five families with PMD not carrying exonic mutations in PLP gene, using comparative multiplex PCR (CM-PCR) as a semiquantitative assay of gene dosage. PLP gene duplications were identified in four families by CM-PCR and confirmed in three families by densitometric RFLP analysis. Because a homologous myelin protein gene, PMP22, is duplicated in the majority of patients with Charcot-Marie-Tooth 1A, PLP gene overdosage may be an important genetic abnormality in PMD and affect myelin formation. 38 ref., 5 figs., 2 tabs.

  14. Preferential duplication of intermodular hub genes: an evolutionary signature in eukaryotes genome networks.

    Directory of Open Access Journals (Sweden)

    Ricardo M Ferreira

    Full Text Available Whole genome protein-protein association networks are not random and their topological properties stem from genome evolution mechanisms. In fact, more connected, but less clustered proteins are related to genes that, in general, present more paralogs as compared to other genes, indicating frequent previous gene duplication episodes. On the other hand, genes related to conserved biological functions present few or no paralogs and yield proteins that are highly connected and clustered. These general network characteristics must have an evolutionary explanation. Considering data from STRING database, we present here experimental evidence that, more than not being scale free, protein degree distributions of organisms present an increased probability for high degree nodes. Furthermore, based on this experimental evidence, we propose a simulation model for genome evolution, where genes in a network are either acquired de novo using a preferential attachment rule, or duplicated with a probability that linearly grows with gene degree and decreases with its clustering coefficient. For the first time a model yields results that simultaneously describe different topological distributions. Also, this model correctly predicts that, to produce protein-protein association networks with number of links and number of nodes in the observed range for Eukaryotes, it is necessary 90% of gene duplication and 10% of de novo gene acquisition. This scenario implies a universal mechanism for genome evolution.

  15. A gene duplication led to specialized gamma-aminobutyrate and beta-alanine aminotransferase in yeast

    DEFF Research Database (Denmark)

    Andersen, Gorm; Andersen, Birgit; Dobritzsch, D.


    and related yeasts have two different genes/enzymes to apparently 'distinguish' between the two reactions in a single cell. It is likely that upon duplication similar to 200 million years ago, a specialized Uga1p evolved into a 'novel' transaminase enzyme with broader substrate specificity....

  16. Duplication and Divergence of Floral MADS-Box Genes in Grasses: Evidence for the Generation and Modification of Novel Regulators

    Institute of Scientific and Technical Information of China (English)

    Guixia Xu; Hongzhi Kong


    The process of flowering is controlled by a hierarchy of floral genes that act as flowering time genes, inflorescence/floral meristem identity genes, and/or floral organ-identity genes. The most important and well-characterized floral genes are those that belong to the MADS-box family of transcription factors. Compelling evidence suggests that floral MADS-box genes have experienced a few large-scale duplication events. In particular, the pre-core eudicot duplication events have been considered to correlate with the emergence and diversification of core eudicots. Duplication of floral MADS-box genes has also been documented in monocots, particularly in grasses, although a systematic study is lacking. In the present study, by conducting extensive phylogenetic analyses, we identified pre-Poaceae gene duplication events in each of the AP1, PI, AG, AGL11, AGL2/3/4, and AGL9gene lineages. Comparative genomic studies further indicated that some of these duplications actually resulted from the genome doubling event that occurred 66-70 million years ago (MYA). In addition, we found that after gene duplication, exonization (of intron sequences) and pseudoexonization (of exon sequences) have contributed to the divergence of duplicate genes in sequence structure and, possibly, gene function.

  17. Exact Algorithms for Duplication-Transfer-Loss Reconciliation with Non-Binary Gene Trees. (United States)

    Kordi, Misagh; Bansal, Mukul S


    Duplication-Transfer-Loss (DTL) reconciliation is a powerful method for studying gene family evolution in the presence of horizontal gene transfer. DTL reconciliation seeks to reconcile gene trees with species trees by postulating speciation, duplication, transfer, and loss events. Efficient algorithms exist for finding optimal DTL reconciliations when the gene tree is binary. In practice, however, gene trees are often non-binary due to uncertainty in the gene tree topologies, and DTL reconciliation with non-binary gene trees is known to be NP-hard. In this paper, we present the first exact algorithms for DTL reconciliation with non-binary gene trees. Specifically, we (i) show that the DTL reconciliation problem for non-binary gene trees is fixed-parameter tractable in the maximum degree of the gene tree, (ii) present an exponential-time, but in-practice efficient, algorithm to track and enumerate all optimal binary resolutions of a non-binary input gene tree, and (iii) apply our algorithms to a large empirical data set of over 4700 gene trees from 100 species to study the impact of gene tree uncertainty on DTL-reconciliation and to demonstrate the applicability and utility of our algorithms. The new techniques and algorithms introduced in this paper will help biologists avoid incorrect evolutionary inferences caused by gene tree uncertainty.

  18. Lager yeasts possess dynamic genomes that undergo rearrangements and gene amplification in response to stress. (United States)

    James, Tharappel C; Usher, Jane; Campbell, Susan; Bond, Ursula


    A long-term goal of the brewing industry is to identify yeast strains with increased tolerance to the stresses experienced during the brewing process. We have characterised the genomes of a number of stress-tolerant mutants, derived from the lager yeast strain CMBS-33, that were selected for tolerance to high temperatures and to growth in high specific gravity wort. Our results indicate that the heat-tolerant strains have undergone a number of gross chromosomal rearrangements when compared to the parental strain. To determine if such rearrangements can spontaneously arise in response to exposure to stress conditions experienced during the brewing process, we examined the chromosome integrity of both the stress-tolerant strains and their parent during a single round of fermentation under a variety of environmental stresses. Our results show that the lager yeast genome shows tremendous plasticity during fermentation, especially when fermentations are carried out in high specific gravity wort and at higher than normal temperatures. Many localised regions of gene amplification were observed especially at the telomeres and at the rRNA gene locus on chromosome XII, and general chromosomal instability was evident. However, gross chromosomal rearrangements were not detected, indicating that continued selection in the stress conditions are required to obtain clonal isolates with stable rearrangements. Taken together, the data suggest that lager yeasts display a high degree of genomic plasticity and undergo genomic changes in response to environmental stress.

  19. Cytosine arabinoside-metabolizing enzyme genes are underexpressed in children with MLL gene-rearranged acute lymphoblastic leukemia

    Directory of Open Access Journals (Sweden)

    J.F. Mata


    Full Text Available Infant acute lymphoblastic leukemia (IALL is characterized by mixed lineage leukemia (MLL gene rearrangements, unique gene expression profiles, poor prognosis, and drug resistance. One exception is cytosine arabinoside (Ara-C to which IALL cells seem to be more sensitive. We quantified mRNA expression of Ara-C key enzymes in leukemic lymphoblasts from 64 Brazilian ALL children, 15 of them presenting MLL gene rearrangement, and correlated it with clinical and biological features. The diagnosis was based on morphological criteria and immunophenotyping using monoclonal antibodies. MLL gene rearrangements were detected by conventional cytogenetic analysis, RT-PCR and/or fluorescence in situ hybridization. The DCK and HENT1 expression levels were determined by real-time quantitative PCR using SYBR Green I. Relative quantification was made by the standard curve method. The results were analyzed by Mann-Whitney and Fisher exact tests. A P value of £0.05 was considered to be statistically significant. DCK and HENT1 expression levels were significantly lower in children with MLL gene-rearranged ALL compared to children with MLL germ line ALL (P = 0.0003 and 0.03, respectively. Our results differ from previous ones concerning HENT1 mRNA expression that observed a higher expression level in MLL gene-rearranged leukemias. In conclusion, the expression of the genes related to Ara-C metabolism was lower in MLL-positive children in the sample studied, suggesting the presence of population differences in the expression profile of these genes especially for HENT1.

  20. Species-specific duplications of NBS-encoding genes in Chinese chestnut (Castanea mollissima) (United States)

    Zhong, Yan; Li, Yingjun; Huang, Kaihui; Cheng, Zong-Ming


    The disease resistance (R) genes play an important role in protecting plants from infection by diverse pathogens in the environment. The nucleotide-binding site (NBS)-leucine-rich repeat (LRR) class of genes is one of the largest R gene families. Chinese chestnut (Castanea mollissima) is resistant to Chestnut Blight Disease, but relatively little is known about the resistance mechanism. We identified 519 NBS-encoding genes, including 374 NBS-LRR genes and 145 NBS-only genes. The majority of Ka/Ks were less than 1, suggesting the purifying selection operated during the evolutionary history of NBS-encoding genes. A minority (4/34) of Ka/Ks in non-TIR gene families were greater than 1, showing that some genes were under positive selection pressure. Furthermore, Ks peaked at a range of 0.4 to 0.5, indicating that ancient duplications arose during the evolution. The relationship between Ka/Ks and Ks indicated greater selective pressure on the newer and older genes with the critical value of Ks = 0.4–0.5. Notably, species-specific duplications were detected in NBS-encoding genes. In addition, the group of RPW8-NBS-encoding genes clustered together as an independent clade located at a relatively basal position in the phylogenetic tree. Many cis-acting elements related to plant defense responses were detected in promoters of NBS-encoding genes. PMID:26559332

  1. Duplication, divergence and persistence in the Phytochrome photoreceptor gene family of cottons (Gossypium spp.

    Directory of Open Access Journals (Sweden)

    Abdukarimov Abdusattor


    Full Text Available Abstract Background Phytochromes are a family of red/far-red photoreceptors that regulate a number of important developmental traits in cotton (Gossypium spp., including plant architecture, fiber development, and photoperiodic flowering. Little is known about the composition and evolution of the phytochrome gene family in diploid (G. herbaceum, G. raimondii or allotetraploid (G. hirsutum, G. barbadense cotton species. The objective of this study was to obtain a preliminary inventory and molecular-evolutionary characterization of the phytochrome gene family in cotton. Results We used comparative sequence resources to design low-degeneracy PCR primers that amplify genomic sequence tags (GSTs for members of the PHYA, PHYB/D, PHYC and PHYE gene sub-families from A- and D-genome diploid and AD-genome allotetraploid Gossypium species. We identified two paralogous PHYA genes (designated PHYA1 and PHYA2 in diploid cottons, the result of a Malvaceae-specific PHYA gene duplication that occurred approximately 14 million years ago (MYA, before the divergence of the A- and D-genome ancestors. We identified a single gene copy of PHYB, PHYC, and PHYE in diploid cottons. The allotetraploid genomes have largely retained the complete gene complements inherited from both of the diploid genome ancestors, with at least four PHYA genes and two genes encoding PHYB, PHYC and PHYE in the AD-genomes. We did not identify a PHYD gene in any cotton genomes examined. Conclusions Detailed sequence analysis suggests that phytochrome genes retained after duplication by segmental duplication and allopolyploidy appear to be evolving independently under a birth-and-death-process with strong purifying selection. Our study provides a preliminary phytochrome gene inventory that is necessary and sufficient for further characterization of the biological functions of each of the cotton phytochrome genes, and for the development of 'candidate gene' markers that are potentially useful for

  2. Insight into transcription factor gene duplication from Caenorhabditis elegans Promoterome-driven expression patterns

    Directory of Open Access Journals (Sweden)

    Vidal Marc


    Full Text Available Abstract Background The C. elegans Promoterome is a powerful resource for revealing the regulatory mechanisms by which transcription is controlled pan-genomically. Transcription factors will form the core of any systems biology model of genome control and therefore the promoter activity of Promoterome inserts for C. elegans transcription factor genes was examined, in vivo, with a reporter gene approach. Results Transgenic C. elegans strains were generated for 366 transcription factor promoter/gfp reporter gene fusions. GFP distributions were determined, and then summarized with reference to developmental stage and cell type. Reliability of these data was demonstrated by comparison to previously described gene product distributions. A detailed consideration of the results for one C. elegans transcription factor gene family, the Six family, comprising ceh-32, ceh-33, ceh-34 and unc-39 illustrates the value of these analyses. The high proportion of Promoterome reporter fusions that drove GFP expression, compared to previous studies, led to the hypothesis that transcription factor genes might be involved in local gene duplication events less frequently than other genes. Comparison of transcription factor genes of C. elegans and Caenorhabditis briggsae was therefore carried out and revealed very few examples of functional gene duplication since the divergence of these species for most, but not all, transcription factor gene families. Conclusion Examining reporter expression patterns for hundreds of promoters informs, and thereby improves, interpretation of this data type. Genes encoding transcription factors involved in intrinsic developmental control processes appear acutely sensitive to changes in gene dosage through local gene duplication, on an evolutionary time scale.

  3. Ancient gene duplication provided a key molecular step for anaerobic growth of Baker's yeast. (United States)

    Hayashi, Masaya; Schilke, Brenda; Marszalek, Jaroslaw; Williams, Barry; Craig, Elizabeth A


    Mitochondria are essential organelles required for a number of key cellular processes. As most mitochondrial proteins are nuclear encoded, their efficient translocation into the organelle is critical. Transport of proteins across the inner membrane is driven by a multicomponent, matrix-localized "import motor," which is based on the activity of the molecular chaperone Hsp70 and a J-protein cochaperone. In Saccharomyces cerevisiae, two paralogous J-proteins, Pam18 and Mdj2, can form the import motor. Both contain transmembrane and matrix domains, with Pam18 having an additional intermembrane space (IMS) domain. Evolutionary analyses revealed that the origin of the IMS domain of S. cerevisiae Pam18 coincides with a gene duplication event that generated the PAM18/MDJ2 gene pair. The duplication event and origin of the Pam18 IMS domain occurred at the relatively ancient divergence of the fungal subphylum Saccharomycotina. The timing of the duplication event also corresponds with a number of additional functional changes related to mitochondrial function and respiration. Physiological and genetic studies revealed that the IMS domain of Pam18 is required for efficient growth under anaerobic conditions, even though it is dispensable when oxygen is present. Thus, the gene duplication was beneficial for growth capacity under particular environmental conditions as well as diversification of the import motor components.

  4. A single enhancer regulating the differential expression of duplicated red-sensitive opsin genes in zebrafish.

    Directory of Open Access Journals (Sweden)

    Taro Tsujimura


    Full Text Available A fundamental step in the evolution of the visual system is the gene duplication of visual opsins and differentiation between the duplicates in absorption spectra and expression pattern in the retina. However, our understanding of the mechanism of expression differentiation is far behind that of spectral tuning of opsins. Zebrafish (Danio rerio have two red-sensitive cone opsin genes, LWS-1 and LWS-2. These genes are arrayed in a tail-to-head manner, in this order, and are both expressed in the long member of double cones (LDCs in the retina. Expression of the longer-wave sensitive LWS-1 occurs later in development and is thus confined to the peripheral, especially ventral-nasal region of the adult retina, whereas expression of LWS-2 occurs earlier and is confined to the central region of the adult retina, shifted slightly to the dorsal-temporal region. In this study, we employed a transgenic reporter assay using fluorescent proteins and P1-artificial chromosome (PAC clones encompassing the two genes and identified a 0.6-kb "LWS-activating region" (LAR upstream of LWS-1, which regulates expression of both genes. Under the 2.6-kb flanking upstream region containing the LAR, the expression pattern of LWS-1 was recapitulated by the fluorescent reporter. On the other hand, when LAR was directly conjugated to the LWS-2 upstream region, the reporter was expressed in the LDCs but also across the entire outer nuclear layer. Deletion of LAR from the PAC clones drastically lowered the reporter expression of the two genes. These results suggest that LAR regulates both LWS-1 and LWS-2 by enhancing their expression and that interaction of LAR with the promoters is competitive between the two genes in a developmentally restricted manner. Sharing a regulatory region between duplicated genes could be a general way to facilitate the expression differentiation in duplicated visual opsins.

  5. New Organelles by Gene Duplication in a Biophysical Model of Eukaryote Endomembrane Evolution


    Ramadas, Rohini; Thattai, Mukund


    Extant eukaryotic cells have a dynamic traffic network that consists of diverse membrane-bound organelles exchanging matter via vesicles. This endomembrane system arose and diversified during a period characterized by massive expansions of gene families involved in trafficking after the acquisition of a mitochondrial endosymbiont by a prokaryotic host cell >1.8 billion years ago. Here we investigate the mechanistic link between gene duplication and the emergence of new nonendosymbiotic organe...

  6. Cobalamin-Independent Methionine Synthase (MetE): A Face-to-Face Double Barrel that Evolved by Gene Duplication

    Energy Technology Data Exchange (ETDEWEB)

    Pejcha, Robert; Ludwig, Martha L. (Michigan)


    Cobalamin-independent methionine synthase (MetE) catalyzes the transfer of a methyl group from methyltetrahydrofolate to L-homocysteine (Hcy) without using an intermediate methyl carrier. Although MetE displays no detectable sequence homology with cobalamin-dependent methionine synthase (MetH), both enzymes require zinc for activation and binding of Hcy. Crystallographic analyses of MetE from T. maritima reveal an unusual dual-barrel structure in which the active site lies between the tops of the two ({beta}{alpha}){sub 8} barrels. The fold of the N-terminal barrel confirms that it has evolved from the C-terminal polypeptide by gene duplication; comparisons of the barrels provide an intriguing example of homologous domain evolution in which binding sites are obliterated. The C-terminal barrel incorporates the zinc ion that binds and activates Hcy. The zinc-binding site in MetE is distinguished from the (Cys){sub 3}Zn site in the related enzymes, MetH and betaine-homocysteine methyltransferase, by its position in the barrel and by the metal ligands, which are histidine, cysteine, glutamate, and cysteine in the resting form of MetE. Hcy associates at the face of the metal opposite glutamate, which moves away from the zinc in the binary E {center_dot} Hcy complex. The folate substrate is not intimately associated with the N-terminal barrel; instead, elements from both barrels contribute binding determinants in a binary complex in which the folate substrate is incorrectly oriented for methyl transfer. Atypical locations of the Hcy and folate sites in the C-terminal barrel presumably permit direct interaction of the substrates in a ternary complex. Structures of the binary substrate complexes imply that rearrangement of folate, perhaps accompanied by domain rearrangement, must occur before formation of a ternary complex that is competent for methyl transfer.

  7. Rearrangement and evolution of mitochondrial genomes in parrots. (United States)

    Eberhard, Jessica R; Wright, Timothy F


    Mitochondrial genome rearrangements that result in control region duplication have been described for a variety of birds, but the mechanisms leading to their appearance and maintenance remain unclear, and their effect on sequence evolution has not been explored. A recent survey of mitochondrial genomes in the Psittaciformes (parrots) found that control region duplications have arisen independently at least six times across the order. We analyzed complete mitochondrial genome sequences from 20 parrot species, including representatives of each lineage with control region duplications, to document the gene order changes and to examine effects of genome rearrangements on patterns of sequence evolution. The gene order previously reported for Amazona parrots was found for four of the six independently derived genome rearrangements, and a previously undescribed gene order was found in Prioniturus luconensis, representing a fifth clade with rearranged genomes; the gene order resulting from the remaining rearrangement event could not be confirmed. In all rearranged genomes, two copies of the control region are present and are very similar at the sequence level, while duplicates of the other genes involved in the rearrangement show signs of degeneration or have been lost altogether. We compared rates of sequence evolution in genomes with and without control region duplications and did not find a consistent acceleration or deceleration associated with the duplications. This could be due to the fact that most of the genome rearrangement events in parrots are ancient, and additionally, to an effect of body size on evolutionary rate that we found for mitochondrial but not nuclear sequences. Base composition analyses found that relative to other birds, parrots have unusually strong compositional asymmetry (AT- and GC-skew) in their coding sequences, especially at fourfold degenerate sites. Furthermore, we found higher AT skew in species with control region duplications. One

  8. CTDGFinder: A Novel Homology-Based Algorithm for Identifying Closely Spaced Clusters of Tandemly Duplicated Genes. (United States)

    Ortiz, Juan F; Rokas, Antonis


    Closely spaced clusters of tandemly duplicated genes (CTDGs) contribute to the diversity of many phenotypes, including chemosensation, snake venom, and animal body plans. CTDGs have traditionally been identified subjectively as genomic neighborhoods containing several gene duplicates in close proximity; however, CTDGs are often highly variable with respect to gene number, intergenic distance, and synteny. This lack of formal definition hampers the study of CTDG evolutionary dynamics and the discovery of novel CTDGs in the exponentially growing body of genomic data. To address this gap, we developed a novel homology-based algorithm, CTDGFinder, which formalizes and automates the identification of CTDGs by examining the physical distribution of individual members of families of duplicated genes across chromosomes. Application of CTDGFinder accurately identified CTDGs for many well-known gene clusters (e.g., Hox and beta-globin gene clusters) in the human, mouse and 20 other mammalian genomes. Differences between previously annotated gene clusters and our inferred CTDGs were due to the exclusion of nonhomologs that have historically been considered parts of specific gene clusters, the inclusion or absence of genes between the CTDGs and their corresponding gene clusters, and the splitting of certain gene clusters into distinct CTDGs. Examination of human genes showing tissue-specific enhancement of their expression by CTDGFinder identified members of several well-known gene clusters (e.g., cytochrome P450s and olfactory receptors) and revealed that they were unequally distributed across tissues. By formalizing and automating CTDG identification, CTDGFinder will facilitate understanding of CTDG evolutionary dynamics, their functional implications, and how they are associated with phenotypic diversity. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e

  9. Analysis of rearranged immunoglobulin genes indicating a process of clonal evolution in chronic lymphocytic leukaemia. (United States)

    Hakim, I; Rechavi, G; Brok-Simoni, F; Grossman, Z; Amariglio, N; Mandel, M; Ramot, B; Ben-Bassat, I; Katzir, N


    Chronic lymphocytic leukaemia (CLL) is known to be a stable monoclonal neoplasm. In contrast to early studies demonstrating no more than two hybridizing immunoglobulin heavy chain bands corresponding to the two expected alleles, we have demonstrated an unexpected multiband pattern when the HindIII-digested DNA samples from 38 CLL patients were analysed by Southern blot hybridization using JH and C mu gene probes. In order to characterize the genetic basis for the multiband pattern, we molecularly cloned the immunoglobulin heavy chain genes of one of the patients whose leukaemic DNA sample demonstrated three hybridizing JH bands and a loss of the germline band. The cloned rearranged immunoglobulin genes could be divided, based on the restriction mapping and the hybridization with the various probes, into two basic patterns representing two alleles. In one of the cloned rearranged immunoglobulin genes a secondary rearrangement occurred that resulted in the addition of 300 base-pair long sequence into the switch region, and the creation of a HindIII restriction site. The results of the study suggest that clonal evolution occurs in some CLL, and that many of these neoplasms are indeed oligoclonal due to the accumulation of secondary genetic changes.

  10. Molecular Subtyping of Primary Prostate Cancer Reveals Specific and Shared Target Genes of Different ETS Rearrangements

    Directory of Open Access Journals (Sweden)

    Paula Paulo


    Full Text Available This work aimed to evaluate whether ETS transcription factors frequently involved in rearrangements in prostate carcinomas (PCa, namely ERG and ETV1, regulate specific or shared target genes. We performed differential expression analysis on nine normal prostate tissues and 50 PCa enriched for different ETS rearrangements using exon-level expression microarrays, followed by in vitro validation using cell line models. We found specific deregulation of 57 genes in ERG-positive PCa and 15 genes in ETV1-positive PCa, whereas deregulation of 27 genes was shared in both tumor subtypes. We further showed that the expression of seven tumor-associated ERG target genes (PLA1A, CACNA1D, ATP8A2, HLA-DMB, PDE3B, TDRD1, and TMBIM1 and two tumor-associated ETV1 target genes (FKBP10 and GLYATL2 was significantly affected by specific ETS silencing in VCaP and LNCaP cell line models, respectively, whereas the expression of three candidate ERG and ETV1 shared targets (GRPR, KCNH8, and TMEM45B was significantly affected by silencing of either ETS. Interestingly, we demonstrate that the expression of TDRD1, the topmost overexpressed gene of our list of ERG-specific candidate targets, is inversely correlated with the methylation levels of a CpG island found at -66 bp of the transcription start site in PCa and that TDRD1 expression is regulated by direct binding of ERG to the CpG island in VCaP cells. We conclude that ETS transcription factors regulate specific and shared target genes and that TDRD1, FKBP10, and GRPR are promising therapeutic targets and can serve as diagnostic markers for molecular subtypes of PCa harboring specific fusion gene rearrangements.

  11. Genomics 4.0 : syntenic gene and genome duplication drives diversification of plant secondary metabolism and innate immunity in flowering plants : advanced pattern analytics in duplicate genomes

    NARCIS (Netherlands)

    Hofberger, J.A.


    Genomics 4.0 - Syntenic Gene and Genome Duplication Drives Diversification of Plant Secondary Metabolism and Innate Immunity in Flowering Plants   Johannes A. Hofberger1, 2, 3 1 Biosystematics Group, Wageningen University & Research Center, Droevendaalsesteeg 1, 6708 PB Wageningen, The Neth

  12. Higher primates, but not New World monkeys, have a duplicate set of enhancers flanking their apoC-I genes. (United States)

    Puppione, Donald L


    Previous studies have demonstrated that the apoC-I gene and its pseudogene on human chromosome 19 are flanked by a duplicate set of enhancers. Multienhancers, ME.1 and ME.2, are located upstream from the genes and the hepatic control region enhancers, HCR.1 and HCR.2, are located downstream. The duplication of the enhancers has been thought to have occurred when the apoC-I gene was duplicated during primate evolution. Currently, the only primate data are for the human enhancers. Examining the genome of other primates (great and lesser apes, Old and New World monkeys), it was possible to locate the duplicate set of enhancers in apes and Old World monkeys. However, only a single set was found in New World monkeys. These observations provide additional evidence that the apoC-I gene and the flanking enhancers underwent duplication after the divergence of Old and New World monkeys.

  13. Finding all sorting tandem duplication random loss operations

    DEFF Research Database (Denmark)

    Bernt, Matthias; Chen, Kuan Yu; Chen, Ming Chiang


    A tandem duplication random loss (TDRL) operation duplicates a contiguous segment of genes, followed by the random loss of one copy of each of the duplicated genes. Although the importance of this operation is founded by several recent biological studies, it has been investigated only rarely from...... a theoretical point of view. Of particular interest are sorting TDRLs which are TDRLs that, when applied to a permutation representing a genome, reduce the distance towards another given permutation. The identification of sorting genome rearrangement operations in general is a key ingredient of many algorithms...

  14. Comparative analysis of Phytophthora genes encoding secreted proteins reveals conserved synteny and lineage-specific gene duplications and deletions

    NARCIS (Netherlands)

    Jiang, R.H.Y.; Tyler, B.M.; Govers, F.


    Comparative analysis of two Phytophthora genomes revealed overall colinearity in four genomic regions consisting of a 1.5-Mb sequence of Phytophthora sojae and a 0.9-Mb sequence of R ramorum. In these regions with conserved synteny, the gene order is largely similar; however, genome rearrangements a

  15. Transcriptional rewiring of the sex determining dmrt1 gene duplicate by transposable elements.

    Directory of Open Access Journals (Sweden)

    Amaury Herpin


    Full Text Available Control and coordination of eukaryotic gene expression rely on transcriptional and posttranscriptional regulatory networks. Evolutionary innovations and adaptations often require rapid changes of such networks. It has long been hypothesized that transposable elements (TE might contribute to the rewiring of regulatory interactions. More recently it emerged that TEs might bring in ready-to-use transcription factor binding sites to create alterations to the promoters by which they were captured. A process where the gene regulatory architecture is of remarkable plasticity is sex determination. While the more downstream components of the sex determination cascades are evolutionary conserved, the master regulators can switch between groups of organisms even on the interspecies level or between populations. In the medaka fish (Oryzias latipes a duplicated copy of dmrt1, designated dmrt1bY or DMY, on the Y chromosome was shown to be the master regulator of male development, similar to Sry in mammals. We found that the dmrt1bY gene has acquired a new feedback downregulation of its expression. Additionally, the autosomal dmrt1a gene is also able to regulate transcription of its duplicated paralog by binding to a unique target Dmrt1 site nested within the dmrt1bY proximal promoter region. We could trace back this novel regulatory element to a highly conserved sequence within a new type of TE that inserted into the upstream region of dmrt1bY shortly after the duplication event. Our data provide functional evidence for a role of TEs in transcriptional network rewiring for sub- and/or neo-functionalization of duplicated genes. In the particular case of dmrt1bY, this contributed to create new hierarchies of sex-determining genes.

  16. Cheetahs have 4 serum amyloid a genes evolved through repeated duplication events. (United States)

    Chen, Lei; Une, Yumi; Higuchi, Keiichi; Mori, Masayuki


    Amyloid A (AA) amyloidosis is a leading cause of mortality in captive cheetahs (Acinonyx jubatus). We performed genome walking and PCR cloning and revealed that cheetahs have 4 SAA genes (provisionally named SAA1A, SAA1B, SAA3A, and SAA3B). In addition, we identified multiple nucleotide polymorphisms in the 4 SAA genes by screening 51 cheetahs. The polymorphisms defined 4, 7, 6, and 4 alleles for SAA1A, SAA3A, SAA1B, and SAA3B, respectively. Pedigree analysis of the inheritance of genotypes for the SAA genes revealed that specific combinations of alleles for the 4 SAA genes cosegregated as a unit (haplotype) in pedigrees, indicating that the 4 genes were linked on the same chromosome. Notably, cheetah SAA1A and SAA1B were highly homologous in their nucleotide sequences. Likewise, SAA3A and SAA3B genes were homologous. These observations suggested a model for the evolution of the 4 SAA genes in cheetahs in which duplication of an ancestral SAA gene first gave rise to SAA1 and SAA3. Subsequently, each gene duplicated one more time, uniquely making 4 genes in the cheetah genome. The monomorphism of the cheetah SAA1A protein might be one of the factors responsible for the high incidence of AA amyloidosis in this species.

  17. Analysis of VH gene rearrangement and somatic hypermutation in type 1 autoimmune pancreatitis. (United States)

    Okumura, Fumihiro; Sakuma, Hidenori; Nakazawa, Takahiro; Hayashi, Kazuki; Naitoh, Itaru; Miyabe, Katsuyuki; Yoshida, Michihiro; Yamashita, Hiroaki; Ohara, Hirotaka; Inagaki, Hiroshi; Joh, Takashi


    Type 1 autoimmune pancreatitis (AIP) is the pancreatic manifestation of systemic fibroinflammatory disease called immunoglobulin G4-associated systemic disease. Although this inflammatory process is considered to be a disease with an autoimmune mechanism, its pathogenesis still remains unclear. To clarify the characteristics of B cells infiltrating the lesion, we analyzed the immunoglobulin heavy chain variable region (VH) gene rearrangement and somatic hypermutation of invasive lymphoid cells in type 1 AIP (n= 3), in comparison with obstructive pancreatitis (n= 3) as a control. DNA was extracted from the affected inflammatory lesions. After PCR amplification of the rearranged VH gene, the clones were subcloned, and recombinant clones were randomly selected and sequenced. More than 60 clones per case were analyzed. Monoclonal VH rearrangement was not detected in any of the cases examined. There was no VH family or VH fragment specific to type 1 AIP and obstructive pancreatitis. However, the rate of unmutated VH fragments in type 1 AIP (17%) was higher than that in obstructive pancreatitis (5.1%) (P= 0.010). Our study suggests that an increased rate of unmutated or less mutated VH genes may be characteristic of type 1 AIP and might play a role in the development of this disease.

  18. Clonal rearrangements of immunoglobulin genes and progression to B cell lymphoma in cutaneous lymphoid hyperplasia. (United States)

    Wood, G S; Ngan, B Y; Tung, R; Hoffman, T E; Abel, E A; Hoppe, R T; Warnke, R A; Cleary, M L; Sklar, J


    Cutaneous lymphoid hyperplasia (CLH) is a disorder characterized by the development of one or more skin lesions containing dense lymphoid infiltrates that exhibit the histopathologic features of a benign, reactive process. Nevertheless, some cases have been associated with the subsequent development of clinically overt lymphomas. This suggests that monoclonal populations may exist in some cases of CLH and that these cases may represent a subset more likely to evolve into lymphoma. To determine if such a subset of CLH can be distinguished, Southern blot analysis of DNA was used to study the immunogenotypic features of lesions from 14 patients with clinical, histopathologic, and immunopathologic findings characteristic of CLH. Five cases exhibited detectable clonal rearrangements of immunoglobulin genes. Furthermore, one of these five cases evolved into overt diffuse large cell lymphoma of B cell lineage during a 2-year follow-up of recurrent disease at the original cutaneous site. The immunoglobulin gene rearrangements of this lymphoma were identical to those of the prior CLH lesion. There was no evidence of detectable t(14;18) chromosomal translocations or clonal rearrangements of the beta gene of the T cell receptor in any case. It was concluded that CLH can be divided into two subsets based on the presence or absence of a clonal B cell population, and that overt lymphoma can arise from the former subset and contain the same B cell clone identified in the pre-existent CLH lesion.

  19. Concomitant duplications of opioid peptide and receptor genes before the origin of jawed vertebrates.

    Directory of Open Access Journals (Sweden)

    Görel Sundström

    Full Text Available BACKGROUND: The opioid system is involved in reward and pain mechanisms and consists in mammals of four receptors and several peptides. The peptides are derived from four prepropeptide genes, PENK, PDYN, PNOC and POMC, encoding enkephalins, dynorphins, orphanin/nociceptin and beta-endorphin, respectively. Previously we have described how two rounds of genome doubling (2R before the origin of jawed vertebrates formed the receptor family. METHODOLOGY/PRINCIPAL FINDINGS: Opioid peptide gene family members were investigated using a combination of sequence-based phylogeny and chromosomal locations of the peptide genes in various vertebrates. Several adjacent gene families were investigated similarly. The results show that the ancestral peptide gene gave rise to two additional copies in the genome doublings. The fourth member was generated by a local gene duplication, as the genes encoding POMC and PNOC are located on the same chromosome in the chicken genome and all three teleost genomes that we have studied. A translocation has disrupted this synteny in mammals. The PDYN gene seems to have been lost in chicken, but not in zebra finch. Duplicates of some peptide genes have arisen in the teleost fishes. Within the prepropeptide precursors, peptides have been lost or gained in different lineages. CONCLUSIONS/SIGNIFICANCE: The ancestral peptide and receptor genes were located on the same chromosome and were thus duplicated concomitantly. However, subsequently genetic linkage has been lost. In conclusion, the system of opioid peptides and receptors was largely formed by the genome doublings that took place early in vertebrate evolution.

  20. Gains, losses and changes of function after gene duplication: study of the metallothionein family.

    Directory of Open Access Journals (Sweden)

    Ana Moleirinho

    Full Text Available Metallothioneins (MT are small proteins involved in heavy metal detoxification and protection against oxidative stress and cancer. The mammalian MT family originated through a series of duplication events which generated four major genes (MT1 to MT4. MT1 and MT2 encode for ubiquitous proteins, while MT3 and MT4 evolved to accomplish specific roles in brain and epithelium, respectively. Herein, phylogenetic, transcriptional and polymorphic analyses are carried out to expose gains, losses and diversification of functions that characterize the evolutionary history of the MT family. The phylogenetic analyses show that all four major genes originated through a single duplication event prior to the radiation of mammals. Further expansion of the MT1 gene has occurred in the primate lineage reaching in humans a total of 13 paralogs, five of which are pseudogenes. In humans, the reading frame of all five MT1 pseudogenes is reconstructed by sequence homology with a functional duplicate revealing that loss of invariant cysteines is the most frequent event accounting for pseudogeneisation. Expression analyses based on EST counts and RT-PCR experiments show that, as for MT1 and MT2, human MT3 is also ubiquitously expressed while MT4 transcripts are present in brain, testes, esophagus and mainly in thymus. Polymorphic variation reveals two deleterious mutations (Cys30Tyr and Arg31Trp in MT4 with frequencies reaching about 30% in African and Asian populations suggesting the gene is inactive in some individuals and physiological compensation for its loss must arise from a functional equivalent. Altogether our findings provide novel data on the evolution and diversification of MT gene duplicates, a valuable resource for understanding the vast set of biological processes in which these proteins are involved.

  1. The effect of functional compensation among duplicate genes can constrain their evolutionary divergence

    Indian Academy of Sciences (India)

    Joseph Esfandiar Hannon Bozorgmehr


    Gene duplicates have the inherent property of initially being functionally redundant. This means that they can compensate for the effect of deleterious variation occurring at one or more sister sites. Here, I present data bearing on evolutionary theory that illustrates the manner in which any functional adaptation in duplicate genes is markedly constrained because of the compensatory utility provided by a sustained genetic redundancy. Specifically, a two-locus epistatic model of paralogous genes was simulated to investigate the degree of purifying selection imposed, and whether this would serve to impede any possible biochemical innovation. Three population sizes were considered to see if, as expected, there was a significant difference in any selection for robustness. Interestingly, physical linkage between tandem duplicates was actually found to increase the probability of any neofunctionalization and the efficacy of selection, contrary to what is expected in the case of singleton genes. The results indicate that an evolutionary trade-off often exists between any functional change under either positive or relaxed selection and the need to compensate for failures due to degenerative mutations, thereby guaranteeing the reliability of protein production.

  2. Assessment and reconstruction of novel HSP90 genes: duplications, gains and losses in fungal and animal lineages.

    Directory of Open Access Journals (Sweden)

    Chrysoula N Pantzartzi

    Full Text Available Hsp90s, members of the Heat Shock Protein class, protect the structure and function of proteins and play a significant task in cellular homeostasis and signal transduction. In order to determine the number of hsp90 gene copies and encoded proteins in fungal and animal lineages and through that key duplication events that this family has undergone, we collected and evaluated Hsp90 protein sequences and corresponding Expressed Sequence Tags and analyzed available genomes from various taxa. We provide evidence for duplication events affecting either single species or wider taxonomic groups. With regard to Fungi, duplicated genes have been detected in several lineages. In invertebrates, we demonstrate key duplication events in certain clades of Arthropoda and Mollusca, and a possible gene loss event in a hymenopteran family. Finally, we infer that the duplication event responsible for the two (a and b isoforms in vertebrates occurred probably shortly after the split of Hyperoartia and Gnathostomata.

  3. The evolution of pepsinogen C genes in vertebrates: duplication, loss and functional diversification.

    Directory of Open Access Journals (Sweden)

    Luís Filipe Costa Castro

    Full Text Available BACKGROUND: Aspartic proteases comprise a large group of enzymes involved in peptide proteolysis. This collection includes prominent enzymes globally categorized as pepsins, which are derived from pepsinogen precursors. Pepsins are involved in gastric digestion, a hallmark of vertebrate physiology. An important member among the pepsinogens is pepsinogen C (Pgc. A particular aspect of Pgc is its apparent single copy status, which contrasts with the numerous gene copies found for example in pepsinogen A (Pga. Although gene sequences with similarity to Pgc have been described in some vertebrate groups, no exhaustive evolutionary framework has been considered so far. METHODOLOGY/PRINCIPAL FINDINGS: By combining phylogenetics and genomic analysis, we find an unexpected Pgc diversity in the vertebrate sub-phylum. We were able to reconstruct gene duplication timings relative to the divergence of major vertebrate clades. Before tetrapod divergence, a single Pgc gene tandemly expanded to produce two gene lineages (Pgbc and Pgc2. These have been differentially retained in various classes. Accordingly, we find Pgc2 in sauropsids, amphibians and marsupials, but not in eutherian mammals. Pgbc was retained in amphibians, but duplicated in the ancestor of amniotes giving rise to Pgb and Pgc1. The latter was retained in mammals and probably in reptiles and marsupials but not in birds. Pgb was kept in all of the amniote clade with independent episodes of loss in some mammalian species. Lineage specific expansions of Pgc2 and Pgbc have also occurred in marsupials and amphibians respectively. We find that teleost and tetrapod Pgc genes reside in distinct genomic regions hinting at a possible translocation. CONCLUSIONS: We conclude that the repertoire of Pgc genes is larger than previously reported, and that tandem duplications have modelled the history of Pgc genes. We hypothesize that gene expansion lead to functional divergence in tetrapods, coincident with the

  4. Duplication of 7q36.3 encompassing the Sonic Hedgehog (SHH) gene is associated with congenital muscular hypertrophy

    DEFF Research Database (Denmark)

    Kroeldrup, L; Kjaergaard, S; Kirchhoff, Eva Maria


    with muscular hypertrophy and mildly retarded psychomotor development. Array-CGH identified a small duplication of 7q36.3 including the Sonic Hedgehog (SHH) gene in both the aborted foetus and the live born male sib. Neither of the parents carried the 7q36.3 duplication. The consequences of overexpression...

  5. Diagnostic value of immunoglobulin κ light chain gene rearrangement analysis in B-cell lymphomas. (United States)

    Kokovic, Ira; Jezersek Novakovic, Barbara; Novakovic, Srdjan


    Analysis of the immunoglobulin κ light chain (IGK) gene is an alternative method for B-cell clonality assessment in the diagnosis of mature B-cell proliferations in which the detection of clonal immunoglobulin heavy chain (IGH) gene rearrangements fails. The aim of the present study was to evaluate the added value of standardized BIOMED-2 assay for the detection of clonal IGK gene rearrangements in the diagnostic setting of suspected B-cell lymphomas. With this purpose, 92 specimens from 80 patients with the final diagnosis of mature B-cell lymphoma (37 specimens), mature T-cell lymphoma (26 specimens) and reactive lymphoid proliferation (29 specimens) were analyzed for B-cell clonality. B-cell clonality analysis was performed using the BIOMED-2 IGH and IGK gene clonality assays. The determined sensitivity of the IGK assay was 67.6%, while the determined sensitivity of the IGH assay was 75.7%. The sensitivity of combined IGH+IGK assay was 81.1%. The determined specificity of the IGK assay was 96.2% in the group of T-cell lymphomas and 96.6% in the group of reactive lesions. The determined specificity of the IGH assay was 84.6% in the group of lymphomas and 86.2% in the group of reactive lesions. The comparison of GeneScan (GS) and heteroduplex pretreatment-polyacrylamide gel electrophoresis (HD-PAGE) methods for the analysis of IGK gene rearrangements showed a higher efficacy of GS analysis in a series of 27 B-cell lymphomas analyzed by both methods. In the present study, we demonstrated that by applying the combined IGH+IGK clonality assay the overall detection rate of B-cell clonality was increased by 5.4%. Thus, we confirmed the added value of the standardized BIOMED-2 IGK assay for assessment of B-cell clonality in suspected B-cell lymphomas with inconclusive clinical and cyto/histological diagnosis.

  6. The butterfly plant arms-race escalated by gene and genome duplications. (United States)

    Edger, Patrick P; Heidel-Fischer, Hanna M; Bekaert, Michaël; Rota, Jadranka; Glöckner, Gernot; Platts, Adrian E; Heckel, David G; Der, Joshua P; Wafula, Eric K; Tang, Michelle; Hofberger, Johannes A; Smithson, Ann; Hall, Jocelyn C; Blanchette, Matthieu; Bureau, Thomas E; Wright, Stephen I; dePamphilis, Claude W; Eric Schranz, M; Barker, Michael S; Conant, Gavin C; Wahlberg, Niklas; Vogel, Heiko; Pires, J Chris; Wheat, Christopher W


    Coevolutionary interactions are thought to have spurred the evolution of key innovations and driven the diversification of much of life on Earth. However, the genetic and evolutionary basis of the innovations that facilitate such interactions remains poorly understood. We examined the coevolutionary interactions between plants (Brassicales) and butterflies (Pieridae), and uncovered evidence for an escalating evolutionary arms-race. Although gradual changes in trait complexity appear to have been facilitated by allelic turnover, key innovations are associated with gene and genome duplications. Furthermore, we show that the origins of both chemical defenses and of molecular counter adaptations were associated with shifts in diversification rates during the arms-race. These findings provide an important connection between the origins of biodiversity, coevolution, and the role of gene and genome duplications as a substrate for novel traits.

  7. A rare case of plastid protein-coding gene duplication in the chloroplast genome of Euglena archaeoplastidiata (Euglenophyta). (United States)

    Bennett, Matthew S; Shiu, Shin-Han; Triemer, Richard E


    Gene duplication is an important evolutionary process that allows duplicate functions to diverge, or, in some cases, allows for new functional gains. However, in contrast to the nuclear genome, gene duplications within the chloroplast are extremely rare. Here, we present the chloroplast genome of the photosynthetic protist Euglena archaeoplastidiata. Upon annotation, it was found that the chloroplast genome contained a novel tandem direct duplication that encoded a portion of RuBisCO large subunit (rbcL) followed by a complete copy of ribosomal protein L32 (rpl32), as well as the associated intergenic sequences. Analyses of the duplicated rpl32 were inconclusive regarding selective pressures, although it was found that substitutions in the duplicated region, all non-synonymous, likely had a neutral functional effect. The duplicated region did not exhibit patterns consistent with previously described mechanisms for tandem direct duplications, and demonstrated an unknown mechanism of duplication. In addition, a comparison of this chloroplast genome to other previously characterized chloroplast genomes from the same family revealed characteristics that indicated E. archaeoplastidiata was probably more closely related to taxa in the genera Monomorphina, Cryptoglena, and Euglenaria than it was to other Euglena taxa. Taken together, the chloroplast genome of E. archaeoplastidiata demonstrated multiple characteristics unique to the euglenoid world, and has justified the longstanding curiosity regarding this enigmatic taxon.

  8. Evaluation of multiplex ligation-dependent probe amplification analysis versus multiplex polymerase chain reaction assays in the detection of dystrophin gene rearrangements in an Iranian population subset

    Directory of Open Access Journals (Sweden)

    Nayereh Nouri


    Full Text Available Background: The Duchenne muscular dystrophy (DMD gene is located in the short arm of the X chromosome (Xp21. It spans 2.4 Mb of the human genomic DNA and is composed of 79 exons. Mutations in the Dystrophin gene result in DMD and Becker muscular dystrophy. In this study, the efficiency of multiplex ligation-dependent probe amplification (MLPA over multiplex polymerase chain reaction (PCR assays in an Iranian population was investigated. Materials and Methods: Multiplex PCR assays and MLPA analysis were carried out in 74 patients affected with DMD. Results: Multiplex PCR detected deletions in 51% of the patients with DMD. MLPA analysis could determine all the deletions detected by the multiplex PCR. Additionally, MLPA was able to identify one more deletion and duplication in patients without detectable mutations by multiplex PCR. Moreover, MLPA precisely determined the exact size of the deletions. Conclusion: Although MLPA analysis is more sensitive for detection of deletions and duplications in the dystrophin gene, multiplex PCR might be used for the initial analysis of the boys affected with DMD in the Iranian population as it was able to detect 95% of the rearrangements in patients with DMD.

  9. Gene duplication of the human peptide YY gene (PYY) generated the pancreatic polypeptide gene (PPY) on chromosome 17q21.1

    Energy Technology Data Exchange (ETDEWEB)

    Hort, Y.; Shine, J.; Herzog, H. [Garvan Inst. of Medical Research, Sydney (Australia)


    Neuropeptide Y (NPY), peptide YY (PYY), and pancreatic polypeptide (PP) are structurally related but functionally diverse peptides, encoded by separate genes and expressed in different tissues. Although the human NPY gene has been mapped to chromosome 7, the authors demonstrate here that the genes for human PYY and PP (PPY) are localized only 10 kb apart from each another on chromosome 17q21.1. The high degree of homology between the members of this gene family, both in primary sequence and exon/intron structure, suggests that the NYP and the PYY genes arose from an initial gene duplication event, with a subsequent tandem duplication of the PYY gene being responsible for the creation of the PPY gene. A second weaker hybridization signal also found on chromosome 17q11 and results obtained by Southern blot analysis suggest that the entire PYY-PPY region has undergone a further duplication event. 27 refs., 5 figs.

  10. Gene duplications and losses among vertebrate deoxyribonucleoside kinases of the non-TK1 Family

    DEFF Research Database (Denmark)

    Mutahir, Zeeshan; Christiansen, Louise Slot; Clausen, Anders R.;


    of the dCK/dGK enzymes encoded by these genes. The two dCK enzymes in G. gallus have broader substrate specificity than their human or X. laevis counterparts. Additionally, the duplicated dCK enzyme in G. gallus might have become mitochondria. Based on our study we postulate that changing and adapting...... substrate specificities and subcellular localization are likely the drivers behind the evolution of vertebrate dNKs...

  11. Whole-Genome Duplications Spurred the Functional Diversification of the Globin Gene Superfamily in Vertebrates


    Hoffmann, Federico G.; Opazo, Juan C; Storz, Jay F.


    It has been hypothesized that two successive rounds of whole-genome duplication (WGD) in the stem lineage of vertebrates provided genetic raw materials for the evolutionary innovation of many vertebrate-specific features. However, it has seldom been possible to trace such innovations to specific functional differences between paralogous gene products that derive from a WGD event. Here, we report genomic evidence for a direct link between WGD and key physiological innovations in the vertebrate...

  12. Adaptations to endosymbiosis in a cnidarian-dinoflagellate association: differential gene expression and specific gene duplications.

    Directory of Open Access Journals (Sweden)

    Philippe Ganot


    Full Text Available Trophic endosymbiosis between anthozoans and photosynthetic dinoflagellates forms the key foundation of reef ecosystems. Dysfunction and collapse of symbiosis lead to bleaching (symbiont expulsion, which is responsible for the severe worldwide decline of coral reefs. Molecular signals are central to the stability of this partnership and are therefore closely related to coral health. To decipher inter-partner signaling, we developed genomic resources (cDNA library and microarrays from the symbiotic sea anemone Anemonia viridis. Here we describe differential expression between symbiotic (also called zooxanthellate anemones or aposymbiotic (also called bleached A. viridis specimens, using microarray hybridizations and qPCR experiments. We mapped, for the first time, transcript abundance separately in the epidermal cell layer and the gastrodermal cells that host photosynthetic symbionts. Transcriptomic profiles showed large inter-individual variability, indicating that aposymbiosis could be induced by different pathways. We defined a restricted subset of 39 common genes that are characteristic of the symbiotic or aposymbiotic states. We demonstrated that transcription of many genes belonging to this set is specifically enhanced in the symbiotic cells (gastroderm. A model is proposed where the aposymbiotic and therefore heterotrophic state triggers vesicular trafficking, whereas the symbiotic and therefore autotrophic state favors metabolic exchanges between host and symbiont. Several genetic pathways were investigated in more detail: i a key vitamin K-dependant process involved in the dinoflagellate-cnidarian recognition; ii two cnidarian tissue-specific carbonic anhydrases involved in the carbon transfer from the environment to the intracellular symbionts; iii host collagen synthesis, mostly supported by the symbiotic tissue. Further, we identified specific gene duplications and showed that the cnidarian-specific isoform was also up-regulated both

  13. Specific amplification by PCR of rearranged genomic variable regions of immunoglobulin genes from mouse hybridoma cells. (United States)

    Berdoz, J; Monath, T P; Kraehenbuhl, J P


    We have designed a novel strategy for the isolation of the rearranged genomic fragments encoding the L-VH-D-JH and L-V kappa/lambda-J kappa/lambda regions of mouse immunoglobulin genes. This strategy is based on the PCR amplification of genomic DNA from mouse hybridomas using multiple specific primers chosen in the 5'-untranslated region and in the intron downstream of the rearranged JH/J kappa/lambda sequences. Variable regions with intact coding sequences, including full-length leader peptides (L) can be obtained without previous DNA sequencing. Our strategy is based on a genomic template that produces fragments that do not need to be adapted for recombinant antibody expression, thus facilitating the generation of chimeric and isotype-switched immunoglobulins.

  14. Sub-functionalization to ovule development following duplication of a floral organ identity gene. (United States)

    Galimba, Kelsey D; Di Stilio, Verónica S


    Gene duplications result in paralogs that may be maintained due to the gain of novel functions (neo-functionalization) or the partitioning of ancestral function (sub-functionalization). Plant genomes are especially prone to duplication; paralogs are particularly widespread in the floral MADS box transcription factors that control organ identity through the ABC model of flower development. C class genes establish stamen and carpel identity and control floral meristem determinacy, and are largely conserved across the angiosperm phylogeny. Originally, an additional D class had been identified as controlling ovule identity; yet subsequent studies indicated that both C and D lineage genes more commonly control ovule development redundantly. The ranunculid Thalictrum thalictroides has two orthologs of the Arabidopsis thaliana C class gene AGAMOUS (AG), ThtAG1 and ThtAG2 (Thalictrum thalictroides AGAMOUS1/2). We previously showed that ThtAG1 exhibits typical C class function; here we examine the role of its paralog, ThtAG2. Our phylogenetic analysis shows that ThtAG2 falls within the C lineage, together with ThtAG1, and is consistent with previous findings of a Ranunculales-specific duplication in this clade. However, ThtAG2 is not expressed in stamens, but rather solely in carpels and ovules. This female-specific expression pattern is consistent with D lineage genes, and with other C lineage genes known to be involved in ovule identity. Given the divergent expression of ThtAG2, we tested the hypothesis that it has acquired ovule identity function. Molecular evolution analyses showed evidence of positive selection on ThtAG2-a pattern that supports divergence of function by sub-functionalization. Down-regulation of ThtAG2 by virus-induced gene silencing resulted in homeotic conversions of ovules into carpel-like structures. Taken together, our results suggest that, although ThtAG2 falls within the C lineage, it has diverged to acquire "D function" as an ovule identity gene

  15. Comparative genomic organization and tissue-specific transcription of the duplicated fabp7 and fabp10 genes in teleost fishes. (United States)

    Parmar, Manoj B; Wright, Jonathan M


    A whole-genome duplication (WGD) early in the teleost fish lineage makes fish ideal organisms to study the fate of duplicated genes and underlying evolutionary trajectories that have led to the retention of ohnologous gene duplicates in fish genomes. Here, we compare the genomic organization and tissue-specific transcription of the ohnologous fabp7 and fabp10 genes in medaka, three-spined stickleback, and spotted green pufferfish to the well-studied duplicated fabp7 and fabp10 genes of zebrafish. Teleost fabp7 and fabp10 genes contain four exons interrupted by three introns. Polypeptide sequences of Fabp7 and Fabp10 show the highest sequence identity and similarity with their orthologs from vertebrates. Orthology was evident as the ohnologous Fabp7 and Fabp10 polypeptides of teleost fishes each formed distinct clades and clustered together with their orthologs from other vertebrates in a phylogenetic tree. Furthermore, ohnologous teleost fabp7 and fabp10 genes exhibit conserved gene synteny with human FABP7 and chicken FABP10, respectively, which provides compelling evidence that the duplicated fabp7 and fabp10 genes of teleost fishes most likely arose from the well-documented WGD. The tissue-specific distribution of fabp7a, fabp7b, fabp10a, and fabp10b transcripts provides evidence of diverged spatial transcriptional regulation between ohnologous gene duplicates of fabp7 and fabp10 in teleost fishes.

  16. Evolution history of duplicated smad3 genes in teleost: insights from Japanese flounder, Paralichthys olivaceus

    Directory of Open Access Journals (Sweden)

    Xinxin Du


    Full Text Available Following the two rounds of whole-genome duplication (WGD during deuterosome evolution, a third genome duplication occurred in the ray-fined fish lineage and is considered to be responsible for the teleost-specific lineage diversification and regulation mechanisms. As a receptor-regulated SMAD (R-SMAD, the function of SMAD3 was widely studied in mammals. However, limited information of its role or putative paralogs is available in ray-finned fishes. In this study, two SMAD3 paralogs were first identified in the transcriptome and genome of Japanese flounder (Paralichthys olivaceus. We also explored SMAD3 duplication in other selected species. Following identification, genomic structure, phylogenetic reconstruction, and synteny analyses performed by MrBayes and online bioinformatic tools confirmed that smad3a/3b most likely originated from the teleost-specific WGD. Additionally, selection pressure analysis and expression pattern of the two genes performed by PAML and quantitative real-time PCR (qRT-PCR revealed evidence of subfunctionalization of the two SMAD3 paralogs in teleost. Our results indicate that two SMAD3 genes originate from teleost-specific WGD, remain transcriptionally active, and may have likely undergone subfunctionalization. This study provides novel insights to the evolution fates of smad3a/3b and draws attentions to future function analysis of SMAD3 gene family.

  17. Some novel intron positions in conserved Drosophila genes are caused by intron sliding or tandem duplication

    Directory of Open Access Journals (Sweden)

    Stadler Peter F


    Full Text Available Abstract Background Positions of spliceosomal introns are often conserved between remotely related genes. Introns that reside in non-conserved positions are either novel or remnants of frequent losses of introns in some evolutionary lineages. A recent gain of such introns is difficult to prove. However, introns verified as novel are needed to evaluate contemporary processes of intron gain. Results We identified 25 unambiguous cases of novel intron positions in 31 Drosophila genes that exhibit near intron pairs (NIPs. Here, a NIP consists of an ancient and a novel intron position that are separated by less than 32 nt. Within a single gene, such closely-spaced introns are very unlikely to have coexisted. In most cases, therefore, the ancient intron position must have disappeared in favour of the novel one. A survey for NIPs among 12 Drosophila genomes identifies intron sliding (migration as one of the more frequent causes of novel intron positions. Other novel introns seem to have been gained by regional tandem duplications of coding sequences containing a proto-splice site. Conclusions Recent intron gains sometimes appear to have arisen by duplication of exonic sequences and subsequent intronization of one of the copies. Intron migration and exon duplication together may account for a significant amount of novel intron positions in conserved coding sequences.

  18. Characterization of gene rearrangements resulted from genomic structural aberrations in human esophageal squamous cell carcinoma KYSE150 cells. (United States)

    Hao, Jia-Jie; Gong, Ting; Zhang, Yu; Shi, Zhi-Zhou; Xu, Xin; Dong, Jin-Tang; Zhan, Qi-Min; Fu, Song-Bin; Wang, Ming-Rong


    Chromosomal rearrangements and involved genes have been reported to play important roles in the development and progression of human malignancies. But the gene rearrangements in esophageal squamous cell carcinoma (ESCC) remain to be identified. In the present study, array-based comparative genomic hybridization (array-CGH) was performed on the ESCC cell line KYSE150. Eight disrupted genes were detected according to the obviously distinct unbalanced breakpoints. The splitting of these genes was validated by dual-color fluorescence in-situ hybridization (FISH). By using rapid amplification of cDNA ends (RACE), genome walking and sequencing analysis, we further identified gene disruptions and rearrangements. A fusion transcript DTL-1q42.2 was derived from an intrachromosomal rearrangement of chromosome 1. Highly amplified segments of DTL and PTPRD were self-rearranged. The sequences on either side of the junctions possess micro-homology with each other. FISH results indicated that the split DTL and PTPRD were also involved in comprising parts of the derivative chromosomes resulted from t(1q;9p;12p) and t(9;1;9). Further, we found that regions harboring DTL (1q32.3) and PTPRD (9p23) were also splitting in ESCC tumors. The data supplement significant information on the existing genetic background of KYSE150, which may be used as a model for studying these gene rearrangements.

  19. Gene conversion-like events in the diversification of human rearranged IGHV3-23*01 gene sequences

    Directory of Open Access Journals (Sweden)

    Bhargavi eDuvvuri


    Full Text Available Gene conversion (GCV as a mechanism of immunoglobulin diversification is well established in a few species. However, definitive evidence of GCV-like events in human immunoglobulin genes is scarce. GCV is mediated by activation-induced cytidine deaminase (AID. The lack of evidence of GCV in human rearranged immunoglobulin gene sequences is puzzling given the presence of highly similar germline donors and all the enzymatic machinery required for GCV. In this study, we undertook a computational analysis of rearranged IGHV3-23*01 gene sequences from common variable immunodeficiency (CVID patients and healthy individuals to survey ‘GCV-like’ activities. Our search identified strong evidence of GCV-like patterns. Germline VH sequences were identified as potential donors for clustered mutations in rearranged IGHV3-23*01 gene sequences. We identified minimum and maximum sequence identities between donor and recipient sequences that can serve as targets for GCV and our findings are consistent with those reported in literature. We observed that GCV-like tracts are flanked by activation-induced cytidine deaminase (AID hotspot motifs. Structural modeling of IGHV3-23*01 gene sequence revealed that hypermutable bases flanking GCV-like tracts, are in the single stranded DNA (ssDNA of stable stem-loop structures (SLSs. SsDNA is inherently fragile and also an optimal target for AID. We speculate that GCV could have been initiated by the targeting of hypermutable bases in ssDNA state in stable SLSs, plausibly by AID. We have observed that the frequency of GCV-like events is significantly higher in rearranged IGHV323-*01 sequences from healthy individuals compared to that of CVID patients. GCV, unlike SHM, can result in multiple base substitutions that can alter many amino acids. The extensive changes in antibody affinity by GCV-like events, as identified in this study would be instrumental in protecting humans against pathogens that diversify their genome by

  20. Diagnostic tool for the identification of MLL rearrangements including unknown partner genes (United States)

    Meyer, Claus; Schneider, Bjoern; Reichel, Martin; Angermueller, Sieglinde; Strehl, Sabine; Schnittger, Susanne; Schoch, Claudia; Jansen, Mieke W. J. C.; van Dongen, Jacques J.; Pieters, Rob; Haas, Oskar A.; Dingermann, Theo; Klingebiel, Thomas; Marschalek, Rolf


    Approximately 50 different chromosomal translocations of the human MLL gene are currently known and associated with high-risk acute leukemia. The large number of different MLL translocation partner genes makes a precise diagnosis a demanding task. After their cytogenetic identification, only the most common MLL translocations are investigated by RT-PCR analyses, whereas infrequent or unknown MLL translocations are excluded from further analyses. Therefore, we aimed at establishing a method that enables the detection of any MLL rearrangement by using genomic DNA isolated from patient biopsy material. This goal was achieved by establishing a universal long-distance inverse-PCR approach that allows the identification of any kind of MLL rearrangement if located within the breakpoint cluster region. This method was applied to biopsy material derived from 40 leukemia patients known to carry MLL abnormalities. Thirty-six patients carried known MLL fusions (34 with der(11) and 2 with reciprocal alleles), whereas 3 patients were found to carry novel MLL fusions to ACACA, SELB, and SMAP1, respectively. One patient carried a genomic fusion between MLL and TIRAP, resulting from an interstitial deletion. Because of this interstitial deletion, portions of the MLL and TIRAP genes were deleted, together with 123 genes located within the 13-Mbp interval between both chromosomal loci. Therefore, this previously undescribed diagnostic tool has been proven successful for analyzing any MLL rearrangement including previously unrecognized partner genes. Furthermore, the determined patient-specific fusion sequences are useful for minimal residual disease monitoring of MLL associated acute leukemias. PMID:15626757

  1. Sox genes in grass carp (Ctenopharyngodon idella with their implications for genome duplication and evolution

    Directory of Open Access Journals (Sweden)

    Tong Jingou


    Full Text Available Abstract The Sox gene family is found in a broad range of animal taxa and encodes important gene regulatory proteins involved in a variety of developmental processes. We have obtained clones representing the HMG boxes of twelve Sox genes from grass carp (Ctenopharyngodon idella, one of the four major domestic carps in China. The cloned Sox genes belong to group B1, B2 and C. Our analyses show that whereas the human genome contains a single copy of Sox4, Sox11 and Sox14, each of these genes has two co-orthologs in grass carp, and the duplication of Sox4 and Sox11 occurred before the divergence of grass carp and zebrafish, which support the "fish-specific whole-genome duplication" theory. An estimation for the origin of grass carp based on the molecular clock using Sox1, Sox3 and Sox11 genes as markers indicates that grass carp (subfamily Leuciscinae and zebrafish (subfamily Danioninae diverged approximately 60 million years ago. The potential uses of Sox genes as markers in revealing the evolutionary history of grass carp are discussed.


    Institute of Scientific and Technical Information of China (English)

    XIANG Zhi-fu; LU Yu-ying; LAI Yong-rong; CHEN Yan; LI Hui-yu; ZOU Ping


    Objective: To develop a sensitive method to detect minimal residual disease and to elucidate the significance of bcl-2 gene rearrangement in diagnosis and treatment of malignant lymphoma. Methods: Using polymerase chain reaction (PCR) to detect bcl-2 gene rearrangement and using serial dilution method to define the sensitivity of PCR. Results: In 9 different malignant lymphoma cell lines, Su-DHL-4 and Su-DHL-6 were shown bcl-2(MBR)/JH rearrangement, the sensitivity of PCR was 1:105. In 16 patients with follicular lymphoma, the peripheral blood and bone marrow were PCR positive in 4 cases both at initial diagnosis and after complete remission. Conclusion:Detection of bcl-2 gene rearrangement by PCR provides a sensitive and specific assay of minimal residual disease.It is helpful to improve staging of disease, prognosis and evaluation of the treatment results.

  3. Large genomic rearrangement of BRCA1 and BRCA2 genes in familial breast cancer patients in Korea. (United States)

    Cho, Ja Young; Cho, Dae-Yeon; Ahn, Sei Hyun; Choi, Su-Youn; Shin, Inkyung; Park, Hyun Gyu; Lee, Jong Won; Kim, Hee Jeong; Yu, Jong Han; Ko, Beom Seok; Ku, Bo Kyung; Son, Byung Ho


    We screened large genomic rearrangements of the BRCA1 and BRCA2 genes in Korean, familial breast cancer patients. Multiplex ligation-dependent probe amplification assay was used to identify BRCA1 and BRCA2 genomic rearrangements in 226 Korean familial breast cancer patients with risk factors for BRCA1 and BRCA2 mutations, who previously tested negative for point mutations in the two genes. We identified only one large deletion (c.4186-1593_4676-1465del) in BRCA1. No large rearrangements were found in BRCA2. Our result indicates that large genomic rearrangement in the BRCA1 and BRCA2 genes does not seem like a major determinant of breast cancer susceptibility in the Korean population. A large-scale study needs to validate our result in Korea.

  4. Divergent Evolutionary Patterns of NAC Transcription Factors Are Associated with Diversification and Gene Duplications in Angiosperm. (United States)

    Jin, Xiaoli; Ren, Jing; Nevo, Eviatar; Yin, Xuegui; Sun, Dongfa; Peng, Junhua


    NAC (NAM/ATAF/CUC) proteins constitute one of the biggest plant-specific transcription factor (TF) families and have crucial roles in diverse developmental programs during plant growth. Phylogenetic analyses have revealed both conserved and lineage-specific NAC subfamilies, among which various origins and distinct features were observed. It is reasonable to hypothesize that there should be divergent evolutionary patterns of NAC TFs both between dicots and monocots, and among NAC subfamilies. In this study, we compared the gene duplication and loss, evolutionary rate, and selective pattern among non-lineage specific NAC subfamilies, as well as those between dicots and monocots, through genome-wide analyses of sequence and functional data in six dicot and five grass lineages. The number of genes gained in the dicot lineages was much larger than that in the grass lineages, while fewer gene losses were observed in the grass than that in the dicots. We revealed (1) uneven constitution of Clusters of Orthologous Groups (COGs) and contrasting birth/death rates among subfamilies, and (2) two distinct evolutionary scenarios of NAC TFs between dicots and grasses. Our results demonstrated that relaxed selection, resulting from concerted gene duplications, may have permitted substitutions responsible for functional divergence of NAC genes into new lineages. The underlying mechanism of distinct evolutionary fates of NAC TFs shed lights on how evolutionary divergence contributes to differences in establishing NAC gene subfamilies and thus impacts the distinct features between dicots and grasses.

  5. Divergent Evolutionary Patterns of NAC Transcription Factors Are Associated with Diversification and Gene Duplications in Angiosperm

    Directory of Open Access Journals (Sweden)

    Xiaoli Jin


    Full Text Available NAC (NAM/ATAF/CUC proteins constitute one of the biggest plant-specific transcription factor (TF families and have crucial roles in diverse developmental programs during plant growth. Phylogenetic analyses have revealed both conserved and lineage-specific NAC subfamilies, among which various origins and distinct features were observed. It is reasonable to hypothesize that there should be divergent evolutionary patterns of NAC TFs both between dicots and monocots, and among NAC subfamilies. In this study, we compared the gene duplication and loss, evolutionary rate, and selective pattern among non-lineage specific NAC subfamilies, as well as those between dicots and monocots, through genome-wide analyses of sequence and functional data in six dicot and five grass lineages. The number of genes gained in the dicot lineages was much larger than that in the grass lineages, while fewer gene losses were observed in the grass than that in the dicots. We revealed (1 uneven constitution of Clusters of Orthologous Groups (COGs and contrasting birth/death rates among subfamilies, and (2 two distinct evolutionary scenarios of NAC TFs between dicots and grasses. Our results demonstrated that relaxed selection, resulting from concerted gene duplications, may have permitted substitutions responsible for functional divergence of NAC genes into new lineages. The underlying mechanism of distinct evolutionary fates of NAC TFs shed lights on how evolutionary divergence contributes to differences in establishing NAC gene subfamilies and thus impacts the distinct features between dicots and grasses.

  6. Automation of ALK gene rearrangement testing with fluorescence in situ hybridization (FISH): a feasibility study. (United States)

    Zwaenepoel, Karen; Merkle, Dennis; Cabillic, Florian; Berg, Erica; Belaud-Rotureau, Marc-Antoine; Grazioli, Vittorio; Herelle, Olga; Hummel, Michael; Le Calve, Michele; Lenze, Dido; Mende, Stefanie; Pauwels, Patrick; Quilichini, Benoit; Repetti, Elena


    In the past several years we have observed a significant increase in our understanding of molecular mechanisms that drive lung cancer. Specifically in the non-small cell lung cancer sub-types, ALK gene rearrangements represent a sub-group of tumors that are targetable by the tyrosine kinase inhibitor Crizotinib, resulting in significant reductions in tumor burden. Phase II and III clinical trials were performed using an ALK break-apart FISH probe kit, making FISH the gold standard for identifying ALK rearrangements in patients. FISH is often considered a labor and cost intensive molecular technique, and in this study we aimed to demonstrate feasibility for automation of ALK FISH testing, to improve laboratory workflow and ease of testing. This involved automation of the pre-treatment steps of the ALK assay using various protocols on the VP 2000 instrument, and facilitating automated scanning of the fluorescent FISH specimens for simplified enumeration on various backend scanning and analysis systems. The results indicated that ALK FISH can be automated. Significantly, both the Ikoniscope and BioView system of automated FISH scanning and analysis systems provided a robust analysis algorithm to define ALK rearrangements. In addition, the BioView system facilitated consultation of difficult cases via the internet.

  7. The role of human-specific gene duplications during brain development and evolution. (United States)

    Sassa, Takayuki


    One of the most fascinating questions in evolutionary biology is how traits unique to humans, such as their high cognitive abilities, erect bipedalism, and hairless skin, are encoded in the genome. Recent advances in genomics have begun to reveal differences between the genomes of the great apes. It has become evident that one of the many mutation types, segmental duplication, has drastically increased in the primate genomes, and most remarkably in the human genome. Genes contained in these segmental duplications have a tremendous potential to cause genetic innovation, probably accounting for the acquisition of human-specific traits. In this review, I begin with an overview of the genes, which have increased their copy number specifically in the human lineage, following its separation from the common ancestor with our closest living relative, the chimpanzee. Then, I introduce the recent experimental approaches, focusing on SRGAP2, which has been partially duplicated, to elucidate the role of SRGAP2 protein and its human-specific paralogs in human brain development and evolution.

  8. The Role of Cis-Regulatory Motifs and Genetical Control of Expression in the Divergence of Yeast Duplicate Genes

    National Research Council Canada - National Science Library

    Leach, Lindsey J; Zhang, Ze; Lu, Chenqi; Kearsey, Michael J; Luo, Zewei


    Expression divergence of duplicate genes is widely believed to be important for their retention and evolution of new function, although the mechanism that determines their expression divergence remains unclear...

  9. Evolution of paralogous genes: Reconstruction of genome rearrangements through comparison of multiple genomes within Staphylococcus aureus. (United States)

    Tsuru, Takeshi; Kawai, Mikihiko; Mizutani-Ui, Yoko; Uchiyama, Ikuo; Kobayashi, Ichizo


    Analysis of evolution of paralogous genes in a genome is central to our understanding of genome evolution. Comparison of closely related bacterial genomes, which has provided clues as to how genome sequences evolve under natural conditions, would help in such an analysis. With species Staphylococcus aureus, whole-genome sequences have been decoded for seven strains. We compared their DNA sequences to detect large genome polymorphisms and to deduce mechanisms of genome rearrangements that have formed each of them. We first compared strains N315 and Mu50, which make one of the most closely related strain pairs, at the single-nucleotide resolution to catalogue all the middle-sized (more than 10 bp) to large genome polymorphisms such as indels and substitutions. These polymorphisms include two paralogous gene sets, one in a tandem paralogue gene cluster for toxins in a genomic island and the other in a ribosomal RNA operon. We also focused on two other tandem paralogue gene clusters and type I restriction-modification (RM) genes on the genomic islands. Then we reconstructed rearrangement events responsible for these polymorphisms, in the paralogous genes and the others, with reference to the other five genomes. For the tandem paralogue gene clusters, we were able to infer sequences for homologous recombination generating the change in the repeat number. These sequences were conserved among the repeated paralogous units likely because of their functional importance. The sequence specificity (S) subunit of type I RM systems showed recombination, likely at the homology of a conserved region, between the two variable regions for sequence specificity. We also noticed novel alleles in the ribosomal RNA operons and suggested a role for illegitimate recombination in their formation. These results revealed importance of recombination involving long conserved sequence in the evolution of paralogous genes in the genome.

  10. Historical profiling of maize duplicate genes sheds light on the evolution of C4 photosynthesis in grasses. (United States)

    Chang, Yao-Ming; Chang, Chia-Lin; Li, Wen-Hsiung; Shih, Arthur Chun-Chieh


    C4 plants evolved from C3 plants through a series of complex evolutionary steps. On the basis of the evolution of key C4 enzyme genes, the evolution of C4 photosynthesis has been considered a story of gene/genome duplications and subsequent modifications of gene function. If whole-genome duplication has contributed to the evolution of C4 photosynthesis, other genes should have been duplicated together with these C4 genes. However, which genes were co-duplicated with C4 genes and whether they have also played a role in C4 evolution are largely unknown. In this study, we developed a simple method to characterize the historical profile of the paralogs of a gene by tracing back to the most recent common ancestor (MRCA) of the gene and its paralog(s) and then counting the number of paralogs at each MRCA. We clustered the genes into clusters with similar duplication profiles and inferred their functional enrichments. Applying our method to maize, a familiar C4 plant, we identified many genes that show similar duplication profiles with those of the key C4 enzyme genes and found that the functional preferences of the C4 gene clusters are not only similar to those identified by an experimental approach in a recent study but also highly consistent with the functions required for the C4 photosynthesis evolutionary model proposed by S.F. Sage. Some of these genes might have co-evolved with the key C4 enzyme genes to increase the strength of C4 photosynthesis. Moreover, our results suggested that most key C4 enzyme genes had different origins and have undergone a long evolutionary process before the emergence of C4 grasses (Andropogoneae), consistent with the conclusion proposed by previous authors. Copyright © 2012 Elsevier Inc. All rights reserved.

  11. Identification of genes that are essential to restrict genome duplication to once per cell division (United States)

    Vassilev, Alex; Lee, Chrissie Y.; Vassilev, Boris; Zhu, Wenge; Ormanoglu, Pinar; Martin, Scott E.; DePamphilis, Melvin L.


    Nuclear genome duplication is normally restricted to once per cell division, but aberrant events that allow excess DNA replication (EDR) promote genomic instability and aneuploidy, both of which are characteristics of cancer development. Here we provide the first comprehensive identification of genes that are essential to restrict genome duplication to once per cell division. An siRNA library of 21,584 human genes was screened for those that prevent EDR in cancer cells with undetectable chromosomal instability. Candidates were validated by testing multiple siRNAs and chemical inhibitors on both TP53+ and TP53- cells to reveal the relevance of this ubiquitous tumor suppressor to preventing EDR, and in the presence of an apoptosis inhibitor to reveal the full extent of EDR. The results revealed 42 genes that prevented either DNA re-replication or unscheduled endoreplication. All of them participate in one or more of eight cell cycle events. Seventeen of them have not been identified previously in this capacity. Remarkably, 14 of the 42 genes have been shown to prevent aneuploidy in mice. Moreover, suppressing a gene that prevents EDR increased the ability of the chemotherapeutic drug Paclitaxel to induce EDR, suggesting new opportunities for synthetic lethalities in the treatment of human cancers. PMID:27144335

  12. The maize auxotrophic mutant orange pericarp is defective in duplicate genes for tryptophan synthase beta. (United States)

    Wright, A D; Moehlenkamp, C A; Perrot, G H; Neuffer, M G; Cone, K C


    orange pericarp (orp) is a seedling lethal mutant of maize caused by mutations in the duplicate unlinked recessive loci orp1 and orp2. Mutant seedlings accumulate two tryptophan precursors, anthranilate and indole, suggesting a block in tryptophan biosynthesis. Results from feeding studies and enzyme assays indicate that the orp mutant is defective in tryptophan synthase beta activity. Thus, orp is one of only a few amino acid auxotrophic mutants to be characterized in plants. Two genes encoding tryptophan synthase beta were isolated from maize and sequenced. Both genes encode polypeptides with high homology to tryptophan synthase beta enzymes from other organisms. The cloned genes were mapped by restriction fragment length polymorphism analysis to approximately the same chromosomal locations as the genetically mapped factors orp1 and orp2. RNA analysis indicates that both genes are expressed in all tissues examined from normal plants. Together, the biochemical, genetic, and molecular data verify the identity of orp1 and orp2 as duplicate structural genes for the beta subunit of tryptophan synthase.

  13. Adaptive evolution after gene duplication in alpha-KT x 14 subfamily from Buthus martensii Karsch. (United States)

    Cao, Zhijian; Mao, Xin; Xu, Xiuling; Sheng, Jiqun; Dai, Chao; Wu, Yingliang; Luo, Feng; Sha, Yonggang; Jiang, Dahe; Li, Wenxin


    A series of isoforms of alpha-KT x 14 (short chain potassium channel scorpion toxins) were isolated from the venom of Buthus martensii Karsch by RACE and screening cDNA library methods. These isoforms adding BmKK1--3 and BmSKTx1--2 together shared high homology (more than 97%) with each other. The result of genomic sequence analysis showed that a length 79 bp intron is inserted Ala codes between the first and the second base at the 17th amino acid of signal peptide. The introns of these isoforms also share high homology with those of BmKK2 and BmSKT x 1 reported previously. Sequence analysis of many clones of cDNA and genomic DNA showed that a species population or individual polymorphism of alpha-KT x 14 genes took place in scorpion Buthus martensii Karsch and accelerated evolution played an important role in the forming process of alpha-KT x 14 scorpion toxins subfamily. The result of southern hybridization indicated that alpha-KT x 14 toxin genes existed in scorpion chromosome with multicopies. All findings maybe provided an important evidence for an extensive evolutionary process of the scorpion "pharmacological factory": at the early course of evolution, the ancestor toxic gene duplicated into a series of multicopy genes integrated at the different chromosome; at the late course of evolution, subsequent functional divergence of duplicate genes was generated by mutations, deletions and insertion.

  14. D-HPLC analysis of the entire FLT3 gene in MLL rearranged and hyperdiploid acute lymphoblastic leukemia. (United States)

    Stam, Ronald W; den Boer, Monique L; Schneider, Pauline; Meier, Marrit; Beverloo, H Berna; Pieters, Rob


    MLL rearranged and hyperdiploid acute lymphoblastic leukemia (ALL) are characterized by high-level FLT3 expression and constitutive FLT3 activation. As known activating FLT3 mutations are often absent in these patients, we screened the entire FLT3 coding sequence in MLL rearranged and hyperdiploid ALL cases for yet unidentified additional genetic alterations using denaturing D-HPLC. Both in MLL rearranged and hyperdiploid ALL we found that a small minority of samples, 7% and 10% respectively, carried genetic alterations. Although some of these alterations may induce FLT3 activation, the majority of these patients carry wild-type FLT3 genes.

  15. The mitochondrial genome of Iberobaenia (Coleoptera: Iberobaeniidae): first rearrangement of protein-coding genes in the beetles. (United States)

    Andujar, Carmelo; Arribas, Paula; Linard, Benjamin; Kundrata, Robin; Bocak, Ladislav; Vogler, Alfried P


    The complete mitochondrial genome of the recently discovered beetle family Iberobaeniidae is described and compared with known coleopteran mitogenomes. The mitochondrial sequence was obtained by shotgun metagenomic sequencing using the Illumina Miseq technology and resulted in an average coverage of 130 × and a minimum coverage of 35×. The mitochondrial genome of Iberobaeniidae includes 13 protein-coding genes, 2 rRNAs, 22 tRNAs genes, and 1 putative control region, and showed a unique rearrangement of protein-coding genes. This is the first rearrangement affecting the relative position of protein-coding and ribosomal genes reported for the order Coleoptera.

  16. A duplicated coxI gene is associated with cytoplasmic male sterility in an alloplasmic Brassica juncea line derived from somatic hybridization with Diplotaxis catholica

    Indian Academy of Sciences (India)

    Aruna Pathania; Rajesh Kumar; V. Dinesh Kumar; Ashutosh; K. K. Dwivedi; P. B. Kirti; P. Prakash; V. L. Chopra; S. R. Bhat


    A cytoplasmic male sterile (CMS) line of Brassica juncea was derived by repeated backcrossing of the somatic hybrid (Diplotaxis catholica + B. juncea) to B. juncea. The new CMS line is comparable to euplasmic lines for almost all characters, except for flowers which bear slender, needle-like anthers with aborted pollen. Detailed Southern analysis revealed two copies of coxI gene in the CMS line. One copy, coxI-1 is similar to the coxI gene of B. juncea, whereas the second copy, coxI-2 is present in a novel rearranged region. Northern analysis with eight mitochondrial gene probes showed altered transcript pattern only for the coxI gene. Two transcripts of 2.0 and 2.4 kb, respectively, were detected in the CMS line. The novel 2.4 kb transcript was present in floral bud tissue but absent in the leaf tissue. In plants where male sterility broke down under high temperature during the later part of the growing season, the 2.4 kb coxI transcript was absent, which suggested its association with the CMS. The two coxI genes from the CMS line showed two amino acid changes in the coding region. The novel coxI gene showed unique repeats in the 5′ region suggesting recombination of mitochondrial genomes of the two species. The possible role of the duplicated coxI gene in causing male sterility is discussed.

  17. Impact of duplicate gene copies on phylogenetic analysis and divergence time estimates in butterflies

    Directory of Open Access Journals (Sweden)

    Liswi Saif W


    Full Text Available Abstract Background The increase in availability of genomic sequences for a wide range of organisms has revealed gene duplication to be a relatively common event. Encounters with duplicate gene copies have consequently become almost inevitable in the context of collecting gene sequences for inferring species trees. Here we examine the effect of incorporating duplicate gene copies evolving at different rates on tree reconstruction and time estimation of recent and deep divergences in butterflies. Results Sequences from ultraviolet-sensitive (UVRh, blue-sensitive (BRh, and long-wavelength sensitive (LWRh opsins,EF-1α and COI were obtained from 27 taxa representing the five major butterfly families (5535 bp total. Both BRh and LWRh are present in multiple copies in some butterfly lineages and the different copies evolve at different rates. Regardless of the phylogenetic reconstruction method used, we found that analyses of combined data sets using either slower or faster evolving copies of duplicate genes resulted in a single topology in agreement with our current understanding of butterfly family relationships based on morphology and molecules. Interestingly, individual analyses of BRh and LWRh sequences also recovered these family-level relationships. Two different relaxed clock methods resulted in similar divergence time estimates at the shallower nodes in the tree, regardless of whether faster or slower evolving copies were used, with larger discrepancies observed at deeper nodes in the phylogeny. The time of divergence between the monarch butterfly Danaus plexippus and the queen D. gilippus (15.3–35.6 Mya was found to be much older than the time of divergence between monarch co-mimic Limenitis archippus and red-spotted purple L. arthemis (4.7–13.6 Mya, and overlapping with the time of divergence of the co-mimetic passionflower butterflies Heliconius erato and H. melpomene (13.5–26.1 Mya. Our family-level results are congruent with

  18. Whole genome sequencing of field isolates reveals a common duplication of the Duffy binding protein gene in Malagasy Plasmodium vivax strains.

    Directory of Open Access Journals (Sweden)

    Didier Menard


    Full Text Available BACKGROUND: Plasmodium vivax is the most prevalent human malaria parasite, causing serious public health problems in malaria-endemic countries. Until recently the Duffy-negative blood group phenotype was considered to confer resistance to vivax malaria for most African ethnicities. We and others have reported that P. vivax strains in African countries from Madagascar to Mauritania display capacity to cause clinical vivax malaria in Duffy-negative people. New insights must now explain Duffy-independent P. vivax invasion of human erythrocytes. METHODS/PRINCIPAL FINDINGS: Through recent whole genome sequencing we obtained ≥ 70× coverage of the P. vivax genome from five field-isolates, resulting in ≥ 93% of the Sal I reference sequenced at coverage greater than 20×. Combined with sequences from one additional Malagasy field isolate and from five monkey-adapted strains, we describe here identification of DNA sequence rearrangements in the P. vivax genome, including discovery of a duplication of the P. vivax Duffy binding protein (PvDBP gene. A survey of Malagasy patients infected with P. vivax showed that the PvDBP duplication was present in numerous locations in Madagascar and found in over 50% of infected patients evaluated. Extended geographic surveys showed that the PvDBP duplication was detected frequently in vivax patients living in East Africa and in some residents of non-African P. vivax-endemic countries. Additionally, the PvDBP duplication was observed in travelers seeking treatment of vivax malaria upon returning home. PvDBP duplication prevalence was highest in west-central Madagascar sites where the highest frequencies of P. vivax-infected, Duffy-negative people were reported. CONCLUSIONS/SIGNIFICANCE: The highly conserved nature of the sequence involved in the PvDBP duplication suggests that it has occurred in a recent evolutionary time frame. These data suggest that PvDBP, a merozoite surface protein involved in red cell adhesion

  19. Arrested rearrangement of TCR V[beta] genes in thymocytes from children with x-linked severe combined immunodeficiency disease

    Energy Technology Data Exchange (ETDEWEB)

    Sleasman, J.W.; Harville, T.O.; White, G.B.; Barrett, D.J. (Univ. of Florida College of Medicine, Gainsville, FL (United States)); George, J.F. (Univ. of Alabama, Birmingham, AL (United States)); Goodenow, M.M. (Univ. of Florida College of Medicine, Gainsville, FL (United States) Univ. of Alabama, Birmingham, AL (United States))


    Human X-linked severe combined immunodeficiency disease (SCID) is an immunodeficiency disorder in which T cell development is arrested in the thymic cortex. B lymphocytes in children with X-linked SCID seem to differentiate normally. X-linked SCID is associated with a mutation in the gene that encodes the IL-2R [gamma]-chain. Because TCR-[beta] gene recombination is a pivotal initial event in T lymphocyte onteogeny within the thymus, the authors hypothesized that a failure to express normal IL-2R[gamma] could lead to impaired TCR-[beta] gene recombination in early thymic development. PCR was used to determine the status of TCR-[beta] gene-segment rearrangements in thymic DNA that had been obtained from children with X-linked SCID. The initial step in TCR-[beta] gene rearrangement, that of D[beta] to J[beta] recombination, was readily detected in all thymus samples from children with X-linked SCID; in contrast, V[beta] to DJ[beta] gene rearrangements were undetectable in the same samples. Both D[beta] to J[beta] and V[beta] to DJ[beta] TCR genes were rearranged in the thymic tissues obtained from immunologically normal children. The authors conclude that TCR[beta]-chain gene rearrangement is arrested in children with X-linked SCID. The results suggest a causative relationship between the failure of TCR [beta]-chain gene arrangements to proceed beyond DJ[beta] rearrangements and the production of a nonfunctional IL-2R [gamma]-chain. 45 refs., 3 figs.

  20. The transformer genes in the fig wasp Ceratosolen solmsi provide new evidence for duplications independent of complementary sex determination. (United States)

    Jia, L-Y; Xiao, J-H; Xiong, T-L; Niu, L-M; Huang, D-W


    Transformer (tra) is the key gene that turns on the sex-determination cascade in Drosophila melanogaster and in some other insects. The honeybee Apis mellifera has two duplicates of tra, one of which (complementary sex determiner, csd) is the primary signal for complementary sex-determination (CSD), regulating the other duplicate (feminizer). Two tra duplicates have been found in some other hymenopteran species, resulting in the assumption that a single ancestral duplication of tra took place in the Hymenoptera. Here, we searched for tra homologues and pseudogenes in the Hymenoptera, focusing on five newly published hymenopteran genomes. We found three tra copies in the fig wasp Ceratosolen solmsi. Further evolutionary and expression analyses also showed that the two duplicates (Csoltra-B and Csoltra-C) are under positive selection, and have female-specific expression, suggesting possible sex-related functions. Moreover, Aculeata species exhibit many pseudogenes generated by lineage-specific duplications. We conclude that phylogenetic reconstruction and pseudogene screening provide novel evidence supporting the hypothesis of independent duplications rather an ancestral origin of multiple tra paralogues in the Hymenoptera. The case of C. solmsi is the first example of a non-CSD species with duplicated tra, contrary to the previous assumption that derived tra paralogues function as the CSD locus. © 2016 The Royal Entomological Society.

  1. Duplication and amplification of antibiotic resistance genes enable increased resistance in isolates of multidrug-resistant Salmonella Typhimurium (United States)

    During normal bacterial DNA replication, gene duplication and amplification (GDA) events occur randomly at a low frequency in the genome throughout a population. In the absence of selection, GDA events that increase the number of copies of a bacterial gene (or a set of genes) are lost. Antibiotic ...

  2. Distal Xq duplication and functional Xq disomy

    Directory of Open Access Journals (Sweden)

    Schluth-Bolard Caroline


    Full Text Available Abstract Distal Xq duplications refer to chromosomal disorders resulting from involvement of the long arm of the X chromosome (Xq. Clinical manifestations widely vary depending on the gender of the patient and on the gene content of the duplicated segment. Prevalence of Xq duplications remains unknown. About 40 cases of Xq28 functional disomy due to cytogenetically visible rearrangements, and about 50 cases of cryptic duplications encompassing the MECP2 gene have been reported. The most frequently reported distal duplications involve the Xq28 segment and yield a recognisable phenotype including distinctive facial features (premature closure of the fontanels or ridged metopic suture, broad face with full cheeks, epicanthal folds, large ears, small and open mouth, ear anomalies, pointed nose, abnormal palate and facial hypotonia, major axial hypotonia, severe developmental delay, severe feeding difficulties, abnormal genitalia and proneness to infections. Xq duplications may be caused either by an intrachromosomal duplication or an unbalanced X/Y or X/autosome translocation. In XY males, structural X disomy always results in functional disomy. In females, failure of X chromosome dosage compensation could result from a variety of mechanisms, including an unfavourable pattern of inactivation, a breakpoint separating an X segment from the X-inactivation centre in cis, or a small ring chromosome. The MECP2 gene in Xq28 is the most important dosage-sensitive gene responsible for the abnormal phenotype in duplications of distal Xq. Diagnosis is based on clinical features and is confirmed by CGH array techniques. Differential diagnoses include Prader-Willi syndrome and Alpha thalassaemia-mental retardation, X linked (ATR-X. The recurrence risk is significant if a structural rearrangement is present in one of the parent, the most frequent situation being that of an intrachromosomal duplication inherited from the mother. Prenatal diagnosis is performed by

  3. Gene Duplication and the Evolution of Plant MADS-box Transcription Factors

    Institute of Scientific and Technical Information of China (English)

    Chiara A. Airoldi; Brendan Davies


    Since the first MADS-box transcription factor genes were implicated in the establishment of floral organ identity in a couple of model plants,the size and scope of this gene family has begun to be appreciated in a much wider range of species.Over the course of millions of years the number of MADS-box genes in plants has increased to the point that the Arabidopsis genome contains more than 100.The understanding gained from studying the evolution,regulation and function of multiple MADS-box genes in an increasing set of species,makes this large plant transcription factor gene family an ideal subject to study the processes that lead to an increase in gene number and the selective birth,death and repurposing of its component members.Here we will use examples taken from the MADS-box gene family to review what is known about the factors that influence the loss and retention of genes duplicated in different ways and examine the varied fates of the retained genes and their associated biological outcomes.

  4. Multiple tandem duplication of the phenylalanine ammonia-lyase genes in Cucumis sativus L. (United States)

    Shang, Qing-Mao; Li, Liang; Dong, Chun-Juan


    Phenylalanine ammonia-lyase (PAL) is the first entry enzyme of the phenylpropanoid pathway, and therefore plays a key role in both plant development and stress defense. In many plants, PAL is encoded by a multi-gene family, and each member is differentially regulated in response to environmental stimuli. In the present study, we report that PAL in cucumber (Cucumis sativus L.) is encoded for by a family of seven genes (designated as CsPAL1-7). All seven CsPALs are arranged in tandem in two duplication blocks, which are located on chromosomes 4 and 6, respectively. The cDNA and protein sequences of the CsPALs share an overall high identity to each other. Homology modeling reveals similarities in their protein structures, besides several slight differences, implying the different activities in conversion of phenylalanine. Phylogenic analysis places CsPAL1-7 in a separate cluster rather than clustering with other plant PALs. Analyses of expression profiles in different cucumber tissues or in response to various stress or plant hormone treatments indicate that CsPAL1-7 play redundant, but divergent roles in cucumber development and stress response. This is consistent with our finding that CsPALs possess overlapping but different cis-elements in their promoter regions. Finally, several duplication events are discussed to explain the evolution of the cucumber PAL genes.

  5. Insights into the coupling of duplication events and macroevolution from an age profile of animal transmembrane gene families.

    Directory of Open Access Journals (Sweden)

    Guohui Ding


    Full Text Available The evolution of new gene families subsequent to gene duplication may be coupled to the fluctuation of population and environment variables. Based upon that, we presented a systematic analysis of the animal transmembrane gene duplication events on a macroevolutionary scale by integrating the palaeontology repository. The age of duplication events was calculated by maximum likelihood method, and the age distribution was estimated by density histogram and normal kernel density estimation. We showed that the density of the duplicates displays a positive correlation with the estimates of maximum number of cell types of common ancestors, and the oxidation events played a key role in the major transitions of this density trace. Next, we focused on the Phanerozoic phase, during which more macroevolution data are available. The pulse mass extinction timepoints coincide with the local peaks of the age distribution, suggesting that the transmembrane gene duplicates fixed frequently when the environment changed dramatically. Moreover, a 61-million-year cycle is the most possible cycle in this phase by spectral analysis, which is consistent with the cycles recently detected in biodiversity. Our data thus elucidate a strong coupling of duplication events and macroevolution; furthermore, our method also provides a new way to address these questions.

  6. Duplication and divergent evolution of the CHS and CHS-like genes in the chalcone synthase (CHS) superfamily

    Institute of Scientific and Technical Information of China (English)


    The enzymes of the CHS-superfamily are responsible for biosynthesis of a wide range of natural products in plants. They are important for flower pigmentation, protection against UV light and defense against phytopathogens. Many plants were found to contain multiple copies of CHS genes. This review summarizes the recent progress in the studies of the CHS-superfamily, focusing on the duplication and divergent evolution of the CHS and CHS-like genes. Comparative analyses of gene structure, expression patterns and catalytic properties revealed extensive differentiation in both regulation and function among duplicate CHS genes. It is also proposed that the CHS-like enzymes in the CHS-superfamily evolved from CHS at different times in various organisms. The CHS-superfamily thus offers a valuable model to study the rates and patterns of sequence divergence between duplicate genes.

  7. The hidden duplication past of the plant pathogen Phytophthora and its consequences for infection

    Directory of Open Access Journals (Sweden)

    Martens Cindy


    Full Text Available Abstract Background Oomycetes of the genus Phytophthora are pathogens that infect a wide range of plant species. For dicot hosts such as tomato, potato and soybean, Phytophthora is even the most important pathogen. Previous analyses of Phytophthora genomes uncovered many genes, large gene families and large genome sizes that can partially be explained by significant repeat expansion patterns. Results Analysis of the complete genomes of three different Phytophthora species, using a newly developed approach, unveiled a large number of small duplicated blocks, mainly consisting of two or three consecutive genes. Further analysis of these duplicated genes and comparison with the known gene and genome duplication history of ten other eukaryotes including parasites, algae, plants, fungi, vertebrates and invertebrates, suggests that the ancestor of P. infestans, P. sojae and P. ramorum most likely underwent a whole genome duplication (WGD. Genes that have survived in duplicate are mainly genes that are known to be preferentially retained following WGDs, but also genes important for pathogenicity and infection of the different hosts seem to have been retained in excess. As a result, the WGD might have contributed to the evolutionary and pathogenic success of Phytophthora. Conclusions The fact that we find many small blocks of duplicated genes indicates that the genomes of Phytophthora species have been heavily rearranged following the WGD. Most likely, the high repeat content in these genomes have played an important role in this rearrangement process. As a consequence, the paucity of retained larger duplicated blocks has greatly complicated previous attempts to detect remnants of a large-scale duplication event in Phytophthora. However, as we show here, our newly developed strategy to identify very small duplicated blocks might be a useful approach to uncover ancient polyploidy events, in particular for heavily rearranged genomes.

  8. Characterization of genes encoding poly(A polymerases in plants: evidence for duplication and functional specialization.

    Directory of Open Access Journals (Sweden)

    Lisa R Meeks

    Full Text Available BACKGROUND: Poly(A polymerase is a key enzyme in the machinery that mediates mRNA 3' end formation in eukaryotes. In plants, poly(A polymerases are encoded by modest gene families. To better understand this multiplicity of genes, poly(A polymerase-encoding genes from several other plants, as well as from Selaginella, Physcomitrella, and Chlamydomonas, were studied. METHODOLOGY/PRINCIPAL FINDINGS: Using bioinformatics tools, poly(A polymerase-encoding genes were identified in the genomes of eight species in the plant lineage. Whereas Chlamydomonas reinhardtii was found to possess a single poly(A polymerase gene, other species possessed between two and six possible poly(A polymerase genes. With the exception of four intron-lacking genes, all of the plant poly(A polymerase genes (but not the C. reinhardtii gene possessed almost identical intron positions within the poly(A polymerase coding sequences, suggesting that all plant poly(A polymerase genes derive from a single ancestral gene. The four Arabidopsis poly(A polymerase genes were found to be essential, based on genetic analysis of T-DNA insertion mutants. GFP fusion proteins containing three of the four Arabidopsis poly(A polymerases localized to the nucleus, while one such fusion protein was localized in the cytoplasm. The fact that this latter protein is largely pollen-specific suggests that it has important roles in male gametogenesis. CONCLUSIONS/SIGNIFICANCE: Our results indicate that poly(A polymerase genes have expanded from a single ancestral gene by a series of duplication events during the evolution of higher plants, and that individual members have undergone sorts of functional specialization so as to render them essential for plant growth and development. Perhaps the most interesting of the plant poly(A polymerases is a novel cytoplasmic poly(A polymerase that is expressed in pollen in Arabidopsis; this is reminiscent of spermatocyte-specific cytoplasmic poly(A polymerases in

  9. Extensive junctional diversity of rearranged human T cell receptor delta genes. (United States)

    Hata, S; Satyanarayana, K; Devlin, P; Band, H; McLean, J; Strominger, J L; Brenner, M B; Krangel, M S


    The human T cell receptor delta (TCR delta) gene encodes one component of the TCR gamma delta-CD3 complex found on subsets of peripheral blood and thymic T cells. Human TCR delta diversity was estimated by characterizing rearrangements in TCR gamma delta cell lines and determining the structures of complementary DNA clones representing functional and nonfunctional transcripts in these cell lines. One V delta segment and one J delta segment were identified in all functional transcripts, although a distinct J delta segment was identified in a truncated transcript. Further, one D delta element was identified, and evidence for the use of an additional D delta element was obtained. Thus human TCR delta genes appear to use a limited number of germline elements. However, the apparent use of two D delta elements in tandem coupled with imprecise joining and extensive incorporation of N nucleotides generates unprecedented variability in the junctional region.

  10. Molecular Characterization of Inflammatory Myofibroblastic Tumors With Frequent ALK and ROS1 Gene Fusions and Rare Novel RET Rearrangement

    NARCIS (Netherlands)

    Antonescu, Cristina R.; Suurmeijer, Albert J. H.; Zhang, Lei; Sung, Yun-Shao; Jungbluth, Achim A.; Travis, William D.; Al-Ahmadie, Hikmat; Fletcher, Christopher D. M.; Alaggio, Rita


    Approximately 50% of conventional inflammatory myofibroblastic tumors (IMTs) harbor ALK gene rearrangement and overexpress ALK. Recently, gene fusions involving other kinases have been implicated in the pathogenesis of IMT, including ROS1 and in 1 patient PDGFRB. However, it remains uncertain whethe

  11. Rearrangement of Rag-1 recombinase gene in DNA-repair deficient/immunodeficient ``wasted`` mice

    Energy Technology Data Exchange (ETDEWEB)

    Woloschak, G.E.; Weaver, P.; Churchill, M.; Chang-Liu, C-M. [Argonne National Lab., IL (United States); Libertin, C.R. [Loyola Univ., Maywood, IL (United States)


    Mice recessive for the autosomal gene ``wasted`` (wst) display a disease pattern which includes increased sensitivity to the killing effects of ionizing radiation, immunodeficiency, and neurologic dysfunction. The recent cloning and characterization of recombinase genes (Rag-l/Rag-2) expressed in lymphoid and possibly central nervous system tissues prompted us to examine expression of these genes in DNA repair-deficient/immunodeficient wasted mice. Our results revealed that in thymus tissue, a small Rag-I transcript (1.0 kb) was detected in wst/wst mice that was not evident in thymus from control mice. In wst/{sm_bullet} mice, a two-fold increase in Rag-1 mRNA was evident in thymus tissue. Rag-2 mRNA could only be detected in thymus tissue from wst/{sm_bullet} and not from wst/wst or parental control BCF, mice. Southern blots revealed a rearrangement or deletion within the Rag-1 gene of affected wasted mice that was not evident in known strain-specific parental or littermate controls. These results support the idea that the Rag-1 gene may map at or near the locus for the wasted mutation. In addition, they suggest the importance of recombinase function in normal immune and central nervous system development as well as the potential contribution of this gene family to the normal repair of radiation-induced DNA damage.

  12. Correlating Traits of Gene Retention, Sequence Divergence, Duplicability and Essentiality in Vertebrates, Arthropods, and Fungi (United States)

    Waterhouse, Robert M.; Zdobnov, Evgeny M.; Kriventseva, Evgenia V.


    Delineating ancestral gene relations among a large set of sequenced eukaryotic genomes allowed us to rigorously examine links between evolutionary and functional traits. We classified 86% of over 1.36 million protein-coding genes from 40 vertebrates, 23 arthropods, and 32 fungi into orthologous groups and linked over 90% of them to Gene Ontology or InterPro annotations. Quantifying properties of ortholog phyletic retention, copy-number variation, and sequence conservation, we examined correlations with gene essentiality and functional traits. More than half of vertebrate, arthropod, and fungal orthologs are universally present across each lineage. These universal orthologs are preferentially distributed in groups with almost all single-copy or all multicopy genes, and sequence evolution of the predominantly single-copy orthologous groups is markedly more constrained. Essential genes from representative model organisms, Mus musculus, Drosophila melanogaster, and Saccharomyces cerevisiae, are significantly enriched in universal orthologs within each lineage, and essential-gene-containing groups consistently exhibit greater sequence conservation than those without. This study of eukaryotic gene repertoire evolution identifies shared fundamental principles and highlights lineage-specific features, it also confirms that essential genes are highly retained and conclusively supports the “knockout-rate prediction” of stronger constraints on essential gene sequence evolution. However, the distinction between sequence conservation of single- versus multicopy orthologs is quantitatively more prominent than between orthologous groups with and without essential genes. The previously underappreciated difference in the tolerance of gene duplications and contrasting evolutionary modes of “single-copy control” versus “multicopy license” may reflect a major evolutionary mechanism that allows extended exploration of gene sequence space. PMID:21148284

  13. Neofunctionalization of a duplicate hatching enzyme gene during the evolution of teleost fishes. (United States)

    Sano, Kaori; Kawaguchi, Mari; Watanabe, Satoshi; Yasumasu, Shigeki


    Duplication and subsequent neofunctionalization of the teleostean hatching enzyme gene occurred in the common ancestor of Euteleostei and Otocephala, producing two genes belonging to different phylogenetic clades (clade I and II). In euteleosts, the clade I enzyme inherited the activity of the ancestral enzyme of swelling the egg envelope by cleavage of the N-terminal region of egg envelope proteins. The clade II enzyme gained two specific cleavage sites, N-ZPd and mid-ZPd but lost the ancestral activity. Thus, euteleostean clade II enzymes assumed a new function; solubilization of the egg envelope by the cooperative action with clade I enzyme. However, in Otocephala, the clade II gene was lost during evolution. Consequently, in a late group of Otocephala, only the clade I enzyme is present to swell the egg envelope. We evaluated the egg envelope digestion properties of clade I and II enzymes in Gonorynchiformes, an early diverging group of Otocephala, using milkfish, and compared their digestion with those of other fishes. Finally, we propose a hypothesis of the neofunctionalization process. The milkfish clade II enzyme cleaved N-ZPd but not mid-ZPd, and did not cause solubilization of the egg envelope. We conclude that neofunctionalization is incomplete in the otocephalan clade II enzymes. Comparison of clade I and clade II enzyme characteristics implies that the specificity of the clade II enzymes gradually changed during evolution after the duplication event, and that a change in substrate was required for the addition of the mid-ZPd site and loss of activity at the N-terminal region. We infer the process of neofunctionalization of the clade II enzyme after duplication of the gene. The ancestral clade II gene gained N-ZPd cleavage activity in the common ancestral lineage of the Euteleostei and Otocephala. Subsequently, acquisition of cleavage activity at the mid-ZPd site and loss of cleavage activity in the N-terminal region occurred during the evolution of

  14. Rearranged anaplastic lymphoma kinase (ALK) gene in adult-onset papillary thyroid cancer amongst atomic bomb survivors. (United States)

    Hamatani, Kiyohiro; Mukai, Mayumi; Takahashi, Keiko; Hayashi, Yuzo; Nakachi, Kei; Kusunoki, Yoichiro


    We previously noted that among atomic bomb survivors (ABS), the relative frequency of cases of adult papillary thyroid cancer (PTC) with chromosomal rearrangements (mainly RET/PTC) was significantly greater in those with relatively higher radiation exposure than those with lower radiation exposure. In contrast, the frequency of PTC cases with point mutations (mainly BRAF(V600E)) was significantly lower in patients with relatively higher radiation exposure than those with lower radiation exposure. We also found that among ABS, the frequency of PTC cases with no detectable gene alterations in RET, neurotrophic tyrosine kinase receptor 1 (NTRK1), BRAF, or RAS was significantly higher in patients with relatively higher radiation exposure than those with lower radiation exposure. However, in ABS with PTC, the relationship between the presence of the anaplastic lymphoma kinase (ALK) gene fused with other gene partners and radiation exposure has received little study. In this study, we tested the hypothesis that the relative frequency of rearranged ALK in ABS with PTC, and with no detectable gene alterations in RET, NTRK1, BRAF, or RAS, would be greater in those having relatively higher radiation exposures. The 105 subjects in the study were drawn from the Life Span Study cohort of ABS of Hiroshima and Nagasaki who were diagnosed with PTC between 1956 and 1993. Seventy-nine were exposed (>0 mGy), and 26 were not exposed to A-bomb radiation. In the 25 ABS with PTC, and with no detectable gene alterations in RET, NTRK1, BRAF, or RAS, we examined archival, formalin-fixed, paraffin-embedded PTC specimens for rearrangement of ALK using reverse transcription-polymerase chain reaction and 5' rapid amplification of cDNA ends (5' RACE). We found rearranged ALK in 10 of 19 radiation-exposed PTC cases, but none among 6 patients with PTC with no radiation exposure. In addition, solid/trabecular-like architecture in PTC was closely associated with ALK rearrangements, being observed in

  15. Gene duplication and adaptive evolution of digestive proteases in Drosophila arizonae female reproductive tracts.

    Directory of Open Access Journals (Sweden)

    Erin S Kelleher


    Full Text Available It frequently has been postulated that intersexual coevolution between the male ejaculate and the female reproductive tract is a driving force in the rapid evolution of reproductive proteins. The dearth of research on female tracts, however, presents a major obstacle to empirical tests of this hypothesis. Here, we employ a comparative EST approach to identify 241 candidate female reproductive proteins in Drosophila arizonae, a repleta group species in which physiological ejaculate-female coevolution has been documented. Thirty-one of these proteins exhibit elevated amino acid substitution rates, making them candidates for molecular coevolution with the male ejaculate. Strikingly, we also discovered 12 unique digestive proteases whose expression is specific to the D. arizonae lower female reproductive tract. These enzymes belong to classes most commonly found in the gastrointestinal tracts of a diverse array of organisms. We show that these proteases are associated with recent, lineage-specific gene duplications in the Drosophila repleta species group, and exhibit strong signatures of positive selection. Observation of adaptive evolution in several female reproductive tract proteins indicates they are active players in the evolution of reproductive tract interactions. Additionally, pervasive gene duplication, adaptive evolution, and rapid acquisition of a novel digestive function by the female reproductive tract points to a novel coevolutionary mechanism of ejaculate-female interaction.

  16. On the origin of protein synthesis factors: a gene duplication/fusion model. (United States)

    Cousineau, B; Leclerc, F; Cedergren, R


    Sequence similarity has given rise to the proposal that IF-2, EF-G, and EF-Tu are related through a common ancestor. We evaluate this proposition and whether the relationship can be extended to other factors of protein synthesis. Analysis of amino acid sequence similarity gives statistical support for an evolutionary affiliation among IF-1, IF-2, IF-3, EF-Tu, EF-Ts, and EF-G and suggests further that this association is a result of gene duplication/fusion events. In support of this mechanism, the three-dimensional structures of IF-3, EF-Tu, and EF-G display a predictable domain structure and overall conformational similarity. The model that we propose consists of three consecutives duplication/fusion events which would have taken place before the divergence of the three superkingdoms: eubacteria, archaea, and eukaryotes. The root of this protein superfamily tree would be an ancestor of the modern IF-1 gene sequence. The repeated fundamental motif of this protein superfamily is a small RNA binding domain composed of two alpha-helices packed along side of an antiparallel beta-sheet.

  17. New organelles by gene duplication in a biophysical model of eukaryote endomembrane evolution. (United States)

    Ramadas, Rohini; Thattai, Mukund


    Extant eukaryotic cells have a dynamic traffic network that consists of diverse membrane-bound organelles exchanging matter via vesicles. This endomembrane system arose and diversified during a period characterized by massive expansions of gene families involved in trafficking after the acquisition of a mitochondrial endosymbiont by a prokaryotic host cell >1.8 billion years ago. Here we investigate the mechanistic link between gene duplication and the emergence of new nonendosymbiotic organelles, using a minimal biophysical model of traffic. Our model incorporates membrane-bound compartments, coat proteins and adaptors that drive vesicles to bud and segregate cargo from source compartments, and SNARE proteins and associated factors that cause vesicles to fuse into specific destination compartments. In simulations, arbitrary numbers of compartments with heterogeneous initial compositions segregate into a few compositionally distinct subsets that we term organelles. The global structure of the traffic system (i.e., the number, composition, and connectivity of organelles) is determined completely by local molecular interactions. On evolutionary timescales, duplication of the budding and fusion machinery followed by loss of cross-interactions leads to the emergence of new organelles, with increased molecular specificity being necessary to maintain larger organellar repertoires. These results clarify potential modes of early eukaryotic evolution as well as more recent eukaryotic diversification. Copyright © 2013 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  18. Genomic rearrangements in trypanosomatids: an alternative to the "one gene" evolutionary hypotheses?

    Directory of Open Access Journals (Sweden)

    JC Dujardin


    Full Text Available Most molecular trees of trypanosomatids are based on point mutations within DNA sequences. In contrast, there are very few evolutionary studies considering DNA (re arrangement as genetic characters. Waiting for the completion of the various parasite genome projects, first information may already be obtained from chromosome size-polymorphism, using the appropriate algorithms for data processing. Three illustrative models are presented here. First, the case of Leishmania (Viannia braziliensis/L. (V. peruviana is described. Thanks to a fast evolution rate (due essentially to amplification/deletion of tandemly repeated genes, molecular karyotyping seems particularly appropriate for studying recent evolutionary divergence, including eco-geographical diversification. Secondly, karyotype evolution is considered at the level of whole genus Leishmania. Despite the fast chromosome evolution rate, there is qualitative congruence with MLEE- and RAPD-based evolutionary hypotheses. Significant differences may be observed between major lineages, likely corresponding to major and less frequent rearrangements (fusion/fission, translocation. Thirdly, comparison is made with Trypanosoma cruzi. Again congruence is observed with other hypotheses and major lineages are delineated by significant chromosome rearrangements. The level of karyotype polymorphism within that "species" is similar to the one observed in "genus" Leishmania. The relativity of the species concept among these two groups of parasites is discussed.

  19. Generation of antigenic diversity in Plasmodium falciparum by structured rearrangement of Var genes during mitosis.

    Directory of Open Access Journals (Sweden)

    Antoine Claessens


    Full Text Available The most polymorphic gene family in P. falciparum is the ∼60 var genes distributed across parasite chromosomes, both in the subtelomeres and in internal regions. They encode hypervariable surface proteins known as P. falciparum erythrocyte membrane protein 1 (PfEMP1 that are critical for pathogenesis and immune evasion in Plasmodium falciparum. How var gene sequence diversity is generated is not currently completely understood. To address this, we constructed large clone trees and performed whole genome sequence analysis to study the generation of novel var gene sequences in asexually replicating parasites. While single nucleotide polymorphisms (SNPs were scattered across the genome, structural variants (deletions, duplications, translocations were focused in and around var genes, with considerable variation in frequency between strains. Analysis of more than 100 recombination events involving var exon 1 revealed that the average nucleotide sequence identity of two recombining exons was only 63% (range: 52.7-72.4% yet the crossovers were error-free and occurred in such a way that the resulting sequence was in frame and domain architecture was preserved. Var exon 1, which encodes the immunologically exposed part of the protein, recombined in up to 0.2% of infected erythrocytes in vitro per life cycle. The high rate of var exon 1 recombination indicates that millions of new antigenic structures could potentially be generated each day in a single infected individual. We propose a model whereby var gene sequence polymorphism is mainly generated during the asexual part of the life cycle.

  20. The impact of gene duplication, insertion, deletion, lateral gene transfer and sequencing error on orthology inference: a simulation study. (United States)

    Dalquen, Daniel A; Altenhoff, Adrian M; Gonnet, Gaston H; Dessimoz, Christophe


    The identification of orthologous genes, a prerequisite for numerous analyses in comparative and functional genomics, is commonly performed computationally from protein sequences. Several previous studies have compared the accuracy of orthology inference methods, but simulated data has not typically been considered in cross-method assessment studies. Yet, while dependent on model assumptions, simulation-based benchmarking offers unique advantages: contrary to empirical data, all aspects of simulated data are known with certainty. Furthermore, the flexibility of simulation makes it possible to investigate performance factors in isolation of one another.Here, we use simulated data to dissect the performance of six methods for orthology inference available as standalone software packages (Inparanoid, OMA, OrthoInspector, OrthoMCL, QuartetS, SPIMAP) as well as two generic approaches (bidirectional best hit and reciprocal smallest distance). We investigate the impact of various evolutionary forces (gene duplication, insertion, deletion, and lateral gene transfer) and technological artefacts (ambiguous sequences) on orthology inference. We show that while gene duplication/loss and insertion/deletion are well handled by most methods (albeit for different trade-offs of precision and recall), lateral gene transfer disrupts all methods. As for ambiguous sequences, which might result from poor sequencing, assembly, or genome annotation, we show that they affect alignment score-based orthology methods more strongly than their distance-based counterparts.

  1. The impact of gene duplication, insertion, deletion, lateral gene transfer and sequencing error on orthology inference: a simulation study.

    Directory of Open Access Journals (Sweden)

    Daniel A Dalquen

    Full Text Available The identification of orthologous genes, a prerequisite for numerous analyses in comparative and functional genomics, is commonly performed computationally from protein sequences. Several previous studies have compared the accuracy of orthology inference methods, but simulated data has not typically been considered in cross-method assessment studies. Yet, while dependent on model assumptions, simulation-based benchmarking offers unique advantages: contrary to empirical data, all aspects of simulated data are known with certainty. Furthermore, the flexibility of simulation makes it possible to investigate performance factors in isolation of one another.Here, we use simulated data to dissect the performance of six methods for orthology inference available as standalone software packages (Inparanoid, OMA, OrthoInspector, OrthoMCL, QuartetS, SPIMAP as well as two generic approaches (bidirectional best hit and reciprocal smallest distance. We investigate the impact of various evolutionary forces (gene duplication, insertion, deletion, and lateral gene transfer and technological artefacts (ambiguous sequences on orthology inference. We show that while gene duplication/loss and insertion/deletion are well handled by most methods (albeit for different trade-offs of precision and recall, lateral gene transfer disrupts all methods. As for ambiguous sequences, which might result from poor sequencing, assembly, or genome annotation, we show that they affect alignment score-based orthology methods more strongly than their distance-based counterparts.

  2. Sgs1 and Exo1 suppress targeted chromosome duplication during ends-in and ends-out gene targeting. (United States)

    Štafa, Anamarija; Miklenić, Marina; Zunar, Bojan; Lisnić, Berislav; Symington, Lorraine S; Svetec, Ivan-Krešimir


    Gene targeting is extremely efficient in the yeast Saccharomyces cerevisiae. It is performed by transformation with a linear, non-replicative DNA fragment carrying a selectable marker and containing ends homologous to the particular locus in a genome. However, even in S. cerevisiae, transformation can result in unwanted (aberrant) integration events, the frequency and spectra of which are quite different for ends-out and ends-in transformation assays. It has been observed that gene replacement (ends-out gene targeting) can result in illegitimate integration, integration of the transforming DNA fragment next to the target sequence and duplication of a targeted chromosome. By contrast, plasmid integration (ends-in gene targeting) is often associated with multiple targeted integration events but illegitimate integration is extremely rare and a targeted chromosome duplication has not been reported. Here we systematically investigated the influence of design of the ends-out assay on the success of targeted genetic modification. We have determined transformation efficiency, fidelity of gene targeting and spectra of all aberrant events in several ends-out gene targeting assays designed to insert, delete or replace a particular sequence in the targeted region of the yeast genome. Furthermore, we have demonstrated for the first time that targeted chromosome duplications occur even during ends-in gene targeting. Most importantly, the whole chromosome duplication is POL32 dependent pointing to break-induced replication (BIR) as the underlying mechanism. Moreover, the occurrence of duplication of the targeted chromosome was strikingly increased in the exo1Δ sgs1Δ double mutant but not in the respective single mutants demonstrating that the Exo1 and Sgs1 proteins independently suppress whole chromosome duplication during gene targeting.

  3. Evolutionary history of c-myc in teleosts and characterization of the duplicated c-myca genes in goldfish embryos. (United States)

    Marandel, Lucie; Labbe, Catherine; Bobe, Julien; Le Bail, Pierre-Yves


    c-Myc plays an important role during embryogenesis in mammals, but little is known about its function during embryonic development in teleosts. In addition, the evolutionary history of c-myc gene in teleosts remains unclear, and depending on the species, a variable number of gene duplicates exist in teleosts. To gain new insight into c-myc genes in teleosts, the present study was designed to clarify the evolutionary history of c-myc gene(s) in teleosts and to subsequently characterize DNA methylation and early embryonic expression patterns in a cyprinid fish. Our results show that a duplication of c-myc gene occurred before or around the teleost radiation, as a result of the teleost-specific whole genome duplication giving rise to c-myca and c-mycb in teleosts and was followed by a loss of the c-mycb gene in the Gasterosteiforms and Tetraodontiforms. Our data also demonstrate that both c-myc genes previously identified in carp and goldfish are co-orthologs of the zebrafish c-myca. These results indicate the presence of additional c-myca duplication in Cyprininae. We were able to identify differences between the expression patterns of the two goldfish c-myca genes in oocytes and early embryos. These differences suggest a partial sub-functionalization of c-myca genes after duplication. Despite differences in transcription patterns, both of the c-myca genes displayed similar DNA methylation patterns during early development and in gametes. Together, our results clarify the evolutionary history of the c-myc gene in teleosts and provide new insight into the involvement of c-myc in early embryonic development in cyprinids. Copyright © 2011 Wiley Periodicals, Inc.

  4. A 380-kb Duplication in 7p22.3 Encompassing the LFNG Gene in a Boy with Asperger Syndrome

    NARCIS (Netherlands)

    Vulto-van Silfhout, A.T.; Brouwer, A.F. de; Leeuw, N. de; Obihara, C.C.; Brunner, H.G.; Vries, B.B. de


    De novo genomic aberrations are considered an important cause of autism spectrum disorders. We describe a de novo 380-kb gain in band p22.3 of chromosome 7 in a patient with Asperger syndrome. This duplicated region contains 9 genes including the LNFG gene that is an important regulator of NOTCH

  5. A 380-kb Duplication in 7p22.3 Encompassing the LFNG Gene in a Boy with Asperger Syndrome

    NARCIS (Netherlands)

    Vulto-van Silfhout, A.T.; Brouwer, A.F. de; Leeuw, N. de; Obihara, C.C.; Brunner, H.G.; Vries, B.B. de


    De novo genomic aberrations are considered an important cause of autism spectrum disorders. We describe a de novo 380-kb gain in band p22.3 of chromosome 7 in a patient with Asperger syndrome. This duplicated region contains 9 genes including the LNFG gene that is an important regulator of NOTCH sig

  6. A 380-kb Duplication in 7p22.3 Encompassing the LFNG Gene in a Boy with Asperger Syndrome

    NARCIS (Netherlands)

    Vulto-van Silfhout, A.T.; Brouwer, A.F. de; Leeuw, N. de; Obihara, C.C.; Brunner, H.G.; Vries, B.B. de


    De novo genomic aberrations are considered an important cause of autism spectrum disorders. We describe a de novo 380-kb gain in band p22.3 of chromosome 7 in a patient with Asperger syndrome. This duplicated region contains 9 genes including the LNFG gene that is an important regulator of NOTCH sig

  7. Duplication of the NPHP1 gene in patients with autism spectrum disorder and normal intellectual ability: a case series. (United States)

    Yasuda, Yuka; Hashimoto, Ryota; Fukai, Ryoko; Okamoto, Nobuhiko; Hiraki, Yoko; Yamamori, Hidenaga; Fujimoto, Michiko; Ohi, Kazutaka; Taniike, Masako; Mohri, Ikuko; Nakashima, Mitsuko; Tsurusaki, Yoshinori; Saitsu, Hirotomo; Matsumoto, Naomichi; Miyake, Noriko; Takeda, Masatoshi


    Autism spectrum disorder is a neurodevelopmental disorder characterized by impairments in social interactions, reduced verbal communication abilities, stereotyped repetitive behaviors, and restricted interests. It is a complex condition caused by genetic and environmental factors; the high heritability of this disorder supports the presence of a significant genetic contribution. Many studies have suggested that copy-number variants contribute to the etiology of autism spectrum disorder. Recently, copy-number variants of the nephronophthisis 1 gene have been reported in patients with autism spectrum disorder. To the best of our knowledge, only six autism spectrum disorder cases with duplications of the nephronophthisis 1 gene have been reported. These patients exhibited intellectual dysfunction, including verbal dysfunction in one patient, below-average verbal intellectual ability in one patient, and intellectual disability in four patients. In this study, we identified nephronophthisis 1 duplications in two unrelated Japanese patients with autism spectrum disorder using a high-resolution single-nucleotide polymorphism array. This report is the first to describe a nephronophthisis 1 duplication in an autism spectrum disorder patient with an average verbal intelligence quotient and an average performance intelligence quotient. However, the second autism spectrum disorder patient with a nephronophthisis 1 duplication had a below-average performance intelligence quotient. Neither patient exhibited physical dysfunction, motor developmental delay, or neurological abnormalities. This study supports the clinical observation of nephronophthisis 1 duplication in autism spectrum disorder cases and might contribute to our understanding of the clinical phenotype that arises from this duplication.

  8. Infectious bronchitis viruses with naturally occurring genomic rearrangement and gene deletion. (United States)

    Hewson, Kylie A; Ignjatovic, Jagoda; Browning, Glenn F; Devlin, Joanne M; Noormohammadi, Amir H


    Infectious bronchitis viruses (IBVs) are group III coronaviruses that infect poultry worldwide. Genetic variations, including whole-gene deletions, are key to IBV evolution. Australian subgroup 2 IBVs contain sequence insertions and multiple gene deletions that have resulted in a substantial genomic divergence from international IBVs. The genomic variations present in Australian IBVs were investigated and compared to those of another group III coronavirus, turkey coronavirus (TCoV). Open reading frames (ORFs) found throughout the genome of Australian IBVs were analogous in sequence and position to TCoV ORFs, except for ORF 4b, which appeared to be translocated to a different position in the subgroup 2 strains. Subgroup 2 strains were previously reported to lack genes 3a, 3b and 5a, with some also lacking 5b. Of these, however, genes 3b and 5b were found to be present but contained various mutations that may affect transcription. In this study, it was found that subgroup 2 IBVs have undergone a more substantial genomic rearrangements than previously thought.

  9. [Extramedullary onset of mixed phenotype acute leukemia with MLL gene rearrangement]. (United States)

    Kawashima, Ichiro; Shobu, Yuki; Yamamoto, Takeo; Hamanaka, Satoshi; Nozaki, Yumi; Nakajima, Kei; Mitsumori, Toru; Kirito, Keita


    Rearrangements of the mixed lineage leukemia MLL gene at chromosome 11q23 are common chromosomal abnormalities in human leukemia. MLL fused with numerous partner genes causes different leukemia phenotypes that depend on the function of partner genes. MLLT3-MLL is generated by translocation t(9;11), which primarily induces acute myeloid leukemia in humans, whereas MLLT3-MLL induces ALL or biphenotypic leukemia in mice. The microenvironment that surrounds leukemia cells plays a central role in this process. We report a patient with mixed phenotype acute leukemia with MLLT3-MLL. This patient, a 44-year-old woman, initially exhibited extramedullary leukemia with multiple tumors and subsequently developed bone marrow disease. The leukemia cells exhibited myeloid (CD13 and MPO) and B cell (CD19 and CD79a) phenotypes. Chromosomal analysis and RT-PCR assay revealed tumor cells with the MLLT3-MLL fusion gene. We treated this patient with a drug regimen for AML (Ara-C plus anthracycline), and complete remission was obtained. This report describes the fourth case of mixed phenotypic leukemia with extramedullary disease. The extramedullary circumstance may underlie the biphenotypic features of these patients.

  10. Gene duplications circumvent trade-offs in enzyme function: Insect adaptation to toxic host plants. (United States)

    Dalla, Safaa; Dobler, Susanne


    Herbivorous insects and their adaptations against plant toxins provide striking opportunities to investigate the genetic basis of traits involved in coevolutionary interactions. Target site insensitivity to cardenolides has evolved convergently across six orders of insects, involving identical substitutions in the Na,K-ATPase gene and repeated convergent gene duplications. The large milkweed bug, Oncopeltus fasciatus, has three copies of the Na,K-ATPase α-subunit gene that bear differing numbers of amino acid substitutions in the binding pocket for cardenolides. To analyze the effect of these substitutions on cardenolide resistance and to infer possible trade-offs in gene function, we expressed the cardenolide-sensitive Na,K-ATPase of Drosophila melanogaster in vitro and introduced four distinct combinations of substitutions observed in the three gene copies of O. fasciatus. With an increasing number of substitutions, the sensitivity of the Na,K-ATPase to a standard cardenolide decreased in a stepwise manner. At the same time, the enzyme's overall activity decreased significantly with increasing cardenolide resistance and only the least substituted mimic of the Na,K-ATPase α1C copy maintained activity similar to the wild-type enzyme. Our results suggest that the Na,K-ATPase copies in O. fasciatus have diverged in function, enabling specific adaptations to dietary cardenolides while maintaining the functionality of this critical ion carrier. © 2016 The Author(s). Evolution © 2016 The Society for the Study of Evolution.

  11. Gene duplication and fragmentation in the zebra finch major histocompatibility complex

    Directory of Open Access Journals (Sweden)

    Burt David W


    Full Text Available Abstract Background Due to its high polymorphism and importance for disease resistance, the major histocompatibility complex (MHC has been an important focus of many vertebrate genome projects. Avian MHC organization is of particular interest because the chicken Gallus gallus, the avian species with the best characterized MHC, possesses a highly streamlined minimal essential MHC, which is linked to resistance against specific pathogens. It remains unclear the extent to which this organization describes the situation in other birds and whether it represents a derived or ancestral condition. The sequencing of the zebra finch Taeniopygia guttata genome, in combination with targeted bacterial artificial chromosome (BAC sequencing, has allowed us to characterize an MHC from a highly divergent and diverse avian lineage, the passerines. Results The zebra finch MHC exhibits a complex structure and history involving gene duplication and fragmentation. The zebra finch MHC includes multiple Class I and Class II genes, some of which appear to be pseudogenes, and spans a much more extensive genomic region than the chicken MHC, as evidenced by the presence of MHC genes on each of seven BACs spanning 739 kb. Cytogenetic (FISH evidence and the genome assembly itself place core MHC genes on as many as four chromosomes with TAP and Class I genes mapping to different chromosomes. MHC Class II regions are further characterized by high endogenous retroviral content. Lastly, we find strong evidence of selection acting on sites within passerine MHC Class I and Class II genes. Conclusion The zebra finch MHC differs markedly from that of the chicken, the only other bird species with a complete genome sequence. The apparent lack of synteny between TAP and the expressed MHC Class I locus is in fact reminiscent of a pattern seen in some mammalian lineages and may represent convergent evolution. Our analyses of the zebra finch MHC suggest a complex history involving

  12. The polyphenol oxidase gene family in land plants: Lineage-specific duplication and expansion

    Directory of Open Access Journals (Sweden)

    Tran Lan T


    Full Text Available Abstract Background Plant polyphenol oxidases (PPOs are enzymes that typically use molecular oxygen to oxidize ortho-diphenols to ortho-quinones. These commonly cause browning reactions following tissue damage, and may be important in plant defense. Some PPOs function as hydroxylases or in cross-linking reactions, but in most plants their physiological roles are not known. To better understand the importance of PPOs in the plant kingdom, we surveyed PPO gene families in 25 sequenced genomes from chlorophytes, bryophytes, lycophytes, and flowering plants. The PPO genes were then analyzed in silico for gene structure, phylogenetic relationships, and targeting signals. Results Many previously uncharacterized PPO genes were uncovered. The moss, Physcomitrella patens, contained 13 PPO genes and Selaginella moellendorffii (spike moss and Glycine max (soybean each had 11 genes. Populus trichocarpa (poplar contained a highly diversified gene family with 11 PPO genes, but several flowering plants had only a single PPO gene. By contrast, no PPO-like sequences were identified in several chlorophyte (green algae genomes or Arabidopsis (A. lyrata and A. thaliana. We found that many PPOs contained one or two introns often near the 3’ terminus. Furthermore, N-terminal amino acid sequence analysis using ChloroP and TargetP 1.1 predicted that several putative PPOs are synthesized via the secretory pathway, a unique finding as most PPOs are predicted to be chloroplast proteins. Phylogenetic reconstruction of these sequences revealed that large PPO gene repertoires in some species are mostly a consequence of independent bursts of gene duplication, while the lineage leading to Arabidopsis must have lost all PPO genes. Conclusion Our survey identified PPOs in gene families of varying sizes in all land plants except in the genus Arabidopsis. While we found variation in intron numbers and positions, overall PPO gene structure is congruent with the phylogenetic

  13. Gene duplication and fragment recombination drive functional diversification of a superfamily of cytoplasmic effectors in Phytophthora sojae. (United States)

    Shen, Danyu; Liu, Tingli; Ye, Wenwu; Liu, Li; Liu, Peihan; Wu, Yuren; Wang, Yuanchao; Dou, Daolong


    Phytophthora and other oomycetes secrete a large number of putative host cytoplasmic effectors with conserved FLAK motifs following signal peptides, termed crinkling and necrosis inducing proteins (CRN), or Crinkler. Here, we first investigated the evolutionary patterns and mechanisms of CRN effectors in Phytophthora sojae and compared them to two other Phytophthora species. The genes encoding CRN effectors could be divided into 45 orthologous gene groups (OGG), and most OGGs unequally distributed in the three species, in which each underwent large number of gene gains or losses, indicating that the CRN genes expanded after species evolution in Phytophthora and evolved through pathoadaptation. The 134 expanded genes in P. sojae encoded family proteins including 82 functional genes and expressed at higher levels while the other 68 genes encoding orphan proteins were less expressed and contained 50 pseudogenes. Furthermore, we demonstrated that most expanded genes underwent gene duplication or/and fragment recombination. Three different mechanisms that drove gene duplication or recombination were identified. Finally, the expanded CRN effectors exhibited varying pathogenic functions, including induction of programmed cell death (PCD) and suppression of PCD through PAMP-triggered immunity or/and effector-triggered immunity. Overall, these results suggest that gene duplication and fragment recombination may be two mechanisms that drive the expansion and neofunctionalization of the CRN family in P. sojae, which aids in understanding the roles of CRN effectors within each oomycete pathogen.

  14. Gene duplication and fragment recombination drive functional diversification of a superfamily of cytoplasmic effectors in Phytophthora sojae.

    Directory of Open Access Journals (Sweden)

    Danyu Shen

    Full Text Available Phytophthora and other oomycetes secrete a large number of putative host cytoplasmic effectors with conserved FLAK motifs following signal peptides, termed crinkling and necrosis inducing proteins (CRN, or Crinkler. Here, we first investigated the evolutionary patterns and mechanisms of CRN effectors in Phytophthora sojae and compared them to two other Phytophthora species. The genes encoding CRN effectors could be divided into 45 orthologous gene groups (OGG, and most OGGs unequally distributed in the three species, in which each underwent large number of gene gains or losses, indicating that the CRN genes expanded after species evolution in Phytophthora and evolved through pathoadaptation. The 134 expanded genes in P. sojae encoded family proteins including 82 functional genes and expressed at higher levels while the other 68 genes encoding orphan proteins were less expressed and contained 50 pseudogenes. Furthermore, we demonstrated that most expanded genes underwent gene duplication or/and fragment recombination. Three different mechanisms that drove gene duplication or recombination were identified. Finally, the expanded CRN effectors exhibited varying pathogenic functions, including induction of programmed cell death (PCD and suppression of PCD through PAMP-triggered immunity or/and effector-triggered immunity. Overall, these results suggest that gene duplication and fragment recombination may be two mechanisms that drive the expansion and neofunctionalization of the CRN family in P. sojae, which aids in understanding the roles of CRN effectors within each oomycete pathogen.

  15. Rearrangement and junctional-site sequence analyses of T-cell receptor gamma genes in intestinal intraepithelial lymphocytes from murine athymic chimeras. (United States)

    Whetsell, M; Mosley, R L; Whetsell, L; Schaefer, F V; Miller, K S; Klein, J R


    The molecular organization of rearranged T-cell receptor (TCR) gamma genes intraepithelial lymphocytes (IEL) was studied in athymic radiation chimeras and was compared with the organization of gamma gene rearrangements in IEL from thymus-bearing animals by polymerase chain reaction and by sequence analyses of DNA spanning the junction of the variable (V) and joining (J) genes. In both thymus-bearing mice and athymic chimeras, IEL V-J gamma-gene rearrangements occurred for V gamma 1.2, V gamma 2, and V gamma 5 but not for V gamma 3 or V gamma 4. Sequence analyses of cloned V-J polymerase chain reaction-amplified products indicated that in both thymus-bearing mice and athymic chimeras, rearrangement of V gamma 1.2 and V gamma 5 resulted in in-frame as well as out-of-frame genes, whereas nearly all V gamma 2 rearrangements were out of frame from either type of animal. V-segment nucleotide removal occurred in most V gamma 1.2, V gamma 2, and V gamma 5 rearrangements; J-segment nucleotide removal was common in V gamma 1.2 but not in V gamma 2 or V gamma 5 rearrangements. N-segment nucleotide insertions were present in V gamma 1.2, V gamma 2, and V gamma 5 IEL rearrangements in both thymus-bearing mice and athymic chimeras, resulting in a predominant in-frame sequence for V gamma 5 and a predominant out-of-frame sequence for V gamma 2 genes. These findings demonstrate that (i) TCR gamma-gene rearrangement occurs extrathymically in IEL, (ii) rearrangements of TCR gamma genes involve the same V gene regardless of thymus influence; and (iii) the thymus does not determine the degree to which functional or nonfunctional rearrangements occur in IEL.

  16. Rearrangement of RAG-1 recombinase gene in radiation-sensitive ``wasted`` mice

    Energy Technology Data Exchange (ETDEWEB)

    Woloschak, G.E. [Argonne National Lab., IL (United States)]|[Loyola Univ., Maywood, IL (United States); Libertin, C.R.; Weaver, P. [Loyola Univ., Maywood, IL (United States); Churchill, M.; Chang-Liu, C.M. [Argonne National Lab., IL (United States)


    Mice recessive for the autosomal gene ``wasted`` (wst) display a disease pattern which includes increased sensitivity to the killing effects of ionizing radiation, immunodeficiency, and neurologic dysfunction. The recent cloning and characterization of recombinase genes (RAG-1/RAG-2) expressed in lymphoid and possibly central nervous system tissues prompted us to examine expression of these genes in DNA repair-deficient/immunodeficient wasted mice. Our results revealed expression of RAG-1 mRNA in spinal cord (but not brain) of control mice; no expression of RAG-1 mRNA was detected in spinal cord or brain from wst/wst mice or their normal littermates (wst/{center_dot} mice). In thymus tissue, a small RAG-1 transcript (1.0 kb) was detected in wst/wst mice that was not evident in thymus from control mice. In wst/{center_dot} mice, a two-fold increase in RAG-1 MRNA was evident in thymus tissue. RAG-2 mRNA could only be detected in thymus tissue from wst/{center_dot} and not from wst/wst or parental control BCF{sub 1} mice. Southern blots revealed a rearrangement/deletion within the RAG-1 gene of affected wasted mice, not evident in known strain-specific parental or littermate controls. These results support the idea that the RAG-1 gene may map at or near the locus for the wasted mutation. In addition, they suggest the importance of recombinase function in normal immune and central nervous system development as well as the potential contribution of this gene family to the normal repair of radiation-induced DNA damage.

  17. Rearrangement of RAG-1 recombinase gene in DNA-repair deficient ``wasted`` mice

    Energy Technology Data Exchange (ETDEWEB)

    Woloschak, G.E.; Libertin, C.R.; Weaver, P. [Loyola Univ., Chicago, IL (United States); Churchill, M.; Chang-Liu, C.M. [Argonne National Lab., IL (United States)


    Mice recessive for the autosomal gene ``wasted`` wst display a disease pattern which includes increased sensitivity to the killing effects of ionizing radiation, immunodeficiency, and neurologic dysfunction. The recent cloning and characterization of recombinase genes (RAG-l/RAG-2) expressed in lymphoid and possibly central nervous system tissues prompted us to examine expression of these genes in DNA repair-deficient/immunodeficient wasted mice. Our results revealed expression of RAG-1 mRNA in spinal cord (but not brain) of control mice; no expression of RAG-1 mRNA was detected in spinal cord or brain from wst/wst mice or their normal littermates (wst/{center_dot}mice). In thymus tissue, a small RAG-1 transcript (1.0 kb) was detected in wst/wst mice that was not evident in thymus from control mice. In wst/{center_dot}mice, a two-fold increase in RAG-1 mRNA was evident in thymus tissue. RAG-2 mRNA could only be detected in thymus tissue from wst/{center_dot} and not from wst/wst or parental control BCF{sub 1} mice. Southern blots revealed a rearrangement/deletion within the RAG-1 gene of affected wasted mice, not evident in known strain-specific parental or littermate controls. These results support the idea that the RAG-1 gene may map at or near the locus for the wasted mutation. In addition, they suggest the importance of recombinase function in normal immune and central nervous system development as well as the potential contribution of this gene family to the normal repair of radiation-induced DNA damage.

  18. Lineage switch in relapse of acute leukemia with rearrangement of MLL gene (KMT2A. literature review and case reports

    Directory of Open Access Journals (Sweden)

    E. A. Zerkalenkova


    Full Text Available Lineage switch is a rare phenomenon in which a transition from lymphoid to myeloid was observed in relapse of acute leukemia, or vice versa. This paper presents the four clinical case reports of acute lymphoblastic leukemia with MLL gene rearrangement (KMT2A with myeloid phenotype in relapse.

  19. Gene duplication, loss and selection in the evolution of saxitoxin biosynthesis in alveolates. (United States)

    Murray, Shauna A; Diwan, Rutuja; Orr, Russell J S; Kohli, Gurjeet S; John, Uwe


    A group of marine dinoflagellates (Alveolata, Eukaryota), consisting of ∼10 species of the genus Alexandrium, Gymnodinium catenatum and Pyrodinium bahamense, produce the toxin saxitoxin and its analogues (STX), which can accumulate in shellfish, leading to ecosystem and human health impacts. The genes, sxt, putatively involved in STX biosynthesis, have recently been identified, however, the evolution of these genes within dinoflagellates is not clear. There are two reasons for this: uncertainty over the phylogeny of dinoflagellates; and that the sxt genes of many species of Alexandrium and other dinoflagellate genera are not known. Here, we determined the phylogeny of STX-producing and other dinoflagellates based on a concatenated eight-gene alignment. We determined the presence, diversity and phylogeny of sxtA, domains A1 and A4 and sxtG in 52 strains of Alexandrium, and a further 43 species of dinoflagellates and thirteen other alveolates. We confirmed the presence and high sequence conservation of sxtA, domain A4, in 40 strains (35 Alexandrium, 1 Pyrodinium, 4 Gymnodinium) of 8 species of STX-producing dinoflagellates, and absence from non-producing species. We found three paralogs of sxtA, domain A1, and a widespread distribution of sxtA1 in non-STX producing dinoflagellates, indicating duplication events in the evolution of this gene. One paralog, clade 2, of sxtA1 may be particularly related to STX biosynthesis. Similarly, sxtG appears to be generally restricted to STX-producing species, while three amidinotransferase gene paralogs were found in dinoflagellates. We investigated the role of positive (diversifying) selection following duplication in sxtA1 and sxtG, and found negative selection in clades of sxtG and sxtA1, clade 2, suggesting they were functionally constrained. Significant episodic diversifying selection was found in some strains in clade 3 of sxtA1, a clade that may not be involved in STX biosynthesis, indicating pressure for diversification

  20. Genome-wide analysis of homeobox genes from Mesobuthus martensii reveals Hox gene duplication in scorpions. (United States)

    Di, Zhiyong; Yu, Yao; Wu, Yingliang; Hao, Pei; He, Yawen; Zhao, Huabin; Li, Yixue; Zhao, Guoping; Li, Xuan; Li, Wenxin; Cao, Zhijian


    Homeobox genes belong to a large gene group, which encodes the famous DNA-binding homeodomain that plays a key role in development and cellular differentiation during embryogenesis in animals. Here, one hundred forty-nine homeobox genes were identified from the Asian scorpion, Mesobuthus martensii (Chelicerata: Arachnida: Scorpiones: Buthidae) based on our newly assembled genome sequence with approximately 248 × coverage. The identified homeobox genes were categorized into eight classes including 82 families: 67 ANTP class genes, 33 PRD genes, 11 LIM genes, five POU genes, six SINE genes, 14 TALE genes, five CUT genes, two ZF genes and six unclassified genes. Transcriptome data confirmed that more than half of the genes were expressed in adults. The homeobox gene diversity of the eight classes is similar to the previously analyzed Mandibulata arthropods. Interestingly, it is hypothesized that the scorpion M. martensii may have two Hox clusters. The first complete genome-wide analysis of homeobox genes in Chelicerata not only reveals the repertoire of scorpion, arachnid and chelicerate homeobox genes, but also shows some insights into the evolution of arthropod homeobox genes.

  1. An evolutionarily mobile antigen receptor variable region gene: doubly rearranging NAR-TcR genes in sharks. (United States)

    Criscitiello, Michael F; Saltis, Mark; Flajnik, Martin F


    Distinctive Ig and T cell receptor (TcR) chains define the two major lineages of vertebrate lymphocyte yet similarly recognize antigen with a single, membrane-distal variable (V) domain. Here we describe the first antigen receptor chain that employs two V domains, which are generated by separate VDJ gene rearrangement events. These molecules have specialized "supportive" TcRdeltaV domains membrane-proximal to domains with most similarity to IgNAR V. The ancestral NAR V gene encoding this domain is hypothesized to have recombined with the TRD locus in a cartilaginous fish ancestor >200 million years ago and encodes the first V domain shown to be used in both Igs and TcRs. Furthermore, these data support the view that gamma/delta TcRs have for long used structural conformations recognizing free antigen.

  2. Tubulin evolution in insects: gene duplication and subfunctionalization provide specialized isoforms in a functionally constrained gene family

    Directory of Open Access Journals (Sweden)

    Gadagkar Sudhindra R


    Full Text Available Abstract Background The completion of 19 insect genome sequencing projects spanning six insect orders provides the opportunity to investigate the evolution of important gene families, here tubulins. Tubulins are a family of eukaryotic structural genes that form microtubules, fundamental components of the cytoskeleton that mediate cell division, shape, motility, and intracellular trafficking. Previous in vivo studies in Drosophila find a stringent relationship between tubulin structure and function; small, biochemically similar changes in the major alpha 1 or testis-specific beta 2 tubulin protein render each unable to generate a motile spermtail axoneme. This has evolutionary implications, not a single non-synonymous substitution is found in beta 2 among 17 species of Drosophila and Hirtodrosophila flies spanning 60 Myr of evolution. This raises an important question, How do tubulins evolve while maintaining their function? To answer, we use molecular evolutionary analyses to characterize the evolution of insect tubulins. Results Sixty-six alpha tubulins and eighty-six beta tubulin gene copies were retrieved and subjected to molecular evolutionary analyses. Four ancient clades of alpha and beta tubulins are found in insects, a major isoform clade (alpha 1, beta 1 and three minor, tissue-specific clades (alpha 2-4, beta 2-4. Based on a Homarus americanus (lobster outgroup, these were generated through gene duplication events on major beta and alpha tubulin ancestors, followed by subfunctionalization in expression domain. Strong purifying selection acts on all tubulins, yet maximum pairwise amino acid distances between tubulin paralogs are large (0.464 substitutions/site beta tubulins, 0.707 alpha tubulins. Conversely orthologs, with the exception of reproductive tissue isoforms, show little sequence variation except in the last 15 carboxy terminus tail (CTT residues, which serve as sites for post-translational modifications (PTMs and interactions

  3. Intragenic rearrangements in X-linked intellectual deficiency: results of a-CGH in a series of 54 patients and identification of TRPC5 and KLHL15 as potential XLID genes. (United States)

    Mignon-Ravix, Cécile; Cacciagli, Pierre; Choucair, Nancy; Popovici, Cornel; Missirian, Chantal; Milh, Mathieu; Mégarbané, André; Busa, Tiffany; Julia, Sophie; Girard, Nadine; Badens, Catherine; Sigaudy, Sabine; Philip, Nicole; Villard, Laurent


    High-resolution array comparative genomic hybridization (a-CGH) enables the detection of intragenic rearrangements, such as single exon deletion or duplication. This approach can lead to the identification of new disease genes. We report on the analysis of 54 male patients presenting with intellectual deficiency (ID) and a family history suggesting X-linked (XL) inheritance or maternal skewed X-chromosome inactivation (XCI), using a home-made X-chromosome-specific microarray covering the whole human X-chromosome at high resolution. The majority of patients had whole genome array-CGH prior to the selection and we did not include large rearrangements such as MECP2 and FMR1 duplications. We identified four rearrangements considered as causative or potentially pathogenic, corresponding to a detection rate of 8%. Two CNVs affected known XLID genes and were therefore considered as causative (IL1RAPL1 and OPHN1 intragenic deletions). Two new CNVs were considered as potentially pathogenic as they affected interesting candidates for ID. The first CNV is a deletion of the first exon of the TRPC5 gene, encoding a cation channel implicated in dendrite growth and patterning, in a child presenting with ID and an autism spectrum disorder (ASD). The second CNV is a partial deletion of KLHL15, in a patient with severe ID, epilepsy, and anomalies of cortical development. In both cases, in spite of strong arguments for clinical relevance, we were not able at this stage to confirm pathogenicity of the mutations, and the causality of the variants identified in XLID remains to be confirmed.

  4. Resolution and reconciliation of non-binary gene trees with transfers, duplications and losses. (United States)

    Jacox, Edwin; Weller, Mathias; Tannier, Eric; Scornavacca, Celine


    Gene trees reconstructed from sequence alignments contain poorly supported branches when the phylogenetic signal in the sequences is insufficient to determine them all. When a species tree is available, the signal of gains and losses of genes can be used to correctly resolve the unsupported parts of the gene history. However finding a most parsimonious binary resolution of a non-binary tree obtained by contracting the unsupported branches is NP-hard if transfer events are considered as possible gene scale events, in addition to gene origination, duplication and loss. We propose an exact, parameterized algorithm to solve this problem in single-exponential time, where the parameter is the number of connected branches of the gene tree that show low support from the sequence alignment or, equivalently, the maximum number of children of any node of the gene tree once the low-support branches have been collapsed. This improves on the best known algorithm by an exponential factor. We propose a way to choose among optimal solutions based on the available information. We show the usability of this principle on several simulated and biological datasets. The results are comparable in quality to several other tested methods having similar goals, but our approach provides a lower running time and a guarantee that the produced solution is optimal. Our algorithm has been integrated into the ecceTERA phylogeny package, available at and which can be run online at . Supplementary data are available at Bioinformatics online.

  5. Effects of Gene Duplication, Positive Selection, and Shifts in Gene Expression on the Evolution of the Venom Gland Transcriptome in Widow Spiders. (United States)

    Haney, Robert A; Clarke, Thomas H; Gadgil, Rujuta; Fitzpatrick, Ryan; Hayashi, Cheryl Y; Ayoub, Nadia A; Garb, Jessica E


    Gene duplication and positive selection can be important determinants of the evolution of venom, a protein-rich secretion used in prey capture and defense. In a typical model of venom evolution, gene duplicates switch to venom gland expression and change function under the action of positive selection, which together with further duplication produces large gene families encoding diverse toxins. Although these processes have been demonstrated for individual toxin families, high-throughput multitissue sequencing of closely related venomous species can provide insights into evolutionary dynamics at the scale of the entire venom gland transcriptome. By assembling and analyzing multitissue transcriptomes from the Western black widow spider and two closely related species with distinct venom toxicity phenotypes, we do not find that gene duplication and duplicate retention is greater in gene families with venom gland biased expression in comparison with broadly expressed families. Positive selection has acted on some venom toxin families, but does not appear to be in excess for families with venom gland biased expression. Moreover, we find 309 distinct gene families that have single transcripts with venom gland biased expression, suggesting that the switching of genes to venom gland expression in numerous unrelated gene families has been a dominant mode of evolution. We also find ample variation in protein sequences of venom gland-specific transcripts, lineage-specific family sizes, and ortholog expression among species. This variation might contribute to the variable venom toxicity of these species.

  6. A survey of innovation through duplication in the reduced genomes of twelve parasites.

    Directory of Open Access Journals (Sweden)

    Jeremy D DeBarry

    Full Text Available We characterize the prevalence, distribution, divergence, and putative functions of detectable two-copy paralogs and segmental duplications in the Apicomplexa, a phylum of parasitic protists. Apicomplexans are mostly obligate intracellular parasites responsible for human and animal diseases (e.g. malaria and toxoplasmosis. Gene loss is a major force in the phylum. Genomes are small and protein-encoding gene repertoires are reduced. Despite this genomic streamlining, duplications and gene family amplifications are present. The potential for innovation introduced by duplications is of particular interest. We compared genomes of twelve apicomplexans across four lineages and used orthology and genome cartography to map distributions of duplications against genome architectures. Segmental duplications appear limited to five species. Where present, they correspond to regions enriched for multi-copy and species-specific genes, pointing toward roles in adaptation and innovation. We found a phylum-wide association of duplications with dynamic chromosome regions and syntenic breakpoints. Trends in the distribution of duplicated genes indicate that recent, species-specific duplicates are often tandem while most others have been dispersed by genome rearrangements. These trends show a relationship between genome architecture and gene duplication. Functional analysis reveals: proteases, which are vital to a parasitic lifecycle, to be prominent in putative recent duplications; a pair of paralogous genes in Toxoplasma gondii previously shown to produce the rate-limiting step in dopamine synthesis in mammalian cells, a possible link to the modification of host behavior; and phylum-wide differences in expression and subcellular localization, indicative of modes of divergence. We have uncovered trends in multiple modes of duplicate divergence including sequence, intron content, expression, subcellular localization, and functions of putative recent duplicates that

  7. Large inverted duplications in the human genome form via a fold-back mechanism.

    Directory of Open Access Journals (Sweden)

    Karen E Hermetz


    Full Text Available Inverted duplications are a common type of copy number variation (CNV in germline and somatic genomes. Large duplications that include many genes can lead to both neurodevelopmental phenotypes in children and gene amplifications in tumors. There are several models for inverted duplication formation, most of which include a dicentric chromosome intermediate followed by breakage-fusion-bridge (BFB cycles, but the mechanisms that give rise to the inverted dicentric chromosome in most inverted duplications remain unknown. Here we have combined high-resolution array CGH, custom sequence capture, next-generation sequencing, and long-range PCR to analyze the breakpoints of 50 nonrecurrent inverted duplications in patients with intellectual disability, autism, and congenital anomalies. For half of the rearrangements in our study, we sequenced at least one breakpoint junction. Sequence analysis of breakpoint junctions reveals a normal-copy disomic spacer between inverted and non-inverted copies of the duplication. Further, short inverted sequences are present at the boundary of the disomic spacer and the inverted duplication. These data support a mechanism of inverted duplication formation whereby a chromosome with a double-strand break intrastrand pairs with itself to form a "fold-back" intermediate that, after DNA replication, produces a dicentric inverted chromosome with a disomic spacer corresponding to the site of the fold-back loop. This process can lead to inverted duplications adjacent to terminal deletions, inverted duplications juxtaposed to translocations, and inverted duplication ring chromosomes.

  8. The opsin repertoire of Jenynsia onca: a new perspective on gene duplication and divergence in livebearers

    Directory of Open Access Journals (Sweden)

    Owens Gregory L


    Full Text Available Abstract Background Jenynsia onca, commonly known as the one sided livebearer, is a member of the family Anablepidae. The opsin gene repertoires of J. onca's close relatives, the four-eyed fish (Anableps anableps and the guppy (Poecilia reticulata, have been characterized and each found to include one unique LWS opsin. Currently, the relationship among LWS paralogs and orthologs in these species are unclear, making it difficult to test the hypotheses that link vision to morphology or life history traits. The phylogenetic signal appears to have been disrupted by gene conversion. Here we have sequenced the opsin genes of J. onca in order to resolve these relationships. Findings We identified nine visual opsins; LWS S180r, LWS S180, LWS P180, SWS1, SWS2A, SWS2B, RH1, RH2-1, and RH2-2. Key site analysis revealed only one unique haplotype, RH2-2, although this is unlikely to shift λmax significantly. LWS P180 was found to be a product of a gene conversion event with LWS S180, followed by convergence to a proline residue at the 180 site. Conclusion Jenynsia onca has at least 9 visual opsins: three LWS, one RH1, two RH2, one SWS1 and two SWS2. The presence of LWS P180 moves the location of the LWS P180-S180 tandem duplication event back to the base of the Poeciliidae-Anablepidae clade, expanding the number of species possessing this unusual blue shifted LWS opsin. The presence of the LWS P180 gene also confirms that gene conversion events have homogenized opsin paralogs in fish, just as they have in humans.

  9. Genome-wide analysis of the Dof transcription factor gene family reveals soybean-specific duplicable and functional characteristics.

    Directory of Open Access Journals (Sweden)

    Yong Guo

    Full Text Available The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max. In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.

  10. Identification of coding exon 3 duplication in the BMPR1A gene in a patient with juvenile polyposis syndrome. (United States)

    Yamaguchi, Junya; Nagayama, Satoshi; Chino, Akiko; Sakata, Ai; Yamamoto, Noriko; Sato, Yuri; Ashihara, Yuumi; Kita, Mizuho; Nomura, Sachio; Ishikawa, Yuichi; Igarashi, Masahiro; Ueno, Masashi; Arai, Masami


    Juvenile polyposis syndrome is an autosomal dominant inherited disorder characterized by multiple juvenile polyps arising in the gastrointestinal tract and an increased risk of gastrointestinal cancers, specifically colon cancer. BMPR1A and SMAD4 germline mutations have been found in patients with juvenile polyposis syndrome. We identified a BMPR1A mutation, which involves a duplication of coding exon 3 (c.230+452_333+441dup1995), on multiple ligation dependent probe amplification in a patient with juvenile polyposis syndrome. The mutation causes a frameshift, producing a truncated protein (p.D112NfsX2). Therefore, the mutation is believed to be pathogenic. We also identified a duplication breakpoint in which Alu sequences are located. These results suggest that the duplication event resulted from recombination between Alu sequences. To our knowledge, partial duplication in the BMPR1A gene has not been reported previously. This is the first case report to document coding exon 3 duplication in the BMPR1A gene in a patient with juvenile polyposis syndrome.

  11. Inhibition of RORγT Skews TCRα Gene Rearrangement and Limits T Cell Repertoire Diversity

    Directory of Open Access Journals (Sweden)

    Yanxia Guo


    Full Text Available Recent studies have elucidated the molecular mechanism of RORγT transcriptional regulation of Th17 differentiation and function. RORγT was initially identified as a transcription factor required for thymopoiesis by maintaining survival of CD4+CD8+ (DP thymocytes. While RORγ antagonists are currently being developed to treat autoimmunity, it remains unclear how RORγT inhibition may impact thymocyte development. In this study, we show that in addition to regulating DP thymocytes survival, RORγT also controls genes that regulate thymocyte migration, proliferation, and T cell receptor (TCRα selection. Strikingly, pharmacological inhibition of RORγ skews TCRα gene rearrangement, limits T cell repertoire diversity, and inhibits development of autoimmune encephalomyelitis. Thus, targeting RORγT not only inhibits Th17 cell development and function but also fundamentally alters thymic-emigrant recognition of self and foreign antigens. The analysis of RORγ inhibitors has allowed us to gain a broader perspective of the diverse function of RORγT and its impact on T cell biology.

  12. The evolution and maintenance of Hox gene clusters in vertebrates and the teleost-specific genome duplication. (United States)

    Kuraku, Shigehiro; Meyer, Axel


    Hox genes are known to specify spatial identities along the anterior-posterior axis during embryogenesis. In vertebrates and most other deuterostomes, they are arranged in sets of uninterrupted clusters on chromosomes, and are in most cases expressed in a "colinear" fashion, in which genes closer to the 3-end of the Hox clusters are expressed earlier and more anteriorly and genes close to the 5-end of the clusters later and more posteriorly. In this review, we summarize the current understanding of how Hox gene clusters have been modified from basal lineages of deuterostomes to diverse taxa of vertebrates. Our parsimony reconstruction of Hox cluster architecture at various stages of vertebrate evolution highlights that the variation in Hox cluster structures among jawed vertebrates is mostly due to secondary lineage-specific gene losses and an additional genome duplication that occurred in the actinopterygian stem lineage, the teleost-specific genome duplication (TSGD).

  13. The role of gene duplication and unconstrained selective pressures in the melanopsin gene family evolution and vertebrate circadian rhythm regulation. (United States)

    Borges, Rui; Johnson, Warren E; O'Brien, Stephen J; Vasconcelos, Vitor; Antunes, Agostinho


    Melanopsin is a photosensitive cell protein involved in regulating circadian rhythms and other non-visual responses to light. The melanopsin gene family is represented by two paralogs, OPN4x and OPN4m, which originated through gene duplication early in the emergence of vertebrates. Here we studied the melanopsin gene family using an integrated gene/protein evolutionary approach, which revealed that the rhabdomeric urbilaterian ancestor had the same amino acid patterns (DRY motif and the Y and E conterions) as extant vertebrate species, suggesting that the mechanism for light detection and regulation is similar to rhabdomeric rhodopsins. Both OPN4m and OPN4x paralogs are found in vertebrate genomic paralogons, suggesting that they diverged following this duplication event about 600 million years ago, when the complex eye emerged in the vertebrate ancestor. Melanopsins generally evolved under negative selection (ω = 0.171) with some minor episodes of positive selection (proportion of sites = 25%) and functional divergence (θ(I) = 0.349 and θ(II) = 0.126). The OPN4m and OPN4x melanopsin paralogs show evidence of spectral divergence at sites likely involved in melanopsin light absorbance (200F, 273S and 276A). Also, following the teleost lineage-specific whole genome duplication (3R) that prompted the teleost fish radiation, type I divergence (θ(I) = 0.181) and positive selection (affecting 11% of sites) contributed to amino acid variability that we related with the photo-activation stability of melanopsin. The melanopsin intracellular regions had unexpectedly high variability in their coupling specificity of G-proteins and we propose that Gq/11 and Gi/o are the two G-proteins most-likely to mediate the melanopsin phototransduction pathway. The selection signatures were mainly observed on retinal-related sites and the third and second intracellular loops, demonstrating the physiological plasticity of the melanopsin protein group. Our results provide new insights on

  14. The role of gene duplication and unconstrained selective pressures in the melanopsin gene family evolution and vertebrate circadian rhythm regulation.

    Directory of Open Access Journals (Sweden)

    Rui Borges

    Full Text Available Melanopsin is a photosensitive cell protein involved in regulating circadian rhythms and other non-visual responses to light. The melanopsin gene family is represented by two paralogs, OPN4x and OPN4m, which originated through gene duplication early in the emergence of vertebrates. Here we studied the melanopsin gene family using an integrated gene/protein evolutionary approach, which revealed that the rhabdomeric urbilaterian ancestor had the same amino acid patterns (DRY motif and the Y and E conterions as extant vertebrate species, suggesting that the mechanism for light detection and regulation is similar to rhabdomeric rhodopsins. Both OPN4m and OPN4x paralogs are found in vertebrate genomic paralogons, suggesting that they diverged following this duplication event about 600 million years ago, when the complex eye emerged in the vertebrate ancestor. Melanopsins generally evolved under negative selection (ω = 0.171 with some minor episodes of positive selection (proportion of sites = 25% and functional divergence (θ(I = 0.349 and θ(II = 0.126. The OPN4m and OPN4x melanopsin paralogs show evidence of spectral divergence at sites likely involved in melanopsin light absorbance (200F, 273S and 276A. Also, following the teleost lineage-specific whole genome duplication (3R that prompted the teleost fish radiation, type I divergence (θ(I = 0.181 and positive selection (affecting 11% of sites contributed to amino acid variability that we related with the photo-activation stability of melanopsin. The melanopsin intracellular regions had unexpectedly high variability in their coupling specificity of G-proteins and we propose that Gq/11 and Gi/o are the two G-proteins most-likely to mediate the melanopsin phototransduction pathway. The selection signatures were mainly observed on retinal-related sites and the third and second intracellular loops, demonstrating the physiological plasticity of the melanopsin protein group. Our results provide new

  15. Plant Genome Duplication Database. (United States)

    Lee, Tae-Ho; Kim, Junah; Robertson, Jon S; Paterson, Andrew H


    Genome duplication, widespread in flowering plants, is a driving force in evolution. Genome alignments between/within genomes facilitate identification of homologous regions and individual genes to investigate evolutionary consequences of genome duplication. PGDD (the Plant Genome Duplication Database), a public web service database, provides intra- or interplant genome alignment information. At present, PGDD contains information for 47 plants whose genome sequences have been released. Here, we describe methods for identification and estimation of dates of genome duplication and speciation by functions of PGDD.The database is freely available at

  16. Molecular Characterization of Duplicate Cytosolic Phosphoglucose Isomerase Genes in Clarkia and Comparison to the Single Gene in Arabidopsis (United States)

    Thomas, B. R.; Ford, V. S.; Pichersky, E.; Gottlieb, L. D.


    The nucleotide sequence of PgiC1-a which encodes a cytosolic isozyme of phosphoglucose isomerase (PGIC; EC in Clarkia lewisii, a wildflower native to California, is described and compared to the previously published sequence of the duplicate PgiC2-a from the same genome. Both genes have the same structure of 23 exons and 22 introns located in identical positions, and they encode proteins of 569 amino acids. Exon and inferred protein sequences of the two genes are 96.4% and 97.2% identical, respectively. Intron sequences are 88.2% identical. The high nucleotide similarity of the two genes is consistent with previous genetic and biosystematic findings that suggest the duplication arose within Clarkia. A partial sequence of PgiC2-b was also obtained. It is 99.5% identical to PgiC2-a in exons and 99.7% in introns. The nucleotide sequence of the single PgiC from Arabidopsis thaliana was also determined for comparison to the Clarkia genes. The A. thaliana PgiC has 21 introns located at positions identical to those in Clarkia PgiC1 and PgiC2, but lacks the intron that divides Clarkia exons 21 and 22. The A. thaliana PGIC protein is shorter, with 560 amino acids, and differs by about 17% from the Clarkia PGICs. The PgiC in A. thaliana was mapped to a site 20 cM from restriction fragment length polymorphism marker 331 on chromosome 5. PMID:8293986

  17. Allelic Polymorphism, Gene Duplication and Balancing Selection of MHC Class IIB Genes in the Omei Treefrog (Rhacophorus omeimontis)

    Institute of Scientific and Technical Information of China (English)

    Li HUANG; Mian ZHAO; Zhenhua LUO; Hua WU


    The worldwide declines in amphibian populations have largely been caused by infectious fungi and bacteria. Given that vertebrate immunity against these extracellular pathogens is primarily functioned by the major histocompatibility complex (MHC) class II molecules, the characterization and the evolution of amphibian MHC class II genes have attracted increasing attention. The polymorphism of MHC class II genes was found to be correlated with susceptibility to fungal pathogens in many amphibian species, suggesting the importance of studies on MHC class II genes for amphibians. However, such studies on MHC class II gene evolution have rarely been conducted on amphibians in China. In this study, we chose Omei treefrog (Rhacophorus omeimontis), which lived moist environments easy for breeding bacteria, to study the polymorphism of its MHC class II genes and the underlying evolutionary mechanisms. We amplified the entire MHC class IIB exon 2 sequence in the R. omeimontis using newly designed primers. We detected 102 putative alleles in 146 individuals. The number of alleles per individual ranged from one to seven, indicating that there are at least four loci containing MHC class IIB genes in R. omeimontis. The allelic polymorphism estimated from the 102 alleles in R. omeimontis was not high compared to that estimated in other anuran species. No significant gene recombination was detected in the 102 MHC class IIB exon 2 sequences. In contrast, both gene duplication and balancing selection greatly contributed to the variability in MHC class IIB exon 2 sequences of R. omeimontis. This study lays the groundwork for the future researches to comprehensively analyze the evolution of amphibian MHC genes and to assess the role of MHC gene polymorphisms in resistance against extracellular pathogens for amphibians in China.

  18. Rearrangements of chicken immunoglobulin genes in lymphoid cells transformed by the avian retroviral oncogene v-rel. (United States)

    Chen, L; Lim, M Y; Bose, H; Bishop, J M


    The retroviral oncogene v-rel transforms poorly characterized lymphoid cells. We have explored the nature of these cells by analyzing the configuration and expression of immunoglobulin genes in chicken hemopoietic cells transformed by v-rel. None of the transformed cells expressed their immunoglobulin genes. The cells fell into three classes: class I cells have their immunoglobulin genes potentially in an embryonic configuration; class II and class III cells have lost one copy of the lambda light chain locus and have one copy of the heavy chain locus rearranged into a configuration that differs from what is found in mature B cells. In class II cells, the other heavy chain locus may be in embryonic configuration, whereas it is deleted in class III cells. The first of these classes may represent the earliest stage of the lymphoid lineage yet encountered among virus-transformed cells, whereas the second and third classes represent an apparently anomalous rearrangement whose origin remains unknown.

  19. Multiple independent origins of mitochondrial control region duplications in the order Psittaciformes

    DEFF Research Database (Denmark)

    Schirtzinger, Erin E.; Tavares, Erika S.; Gonzales, Lauren A.;


    Mitochondrial genomes are generally thought to be under selection for compactness, due to their small size, consistent gene content, and a lack of introns or intergenic spacers. As more animal mitochondrial genomes are fully sequenced, rearrangements and partial duplications are being identified ...

  20. Multiple independent origins of mitochondrial control region duplications in the order Psittaciformes

    DEFF Research Database (Denmark)

    Schirtzinger, Erin E.; Tavares, Erika S.; Gonzales, Lauren A.


    Mitochondrial genomes are generally thought to be under selection for compactness, due to their small size, consistent gene content, and a lack of introns or intergenic spacers. As more animal mitochondrial genomes are fully sequenced, rearrangements and partial duplications are being identified ...

  1. Whole-genome duplications spurred the functional diversification of the globin gene superfamily in vertebrates. (United States)

    Hoffmann, Federico G; Opazo, Juan C; Storz, Jay F


    It has been hypothesized that two successive rounds of whole-genome duplication (WGD) in the stem lineage of vertebrates provided genetic raw materials for the evolutionary innovation of many vertebrate-specific features. However, it has seldom been possible to trace such innovations to specific functional differences between paralogous gene products that derive from a WGD event. Here, we report genomic evidence for a direct link between WGD and key physiological innovations in the vertebrate oxygen transport system. Specifically, we demonstrate that key globin proteins that evolved specialized functions in different aspects of oxidative metabolism (hemoglobin, myoglobin, and cytoglobin) represent paralogous products of two WGD events in the vertebrate common ancestor. Analysis of conserved macrosynteny between the genomes of vertebrates and amphioxus (subphylum Cephalochordata) revealed that homologous chromosomal segments defined by myoglobin + globin-E, cytoglobin, and the α-globin gene cluster each descend from the same linkage group in the reconstructed proto-karyotype of the chordate common ancestor. The physiological division of labor between the oxygen transport function of hemoglobin and the oxygen storage function of myoglobin played a pivotal role in the evolution of aerobic energy metabolism, supporting the hypothesis that WGDs helped fuel key innovations in vertebrate evolution.

  2. Duplications of the neuropeptide receptor gene VIPR2 confer significant risk for schizophrenia.

    LENUS (Irish Health Repository)

    Vacic, Vladimir


    Rare copy number variants (CNVs) have a prominent role in the aetiology of schizophrenia and other neuropsychiatric disorders. Substantial risk for schizophrenia is conferred by large (>500-kilobase) CNVs at several loci, including microdeletions at 1q21.1 (ref. 2), 3q29 (ref. 3), 15q13.3 (ref. 2) and 22q11.2 (ref. 4) and microduplication at 16p11.2 (ref. 5). However, these CNVs collectively account for a small fraction (2-4%) of cases, and the relevant genes and neurobiological mechanisms are not well understood. Here we performed a large two-stage genome-wide scan of rare CNVs and report the significant association of copy number gains at chromosome 7q36.3 with schizophrenia. Microduplications with variable breakpoints occurred within a 362-kilobase region and were detected in 29 of 8,290 (0.35%) patients versus 2 of 7,431 (0.03%) controls in the combined sample. All duplications overlapped or were located within 89 kilobases upstream of the vasoactive intestinal peptide receptor gene VIPR2. VIPR2 transcription and cyclic-AMP signalling were significantly increased in cultured lymphocytes from patients with microduplications of 7q36.3. These findings implicate altered vasoactive intestinal peptide signalling in the pathogenesis of schizophrenia and indicate the VPAC2 receptor as a potential target for the development of new antipsychotic drugs.

  3. Mirror-image duplication of the primary axis and heart in Xenopus embryos by the overexpression of Msx-1 gene. (United States)

    Chen, Y; Solursh, M


    The Msx-1 gene (formerly known as Hox-7) is a member of a discrete subclass of homeobox-containing genes. Examination of the expression pattern of Msx-1 in murine and avian embryos suggests that this gene may be involved in the regionalization of the medio-lateral axis during earlier development. We have examined the possible functions of Xenopus Msx-1 during early Xenopus embryonic development by overexpression of the Msx-1 gene. Overexpression of Msx-1 causes a left-right mirror-image duplication of primary axial structures, including notochord, neural tube, somites, suckers, and foregut. The embryonic developing heart is also mirror-image duplicated, including looping directions and polarity. These results indicate that Msx-1 may be involved in the mesoderm formation as well as left-right patterning in the early Xenopus embryonic development.

  4. Evolution of Vertebrate Adam Genes; Duplication of Testicular Adams from Ancient Adam9/9-like Loci. (United States)

    Bahudhanapati, Harinath; Bhattacharya, Shashwati; Wei, Shuo


    Members of the disintegrin metalloproteinase (ADAM) family have important functions in regulating cell-cell and cell-matrix interactions as well as cell signaling. There are two major types of ADAMs: the somatic ADAMs (sADAMs) that have a significant presence in somatic tissues, and the testicular ADAMs (tADAMs) that are expressed predominantly in the testis. Genes encoding tADAMs can be further divided into two groups: group I (intronless) and group II (intron-containing). To date, tAdams have only been reported in placental mammals, and their evolutionary origin and relationship to sAdams remain largely unknown. Using phylogenetic and syntenic tools, we analyzed the Adam genes in various vertebrates ranging from fishes to placental mammals. Our analyses reveal duplication and loss of some sAdams in certain vertebrate species. In particular, there exists an Adam9-like gene in non-mammalian vertebrates but not mammals. We also identified putative group I and group II tAdams in all amniote species that have been examined. These tAdam homologues are more closely related to Adams 9 and 9-like than to other sAdams. In all amniote species examined, group II tAdams lie in close vicinity to Adam9 and hence likely arose from tandem duplication, whereas group I tAdams likely originated through retroposition because of their lack of introns. Clusters of multiple group I tAdams are also common, suggesting tandem duplication after retroposition. Therefore, Adam9/9-like and some of the derived tAdam loci are likely preferred targets for tandem duplication and/or retroposition. Consistent with this hypothesis, we identified a young retroposed gene that duplicated recently from Adam9 in the opossum. As a result of gene duplication, some tAdams were pseudogenized in certain species, whereas others acquired new expression patterns and functions. The rapid duplication of Adam genes has a major contribution to the diversity of ADAMs in various vertebrate species.

  5. Function of Partially Duplicated Human α7 Nicotinic Receptor Subunit CHRFAM7A Gene (United States)

    de Lucas-Cerrillo, Ana M.; Maldifassi, M. Constanza; Arnalich, Francisco; Renart, Jaime; Atienza, Gema; Serantes, Rocío; Cruces, Jesús; Sánchez-Pacheco, Aurora; Andrés-Mateos, Eva; Montiel, Carmen


    The neuronal α7 nicotinic receptor subunit gene (CHRNA7) is partially duplicated in the human genome forming a hybrid gene (CHRFAM7A) with the novel FAM7A gene. The hybrid gene transcript, dupα7, has been identified in brain, immune cells, and the HL-60 cell line, although its translation and function are still unknown. In this study, dupα7 cDNA has been cloned and expressed in GH4C1 cells and Xenopus oocytes to study the pattern and functional role of the expressed protein. Our results reveal that dupα7 transcript was natively translated in HL-60 cells and heterologously expressed in GH4C1 cells and oocytes. Injection of dupα7 mRNA into oocytes failed to generate functional receptors, but when co-injected with α7 mRNA at α7/dupα7 ratios of 5:1, 2:1, 1:1, 1:5, and 1:10, it reduced the nicotine-elicited α7 current generated in control oocytes (α7 alone) by 26, 53, 75, 93, and 94%, respectively. This effect is mainly due to a reduction in the number of functional α7 receptors reaching the oocyte membrane, as deduced from α-bungarotoxin binding and fluorescent confocal assays. Two additional findings open the possibility that the dominant negative effect of dupα7 on α7 receptor activity observed in vitro could be extrapolated to in vivo situations. (i) Compared with α7 mRNA, basal dupα7 mRNA levels are substantial in human cerebral cortex and higher in macrophages. (ii) dupα7 mRNA levels in macrophages are down-regulated by IL-1β, LPS, and nicotine. Thus, dupα7 could modulate α7 receptor-mediated synaptic transmission and cholinergic anti-inflammatory response. PMID:21047781

  6. Octocoral mitochondrial genomes provide insights into the phylogenetic history of gene order rearrangements, order reversals, and cnidarian phylogenetics. (United States)

    Figueroa, Diego F; Baco, Amy R


    We use full mitochondrial genomes to test the robustness of the phylogeny of the Octocorallia, to determine the evolutionary pathway for the five known mitochondrial gene rearrangements in octocorals, and to test the suitability of using mitochondrial genomes for higher taxonomic-level phylogenetic reconstructions. Our phylogeny supports three major divisions within the Octocorallia and show that Paragorgiidae is paraphyletic, with Sibogagorgia forming a sister branch to the Coralliidae. Furthermore, Sibogagorgia cauliflora has what is presumed to be the ancestral gene order in octocorals, but the presence of a pair of inverted repeat sequences suggest that this gene order was not conserved but rather evolved back to this apparent ancestral state. Based on this we recommend the resurrection of the family Sibogagorgiidae to fix the paraphyly of the Paragorgiidae. This is the first study to show that in the Octocorallia, mitochondrial gene orders have evolved back to an ancestral state after going through a gene rearrangement, with at least one of the gene orders evolving independently in different lineages. A number of studies have used gene boundaries to determine the type of mitochondrial gene arrangement present. However, our findings suggest that this method known as gene junction screening may miss evolutionary reversals. Additionally, substitution saturation analysis demonstrates that while whole mitochondrial genomes can be used effectively for phylogenetic analyses within Octocorallia, their utility at higher taxonomic levels within Cnidaria is inadequate. Therefore for phylogenetic reconstruction at taxonomic levels higher than subclass within the Cnidaria, nuclear genes will be required, even when whole mitochondrial genomes are available.

  7. The fate of tandemly duplicated genes assessed by the expression analysis of a group of Arabidopsis thaliana RING-H2 ubiquitin ligase genes of the ATL family. (United States)

    Aguilar-Hernández, Victor; Guzmán, Plinio


    Gene duplication events exert key functions on gene innovations during the evolution of the eukaryotic genomes. A large portion of the total gene content in plants arose from tandem duplications events, which often result in paralog genes with high sequence identity. Ubiquitin ligases or E3 enzymes are components of the ubiquitin proteasome system that function during the transfer of the ubiquitin molecule to the substrate. In plants, several E3s have expanded in their genomes as multigene families. To gain insight into the consequences of gene duplications on the expansion and diversification of E3s, we examined the evolutionary basis of a cluster of six genes, duplC-ATLs, which arose from segmental and tandem duplication events in Brassicaceae. The assessment of the expression suggested two patterns that are supported by lineage. While retention of expression domains was observed, an apparent absence or reduction of expression was also inferred. We found that two duplC-ATL genes underwent pseudogenization and that, in one case, gene expression is probably regained. Our findings provide insights into the evolution of gene families in plants, defining key events on the expansion of the Arabidopsis Tóxicos en Levadura family of E3 ligases.

  8. Genesis of the vertebrate FoxP subfamily member genes occurred during two ancestral whole genome duplication events. (United States)

    Song, Xiaowei; Tang, Yezhong; Wang, Yajun


    The vertebrate FoxP subfamily genes play important roles in the construction of essential functional modules involved in physiological and developmental processes. To explore the adaptive evolution of functional modules associated with the FoxP subfamily member genes, it is necessary to study the gene duplication process. We detected four member genes of the FoxP subfamily in sea lampreys (a representative species of jawless vertebrates) through genome screenings and phylogenetic analyses. Reliable paralogons (i.e. paralogous chromosome segments) have rarely been detected in scaffolds of FoxP subfamily member genes in sea lampreys due to the considerable existence of HTH_Tnp_Tc3_2 transposases. However, these transposases did not alter gene numbers of the FoxP subfamily in sea lampreys. The coincidence between the "1-4" gene duplication pattern of FoxP subfamily genes from invertebrates to vertebrates and two rounds of ancestral whole genome duplication (1R- and 2R-WGD) events reveal that the FoxP subfamily of vertebrates was quadruplicated in the 1R- and 2R-WGD events. Furthermore, we deduced that a synchronous gene duplication process occurred for the FoxP subfamily and for three linked gene families/subfamilies (i.e. MIT family, mGluR group III and PLXNA subfamily) in the 1R- and 2R-WGD events using phylogenetic analyses and mirror-dendrogram methods (i.e. algorithms to test protein-protein interactions). Specifically, the ancestor of FoxP1 and FoxP3 and the ancestor of FoxP2 and FoxP4 were generated in 1R-WGD event. In the subsequent 2R-WGD event, these two ancestral genes were changed into FoxP1, FoxP2, FoxP3 and FoxP4. The elucidation of these gene duplication processes shed light on the phylogenetic relationships between functional modules of the FoxP subfamily member genes.

  9. Postsplenectomy cytomegaloviral mononucleosis: marked lymphocytosis, TCRgamma gene rearrangements, and impaired IgM response. (United States)

    Han, Xiang Y; Lin, Pei; Amin, Hesham M; Ferrajoli, Alessandra


    People who have undergone splenectomy mount a poor IgM response to bacterial polysaccharide vaccines. Whether this defect is true during natural bacterial and viral infections is unknown. We present 2 cases of postsplenectomy cytomegalovirus (CMV)-induced mononucleosis with impaired IgM but normal to augmented IgG response. The cases presented initial diagnostic challenges owing to a prolonged course of infection, marked lymphocytosis (peak lymphocyte count, 27,900/microL [27.9 10(9)/L]), clonal T-cell proliferation with T-cell receptor g gene rearrangements, and remote history of splenectomy. However, the acute nature of the infections, serial determinations of the anti-CMV IgM and IgG, exclusion of other causes, and detection of CMV in the blood established the diagnosis and revealed the deranged antibody response. The infections resolved without specific treatment. These cases suggest that the spleen might be a primary site for specific anti-CMV IgM response.

  10. Diagnosis of mantle cell lymphoma and detection of bcl-1 gene rearrangement

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Seung Sook; Cho, Kyung Ja; Lee, Sun Joo [Korea Cancer Center Hospital, Seoul (Korea, Republic of)


    We reclassified a large series of non-Hogkin`s lymphoma diagnosed at Korea Cancer Center Hospital from 1991 to 1995, according to REAL classification, and compared the efficacy of immunohistochemical study for cyclin D1 protein and PCR for bcl-1 gene rearrangement to diagnose mantle cell lymphoma (MCL). By REAL classification, 7 %, diffuse large B-cell lymphoma was the most common type (51.8%) and was followed by peripheral T-cell lymphoma-unspecified (10%) and angiocentric lymphoma (7.5%). The most reliable histologic finding was mitosis to make a differential diagnosis. Mitoses of MCL were 17/10 HPF in average and all the cases showed more than 10/10 HPF. Immunophenotypic study alone cannot lead to a differential diagnosis between MCL and SLL, and the overexpression of cyclin D1 was the most important for diagnosis of MCL . Both immunohistochemistry for cyclin D1 and PCR for bcl-1 were specific for MCL and immunohistochemistry was more sensitive than PCR. Statistical analysis showed a different survival rate between MCL and the other low-grade B-cell lymphomas (SLL + MALT + LPL) and a difference between MCL and SLL. Immunohistochemical detection of cyclin D1 has a practical usefulness in making routine diagnosis of MCL. The initial accurate diagnosis of MCL will help clinicians make a proper management. (author). 27 refs., 6 tabs., 4 figs.

  11. Genomic disorders: molecular mechanisms for rearrangements and conveyed phenotypes.

    Directory of Open Access Journals (Sweden)

    James R Lupski


    Full Text Available Rearrangements of our genome can be responsible for inherited as well as sporadic traits. The analyses of chromosome breakpoints in the proximal short arm of Chromosome 17 (17p reveal nonallelic homologous recombination (NAHR as a major mechanism for recurrent rearrangements whereas nonhomologous end-joining (NHEJ can be responsible for many of the nonrecurrent rearrangements. Genome architectural features consisting of low-copy repeats (LCRs, or segmental duplications, can stimulate and mediate NAHR, and there are hotspots for the crossovers within the LCRs. Rearrangements introduce variation into our genome for selection to act upon and as such serve an evolutionary function analogous to base pair changes. Genomic rearrangements may cause Mendelian diseases, produce complex traits such as behaviors, or represent benign polymorphic changes. The mechanisms by which rearrangements convey phenotypes are diverse and include gene dosage, gene interruption, generation of a fusion gene, position effects, unmasking of recessive coding region mutations (single nucleotide polymorphisms, SNPs, in coding DNA or other functional SNPs, and perhaps by effects on transvection.

  12. Closely linked H2B genes in the marine copepod, Tigriopus californicus indicate a recent gene duplication or gene conversion event. (United States)

    Brown, D; Cook, A; Wagner, M; Wells, D


    Two nonallelic histone gene clusters were characterized in the marine copepod, Tigriopus californicus. The DNA sequence of one of the clusters reveals six genes in the contiguous arrangement of H2B, H1, H3, H4, H2B and H2A. The order of genes within the second cluster is H3, H4, H2B and H2A. There is no evidence for the presence of an H1 gene in this cluster. Comparison of the three copepod H2B genes reveals a high degree of similarity between the 5' upstream regions and between the amino terminal halves of the two H2B genes found within the same cluster. From these data we infer that gene duplication and/or gene conversion events occurred within this cluster in the recent past.

  13. Duplication of the dystroglycan gene in most branches of teleost fish

    Directory of Open Access Journals (Sweden)

    Giardina Bruno


    Full Text Available Abstract Background The dystroglycan (DG complex is a major non-integrin cell adhesion system whose multiple biological roles involve, among others, skeletal muscle stability, embryonic development and synapse maturation. DG is composed of two subunits: α-DG, extracellular and highly glycosylated, and the transmembrane β-DG, linking the cytoskeleton to the surrounding basement membrane in a wide variety of tissues. A single copy of the DG gene (DAG1 has been identified so far in humans and other mammals, encoding for a precursor protein which is post-translationally cleaved to liberate the two DG subunits. Similarly, D. rerio (zebrafish seems to have a single copy of DAG1, whose removal was shown to cause a severe dystrophic phenotype in adult animals, although it is known that during evolution, due to a whole genome duplication (WGD event, many teleost fish acquired multiple copies of several genes (paralogues. Results Data mining of pufferfish (T. nigroviridis and T. rubripes and other teleost fish (O. latipes and G. aculeatus available nucleotide sequences revealed the presence of two functional paralogous DG sequences. RT-PCR analysis proved that both the DG sequences are transcribed in T. nigroviridis. One of the two DG sequences harbours an additional mini-intronic sequence, 137 bp long, interrupting the uncomplicated exon-intron-exon pattern displayed by DAG1 in mammals and D. rerio. A similar scenario emerged also in D. labrax (sea bass, from whose genome we have cloned and sequenced a new DG sequence that also harbours a shorter additional intronic sequence of 116 bp. Western blot analysis confirmed the presence of DG protein products in all the species analysed including two teleost Antarctic species (T. bernacchii and C. hamatus. Conclusion Our evolutionary analysis has shown that the whole-genome duplication event in the Class Actinopterygii (ray-finned fish involved also DAG1. We unravelled new important molecular genetic details

  14. Biological Consequences of Ancient Gene Acquisition and Duplication in the Large Genome of Candidatus Solibacter usitatus Ellin6076

    Energy Technology Data Exchange (ETDEWEB)

    Challacombe, Jean F [ORNL; Eichorst, Stephanie A [Los Alamos National Laboratory (LANL); Hauser, Loren John [ORNL; Land, Miriam L [ORNL; Xie, Gary [Los Alamos National Laboratory (LANL); Kuske, Cheryl R [Los Alamos National Laboratory (LANL)


    Members of the bacterial phylum Acidobacteria are widespread in soils and sediments worldwide, and are abundant in many soils. Acidobacteria are challenging to culture in vitro, and many basic features of their biology and functional roles in the soil have not been determined. Candidatus Solibacter usitatus strain Ellin6076 has a 9.9 Mb genome that is approximately 2 5 times as large as the other sequenced Acidobacteria genomes. Bacterial genome sizes typically range from 0.5 to 10 Mb and are influenced by gene duplication, horizontal gene transfer, gene loss and other evolutionary processes. Our comparative genome analyses indicate that the Ellin6076 large genome has arisen by horizontal gene transfer via ancient bacteriophage and/or plasmid-mediated transduction, and widespread small-scale gene duplications, resulting in an increased number of paralogs. Low amino acid sequence identities among functional group members, and lack of conserved gene order and orientation in regions containing similar groups of paralogs, suggest that most of the paralogs are not the result of recent duplication events. The genome sizes of additional cultured Acidobacteria strains were estimated using pulsed-field gel electrophoresis to determine the prevalence of the large genome trait within the phylum. Members of subdivision 3 had larger genomes than those of subdivision 1, but none were as large as the Ellin6076 genome. The large genome of Ellin6076 may not be typical of the phylum, and encodes traits that could provide a selective metabolic, defensive and regulatory advantage in the soil environment.

  15. Biological consequences of ancient gene acquisition and duplication in the large genome soil bacterium, ""solibacter usitatus"" strain Ellin6076

    Energy Technology Data Exchange (ETDEWEB)

    Challacombe, Jean F [Los Alamos National Laboratory; Eichorst, Stephanie A [Los Alamos National Laboratory; Xie, Gary [Los Alamos National Laboratory; Kuske, Cheryl R [Los Alamos National Laboratory; Hauser, Loren [ORNL; Land, Miriam [ORNL


    Bacterial genome sizes range from ca. 0.5 to 10Mb and are influenced by gene duplication, horizontal gene transfer, gene loss and other evolutionary processes. Sequenced genomes of strains in the phylum Acidobacteria revealed that 'Solibacter usistatus' strain Ellin6076 harbors a 9.9 Mb genome. This large genome appears to have arisen by horizontal gene transfer via ancient bacteriophage and plasmid-mediated transduction, as well as widespread small-scale gene duplications. This has resulted in an increased number of paralogs that are potentially ecologically important (ecoparalogs). Low amino acid sequence identities among functional group members and lack of conserved gene order and orientation in the regions containing similar groups of paralogs suggest that most of the paralogs were not the result of recent duplication events. The genome sizes of cultured subdivision 1 and 3 strains in the phylum Acidobacteria were estimated using pulsed-field gel electrophoresis to determine the prevalence of the large genome trait within the phylum. Members of subdivision 1 were estimated to have smaller genome sizes ranging from ca. 2.0 to 4.8 Mb, whereas members of subdivision 3 had slightly larger genomes, from ca. 5.8 to 9.9 Mb. It is hypothesized that the large genome of strain Ellin6076 encodes traits that provide a selective metabolic, defensive and regulatory advantage in the variable soil environment.

  16. A large duplication in the gene for lysyl hydroxylase accounts for the type VI variant of Ehlers-Danlos syndrome in two siblings

    Energy Technology Data Exchange (ETDEWEB)

    Hautala, T.; Heikkinen, J.; Kivirikko, K.I.; Myllylae, R. (Univ. of Oulu (Finland))


    Ehlers-Danlos syndrome is a deterogeneous disorder characterized by joint hypermobility, skin hyperextensibility, fragility, and other sign of connective tissue involvement. In addition to these, the type VI variant of the disease has some special characteristics such as kyphoscoliosis and ocular abnormalities. The biochemical abnormality in most patients with this autosomal recessively inherited type IV variant is a deficiency in the activity of lysyl hydroxylase (EC 1.14,11.4), the enzyme catalyzing the formation of hydroxylysine in collagens and other proteins with collagen-like amino acid sequences. The type VI variant of Ehlers-Danlos syndrome was first identified in two sisters with a reduced amount of lysyl hydroxylase activity in their skin fibroblasts (S.R. Pinnell, S.M. Krane, J.E. Kenzora, and M.J. Glimcher (1972) N. Engl. J. Med. 286; 1013-1020). Our recent molecular cloning of lysyl hydroxylase has now made it possible to study the mutations leading to the deficiency in lysyl dydroxylase activity in these cells. Our data indicate that the mRNA for lysyl hydroxylase produced in the affected cells is about 4 kb in size, whereas it is 3.2 kb in the control cells. The sequencing of the cDNA for lysyl hydroxylase from the affected cells revealed an apparently homozygous duplication rearrangement of nucleotides 1176 to 1955, corresponding to amino acids 326 to 585 in the normal sequence. From Southern blotting data, the duplicated area in the gene equals about 6-9 kb and corresponds to seven exons. 35 refs., 4 figs.

  17. Voltage-gated sodium channel gene repertoire of lampreys: gene duplications, tissue-specific expression and discovery of a long-lost gene. (United States)

    Zakon, Harold H; Li, Weiming; Pillai, Nisha E; Tohari, Sumanty; Shingate, Prashant; Ren, Jianfeng; Venkatesh, Byrappa


    Studies of the voltage-gated sodium (Nav) channels of extant gnathostomes have made it possible to deduce that ancestral gnathostomes possessed four voltage-gated sodium channel genes derived from a single ancestral chordate gene following two rounds of genome duplication early in vertebrates. We investigated the Nav gene family in two species of lampreys (the Japanese lamprey Lethenteron japonicum and sea lamprey Petromyzon marinus) (jawless vertebrates-agnatha) and compared them with those of basal vertebrates to better understand the origin of Nav genes in vertebrates. We noted six Nav genes in both lamprey species, but orthology with gnathostome (jawed vertebrate) channels was inconclusive. Surprisingly, the Nav2 gene, ubiquitously found in invertebrates and believed to have been lost in vertebrates, is present in lampreys, elephant shark (Callorhinchus milii) and coelacanth (Latimeria chalumnae). Despite repeated duplication of the Nav1 family in vertebrates, Nav2 is only in single copy in those vertebrates in which it is retained, and was independently lost in ray-finned fishes and tetrapods. Of the other five Nav channel genes, most were expressed in brain, one in brain and heart, and one exclusively in skeletal muscle. Invertebrates do not express Nav channel genes in muscle. Thus, early in the vertebrate lineage Nav channels began to diversify and different genes began to express in heart and muscle. © 2017 The Author(s).

  18. Deep sequencing of the murine Igh repertoire reveals complex regulation of non-random V gene rearrangement frequencies (United States)

    Choi, Nancy M.; Loguercio, Salvatore; Verma-Gaur, Jiyoti; Degner, Stephanie C.; Torkamani, Ali; Su, Andrew I.; Oltz, Eugene M.; Artyomov, Maxim; Feeney, Ann J.


    A diverse antibody repertoire is formed through the rearrangement of V, D, and J segments at the immunoglobulin heavy chain (Igh) loci. The C57BL/6 murine Igh locus has over 100 functional VH gene segments that can recombine to a rearranged DJH. While the non-random usage of VH genes is well documented, it is not clear what elements determine recombination frequency. To answer this question we conducted deep sequencing of 5′-RACE products of the Igh repertoire in pro-B cells, amplified in an unbiased manner. ChIP-seq results for several histone modifications and RNA polymerase II binding, RNA-seq for sense and antisense non-coding germline transcripts, and proximity to CTCF and Rad21 sites were compared to the usage of individual V genes. Computational analyses assessed the relative importance of these various accessibility elements. These elements divide the Igh locus into four epigenetically and transcriptionally distinct domains, and our computational analyses reveal different regulatory mechanisms for each region. Proximal V genes are relatively devoid of active histone marks and non-coding RNA in general, but having a CTCF site near their RSS is critical, suggesting that being positioned near the base of the chromatin loops is important for rearrangement. In contrast, distal V genes have higher levels of histone marks and non-coding RNA, which may compensate for their poorer RSSs and for being distant from CTCF sites. Thus, the Igh locus has evolved a complex system for the regulation of V(D)J rearrangement that is different for each of the four domains that comprise this locus. PMID:23898036

  19. Detection of immunoglobulin IGH gene rearrangements on formalin-fixed, paraffin embedded tissue in lymphoid malignancies. (United States)

    Moharrami, G; Ghorbian, S; Seifi, M; Estiar, M A; Fakhrjoo, A; Sakhinia, M; Sakhinia, E


    Human lymphomas are aggressive malignant diseases, which can be categorized based on their B and T cell lineage. B-cell lymphomas form around 90% of the total lymphoma cases, the remnants of malignancies arise from the T cell branch. Lymphomas are mostly characterized as clonal proliferations of specific tumor cells. The detection of malignant lymphomas are extensively investigated by their morphological features, immunohistochemistry and flowcytometric immunophenotyping, but in some of cases remained unknown. The BIOMED-2 protocols were used to determine the clonality of IGH gene rearrangements in patients with lymphoma. PCR amplification was performed on FFPE of 50 patients with B-cell lymphoma, which consisted of 11 cases with HLs, 25 cases of B-NHLs and 14 cases of B-LPD (lymphoproliferative disorders) that diagnosed as unclassifiable lymphoma. The rate of positive clonality was detected in 96% (24/25) of B-NHLs, whereas in 4% (1/25) of cases clonality was showed in a polyclonal pattern. In B-HLs, 82% (9/11) of cases showed clonality and 18% (2/11) of the cases showed polyclonality. The rate of positive clonality observed in 64.3% (9/14) of cases with B-LPD and 35.7% (5/14) of cases clonality was not detected in any of immunoglobulin gene family (FR1, FR2, FR3). In groups with DLBCL, clonality was detected in 95% (19/20) of the cases. In patients diagnosed with FL and MALTs 100% cases showed clonality for complete IGH. Our study revealed that EuroClonality BIOMED-2 protocols could be considered as a valuable and reliable method for clonality detection, especially in IGH analysis.

  20. North Carolina macular dystrophy (MCDR1) caused by a novel tandem duplication of the PRDM13 gene (United States)

    Sullivan, Lori S.; Wheaton, Dianna K.; Locke, Kirsten G.; Jones, Kaylie D.; Koboldt, Daniel C.; Fulton, Robert S.; Wilson, Richard K.; Blanton, Susan H.; Birch, David G.; Daiger, Stephen P.


    Purpose To identify the underlying cause of disease in a large family with North Carolina macular dystrophy (NCMD). Methods A large four-generation family (RFS355) with an autosomal dominant form of NCMD was ascertained. Family members underwent comprehensive visual function evaluations. Blood or saliva from six affected family members and three unaffected spouses was collected and DNA tested for linkage to the MCDR1 locus on chromosome 6q12. Three affected family members and two unaffected spouses underwent whole exome sequencing (WES) and subsequently, custom capture of the linkage region followed by next-generation sequencing (NGS). Standard PCR and dideoxy sequencing were used to further characterize the mutation. Results Of the 12 eyes examined in six affected individuals, all but two had Gass grade 3 macular degeneration features. Large central excavation of the retinal and choroid layers, referred to as a macular caldera, was seen in an age-independent manner in the grade 3 eyes. The calderas are unique to affected individuals with MCDR1. Genome-wide linkage mapping and haplotype analysis of markers from the chromosome 6q region were consistent with linkage to the MCDR1 locus. Whole exome sequencing and custom-capture NGS failed to reveal any rare coding variants segregating with the phenotype. Analysis of the custom-capture NGS sequencing data for copy number variants uncovered a tandem duplication of approximately 60 kb on chromosome 6q. This region contains two genes, CCNC and PRDM13. The duplication creates a partial copy of CCNC and a complete copy of PRDM13. The duplication was found in all affected members of the family and is not present in any unaffected members. The duplication was not seen in 200 ethnically matched normal chromosomes. Conclusions The cause of disease in the original family with MCDR1 and several others has been recently reported to be dysregulation of the PRDM13 gene, caused by either single base substitutions in a DNase 1

  1. Clonal diversity of Ig and T-cell receptor gene rearrangements in childhood B-precursor acute lymphoblastic leukaemia. (United States)

    Stankovic, T; Weston, V; McConville, C M; Green, E; Powell, J E; Mann, J R; Darbyshire, P J; Taylor, A M


    The majority of paediatric B precursor acute lymphoblastic leukaemias in children are derived from a single transformed haematopoietic cell with complete or partial VDJ recombination within the immunoglobulin heavy chain gene. A high frequency of patients also show rearrangements within TCRdelta and TCRgamma loci and in up to 40% of children there is an excess of immune system gene rearrangements compared with the number of identified alleles of immune system genes, suggesting the presence of multiple leukaemic subclones -clonal diversity. It has been observed by us and other investigators that in individual patients the pattern of immune system gene rearrangements often changes between presentation and relapse. In order to explore the possibility that clonal diversity plays a biological role during disease progression we optimised methods for subclone detection and analysed the prognostic significance of clonal diversity among 75 children with B precursor-ALL. Our results suggest that clonal diversity plays a role in disease progression as patients with oligoclonal disease showed a significantly shorter disease free survival than patients with monoclonal disease. This trend was of particular importance in the 'standard risk' group of ALL where aggressive disease could not be recognised by other means. In addition, generation of independent subclones from an early, non-rearranged tumour progenitor appears to be a common feature among leukaemias with aggressive clinical behaviour. We speculate on the type of genetic factors which may participate both in the generation of subclones and also in wider genomic instability and which are likely to be required for the aggressive clinical phenotype in children with ALL.

  2. Ancestral gene duplication enabled the evolution of multifunctional cellulases in stick insects (Phasmatodea). (United States)

    Shelomi, Matan; Heckel, David G; Pauchet, Yannick


    The Phasmatodea (stick insects) have multiple, endogenous, highly expressed copies of glycoside hydrolase family 9 (GH9) genes. The purpose for retaining so many was unknown. We cloned and expressed the enzymes in transfected insect cell lines, and tested the individual proteins against different plant cell wall component poly- and oligosaccharides. Nearly all isolated enzymes were active against carboxymethylcellulose, however most could also degrade glucomannan, and some also either xylan or xyloglucan. The latter two enzyme groups were each monophyletic, suggesting the evolution of these novel substrate specificities in an early ancestor of the order. Such enzymes are highly unusual for Metazoa, for which no xyloglucanases had been reported. Phasmatodea gut extracts could degrade multiple plant cell wall components fully into sugar monomers, suggesting that enzymatic breakdown of plant cell walls by the entire Phasmatodea digestome may contribute to the Phasmatodea nutritional budget. The duplication and neofunctionalization of GH9s in the ancestral Phasmatodea may have enabled them to specialize as folivores and diverge from their omnivorous ancestors. The structural changes enabling these unprecedented activities in the cellulases require further study.

  3. Probing the evolution of biological nitrogen fixation by examining phylogenetic relationships of nitrogen fixation genes related by gene duplication (United States)

    Peters, J.; Boyd, E. S.; Hamilton, T.


    Mounting evidence indicates the presence of a near complete biological nitrogen cycle in redox stratified oceans during the late Archean to early Proterozoic (~2.5 to 2.0 Ga). It has been suggested that the iron (Fe)-only or vanadium (V)-dependent alternative forms of nitrogenase rather than molybdenum (Mo)-dependent form was responsible for dinitrogen (N2) fixation during this time because oceans were depleted in Mo and rich in Fe. However, the only extant nitrogen fixing organisms that harbor alternative nitrogenases also harbor a Mo-dependent nitrogenase. Furthermore, our recent global gene expression analysis revealed that the alternative enzymes rely on genes encoding biosynthetic machinery to assemble active enzymes that are associated with the Mo-dependent nitrogenase. In our recent work we conducted an in-depth phylogenetic analysis of the proteins required for molybdenum (Mo)-nitrogenase that arose from gene fusion and duplication, expanding on previous analyses of single gene loci and multiple gene loci. The results of this analysis are highly suggestive that Mo-nitrogenase is unlikely to have been associated with the last universal common ancestor (LUCA). Rather, the oldest extant organisms harboring Mo-nitrogenase can be traced to hydrogenotrophic methanogens with acquisition in the bacterial domain via lateral gene transfer involving an anaerobic member of the Firmicutes. An origin and ensuing proliferation of Mo-nitrogenase under anoxic conditions would likely have occurred in an environment where anaerobic methanogens and Firmicutes coexisted and where Mo was at least episodically available, such as in a redox stratified Proterozoic ocean basin. In more recent work we have examined the hypothesis that the alternative forms predate the Mo-dependent nitrogenase by examining the phylogenetic relationships of the genetically distinct structural proteins of the Fe-only, V-, and Mo-nitrogenase that are required for activity. As a result, a clear and

  4. New insights into the nutritional regulation of gluconeogenesis in carnivorous rainbow trout (Oncorhynchus mykiss): a gene duplication trail. (United States)

    Marandel, Lucie; Seiliez, Iban; Véron, Vincent; Skiba-Cassy, Sandrine; Panserat, Stéphane


    The rainbow trout (Oncorhynchus mykiss) is considered to be a strictly carnivorous fish species that is metabolically adapted for high catabolism of proteins and low utilization of dietary carbohydrates. This species consequently has a "glucose-intolerant" phenotype manifested by persistent hyperglycemia when fed a high-carbohydrate diet. Gluconeogenesis in adult fish is also poorly, if ever, regulated by carbohydrates, suggesting that this metabolic pathway is involved in this specific phenotype. In this study, we hypothesized that the fate of duplicated genes after the salmonid-specific 4th whole genome duplication (Ss4R) may have led to adaptive innovation and that their study might provide new elements to enhance our understanding of gluconeogenesis and poor dietary carbohydrate use in this species. Our evolutionary analysis of gluconeogenic genes revealed that pck1, pck2, fbp1a, and g6pca were retained as singletons after Ss4r, while g6pcb1, g6pcb2, and fbp1b ohnolog pairs were maintained. For all genes, duplication may have led to sub- or neofunctionalization. Expression profiles suggest that the gluconeogenesis pathway remained active in trout fed a no-carbohydrate diet. When trout were fed a high-carbohydrate diet (30%), most of the gluconeogenic genes were non- or downregulated, except for g6pbc2 ohnologs, whose RNA levels were surprisingly increased. This study demonstrates that Ss4R in trout involved adaptive innovation via gene duplication and via the outcome of the resulting ohnologs. Indeed, maintenance of ohnologous g6pcb2 pair may contribute in a significant way to the glucose-intolerant phenotype of trout and may partially explain its poor use of dietary carbohydrates.

  5. Characterization of Alu and recombination-associated motifs mediating a large homozygous SPG7 gene rearrangement causing hereditary spastic paraplegia. (United States)

    López, Eva; Casasnovas, Carlos; Giménez, Javier; Matilla-Dueñas, Antoni; Sánchez, Ivelisse; Volpini, Víctor


    Spastic paraplegia type 7 (SPG7) is one of the most common forms of autosomal recessive hereditary spastic paraplegia (AR-HSP). Although over 77 different mutations have been identified in SPG7 patients, only 9 gross deletions have been reported with only a few of them being fully characterized. Here, we present a detailed description of a large homozygous intragenic SPG7 gene rearrangement involving a 5144-base pair (bp) genomic loss (c. 1450-446_1779 + 746 delinsAAAGTGCT) encompassing exons 11 to 13, identified in a Spanish AR-HSP family. Analysis of the deletion junction sequences revealed that the 5' breakpoint of this SPG7 gene deletion was located within highly homologous Alu sequences where the 3' breakpoint appears to be flanked by the core crossover hotspot instigator (chi)-like sequence (GCTGG). Furthermore, an 8-bp (AAAGTTGCT) conserved sequence at the breakpoint junction was identified, suggesting that the most likely mechanism for the occurrence of this rearrangement is by Alu microhomology and chi-like recombination-associated motif-mediated multiple exon deletion. Our results are consistent with non-allelic homologous recombination and non-homologous end joining in deletion mutagenesis for the generation of rearrangements. This study provides more evidence associating repeated elements as a genetic mechanism underlying neurodegenerative disorders, highlighting their importance in human diseases.

  6. Reconciling gene and genome duplication events: using multiple nuclear gene families to infer the phylogeny of the aquatic plant family Pontederiaceae. (United States)

    Ness, Rob W; Graham, Sean W; Barrett, Spencer C H


    Most plant phylogenetic inference has used DNA sequence data from the plastid genome. This genome represents a single genealogical sample with no recombination among genes, potentially limiting the resolution of evolutionary relationships in some contexts. In contrast, nuclear DNA is inherently more difficult to employ for phylogeny reconstruction because major mutational events in the genome, including polyploidization, gene duplication, and gene extinction can result in homologous gene copies that are difficult to identify as orthologs or paralogs. Gene tree parsimony (GTP) can be used to infer the rooted species tree by fitting gene genealogies to species trees while simultaneously minimizing the estimated number of duplications needed to reconcile conflicts among them. Here, we use GTP for five nuclear gene families and a previously published plastid data set to reconstruct the phylogenetic backbone of the aquatic plant family Pontederiaceae. Plastid-based phylogenetic studies strongly supported extensive paraphyly of Eichhornia (one of the four major genera) but also depicted considerable ambiguity concerning the true root placement for the family. Our results indicate that species trees inferred from the nuclear genes (alone and in combination with the plastid data) are highly congruent with gene trees inferred from plastid data alone. Consideration of optimal and suboptimal gene tree reconciliations place the root of the family at (or near) a branch leading to the rare and locally restricted E. meyeri. We also explore methods to incorporate uncertainty in individual gene trees during reconciliation by considering their individual bootstrap profiles and relate inferred excesses of gene duplication events on individual branches to whole-genome duplication events inferred for the same branches. Our study improves understanding of the phylogenetic history of Pontederiaceae and also demonstrates the utility of GTP for phylogenetic analysis.

  7. Evolution of CONSTANS Regulation and Function after Gene Duplication Produced a Photoperiodic Flowering Switch in the Brassicaceae. (United States)

    Simon, Samson; Rühl, Mark; de Montaigu, Amaury; Wötzel, Stefan; Coupland, George


    Environmental control of flowering allows plant reproduction to occur under optimal conditions and facilitates adaptation to different locations. At high latitude, flowering of many plants is controlled by seasonal changes in day length. The photoperiodic flowering pathway confers this response in the Brassicaceae, which colonized temperate latitudes after divergence from the Cleomaceae, their subtropical sister family. The CONSTANS (CO) transcription factor of Arabidopsis thaliana, a member of the Brassicaceae, is central to the photoperiodic flowering response and shows characteristic patterns of transcription required for day-length sensing. CO is believed to be widely conserved among flowering plants; however, we show that it arose after gene duplication at the root of the Brassicaceae followed by divergence of transcriptional regulation and protein function. CO has two close homologs, CONSTANS-LIKE1 (COL1) and COL2, which are related to CO by tandem duplication and whole-genome duplication, respectively. The single CO homolog present in the Cleomaceae shows transcriptional and functional features similar to those of COL1 and COL2, suggesting that these were ancestral. We detect cis-regulatory and codon changes characteristic of CO and use transgenic assays to demonstrate their significance in the day-length-dependent activation of the CO target gene FLOWERING LOCUS T. Thus, the function of CO as a potent photoperiodic flowering switch evolved in the Brassicaceae after gene duplication. The origin of CO may have contributed to the range expansion of the Brassicaceae and suggests that in other families CO genes involved in photoperiodic flowering arose by convergent evolution.

  8. Chaperonin genes on the rise: new divergent classes and intense duplication in human and other vertebrate genomes

    Directory of Open Access Journals (Sweden)

    Macario Alberto JL


    Full Text Available Abstract Background Chaperonin proteins are well known for the critical role they play in protein folding and in disease. However, the recent identification of three diverged chaperonin paralogs associated with the human Bardet-Biedl and McKusick-Kaufman Syndromes (BBS and MKKS, respectively indicates that the eukaryotic chaperonin-gene family is larger and more differentiated than previously thought. The availability of complete genome sequences makes possible a definitive characterization of the complete set of chaperonin sequences in human and other species. Results We identified fifty-four chaperonin-like sequences in the human genome and similar numbers in the genomes of the model organisms mouse and rat. In mammal genomes we identified, besides the well-known CCT chaperonin genes and the three genes associated with the MKKS and BBS pathological conditions, a newly-defined class of chaperonin genes named CCT8L, represented in human by the two sequences CCT8L1 and CCT8L2. Comparative analyses from several vertebrate genomes established the monophyletic origin of chaperonin-like MKKS and BBS genes from the CCT8 lineage. The CCT8L gene originated from a later duplication also in the CCT8 lineage at the onset of mammal evolution and duplicated in primate genomes. The functionality of CCT8L genes in different species was confirmed by evolutionary analyses and in human by expression data. Detailed sequence analysis and structural predictions of MKKS, BBS and CCT8L proteins strongly suggested that they conserve a typical chaperonin-like core structure but that they are unlikely to form a CCT-like oligomeric complex. The characterization of many newly-discovered chaperonin pseudogenes uncovered the intense duplication activity of eukaryotic chaperonin genes. Conclusions In vertebrates, chaperonin genes, driven by intense duplication processes, have diversified into multiple classes and functionalities that extend beyond their well-known protein

  9. Tandem Duplication Events in the Expansion of the Small Heat Shock Protein Gene Family in Solanum lycopersicum (cv. Heinz 1706) (United States)

    Krsticevic, Flavia J.; Arce, Débora P.; Ezpeleta, Joaquín; Tapia, Elizabeth


    In plants, fruit maturation and oxidative stress can induce small heat shock protein (sHSP) synthesis to maintain cellular homeostasis. Although the tomato reference genome was published in 2012, the actual number and functionality of sHSP genes remain unknown. Using a transcriptomic (RNA-seq) and evolutionary genomic approach, putative sHSP genes in the Solanum lycopersicum (cv. Heinz 1706) genome were investigated. A sHSP gene family of 33 members was established. Remarkably, roughly half of the members of this family can be explained by nine independent tandem duplication events that determined, evolutionarily, their functional fates. Within a mitochondrial class subfamily, only one duplicated member, Solyc08g078700, retained its ancestral chaperone function, while the others, Solyc08g078710 and Solyc08g078720, likely degenerated under neutrality and lack ancestral chaperone function. Functional conservation occurred within a cytosolic class I subfamily, whose four members, Solyc06g076570, Solyc06g076560, Solyc06g076540, and Solyc06g076520, support ∼57% of the total sHSP RNAm in the red ripe fruit. Subfunctionalization occurred within a new subfamily, whose two members, Solyc04g082720 and Solyc04g082740, show heterogeneous differential expression profiles during fruit ripening. These findings, involving the birth/death of some genes or the preferential/plastic expression of some others during fruit ripening, highlight the importance of tandem duplication events in the expansion of the sHSP gene family in the tomato genome. Despite its evolutionary diversity, the sHSP gene family in the tomato genome seems to be endowed with a core set of four homeostasis genes: Solyc05g014280, Solyc03g082420, Solyc11g020330, and Solyc06g076560, which appear to provide a baseline protection during both fruit ripening and heat shock stress in different tomato tissues. PMID:27565886

  10. Tandem Duplication Events in the Expansion of the Small Heat Shock Protein Gene Family in Solanum lycopersicum (cv. Heinz 1706

    Directory of Open Access Journals (Sweden)

    Flavia J. Krsticevic


    Full Text Available In plants, fruit maturation and oxidative stress can induce small heat shock protein (sHSP synthesis to maintain cellular homeostasis. Although the tomato reference genome was published in 2012, the actual number and functionality of sHSP genes remain unknown. Using a transcriptomic (RNA-seq and evolutionary genomic approach, putative sHSP genes in the Solanum lycopersicum (cv. Heinz 1706 genome were investigated. A sHSP gene family of 33 members was established. Remarkably, roughly half of the members of this family can be explained by nine independent tandem duplication events that determined, evolutionarily, their functional fates. Within a mitochondrial class subfamily, only one duplicated member, Solyc08g078700, retained its ancestral chaperone function, while the others, Solyc08g078710 and Solyc08g078720, likely degenerated under neutrality and lack ancestral chaperone function. Functional conservation occurred within a cytosolic class I subfamily, whose four members, Solyc06g076570, Solyc06g076560, Solyc06g076540, and Solyc06g076520, support ∼57% of the total sHSP RNAm in the red ripe fruit. Subfunctionalization occurred within a new subfamily, whose two members, Solyc04g082720 and Solyc04g082740, show heterogeneous differential expression profiles during fruit ripening. These findings, involving the birth/death of some genes or the preferential/plastic expression of some others during fruit ripening, highlight the importance of tandem duplication events in the expansion of the sHSP gene family in the tomato genome. Despite its evolutionary diversity, the sHSP gene family in the tomato genome seems to be endowed with a core set of four homeostasis genes: Solyc05g014280, Solyc03g082420, Solyc11g020330, and Solyc06g076560, which appear to provide a baseline protection during both fruit ripening and heat shock stress in different tomato tissues.

  11. An ancient history of gene duplications, fusions and losses in the evolution of APOBEC3 mutators in mammals

    Directory of Open Access Journals (Sweden)

    Münk Carsten


    Full Text Available Abstract Background The APOBEC3 (A3 genes play a key role in innate antiviral defense in mammals by introducing directed mutations in the DNA. The human genome encodes for seven A3 genes, with multiple splice alternatives. Different A3 proteins display different substrate specificity, but the very basic question on how discerning self from non-self still remains unresolved. Further, the expression of A3 activity/ies shapes the way both viral and host genomes evolve. Results We present here a detailed temporal analysis of the origin and expansion of the A3 repertoire in mammals. Our data support an evolutionary scenario where the genome of the mammalian ancestor encoded for at least one ancestral A3 gene, and where the genome of the ancestor of placental mammals (and possibly of the ancestor of all mammals already encoded for an A3Z1-A3Z2-A3Z3 arrangement. Duplication events of the A3 genes have occurred independently in different lineages: humans, cats and horses. In all of them, gene duplication has resulted in changes in enzyme activity and/or substrate specificity, in a paradigmatic example of convergent adaptive evolution at the genomic level. Finally, our results show that evolutionary rates for the three A3Z1, A3Z2 and A3Z3 motifs have significantly decreased in the last 100 Mya. The analysis constitutes a textbook example of the evolution of a gene locus by duplication and sub/neofunctionalization in the context of virus-host arms race. Conclusions Our results provide a time framework for identifying ancestral and derived genomic arrangements in the APOBEC loci, and to date the expansion of this gene family for different lineages through time, as a response to changes in viral/retroviral/retrotransposon pressure.

  12. Rapid genome reshaping by multiple-gene loss after whole-genome duplication in teleost fish suggested by mathematical modeling. (United States)

    Inoue, Jun; Sato, Yukuto; Sinclair, Robert; Tsukamoto, Katsumi; Nishida, Mutsumi


    Whole-genome duplication (WGD) is believed to be a significant source of major evolutionary innovation. Redundant genes resulting from WGD are thought to be lost or acquire new functions. However, the rates of gene loss and thus temporal process of genome reshaping after WGD remain unclear. The WGD shared by all teleost fish, one-half of all jawed vertebrates, was more recent than the two ancient WGDs that occurred before the origin of jawed vertebrates, and thus lends itself to analysis of gene loss and genome reshaping. Using a newly developed orthology identification pipeline, we inferred the post-teleost-specific WGD evolutionary histories of 6,892 protein-coding genes from nine phylogenetically representative teleost genomes on a time-calibrated tree. We found that rapid gene loss did occur in the first 60 My, with a loss of more than 70-80% of duplicated genes, and produced similar genomic gene arrangements within teleosts in that relatively short time. Mathematical modeling suggests that rapid gene loss occurred mainly by events involving simultaneous loss of multiple genes. We found that the subsequent 250 My were characterized by slow and steady loss of individual genes. Our pipeline also identified about 1,100 shared single-copy genes that are inferred to have become singletons before the divergence of clupeocephalan teleosts. Therefore, our comparative genome analysis suggests that rapid gene loss just after the WGD reshaped teleost genomes before the major divergence, and provides a useful set of marker genes for future phylogenetic analysis.

  13. Efficient inversions and duplications of mammalian regulatory DNA elements and gene clusters by CRISPR/Cas9 (United States)

    Li, Jinhuan; Shou, Jia; Guo, Ya; Tang, Yuanxiao; Wu, Yonghu; Jia, Zhilian; Zhai, Yanan; Chen, Zhifeng; Xu, Quan; Wu, Qiang


    The human genome contains millions of DNA regulatory elements and a large number of gene clusters, most of which have not been tested experimentally. The clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated nuclease 9 (Cas9) programed with a synthetic single-guide RNA (sgRNA) emerges as a method for genome editing in virtually any organisms. Here we report that targeted DNA fragment inversions and duplications could easily be achieved in human and mouse genomes by CRISPR with two sgRNAs. Specifically, we found that, in cultured human cells and mice, efficient precise inversions of DNA fragments ranging in size from a few tens of bp to hundreds of kb could be generated. In addition, DNA fragment duplications and deletions could also be generated by CRISPR through trans-allelic recombination between the Cas9-induced double-strand breaks (DSBs) on two homologous chromosomes (chromatids). Moreover, junctions of combinatorial inversions and duplications of the protocadherin (Pcdh) gene clusters induced by Cas9 with four sgRNAs could be detected. In mice, we obtained founders with alleles of precise inversions, duplications, and deletions of DNA fragments of variable sizes by CRISPR. Interestingly, we found that very efficient inversions were mediated by microhomology-mediated end joining (MMEJ) through short inverted repeats. We showed for the first time that DNA fragment inversions could be transmitted through germlines in mice. Finally, we applied this CRISPR method to a regulatory element of the Pcdhα cluster and found a new role in the regulation of members of the Pcdhγ cluster. This simple and efficient method should be useful in manipulating mammalian genomes to study millions of regulatory DNA elements as well as vast numbers of gene clusters. PMID:25757625

  14. Expression, subcellular localization, and cis-regulatory structure of duplicated phytoene synthase genes in melon (Cucumis melo L.). (United States)

    Qin, Xiaoqiong; Coku, Ardian; Inoue, Kentaro; Tian, Li


    Carotenoids perform many critical functions in plants, animals, and humans. It is therefore important to understand carotenoid biosynthesis and its regulation in plants. Phytoene synthase (PSY) catalyzes the first committed and rate-limiting step in carotenoid biosynthesis. While PSY is present as a single copy gene in Arabidopsis, duplicated PSY genes have been identified in many economically important monocot and dicot crops. CmPSY1 was previously identified from melon (Cucumis melo L.), but was not functionally characterized. We isolated a second PSY gene, CmPSY2, from melon in this work. CmPSY2 possesses a unique intron/exon structure that has not been observed in other plant PSYs. Both CmPSY1 and CmPSY2 are functional in vitro, but exhibit distinct expression patterns in different melon tissues and during fruit development, suggesting differential regulation of the duplicated melon PSY genes. In vitro chloroplast import assays verified the plastidic localization of CmPSY1 and CmPSY2 despite the lack of an obvious plastid target peptide in CmPSY2. Promoter motif analysis of the duplicated melon and tomato PSY genes and the Arabidopsis PSY revealed distinctive cis-regulatory structures of melon PSYs and identified gibberellin-responsive motifs in all PSYs except for SlPSY1, which has not been reported previously. Overall, these data provide new insights into the evolutionary history of plant PSY genes and the regulation of PSY expression by developmental and environmental signals that may involve different regulatory networks.

  15. Gene fusions and gene duplications: relevance to genomic annotation and functional analysis

    Directory of Open Access Journals (Sweden)

    Riley Monica


    Full Text Available Abstract Background Escherichia coli a model organism provides information for annotation of other genomes. Our analysis of its genome has shown that proteins encoded by fused genes need special attention. Such composite (multimodular proteins consist of two or more components (modules encoding distinct functions. Multimodular proteins have been found to complicate both annotation and generation of sequence similar groups. Previous work overstated the number of multimodular proteins in E. coli. This work corrects the identification of modules by including sequence information from proteins in 50 sequenced microbial genomes. Results Multimodular E. coli K-12 proteins were identified from sequence similarities between their component modules and non-fused proteins in 50 genomes and from the literature. We found 109 multimodular proteins in E. coli containing either two or three modules. Most modules had standalone sequence relatives in other genomes. The separated modules together with all the single (un-fused proteins constitute the sum of all unimodular proteins of E. coli. Pairwise sequence relationships among all E. coli unimodular proteins generated 490 sequence similar, paralogous groups. Groups ranged in size from 92 to 2 members and had varying degrees of relatedness among their members. Some E. coli enzyme groups were compared to homologs in other bacterial genomes. Conclusion The deleterious effects of multimodular proteins on annotation and on the formation of groups of paralogs are emphasized. To improve annotation results, all multimodular proteins in an organism should be detected and when known each function should be connected with its location in the sequence of the protein. When transferring functions by sequence similarity, alignment locations must be noted, particularly when alignments cover only part of the sequences, in order to enable transfer of the correct function. Separating multimodular proteins into module units makes

  16. Mechanisms for Nonrecurrent Genomic Rearrangements Associated with CMT1A or HNPP: Rare CNVs as a Cause for Missing Heritability

    NARCIS (Netherlands)

    F. Zhang; P. Seeman; P. Liu; M.A.J. Weterman; C. Gonzaga-Jauregui; C.F. Towne; S.D. Batish; E. de Vriendt; P. de Jonghe; B. Rautenstrauss; K.H. Krause; M. Khajavi; J. Posadka; A. Vandenberghe; F. Palau; L. van Maldergem; F. Baas; V. Timmerman; J.R. Lupski


    Genomic rearrangements involving the peripheral myelin protein gene (PMP22) in human chromosome 17p12 are associated with neuropathy: duplications cause Charcot-Marie-Tooth disease type IA (CMT1A), whereas deletions lead to hereditary neuropathy with liability to pressure palsies (HNPP). Our previou

  17. Duplication and Loss of Function of Genes Encoding RNA Polymerase III Subunit C4 Causes Hybrid Incompatibility in Rice

    Directory of Open Access Journals (Sweden)

    Giao Ngoc Nguyen


    Full Text Available Reproductive barriers are commonly observed in both animals and plants, in which they maintain species integrity and contribute to speciation. This report shows that a combination of loss-of-function alleles at two duplicated loci, DUPLICATED GAMETOPHYTIC STERILITY 1 (DGS1 on chromosome 4 and DGS2 on chromosome 7, causes pollen sterility in hybrid progeny derived from an interspecific cross between cultivated rice, Oryza sativa, and an Asian annual wild rice, O. nivara. Male gametes carrying the DGS1 allele from O. nivara (DGS1-nivaras and the DGS2 allele from O. sativa (DGS2-T65s were sterile, but female gametes carrying the same genotype were fertile. We isolated the causal gene, which encodes a protein homologous to DNA-dependent RNA polymerase (RNAP III subunit C4 (RPC4. RPC4 facilitates the transcription of 5S rRNAs and tRNAs. The loss-of-function alleles at DGS1-nivaras and DGS2-T65s were caused by weak or nonexpression of RPC4 and an absence of RPC4, respectively. Phylogenetic analysis demonstrated that gene duplication of RPC4 at DGS1 and DGS2 was a recent event that occurred after divergence of the ancestral population of Oryza from other Poaceae or during diversification of AA-genome species.

  18. Increased RPA1 gene dosage affects genomic stability potentially contributing to 17p13.3 duplication syndrome.

    Directory of Open Access Journals (Sweden)

    Emily Outwin


    Full Text Available A novel microduplication syndrome involving various-sized contiguous duplications in 17p13.3 has recently been described, suggesting that increased copy number of genes in 17p13.3, particularly PAFAH1B1, is associated with clinical features including facial dysmorphism, developmental delay, and autism spectrum disorder. We have previously shown that patient-derived cell lines from individuals with haploinsufficiency of RPA1, a gene within 17p13.3, exhibit an impaired ATR-dependent DNA damage response (DDR. Here, we show that cell lines from patients with duplications specifically incorporating RPA1 exhibit a different although characteristic spectrum of DDR defects including abnormal S phase distribution, attenuated DNA double strand break (DSB-induced RAD51 chromatin retention, elevated genomic instability, and increased sensitivity to DNA damaging agents. Using controlled conditional over-expression of RPA1 in a human model cell system, we also see attenuated DSB-induced RAD51 chromatin retention. Furthermore, we find that transient over-expression of RPA1 can impact on homologous recombination (HR pathways following DSB formation, favouring engagement in aberrant forms of recombination and repair. Our data identifies unanticipated defects in the DDR associated with duplications in 17p13.3 in humans involving modest RPA1 over-expression.

  19. Gene mutations and genomic rearrangements in the mouse as a result of transposon mobilization from chromosomal concatemers.

    Directory of Open Access Journals (Sweden)

    Aron M Geurts


    Full Text Available Previous studies of the Sleeping Beauty (SB transposon system, as an insertional mutagen in the germline of mice, have used reverse genetic approaches. These studies have led to its proposed use for regional saturation mutagenesis by taking a forward-genetic approach. Thus, we used the SB system to mutate a region of mouse Chromosome 11 in a forward-genetic screen for recessive lethal and viable phenotypes. This work represents the first reported use of an insertional mutagen in a phenotype-driven approach. The phenotype-driven approach was successful in both recovering visible and behavioral mutants, including dominant limb and recessive behavioral phenotypes, and allowing for the rapid identification of candidate gene disruptions. In addition, a high frequency of recessive lethal mutations arose as a result of genomic rearrangements near the site of transposition, resulting from transposon mobilization. The results suggest that the SB system could be used in a forward-genetic approach to recover interesting phenotypes, but that local chromosomal rearrangements should be anticipated in conjunction with single-copy, local transposon insertions in chromosomes. Additionally, these mice may serve as a model for chromosome rearrangements caused by transposable elements during the evolution of vertebrate genomes.

  20. Assignment of genes encoding metallothioneins I and II to Chinese hamster chromosomes 3. Evidence for the role of chromosome rearrangement in gene amplification

    Energy Technology Data Exchange (ETDEWEB)

    Stallings, R.L.; Munk, A.C.; Longmire, J.L.; Hildebrand, C.E.; Crawford, B.D.


    Cadmium resistant (Cd/sup r/) variants with coordinately amplified metallothionein I and II (MTI and MTII) genes have been derived from both Chinese hamster ovary and near-euploid Chinese hamster cell lines. Cytogenetic analyses of Cd/sup r/ variants consistently revealed breakage and rearrangement involving chromosome 3p. In situ hybridization with Chinese hamster MT-encoding cDNA probe localized amplified MT gene sequences near the translocation breakpoint involving chromosome 3p. These observations suggested that both functionally related, isometallothionein loci are linked on Chinese hamster chromosome 3. Southern blot analyses of DNAs isolated from a panel of Chinese hamster x mouse somatic cell hybrids which segregate hamster chromosomes confirmed that both MTI and MTII are located on chromosome 3. The authors speculate that rearrangement of chromosome 3p could be causally involved with the amplification of MT genes in Cd/sup r/ hamster cell lines. 34 references, 3 figures, 1 table.

  1. Exons I and VII of the gene (Ker10) encoding human keratin 10 undergo structural rearrangements within repeats. (United States)

    Tkachenko, A V; Buchman, V L; Bliskovsky, V V; Shvets YuP; Kisselev, L L


    A genomic fragment containing the K51 gene previously isolated from a rat genomic library by hybridization with the v-mos probe in nonstringent conditions [Chumakov et al., Dokl. Akad. Nauk SSSR 290 (1986) 1252-1254], resembles a human keratin type-I-encoding gene [Shvets et al., Mol. Biol. 24 (1990) 663-677]. This genomic clone, K51, has been used as a probe to search for related human genes. A recombinant clone, HK51, with a 1.5-kb insert, was isolated from a human embryonic skin cDNA library, and its nucleotide (nt) sequence was determined. Analysis has shown that the cloned cDNA encodes human keratin 10 (Ker10). All presently known nt sequences of the human Ker10-encoding gene (Ker10) are not identical. Differences are concentrated in the 5'-end of the first exon and in the middle of the seventh exon within repeats. In spite of structural rearrangements in two of eight exons, the reading frame and position of the stop codon are preserved. The genetic rearrangements cause changes in hydrophobicity profiles of the N and C termini of Ker10. It was also noticed that insertion of one nt leads to the formation of an unusual 3'-end of the transcript.

  2. Gene duplication and an accelerated evolutionary rate in 11S globulin genes are associated with higher protein synthesis in dicots as compared to monocots

    Directory of Open Access Journals (Sweden)

    Li Chun


    Full Text Available Abstract Background Seed storage proteins are a major source of dietary protein, and the content of such proteins determines both the quantity and quality of crop yield. Significantly, examination of the protein content in the seeds of crop plants shows a distinct difference between monocots and dicots. Thus, it is expected that there are different evolutionary patterns in the genes underlying protein synthesis in the seeds of these two groups of plants. Results Gene duplication, evolutionary rate and positive selection of a major gene family of seed storage proteins (the 11S globulin genes, were compared in dicots and monocots. The results, obtained from five species in each group, show more gene duplications, a higher evolutionary rate and positive selections of this gene family in dicots, which are rich in 11S globulins, but not in the monocots. Conclusion Our findings provide evidence to support the suggestion that gene duplication and an accelerated evolutionary rate may be associated with higher protein synthesis in dicots as compared to monocots.

  3. Heterogeneous expression pattern of tandem duplicated sHsps genes during fruit ripening in two tomato species (United States)

    Arce, DP; Krsticevic, FJ; Ezpeleta, J.; Ponce, SD; Pratta, GR; Tapia, E.


    The small heat shock proteins (sHSPs) have been found to play a critical role in physiological stress conditions in protecting proteins from irreversible aggregation. To characterize the gene expression profile of four sHsps with a tandem gene structure arrangement in the domesticated Solanum lycopersicum (Heinz 1706) genome and its wild close relative Solanum pimpinellifolium (LA1589), differential gene expression analysis using RNA-Seq was conducted in three ripening stages in both cultivars fruits. Gene promoter analysis was performed to explain the heterogeneous pattern of gene expression found for these tandem duplicated sHsps. In silico analysis results contribute to refocus wet experiment analysis in tomato sHsp family proteins.

  4. Rearranged anaplastic lymphoma kinase (ALK) gene found for the first time in adult-onset papillary thyroid cancer cases among atomic bomb survivors

    Energy Technology Data Exchange (ETDEWEB)

    Hamatani, K.; Mukai, M.; Takahashi, K.; Nakachi, K.; Kusunoki, Y. [Radiobiology/Molecular Epidemiology, Radiation Effects Research Foundation, Hiroshima (Japan); Hayashi, Y. [Geriatric Health Service Facility Hidamari, Hiroshima (Japan)


    Full text of the publication follows: Thyroid cancer is one of the malignancies most strongly associated with ionizing radiation in humans. Epidemiology studies of atomic bomb (A-bomb) survivors have indicated that excess relative risk of papillary thyroid cancer per Gy was remarkably high in the survivors. We therefore aim to clarify mechanisms linking A-bomb radiation exposure and development of papillary thyroid cancer. Toward this end, we intend to clarify characteristics of gene alterations occurring in radiation-associated adult-onset papillary thyroid cancer from the Life Span Study cohort of A-bomb survivors. We have thus far found that with increased radiation dose, papillary thyroid cancer cases with chromosomal rearrangements (mainly RET/PTC rearrangements) significantly increased and papillary thyroid cancer cases with point mutations (mainly BRAF-V600E) significantly decreased. Papillary thyroid cancer cases with non-detected gene alterations that carried no mutations in RET, NTRK1, BRAF or RAS genes tended to increase with increased radiation dose. In addition, we found that relative frequency of these papillary thyroid cancer cases significantly decreased with time elapsed since exposure. Through analysis of papillary thyroid cancer cases with non-detected gene alterations, we recently discovered a new type of rearrangement for the first time in papillary thyroid cancer, i.e., rearranged anaplastic lymphoma kinase (ALK) gene, although identification of any partner gene(s) is needed. Specifically, rearrangement of ALK was found in 10 of 19 exposed papillary thyroid cancer cases with non-detected gene alterations but not in any of the six non-exposed papillary thyroid cancer cases. Furthermore, papillary thyroid cancer with ALK rearrangement was frequently found in the cases with high radiation dose or with short time elapsed since A-bomb exposure. These results suggest that chromosomal rearrangement, typically of RET and ALK, may play an important

  5. Evolutionary dynamics of a mitochondrial rearrangement "hot spot" in the Hymenoptera. (United States)

    Dowton, M; Austin, A D


    The arrangement of tRNA genes at the junction of the cytochrome oxidase II and ATPase 8 genes was examined across a broad range of Hymenoptera. Seven distinct arrangements of tRNA genes were identified among a group of wasps that have diverged over the last 180 Myr (suborder Apocrita); many of the rearrangements represent evolutionarily independent events. Approximately equal proportions of local rearrangements, inversions, and translocations were observed, in contrast to vertebrate mitochondria, in which local rearrangements predominate. Surprisingly, homoplasy was evident among certain types of rearrangement; a reversal of the plesiomorphic gene order has arisen on three separate occasions in the Insecta, while the tRNA(H) gene has been translocated to this locus on two separate occasions. Phylogenetic analysis indicates that this gene translocation is real and is not an artifactual translocation resulting from the duplication of a resident tRNA gene followed by mutation of the anticodon. The nature of the intergenic sequences surrounding this region does not indicate that it should be especially prone to rearrangement; it does not generally have the tandem or inverted repeats that might facilitate this plasticity. Intriguingly, these findings are consistent with the view that during the evolution of the Hymenoptera, rearrangements increased at the same time that the rate of point mutations and compositional bias also increased. This association may direct investigations into mitochondrial genome plasticity in other invertebrate lineages.

  6. Duplication of the IGFBP-2 gene in teleost fish: protein structure and functionality conservation and gene expression divergence.

    Directory of Open Access Journals (Sweden)

    Jianfeng Zhou

    growth and development primarily by binding to and inhibiting IGF actions in vivo. The duplicated IGFBP-2 genes may provide additional flexibility in the regulation of IGF activities.

  7. Frequent lambda light chain gene rearrangement and expression in a Ly-1 B lymphoma with a productive kappa chain allele.


    Hardy, R R; Dangl, J L; Hayakawa, K.; Jager, G.; Herzenberg, L.A.


    We describe here a murine Ly-1-bearing pre-B-cell tumor that, when induced for kappa light chain expression with bacterial lipopolysaccharide, also gives rise spontaneously to a few percent of cells expressing surface lambda light chains. These lambda-positive cells have undergone DNA rearrangements involving either V lambda 1 or V lambda 2 genes. Nearly all clones of lambda-bearing cells express mu and lambda on their surface (but not kappa). However, all these lambda-positive clones continu...

  8. Opossum carboxylesterases: sequences, phylogeny and evidence for CES gene duplication events predating the marsupial-eutherian common ancestor

    Directory of Open Access Journals (Sweden)

    Chan Jeannie


    Full Text Available Abstract Background Carboxylesterases (CES perform diverse metabolic roles in mammalian organisms in the detoxification of a broad range of drugs and xenobiotics and may also serve in specific roles in lipid, cholesterol, pheromone and lung surfactant metabolism. Five CES families have been reported in mammals with human CES1 and CES2 the most extensively studied. Here we describe the genetics, expression and phylogeny of CES isozymes in the opossum and report on the sequences and locations of CES1, CES2 and CES6 'like' genes within two gene clusters on chromosome one. We also discuss the likely sequence of gene duplication events generating multiple CES genes during vertebrate evolution. Results We report a cDNA sequence for an opossum CES and present evidence for CES1 and CES2 like genes expressed in opossum liver and intestine and for distinct gene locations of five opossum CES genes,CES1, CES2.1, CES2.2, CES2.3 and CES6, on chromosome 1. Phylogenetic and sequence alignment studies compared the predicted amino acid sequences for opossum CES with those for human, mouse, chicken, frog, salmon and Drosophila CES gene products. Phylogenetic analyses produced congruent phylogenetic trees depicting a rapid early diversification into at least five distinct CES gene family clusters: CES2, CES1, CES7, CES3, and CES6. Molecular divergence estimates based on a Bayesian relaxed clock approach revealed an origin for the five mammalian CES gene families between 328–378 MYA. Conclusion The deduced amino acid sequence for an opossum cDNA was consistent with its identity as a mammalian CES2 gene product (designated CES2.1. Distinct gene locations for opossum CES1 (1: 446,222,550–446,274,850, three CES2 genes (1: 677,773,395–677,927,030 and a CES6 gene (1: 677,585,520–677,730,419 were observed on chromosome 1. Opossum CES1 and multiple CES2 genes were expressed in liver and intestine. Amino acid sequences for opossum CES1 and three CES2 gene products

  9. ssb gene duplication restores the viability of ΔholC and ΔholD Escherichia coli mutants.

    Directory of Open Access Journals (Sweden)

    Stéphane Duigou


    Full Text Available The HolC-HolD (χψ complex is part of the DNA polymerase III holoenzyme (Pol III HE clamp-loader. Several lines of evidence indicate that both leading- and lagging-strand synthesis are affected in the absence of this complex. The Escherichia coli ΔholD mutant grows poorly and suppressor mutations that restore growth appear spontaneously. Here we show that duplication of the ssb gene, encoding the single-stranded DNA binding protein (SSB, restores ΔholD mutant growth at all temperatures on both minimal and rich medium. RecFOR-dependent SOS induction, previously shown to occur in the ΔholD mutant, is unaffected by ssb gene duplication, suggesting that lagging-strand synthesis remains perturbed. The C-terminal SSB disordered tail, which interacts with several E. coli repair, recombination and replication proteins, must be intact in both copies of the gene in order to restore normal growth. This suggests that SSB-mediated ΔholD suppression involves interaction with one or more partner proteins. ssb gene duplication also suppresses ΔholC single mutant and ΔholC ΔholD double mutant growth defects, indicating that it bypasses the need for the entire χψ complex. We propose that doubling the amount of SSB stabilizes HolCD-less Pol III HE DNA binding through interactions between SSB and a replisome component, possibly DnaE. Given that SSB binds DNA in vitro via different binding modes depending on experimental conditions, including SSB protein concentration and SSB interactions with partner proteins, our results support the idea that controlling the balance between SSB binding modes is critical for DNA Pol III HE stability in vivo, with important implications for DNA replication and genome stability.

  10. Pseudoscorpion mitochondria show rearranged genes and genome-wide reductions of RNA gene sizes and inferred structures, yet typical nucleotide composition bias

    Directory of Open Access Journals (Sweden)

    Ovchinnikov Sergey


    Full Text Available Abstract Background Pseudoscorpions are chelicerates and have historically been viewed as being most closely related to solifuges, harvestmen, and scorpions. No mitochondrial genomes of pseudoscorpions have been published, but the mitochondrial genomes of some lineages of Chelicerata possess unusual features, including short rRNA genes and tRNA genes that lack sequence to encode arms of the canonical cloverleaf-shaped tRNA. Additionally, some chelicerates possess an atypical guanine-thymine nucleotide bias on the major coding strand of their mitochondrial genomes. Results We sequenced the mitochondrial genomes of two divergent taxa from the chelicerate order Pseudoscorpiones. We find that these genomes possess unusually short tRNA genes that do not encode cloverleaf-shaped tRNA structures. Indeed, in one genome, all 22 tRNA genes lack sequence to encode canonical cloverleaf structures. We also find that the large ribosomal RNA genes are substantially shorter than those of most arthropods. We inferred secondary structures of the LSU rRNAs from both pseudoscorpions, and find that they have lost multiple helices. Based on comparisons with the crystal structure of the bacterial ribosome, two of these helices were likely contact points with tRNA T-arms or D-arms as they pass through the ribosome during protein synthesis. The mitochondrial gene arrangements of both pseudoscorpions differ from the ancestral chelicerate gene arrangement. One genome is rearranged with respect to the location of protein-coding genes, the small rRNA gene, and at least 8 tRNA genes. The other genome contains 6 tRNA genes in novel locations. Most chelicerates with rearranged mitochondrial genes show a genome-wide reversal of the CA nucleotide bias typical for arthropods on their major coding strand, and instead possess a GT bias. Yet despite their extensive rearrangement, these pseudoscorpion mitochondrial genomes possess a CA bias on the major coding strand. Phylogenetic

  11. The vertebrate makorin ubiquitin ligase gene family has been shaped by large-scale duplication and retroposition from an ancestral gonad-specific, maternal-effect gene

    Directory of Open Access Journals (Sweden)

    Volff Jean-Nicolas


    Full Text Available Abstract Background Members of the makorin (mkrn gene family encode RING/C3H zinc finger proteins with U3 ubiquitin ligase activity. Although these proteins have been described in a variety of eukaryotes such as plants, fungi, invertebrates and vertebrates including human, almost nothing is known about their structural and functional evolution. Results Via partial sequencing of a testis cDNA library from the poeciliid fish Xiphophorus maculatus, we have identified a new member of the makorin gene family, that we called mkrn4. In addition to the already described mkrn1 and mkrn2, mkrn4 is the third example of a makorin gene present in both tetrapods and ray-finned fish. However, this gene was not detected in mouse and rat, suggesting its loss in the lineage leading to rodent murids. Mkrn2 and mkrn4 are located in large ancient duplicated regions in tetrapod and fish genomes, suggesting the possible involvement of ancestral vertebrate-specific genome duplication in the formation of these genes. Intriguingly, many mkrn1 and mkrn2 intronless retrocopies have been detected in mammals but not in other vertebrates, most of them corresponding to pseudogenes. The nature and number of zinc fingers were found to be conserved in Mkrn1 and Mkrn2 but much more variable in Mkrn4, with lineage-specific differences. RT-qPCR analysis demonstrated a highly gonad-biased expression pattern for makorin genes in medaka and zebrafish (ray-finned fishes and amphibians, but a strong relaxation of this specificity in birds and mammals. All three mkrn genes were maternally expressed before zygotic genome activation in both medaka and zebrafish early embryos. Conclusion Our analysis demonstrates that the makorin gene family has evolved through large-scale duplication and subsequent lineage-specific retroposition-mediated duplications in vertebrates. From the three major vertebrate mkrn genes, mkrn4 shows the highest evolutionary dynamics, with lineage-specific loss of zinc

  12. Positive selection in the adhesion domain of Mus sperm Adam genes through gene duplications and function-driven gene complex formations. (United States)

    Grayson, Phil; Civetta, Alberto


    Sperm and testes-expressed Adam genes have been shown to undergo bouts of positive selection in mammals. Despite the pervasiveness of positive selection signals, it is unclear what has driven such selective bouts. The fact that only sperm surface Adam genes show signals of positive selection within their adhesion domain has led to speculation that selection might be driven by species-specific adaptations to fertilization or sperm competition. Alternatively, duplications and neofunctionalization of Adam sperm surface genes, particularly as it is now understood in rodents, might have contributed to an acceleration of evolutionary rates and possibly adaptive diversification. Here we sequenced and conducted tests of selection within the adhesion domain of sixteen known sperm-surface Adam genes among five species of the Mus genus. We find evidence of positive selection associated with all six Adam genes known to interact to form functional complexes on Mus sperm. A subset of these complex-forming sperm genes also displayed accelerated branch evolution with Adam5 evolving under positive selection. In contrast to our previous findings in primates, selective bouts within Mus sperm Adams showed no associations to proxies of sperm competition. Expanded phylogenetic analysis including sequence data from other placental mammals allowed us to uncover ancient and recent episodes of adaptive evolution. The prevailing signals of rapid divergence and positive selection detected within the adhesion domain of interacting sperm Adams is driven by duplications and potential neofunctionalizations that are in some cases ancient (Adams 2, 3 and 5) or more recent (Adams 1b, 4b and 6).

  13. Regional rearrangements in chromosome 15q21 cause formation of cryptic promoters for the CYP19 (aromatase) gene. (United States)

    Demura, Masashi; Martin, Regina M; Shozu, Makio; Sebastian, Siby; Takayama, Kazuto; Hsu, Wei-Tong; Schultz, Roger A; Neely, Kirk; Bryant, Michael; Mendonca, Berenice B; Hanaki, Keiichi; Kanzaki, Susumu; Rhoads, David B; Misra, Madhusmita; Bulun, Serdar E


    Production of appropriate quantities of estrogen in various tissues is essential for human physiology. A single gene (CYP19), regulated via tissue-specific promoters, encodes the enzyme aromatase, which catalyzes the key step in estrogen biosynthesis. Aromatase excess syndrome is inherited as autosomal dominant and characterized by high systemic estrogen levels, short stature, prepubertal gynecomastia and testicular failure in males, and premature breast development and uterine pathology in females. The underlying genetic mechanism is poorly understood. Here, we characterize five distinct heterozygous rearrangements responsible for aromatase excess syndrome in three unrelated families and two individuals (nine patients). The constitutively active promoter of one of five ubiquitously expressed genes located within the 11.2 Mb region telomeric to the CYP19 gene in chromosome 15q21 cryptically upregulated aromatase expression in several tissues. Four distinct inversions reversed the transcriptional direction of the promoter of a gene (CGNL1, TMOD3, MAPK6 or TLN2), placing it upstream of the CYP19 coding region in the opposite strand, whereas a deletion moved the promoter of a fifth gene (DMXL2), normally transcribed from the same strand, closer to CYP19. The proximal breakpoints of inversions were located 17-185 kb upstream of the CYP19 coding region. Sequences at the breakpoints suggested that the inversions were caused by intrachromosomal nonhomologous recombination. Splicing the untranslated exon downstream of each promoter onto the identical junction upstream of the translation initiation site created CYP19 mRNA encoding functional aromatase protein. Taken together, small rearrangements may create cryptic promoters that direct inappropriate transcription of CYP19 or other critical genes.

  14. Diversity of V delta-J delta gene rearrangement in peripheral blood lymphocytes and intrathecal IgG synthesis in multiple sclerosis. (United States)

    Michałowska-Wender, G; Nowak, J; Losy, J; Januszkiewicz, D; Wender, M


    The object of the study is a comparison of intrathecal IgG synthesis and gamma/delta TCR genes rearrangement in multiple sclerosis. The subgroup of 13 cases with intrathecal IgG synthesis and positive oligoclonal bands was compared with 8 cases with IgG index below 0.75 and with undetectable oligoclonal bands. TCR gene rearrangement was studied in peripheral blood lymphocytes by PCR analysis. In majority of cases of the first group the V delta-J delta junctional repertoire was restricted as evidenced by oligoclonal rearrangement. Monoclonal pattern of rearrangement was also established in some cases concerning V delta 1-J delta 1 and V delta 5-J delta 1. In all cases with one exception, demonstrating IgG index < 0.75 and with negative oligoclonal bands in CSF the oligo- or polyclonal pattern of V delta-J delta gene rearrangement was noticed. It is therefore suggested that subset T and B lymphocytes may undergo clonal expansion in MS as evidenced by restricted pattern of V delta-J delta rearrangement and intrathecal oligoclonal IgG synthesis, respectively. Oligoclonal expansion at certain B and T cells may occur due to stimulation by an antigen related to MS pathogen.

  15. A new resource for characterizing X-linked genes in Drosophila melanogaster: systematic coverage and subdivision of the X chromosome with nested, Y-linked duplications. (United States)

    Cook, R Kimberley; Deal, Megan E; Deal, Jennifer A; Garton, Russell D; Brown, C Adam; Ward, Megan E; Andrade, Rachel S; Spana, Eric P; Kaufman, Thomas C; Cook, Kevin R


    Interchromosomal duplications are especially important for the study of X-linked genes. Males inheriting a mutation in a vital X-linked gene cannot survive unless there is a wild-type copy of the gene duplicated elsewhere in the genome. Rescuing the lethality of an X-linked mutation with a duplication allows the mutation to be used experimentally in complementation tests and other genetic crosses and it maps the mutated gene to a defined chromosomal region. Duplications can also be used to screen for dosage-dependent enhancers and suppressors of mutant phenotypes as a way to identify genes involved in the same biological process. We describe an ongoing project in Drosophila melanogaster to generate comprehensive coverage and extensive breakpoint subdivision of the X chromosome with megabase-scale X segments borne on Y chromosomes. The in vivo method involves the creation of X inversions on attached-XY chromosomes by FLP-FRT site-specific recombination technology followed by irradiation to induce large internal X deletions. The resulting chromosomes consist of the X tip, a medial X segment placed near the tip by an inversion, and a full Y. A nested set of medial duplicated segments is derived from each inversion precursor. We have constructed a set of inversions on attached-XY chromosomes that enable us to isolate nested duplicated segments from all X regions. To date, our screens have provided a minimum of 78% X coverage with duplication breakpoints spaced a median of nine genes apart. These duplication chromosomes will be valuable resources for rescuing and mapping X-linked mutations and identifying dosage-dependent modifiers of mutant phenotypes.

  16. Clinicopathology, immunophenotype, T cell receptor gene rearrangement, Epstein-Barr virus status and p53 gene mutation of cutaneous extranodal NK/T-cell lymphoma, nasal-type

    Institute of Scientific and Technical Information of China (English)

    WANG Ting-ting; XU Chen; LIU Shan-ling; KAN Bei; RAN Yu-ping; LIU Wei-ping; LI Gan-di


    Background Extranodal natural killer/T-cell (NK/T cell) lymphoma,nasal-type,is a rare lymphoma.Skin is the second most common site of involvement after the nasal cavity/nasalpharynx.The aim of this study was to investigate the clinicopathologic features,immunophenotype,T cell receptor (TCR) gene rearrangement,the association with Epstein-Barr virus (EBV) infection and p53 gene mutations of the lymphoma.Methods The clinicopathologic analysis,immunohistochemistry,in situ hybridization for EBER1/2,TCR gene rearrangement by polymerase chain reaction (PCR),mutations of p53 gene analyzed by PCR and sequence analysis were employed in this study.Results In the 19 cases,the tumor primarily involved the dermis and subcutaneous layer.Immunohistochemical staining showed that most of the cases expressed CD45RO,CD56,CD3ε,TIA-1 and GrB.Three cases were positive for CD3 and two cases were positive for CD30.Monoclonal TCRY gene rearrangement was found in 7 of 18 cases.The positive rate of EBER1/2 was 100%.No p53 gene mutation was detected on the exon 4-9 in the 18 cases.Fifteen cases showed Pro (proline)/Arg (arginine) single nucleotide polymorphisms (SNPs) on the exon 4 at codon 72.The expression of p53 protein was 72% (13/18) immunohistochemically.Conclusions Cutaneous NK/T-cell lymphoma is a rare but highly aggressive lymphoma with poor prognosis.No p53 gene mutation was detected on the exon 4-9,and Pro/Arg SNPs on p53 codon 72 were detected in the cutaneous NK/T-cell lymphoma.The overexpression of p53 protein may not be the result of p53 gene mutation.

  17. Enzymatic, expression and structural divergences among carboxyl O-methyltransferases after gene duplication and speciation in Nicotiana. (United States)

    Hippauf, Frank; Michalsky, Elke; Huang, Ruiqi; Preissner, Robert; Barkman, Todd J; Piechulla, Birgit


    Methyl salicylate and methyl benzoate have important roles in a variety of processes including pollinator attraction and plant defence. These compounds are synthesized by salicylic acid, benzoic acid and benzoic acid/salicylic acid carboxyl methyltransferases (SAMT, BAMT and BSMT) which are members of the SABATH gene family. Both SAMT and BSMT were isolated from Nicotiana suaveolens, Nicotiana alata, and Nicotiana sylvestris allowing us to discern levels of enzyme divergence resulting from gene duplication in addition to species divergence. Phylogenetic analyses showed that Nicotiana SAMTs and BSMTs evolved in separate clades and the latter can be differentiated into the BSMT1 and the newly established BSMT2 branch. Although SAMT and BSMT orthologs showed minimal change coincident with species divergences, substantial evolutionary change of enzyme activity and expression patterns occurred following gene duplication. After duplication, the BSMT enzymes evolved higher preference for benzoic acid (BA) than salicylic acid (SA) whereas SAMTs maintained ancestral enzymatic preference for SA over BA. Expression patterns are largely complementary in that BSMT transcripts primarily accumulate in flowers, leaves and stems whereas SAMT is expressed mostly in roots. A novel enzyme, nicotinic acid carboxyl methyltransferase (NAMT), which displays a high degree of activity with nicotinic acid was discovered to have evolved in N. gossei from an ancestral BSMT. Furthermore a SAM-dependent synthesis of methyl anthranilate via BSMT2 is reported and contrasts with alternative biosynthetic routes previously proposed. While BSMT in flowers is clearly involved in methyl benzoate synthesis to attract pollinators, its function in other organs and tissues remains obscure.

  18. Similarity of DMD gene deletion and duplication in the Chinese patients compared to global populations

    Directory of Open Access Journals (Sweden)

    Yan Ming


    Full Text Available Abstract Background DNA deletion and duplication were determined as the major mutation underlying Duchenne muscular dystrophy (DMD and Becker muscular dystrophy (BMD. Method Applying multiplex ligation-dependent probe amplification (MLPA, we have analyzed 179 unrelated DMD/BMD subjects from northern China. Results Seventy-three percent of the subjects were found having a deletion (66.25% or duplication (6.25%. Exons 51–52 were detected as the most common fragment deleted in single-exon deletion, and the region of exons 45–50 was the most common exons deleted in multi-exon deletions. About 90% of DMD/BMD cases carry a small size deletion that involves 10 exons or less, 26.67% of which carry a single-exon deletion. Most of the smaller deletions resulted in an out-of-frame mutation. The most common exons deleted were determined to be between exon 48 and exon 52, with exon 50 was the model allele. Verifying single-exon deletion, one sample with a deletion of exon 53 that was initially observed from MLPA showed that there was a single base deletion that abolished the ligation site in MLPA. Confirmation of single-exon deletion is recommended to exclude single base deletion or mutation at the MLPA ligation site. Conclusion The frequency of deletion and duplication in northern China is similar to global ethnic populations.

  19. Detection of clonal B cells in microdissected reactive lymphoproliferations: possible diagnostic pitfalls in PCR analysis of immunoglobulin heavy chain gene rearrangement

    DEFF Research Database (Denmark)

    Zhou, X.G.; Sandvej, K.; Gregersen, Niels


    Aims-To evaluate the specificity of standard and fluorescence based (GENESCAN) polymerase chain reaction (PCR) immunoglobulin heavy chain (IgH) gene rearrangement analysis in complete and microdissected paraffin wax embedded sections from lymphoid proliferations. Methods-PCR IgH gene rearrangement...... showed reproducible bands on gel analysis and satisfied accepted criteria for monoclonality. Use of high resolution gels with GENESCAN analysis improved sensitivity and band definition; however, three samples still appeared to be monoclonal. Conclusions These results confirm that PCR based IgH gene...

  20. Becker Muscular Dystrophy (BMD) caused by duplication of exons 3-6 of the dystrophin gene presenting as dilated cardiomyopathy

    Energy Technology Data Exchange (ETDEWEB)

    Tsai, A.C.; Allingham-Hawkins, D.J.; Becker, L. [Univ. of Toronto, Ontario (Canada)] [and others


    X-linked dilated cardiomyopathy (XLCM) is a progressive myocardial disease presenting with congestive heart failure in teenage males without clinical signs of skeletal myopathy. Tight linkage of XLCM to the DMD locus has been demonstrated; it has been suggested that, at least in some families, XLCM is a {open_quotes}dystrophinopathy.{close_quotes} We report a 14-year-old boy who presented with acute heart failure due to dilated cardiomyopathy. He had no history of muscle weakness, but physical examination revealed pseudohypertrophy of the calf muscles. He subsequently received a heart transplantation. Family history was negative. Serum CK level at the time of diagnosis was 10,416. Myocardial biopsy showed no evidence of carditis. Dystrophin staining of cardiac and skeletal muscle with anti-sera to COOH and NH{sub 2}termini showed a patchy distribution of positivity suggestive of Becker muscular dystrophy. Analysis of 18 of the 79 dystrophin exons detected a duplication that included exons 3-6. The proband`s mother has an elevated serum CK and was confirmed to be a carrier of the same duplication. A mutation in the muscle promotor region of the dystrophin gene has been implicated in the etiology of SLCM. However, Towbin et al. (1991) argued that other 5{prime} mutations in the dystrophin gene could cause selective cardiomyopathy. The findings in our patient support the latter hypothesis. This suggests that there are multiple regions in the dystrophin gene which, when disrupted, can cause isolated dilated cardiomyopathy.

  1. Flexibility and symmetry of prokaryotic genome rearrangement reveal lineage-associated core-gene-defined genome organizational frameworks. (United States)

    Kang, Yu; Gu, Chaohao; Yuan, Lina; Wang, Yue; Zhu, Yanmin; Li, Xinna; Luo, Qibin; Xiao, Jingfa; Jiang, Daquan; Qian, Minping; Ahmed Khan, Aftab; Chen, Fei; Zhang, Zhang; Yu, Jun


    The prokaryotic pangenome partitions genes into core and dispensable genes. The order of core genes, albeit assumed to be stable under selection in general, is frequently interrupted by horizontal gene transfer and rearrangement, but how a core-gene-defined genome maintains its stability or flexibility remains to be investigated. Based on data from 30 species, including 425 genomes from six phyla, we grouped core genes into syntenic blocks in the context of a pangenome according to their stability across multiple isolates. A subset of the core genes, often species specific and lineage associated, formed a core-gene-defined genome organizational framework (cGOF). Such cGOFs are either single segmental (one-third of the species analyzed) or multisegmental (the rest). Multisegment cGOFs were further classified into symmetric or asymmetric according to segment orientations toward the origin-terminus axis. The cGOFs in Gram-positive species are exclusively symmetric and often reversible in orientation, as opposed to those of the Gram-negative bacteria, which are all asymmetric and irreversible. Meanwhile, all species showing strong strand-biased gene distribution contain symmetric cGOFs and often specific DnaE (α subunit of DNA polymerase III) isoforms. Furthermore, functional evaluations revealed that cGOF genes are hub associated with regard to cellular activities, and the stability of cGOF provides efficient indexes for scaffold orientation as demonstrated by assembling virtual and empirical genome drafts. cGOFs show species specificity, and the symmetry of multisegmental cGOFs is conserved among taxa and constrained by DNA polymerase-centric strand-biased gene distribution. The definition of species-specific cGOFs provides powerful guidance for genome assembly and other structure-based analysis. Prokaryotic genomes are frequently interrupted by horizontal gene transfer (HGT) and rearrangement. To know whether there is a set of genes not only conserved in position

  2. Antagonistic roles for KNOX1 and KNOX2 genes in patterning the land plant body plan following an ancient gene duplication. (United States)

    Furumizu, Chihiro; Alvarez, John Paul; Sakakibara, Keiko; Bowman, John L


    Neofunctionalization following gene duplication is thought to be one of the key drivers in generating evolutionary novelty. A gene duplication in a common ancestor of land plants produced two classes of KNOTTED-like TALE homeobox genes, class I (KNOX1) and class II (KNOX2). KNOX1 genes are linked to tissue proliferation and maintenance of meristematic potentials of flowering plant and moss sporophytes, and modulation of KNOX1 activity is implicated in contributing to leaf shape diversity of flowering plants. While KNOX2 function has been shown to repress the gametophytic (haploid) developmental program during moss sporophyte (diploid) development, little is known about KNOX2 function in flowering plants, hindering syntheses regarding the relationship between two classes of KNOX genes in the context of land plant evolution. Arabidopsis plants harboring loss-of-function KNOX2 alleles exhibit impaired differentiation of all aerial organs and have highly complex leaves, phenocopying gain-of-function KNOX1 alleles. Conversely, gain-of-function KNOX2 alleles in conjunction with a presumptive heterodimeric BELL TALE homeobox partner suppressed SAM activity in Arabidopsis and reduced leaf complexity in the Arabidopsis relative Cardamine hirsuta, reminiscent of loss-of-function KNOX1 alleles. Little evidence was found indicative of epistasis or mutual repression between KNOX1 and KNOX2 genes. KNOX proteins heterodimerize with BELL TALE homeobox proteins to form functional complexes, and contrary to earlier reports based on in vitro and heterologous expression, we find high selectivity between KNOX and BELL partners in vivo. Thus, KNOX2 genes confer opposing activities rather than redundant roles with KNOX1 genes, and together they act to direct the development of all above-ground organs of the Arabidopsis sporophyte. We infer that following the KNOX1/KNOX2 gene duplication in an ancestor of land plants, neofunctionalization led to evolution of antagonistic biochemical

  3. TCR gene segments from at least one third of V alpha subfamilies rearrange at the delta locus. (United States)

    Genevée, C; Chung, V; Diu, A; Hercend, T; Triebel, F


    Using PCR and an experimentally validated V alpha subfamily-specific oligonucleotide panel (V alpha 1-w29), we have investigated whether the TCR delta chain may increase its combinatorial diversity by using V genes considered as alpha chain-specific. We show that at least 10 distinct human V alpha segments rearrange at the J delta locus, leading to scrambling of the two V gene repertoires. Fifty-five per cent of the V alpha/J delta transcripts characterized here were in frame. The 17 V alpha/C delta chains analysed included an extended CDR3 region with up to 18 aa encoded by the junctional region. In addition, a new J delta segment (J delta 4) has been characterized. Together, these findings demonstrate that combinatorial diversity in the human delta locus is larger than previously thought.

  4. Selection shaped the evolution of mouse androgen-binding protein (ABP) function and promoted the duplication of Abp genes. (United States)

    Karn, Robert C; Laukaitis, Christina M


    In the present article, we summarize two aspects of our work on mouse ABP (androgen-binding protein): (i) the sexual selection function producing incipient reinforcement on the European house mouse hybrid zone, and (ii) the mechanism behind the dramatic expansion of the Abp gene region in the mouse genome. Selection unifies these two components, although the ways in which selection has acted differ. At the functional level, strong positive selection has acted on key sites on the surface of one face of the ABP dimer, possibly to influence binding to a receptor. A different kind of selection has apparently driven the recent and rapid expansion of the gene region, probably by increasing the amount of Abp transcript, in one or both of two ways. We have shown previously that groups of Abp genes behave as LCRs (low-copy repeats), duplicating as relatively large blocks of genes by NAHR (non-allelic homologous recombination). The second type of selection involves the close link between the accumulation of L1 elements and the expansion of the Abp gene family by NAHR. It is probably predicated on an initial selection for increased transcription of existing Abp genes and/or an increase in Abp gene number providing more transcriptional sites. Either or both could increase initial transcript production, a quantitative change similar to increasing the volume of a radio transmission. In closing, we also provide a note on Abp gene nomenclature.

  5. Laser-based microdissection of single cells from tissue sections and PCR analysis of rearranged immunoglobulin genes from isolated normal and malignant human B cells. (United States)

    Küppers, Ralf; Schneider, Markus; Hansmann, Martin-Leo


    Normal and malignant B cells carry rearranged immunoglobulin (Ig) variable region genes, which due to their practically limitless diversity represent ideal clonal markers for these cells. We describe here an approach to isolate single cells from frozen tissue sections by microdissection using a laser-based method. From the isolated cells rearranged IgH and Igκ genes are amplified in a semi-nested PCR approach, using a collection of V gene family-specific primers recognizing nearly all V gene segments together with primers for the J gene segments. By sequence analysis of V genes from distinct cells, the clonal relationship of the B lineage cells can unequivocally be determined and related to the histological distribution of the cells. The approach is also useful to determine V, D, and J gene usage. Moreover, the presence and pattern of somatic Ig V gene mutations give valuable insight into the stage of differentiation of the B cells.

  6. Gene Duplication of the zebrafish kit ligand and partitioning of melanocyte development functions to kit ligand a.

    Directory of Open Access Journals (Sweden)

    Keith A Hultman


    Full Text Available The retention of particular genes after the whole genome duplication in zebrafish has given insights into how genes may evolve through partitioning of ancestral functions. We examine the partitioning of expression patterns and functions of two zebrafish kit ligands, kit ligand a (kitla and kit ligand b (kitlb, and discuss their possible coevolution with the duplicated zebrafish kit receptors (kita and kitb. In situ hybridizations show that kitla mRNA is expressed in the trunk adjacent to the notochord in the middle of each somite during stages of melanocyte migration and later expressed in the skin, when the receptor is required for melanocyte survival. kitla is also expressed in other regions complementary to kita receptor expression, including the pineal gland, tail bud, and ear. In contrast, kitlb mRNA is expressed in brain ventricles, ear, and cardinal vein plexus, in regions generally not complementary to either zebrafish kit receptor ortholog. However, like kitla, kitlb is expressed in the skin during stages consistent with melanocyte survival. Thus, it appears that kita and kitla have maintained congruent expression patterns, while kitb and kitlb have evolved divergent expression patterns. We demonstrate the interaction of kita and kitla by morpholino knockdown analysis. kitla morphants, but not kitlb morphants, phenocopy the null allele of kita, with defects for both melanocyte migration and survival. Furthermore, kitla morpholino, but not kitlb morpholino, interacts genetically with a sensitized allele of kita, confirming that kitla is the functional ligand to kita. Last, we examine kitla overexpression in embryos, which results in hyperpigmentation caused by an increase in the number and size of melanocytes. This hyperpigmentation is dependent on kita function. We conclude that following genome duplication, kita and kitla have maintained their receptor-ligand relationship, coevolved complementary expression patterns, and that


    Institute of Scientific and Technical Information of China (English)

    仇一华; 陈诗书


    Using Southern blot, Northern blot and Quick blot methods, we examined the rearrangement and expression of TCR βgene in four early differentiation stage cell lines from human hemopoietic system, namely HL-60, Jurkat, Daudi and Raji cells as well as lymphocytes from 17 acute lymphocytic leukemia (ALL) patients. The results showed. Ⅰ) Rearrangement of TCR βgene was seen in Jurkat cells. A germline pattern was observed in HL-60, Daudi and Raji cells. 2) Eight of 9 patients with T-ALL had cells with rearranged TCR βgene. But two of 3 patients with B-ALL and three of 5 patients with nonT, nonB-ALL also had cells with rearranged TCR βgene. 3) A 1.3 kb full-length transcript and a 1.0 kb truncated transcript were detected in Jurkat cells by probing with 32P-TCR βcDNA. But some leukemic B cells also expressed an incompleted transcript. 4) TCR βmRNA was detected in six of 8 patients with T-ALL, four of 5 patients with nonT, nonB-ALL and one of 3 patients with B-ALL. But the level of expression was quite differ ent. The dual-rearrangement and the abnormal expression may give us a new clue for researching leukemogenesis.

  8. Expanding the clinical spectrum of the 16p11.2 chromosomal rearrangements: three patients with syringomyelia. (United States)

    Schaaf, Christian P; Goin-Kochel, Robin P; Nowell, Kerri P; Hunter, Jill V; Aleck, Kirk A; Cox, Sarah; Patel, Ankita; Bacino, Carlos A; Shinawi, Marwan


    16p11.2 rearrangements are associated with developmental delay, cognitive impairment, autism spectrum disorder, behavioral problems (especially attention-deficit hyperactivity disorder), seizures, obesity, dysmorphic features, and abnormal head size. In addition, congenital anomalies and abnormal brain findings were frequently observed in patients with these rearrangements. We identified and performed a detailed microarray, phenotypic, and radiological characterization of three new patients with 16p11.2 rearrangements: two deletion patients and one patient with the reciprocal duplication. All patients have a heterozygous loss (deletion) or gain (duplication) corresponding to chromosomal coordinates (chr16: 29 528 190-30 107 184) with a minimal size of 579 kb. The deletion patients had language delay and learning disabilities and one met criteria for pervasive developmental disorder not otherwise specified. The duplication patient received a diagnosis of autism and had academic deficits and behavioral problems. The patients with deletion had long cervicothoracic syringomyelia and the duplication patient had long thoracolumbar syringomyelia. The syringomyelia in one patient with deletion was associated with Chiari malformation. Our findings highlight the broad spectrum of clinical and neurological manifestations in patients with 16p11.2 rearrangements. Our observation suggests that genes (or a single gene) within the implicated interval have significant roles in the pathogenesis of syringomyelia. A more comprehensive and systematic research is warranted to study the frequency and spectrum of malformations in the central nervous system in these patients.

  9. Rearrangement of Rag-1 recombinase gene in DNA-repair deficient/immunodeficient wasted'' mice

    Energy Technology Data Exchange (ETDEWEB)

    Woloschak, G.E.; Weaver, P.; Churchill, M.; Chang-Liu, C-M. (Argonne National Lab., IL (United States)); Libertin, C.R. (Loyola Univ., Maywood, IL (United States))


    Mice recessive for the autosomal gene wasted'' (wst) display a disease pattern which includes increased sensitivity to the killing effects of ionizing radiation, immunodeficiency, and neurologic dysfunction. The recent cloning and characterization of recombinase genes (Rag-l/Rag-2) expressed in lymphoid and possibly central nervous system tissues prompted us to examine expression of these genes in DNA repair-deficient/immunodeficient wasted mice. Our results revealed that in thymus tissue, a small Rag-I transcript (1.0 kb) was detected in wst/wst mice that was not evident in thymus from control mice. In wst/[sm bullet] mice, a two-fold increase in Rag-1 mRNA was evident in thymus tissue. Rag-2 mRNA could only be detected in thymus tissue from wst/[sm bullet] and not from wst/wst or parental control BCF, mice. Southern blots revealed a rearrangement or deletion within the Rag-1 gene of affected wasted mice that was not evident in known strain-specific parental or littermate controls. These results support the idea that the Rag-1 gene may map at or near the locus for the wasted mutation. In addition, they suggest the importance of recombinase function in normal immune and central nervous system development as well as the potential contribution of this gene family to the normal repair of radiation-induced DNA damage.

  10. Phenotypic Consequences In vivo and In vitro of Rearranging the P Gene of RABV HEP-Flury (United States)

    Mei, Mingzhu; Long, Teng; Zhang, Qiong; Zhao, Jing; Tian, Qin; Peng, Jiaojiao; Luo, Jun; Wang, Yifei; Lin, Yingyi; Guo, Xiaofeng


    Phosphoprotein (P) of the Rabies virus (RABV) is critically required for viral replication and pathogenicity. Here we manipulated infectious cDNA clones of the RABV HEP-Flury to translocate the P gene from its wild-type position 2 to 1, 3, or 4 in gene order, using an approach which left the viral nucleotide sequence unaltered. The recovered viruses were evaluated for the levels of gene expression, growth kinetics in cell culture, lethality in suckling mice and protection of mice. The results showed that viral replication was affected by the absolute value of N protein which was regulated by P protein. Viral lethality in suckling mice was consistent with the ratio of P mRNA in one complete transcription. The protection of mice induced by viruses was related to the antibody titer 5 weeks post-infection which might be regulated by G protein. However, the ability to induce cell apoptosis and viral spread were not only related to the viral replication but also to the ratio of related gene which affected by the gene position. These findings might not only improve the understanding of phenotype of RABV and P gene rearrangement, but also help rabies vaccine candidate construction. PMID:28217116

  11. [A case of lambda-expressing pulmonary MALT lymphoma with dual clonal rearrangements of kappa and lambda immunoglobulin light chain gene]. (United States)

    Oh, Hye Ryong; Lee, Mi Ja; Park, Geon; Moon, Dae Soo; Park, Young Jin; Jang, Sook Jin


    A 70-yr-old woman was hospitalized with a history of dry cough. Bronchial endoscopy and transbronchial lung biopsy were performed. However, the findings of histopathology and immunohistochemistry were not sufficient to decide whether the lesion was benign or malignant, because of the presence of crush artifacts in the biopsy specimens. We performed B-cell clonality studies using BIOMED-2 multiplex PCR (InVivoScribe Technologies, USA) to detect clonal rearrangements in the immunoglobulin gene. The results of multiplex PCR showed clonal rearrangements of both kappa and lambda immunoglobulin light chain genes. The findings of immunochemistry revealed that the lesion expressed lambda light chain, but not kappa light chain. Based on the clinical, pathologic, and molecular findings, this case was diagnosed as pulmonary MALT lymphoma. We report the first case in Korea of lambda-expressing MALT lymphoma that is shown to have dual clonal rearrangements of kappa and lambda immunoglobulin light chain gene by multiplex PCR.

  12. Collateral damage: Spread of repeat-induced point mutation from a duplicated DNA sequence into an adjoining single-copy gene in Neurospora crassa

    Indian Academy of Sciences (India)

    Meenal Vyas; Durgadas P Kasbekar


    Repeat-induced point mutation (RIP) is an unusual genome defense mechanism that was discovered in Neurospora crassa. RIP occurs during a sexual cross and induces numerous G : C to A : T mutations in duplicated DNA sequences and also methylates many of the remaining cytosine residues. We measured the susceptibility of the erg-3 gene, present in single copy, to the spread of RIP from duplications of adjoining sequences. Genomic segments of defined length (1, 1.5 or 2 kb) and located at defined distances (0, 0.5, 1 or 2 kb) upstream or downstream of the erg-3 open reading frame (ORF) were amplified by polymerase chain reaction (PCR), and the duplications were created by transformation of the amplified DNA. Crosses were made with the duplication strains and the frequency of erg-3 mutant progeny provided a measure of the spread of RIP from the duplicated segments into the erg-3 gene. Our results suggest that ordinarily RIP-spread does not occur. However, occasionally the mechanism that confines RIP to the duplicated segment seems to fail (frequency 0.1–0.8%) and then RIP can spread across as much as 1 kb of unduplicated DNA. Additionally, the bacterial hph gene appeared to be very susceptible to the spread of RIP-associated cytosine methylation.

  13. Interlocus gene conversion explains at least 2.7% of single nucleotide variants in human segmental duplications. (United States)

    Dumont, Beth L


    Interlocus gene conversion (IGC) is a recombination-based mechanism that results in the unidirectional transfer of short stretches of sequence between paralogous loci. Although IGC is a well-established mechanism of human disease, the extent to which this mutagenic process has shaped overall patterns of segregating variation in multi-copy regions of the human genome remains unknown. One expected manifestation of IGC in population genomic data is the presence of one-to-one paralogous SNPs that segregate identical alleles. Here, I use SNP genotype calls from the low-coverage phase 3 release of the 1000 Genomes Project to identify 15,790 parallel, shared SNPs in duplicated regions of the human genome. My approach for identifying these sites accounts for the potential redundancy of short read mapping in multi-copy genomic regions, thereby effectively eliminating false positive SNP calls arising from paralogous sequence variation. I demonstrate that independent mutation events to identical nucleotides at paralogous sites are not a significant source of shared polymorphisms in the human genome, consistent with the interpretation that these sites are the outcome of historical IGC events. These putative signals of IGC are enriched in genomic contexts previously associated with non-allelic homologous recombination, including clear signals in gene families that form tandem intra-chromosomal clusters. Taken together, my analyses implicate IGC, not point mutation, as the mechanism generating at least 2.7% of single nucleotide variants in duplicated regions of the human genome.

  14. Duplications and positive selection drive the evolution of parasitism associated gene families in the nematode Strongyloides papillosus. (United States)

    Baskaran, Praveen; Jaleta, Tegegn G; Streit, Adrian; Rödelsperger, Christian


    Gene duplication is one major mechanism playing a role in the evolution of phenotypic complexity and in the generation of novel traits. By comparing parasitic and nonparasitic nematodes, a recent study found that the evolution of parasitism in Strongyloididae is associated with a large expansion in the Astacin and CAP gene families.To gain novel insights into the developmental processes in the sheep parasite Strongyloides papillosus, we sequenced transcriptomes of different developmental stages and sexes. Overall, we found that the majority of genes are developmentally regulated and have one-to-one orthologs in the diverged S. ratti genome. Together with the finding of similar expression profiles between S. papillosus and S. ratti, these results indicate a strong evolutionary constraint acting against change at sequence and expression levels. However, the comparison between parasitic and free-living females demonstrates a quite divergent pattern that is mostly due to the previously mentioned expansion in the Astacin and CAP gene families. More detailed phylogenetic analysis of both gene families shows that most members date back to single expansion events early in the Strongyloides lineage and have undergone subfunctionalization resulting in clusters that are highly expressed either in infective larvae or in parasitic females. Finally, we found increased evidence for positive selection in both gene families relative to the genome-wide expectation.In summary, our study reveals first insights into the developmental transcriptomes of S. papillosus and provides a detailed analysis of sequence and expression evolution in parasitism associated gene families.

  15. Four unrelated patients with Lubs X-linked mental retardation syndrome and different Xq28 duplications. (United States)

    Bartsch, Oliver; Gebauer, Konstanze; Lechno, Stanislav; van Esch, Hilde; Froyen, Guy; Bonin, Michael; Seidel, Jörg; Thamm-Mücke, Barbara; Horn, Denise; Klopocki, Eva; Hertzberg, Christoph; Zechner, Ulrich; Haaf, Thomas


    The Lubs X-linked mental retardation syndrome (MRXSL) is caused by small interstitial duplications at distal Xq28 including the MECP2 gene. Here we report on four novel male patients with MRXSL and different Xq28 duplications delineated by microarray-based chromosome analysis. All mothers were healthy carriers of the duplications. Consistent with an earlier report [Bauters et al. (2008); Genome Res 18: 847-858], the distal breakpoints of all four Xq28 duplications were located in regions containing low-copy repeats (LCRs; J, K, and L groups), which may facilitate chromosome breakage and reunion events. The proximal breakpoint regions did not contain known LCRs. Interestingly, we identified apparent recurrent breakage sites in the proximal and distal breakpoint regions. Two of the four patients displayed more complex rearrangements. Patient 2 was endowed with a quadruplicated segment and a small triplication within the duplication, whereas patient 3 displayed two triplicated segments within the duplication, supporting that the Fork Stalling and Template Switching (FoSTeS) model may explain a subset of the structural rearrangements in Xq28. Clinically, muscular hypertonia and contractures of large joints may present a major problem in children with MRXSL. Because injection of botulinum toxin (BT-A; Botox) proved to be extremely helpful for patient 1, we recommend consideration of Botox treatment in other patients with MRXSL and severe joint contractures.

  16. [RET/PTC Gene Rearrangements in the Sporadic and Radiogenic Thyroid Tumors: Molecular Genetics, Radiobiology and Molecular Epidemiology]. (United States)

    Ushenkova, L N; Koterov, A N; Biryukov, A P


    A review of molecular genetic, radiobiological and molecular epidemiological studies of gene (chromosome) rearrangements RET/PTC in the cells of the thyroid gland as well as the laws in relation to radiation exposure in vitro, in vivo and human populations identified with them are submitted. The data on the c-RET gene and its chimeric constructs with the gene-donors (RET/PTC rearrangements) are considered. The information about the history of the RET/PTC discovery, their types, carcinogenic potential and specificity both to tumor and non-tumor thyroid disease especially for papillary thyroid carcinoma are provided. The data (seven studies) on the induction of RET/PTC after irradiation of tumor and normal thyroid cells in vitro and mice are reviewed. The mechanisms of RET/PTC induction may be associated with DNA double strand breaks and oxidative stress. Some information (three publications) about the possibility of RET/PTC induction by low doses of radiation with low LET (to 0.1 Gy) is given and it is concluded that their potential evidentiary is generally weak. The achievements in the molecular epidemiology of RET/PTC frequency for exposed and unexposed cohorts are stated. At the same time it is noted that, despite the vast array. of data accumulated from 30 countries of the world and more than 20 years of research, the formed provisions are weakly confirmed statistically and have no base corresponding to the canons of evidence-based medicine. The possibility of use of the RET/PTC presence or their frequencies as markers of the papillary thyroid carcinomas and, specifically, their radiogenic forms, is considered. In the first case the answer may be positive, while in the second, the situation is characterized by uncertainty. Based to the above mentioned we came to a conclusion about the need of a pooled or meta-analysis of the totality of the published data.

  17. Epigenetic and 3-dimensional regulation of V(D)J rearrangement of immunoglobulin genes. (United States)

    Degner-Leisso, Stephanie C; Feeney, Ann J


    V(D)J recombination is a crucial component of the adaptive immune response, allowing for the production of a diverse antigen receptor repertoire (Ig and TCR). This review will focus on how epigenetic regulation and 3-dimensional (3D) interactions may control V(D)J recombination at Ig loci. The interplay between transcription factors and post-translational modifications at the Igh, Igκ, and Igλ loci will be highlighted. Furthermore, we propose that the spatial organization and epigenetic boundaries of each Ig loci before and during V(D)J recombination may be influenced in part by the CTCF/cohesin complex. Taken together, the many epigenetic and 3D layers of control ensure that Ig loci are only rearranged at appropriate stages of B cell development.

  18. [Diffuse large B-cell lymphoma with concomitant c-MYC and BCL6 gene rearrangements with primary skin involvement: A case report and a review of literature]. (United States)

    Gabeeva, N G; Koroleva, D A; Belyaeva, A V; Chernova, N G; Kuzmina, L A; Sudarikov, A B; Obukhova, T N; Kovrigina, A M; Zvonkov, E E; Savchenko, V G


    Double-hit lymphoma (DHL) is a rare aggressive B-cell lymphoma with concomitant c-MYC, BCL2 or BCL6 gene rearrangements, which is characterized by the high frequency of extranodal lesions and by resistance to chemotherapy. The median survival does not exceed 18 months in patients with this disease. The majority of DHL is represented by с-MYC/BCL2 cases. The combination of c-MYC/BCL6 occurs rarely (5-8%). The paper describes a case of DHL with concomitant c-MYC and BCL6 gene rearrangements, which mimics diffuse large B-cell lymphoma, leg-type.

  19. Deletion/duplication mutation screening of TP53 gene in patients with transitional cell carcinoma of urinary bladder using multiplex ligation-dependent probe amplification. (United States)

    Bazrafshani, Mohammad Reza R; Nowshadi, Pouriaali A; Shirian, Sadegh; Daneshbod, Yahya; Nabipour, Fatemeh; Mokhtari, Maral; Hosseini, Fatemehsadat; Dehghan, Somayeh; Saeedzadeh, Abolfazl; Mosayebi, Ziba


    Bladder cancer is a molecular disease driven by the accumulation of genetic, epigenetic, and environmental factors. The aim of this study was to detect the deletions/duplication mutations in TP53 gene exons using multiplex ligation-dependent probe amplification (MLPA) method in the patients with transitional cell carcinoma (TCC). The achieved formalin-fixed paraffin-embedded tissues from 60 patients with TCC of bladder were screened for exonal deletions or duplications of every 12 TP53 gene exons using MLPA. The pathological sections were examined by three pathologists and categorized according to the WHO scoring guideline as 18 (30%) grade I, 22 (37%) grade II, 13 (22%) grade III, and 7 (11%) grade IV cases of TCC. None mutation changes of TP53 gene were detected in 24 (40%) of the patients. Furthermore, mutation changes including, 15 (25%) deletion, 17 (28%) duplication, and 4 (7%) both deletion and duplication cases were observed among 60 samples. From 12 exons of TP53 gene, exon 1 was more subjected to exonal deletion. Deletion of exon 1 of TP53 gene has occurred in 11 (35.4%) patients with TCC. In general, most mutations of TP53, either deletion or duplication, were found in exon 1, which was statistically significant. In addition, no relation between the TCC tumor grade and any type of mutation were observed in this research. MLPA is a simple and efficient method to analyze genomic deletions and duplications of all 12 exons of TP53 gene. The finding of this report that most of the mutations of TP53 occur in exon 1 is in contrast to that of the other reports suggesting that exons 5-8 are the most (frequently) mutated exons of TP53 gene. The mutations of exon 1 of TP53 gene may play an important role in the tumorogenesis of TCC.

  20. Modeling protein network evolution under genome duplication and domain shuffling

    Directory of Open Access Journals (Sweden)

    Isambert Hervé


    Full Text Available Abstract Background Successive whole genome duplications have recently been firmly established in all major eukaryote kingdoms. Such exponential evolutionary processes must have largely contributed to shape the topology of protein-protein interaction (PPI networks by outweighing, in particular, all time-linear network growths modeled so far. Results We propose and solve a mathematical model of PPI network evolution under successive genome duplications. This demonstrates, from first principles, that evolutionary conservation and scale-free topology are intrinsically linked properties of PPI networks and emerge from i prevailing exponential network dynamics under duplication and ii asymmetric divergence of gene duplicates. While required, we argue that this asymmetric divergence arises, in fact, spontaneously at the level of protein-binding sites. This supports a refined model of PPI network evolution in terms of protein domains under exponential and asymmetric duplication/divergence dynamics, with multidomain proteins underlying the combinatorial formation of protein complexes. Genome duplication then provides a powerful source of PPI network innovation by promoting local rearrangements of multidomain proteins on a genome wide scale. Yet, we show that the overall conservation and topology of PPI networks are robust to extensive domain shuffling of multidomain proteins as well as to finer details of protein interaction and evolution. Finally, large scale features of direct and indirect PPI networks of S. cerevisiae are well reproduced numerically with only two adjusted parameters of clear biological significance (i.e. network effective growth rate and average number of protein-binding domains per protein. Conclusion This study demonstrates the statistical consequences of genome duplication and domain shuffling on the conservation and topology of PPI networks over a broad evolutionary scale across eukaryote kingdoms. In particular, scale

  1. Evidence of neofunctionalization after the duplication of the highly conserved Polycomb group gene Caf1-55 in the obscura group of Drosophila. (United States)

    Calvo-Martín, Juan M; Papaceit, Montserrat; Segarra, Carmen


    Drosophila CAF1-55 protein is a subunit of the Polycomb repressive complex PRC2 and other protein complexes. It is a multifunctional and evolutionarily conserved protein that participates in nucleosome assembly and remodelling, as well as in the epigenetic regulation of a large set of target genes. Here, we describe and analyze the duplication of Caf1-55 in the obscura group of Drosophila. Paralogs exhibited a strong asymmetry in evolutionary rates, which suggests that they have evolved according to a neofunctionalization process. During this process, the ancestral copy has been kept under steady purifying selection to retain the ancestral function and the derived copy (Caf1-55dup) that originated via a DNA-mediated duplication event ~18 Mya, has been under clear episodic selection. Different maximum likelihood approaches confirmed the action of positive selection, in contrast to relaxed selection, on Caf1-55dup after the duplication. This adaptive process has also taken place more recently during the divergence of D. subobscura and D. guanche. The possible association of this duplication with a previously detected acceleration in the evolutionary rate of three CAF1-55 partners in PRC2 complexes is discussed. Finally, the timing and functional consequences of the Caf1-55 duplication is compared to other duplications of Polycomb genes.

  2. The nuclear OXPHOS genes in insecta: a common evolutionary origin, a common cis-regulatory motif, a common destiny for gene duplicates

    Directory of Open Access Journals (Sweden)

    Pesole Graziano


    Full Text Available Abstract Background When orthologous sequences from species distributed throughout an optimal range of divergence times are available, comparative genomics is a powerful tool to address problems such as the identification of the forces that shape gene structure during evolution, although the functional constraints involved may vary in different genes and lineages. Results We identified and annotated in the MitoComp2 dataset the orthologs of 68 nuclear genes controlling oxidative phosphorylation in 11 Drosophilidae species and in five non-Drosophilidae insects, and compared them with each other and with their counterparts in three vertebrates (Fugu rubripes, Danio rerio and Homo sapiens and in the cnidarian Nematostella vectensis, taking into account conservation of gene structure and regulatory motifs, and preservation of gene paralogs in the genome. Comparative analysis indicates that the ancestral insect OXPHOS genes were intron rich and that extensive intron loss and lineage-specific intron gain occurred during evolution. Comparison with vertebrates and cnidarians also shows that many OXPHOS gene introns predate the cnidarian/Bilateria evolutionary split. The nuclear respiratory gene element (NRG has played a key role in the evolution of the insect OXPHOS genes; it is constantly conserved in the OXPHOS orthologs of all the insect species examined, while their duplicates either completely lack the element or possess only relics of the motif. Conclusion Our observations reinforce the notion that the common ancestor of most animal phyla had intron-rich gene, and suggest that changes in the pattern of expression of the gene facilitate the fixation of duplications in the genome and the development of novel genetic functions.

  3. Imperfect conformation of experimental and epidemiological data for frequency of RET/РТС gene rearrangements in papillary thyroid carcinoma for the Chernobyl accident

    Directory of Open Access Journals (Sweden)

    Ushenkova L.N.


    Full Text Available In an overview and analytical study of the epidemiological data on the frequency of RET/РТС gene rearrangements in sporadic and radiogenic (patients after radiotherapy, residents of contaminated after the Chernobyl disaster areas, victims after the atomic bombings, etc. carcinomas of the thyroid gland were examined. In general, the observed epidemiological laws were confirmed in radiobiology experiments by irradiation of different cultures of thyroid cells and ex vivo with the exception of Chernobyl cohorts. Induction of RET/РТС gene rearrangements by 131l exposure in children carcinomas of Chernobyl residents in mice did not observe too. It is concluded that the situation with the frequency of RET/РТС rearrangements in thyroid carcinoma in Chernobyl cohorts once again confirms the multifactorial nature of the induction and development of these tumors with a contribution of radiation and non-radiation factors (iodine deficiency and different stresses.

  4. Relationships among the cyclostome braconid (Hymenoptera: Braconidae) subfamilies inferred from a mitochondrial tRNA gene rearrangement. (United States)

    Dowton, M


    The arrangement of mitochondrial tRNA genes for lysine (K) and aspartate (D) from the junction of the cytochrome oxidase II and ATPase 8 genes was determined in a range of hymenopteran taxa. This indicated that the ancestral arrangement for the order is 'KD', as found in the Diptera (represented by Drosophila and Anopheles) and basal Orthoptera. Most Hymenoptera that evolved after the appearance of parasitism also have the 'KD' arrangement, including noncyclostome braconids. However, most cyclostome braconids have either a 'DK' or a 'DHK' arrangement (where 'H' refers to the tRNA gene for Histidine). In both cases, the aspartate tRNA gene is encoded on the mitochondrial N-strand, rather than the J-strand as is usually the case. This rearrangement identified a monophyletic group not previously recognized, consisting of Rogadinae + Braconinae + Gnamptodontinae + Histeromerinae + Rhyssalinae + Betylobraconinae + Opiinae + Alysiinae. Only one cyclostome subfamily (Doryctinae) retained the 'KD' arrangement, suggesting this to be the most basal of the cyclostome subfamilies, consistent with ectoparasitism being plesiomorphic for the cyclostomes. However, the Aphidiinae also retained the 'KD' arrangement, leaving unresolved the issue of whether they should be included within the cyclostomes.

  5. Construction and characterization of single-chain variable fragment antibody library derived from germline rearranged immunoglobulin variable genes.

    Directory of Open Access Journals (Sweden)

    Man Cheng

    Full Text Available Antibody repertoires for library construction are conventionally harvested from mRNAs of immune cells. To examine whether germline rearranged immunoglobulin (Ig variable region genes could be used as source of antibody repertoire, an immunized phage-displayed scFv library was prepared using splenocytic genomic DNA as template. In addition, a novel frame-shifting PCR (fsPCR step was introduced to rescue stop codon and to enhance diversity of the complementarity-determining region 3 (CDR3. The germline scFv library was initially characterized against the hapten antigen phenyloxazolone (phOx. Sequence analysis of the phOx-selective scFvs indicated that the CDRs consisted of novel as well as conserved motifs. In order to illustrate that the diversity of CDR3 was increased by the fsPCR step, a second scFv library was constructed using a single scFv clone L3G7C as a template. Despite showing similar binding characteristics towards phOx, the scFv clones that were obtained from the L3G7C-derived antibody library gave a lower non-specific binding than that of the parental L3G7C clone. To determine whether germline library represented the endogenous immune status, specific scFv clones for nucleocapsid (N protein of SARS-associated coronavirus (SCoV were obtained both from naïve and immunized germline scFv libraries. Both libraries yielded specific anti-N scFvs that exhibited similar binding characteristics towards recombinant N protein, except the immunized library gave a larger number of specific anti-N scFv, and clones with identical nucleotide sequences were found. In conclusion, highly diversified antibody library can be efficiently constructed using germline rearranged immunoglobulin variable genes as source of antibody repertoires and fsPCR to diversify the CDR3.

  6. Rearrangement and expression of beta-T-cell receptor and immunoglobulin genes in established Ph1 chronic myelogenous leukemia cell lines. (United States)

    Berenson, J; Koeffler, H P


    We have determined the arrangement and expression of immunoglobulin (Ig) and beta-T-cell receptor (TCR) genes in six established Philadelphia chromosome-positive (Ph1) chronic myelogenous leukemia (CML) cell lines, and correlated these results with their phenotypic characteristics. Three cell lines with nonlymphoid characteristics, EM2, EM3, and K562, did not demonstrate rearrangement or expression of Ig or beta-TCR genes. A new cell line, MB, with a mature B-cell phenotype recently established in our laboratory, contained light and heavy chain immunoglobulin gene rearrangements and expressed mature Ig RNA. In a cell line with an early lymphoid phenotype, BV173, this analysis showed rearrangement of Ig heavy chain and beta-TCR genes, unrearranged Ig light chain DNA, and expression of only an immature beta-TCR transcript. This line provides evidence for T-cell lineage involvement in Ph1 CML. One cell line without markers of any cell type, KCL-22, demonstrated rearranged, unexpressed Ig heavy chain genes, suggesting these cells are at the very earliest stages of lymphoid differentiation. These lines should provide valuable tools to dissect the molecular biology of differentiation in CML and in early lymphocytes.

  7. Unusual patterns of immunoglobulin gene rearrangement and expression during human B cell ontogeny: human B cells can simultaneously express cell surface kappa and lambda light chains



    Immunoglobulin gene rearrangement during mammalian B cell development generally follows an ordered progression, beginning with heavy (H) chain genes and proceeding through kappa and lambda light (L) chain genes. To determine whether the predicted kappa-->lambda hierarchy was occurring in vitro, we generated Epstein-Barr virus-transformed cell lines from cultures undergoing human pre-B cell differentiation. A total of 143 cell lines were established. 24 expressed cell surface mu/lambda by flow...

  8. Duplication at Xq13.3-q21.1 with syndromic intellectual disability, a probable role for the ATRX gene. (United States)

    Martínez, Francisco; Roselló, Mónica; Mayo, Sonia; Monfort, Sandra; Oltra, Silvestre; Orellana, Carmen


    Here we report on two unrelated male patients with syndromic intellectual disability (ID) due to duplication at Xq13.3-q21.1, a region of about 6 Mb and 25 genes. Among these, the most outstanding is ATRX, the causative gene of X-linked alpha-thalassemia/mental retardation. ATRX belongs to the growing list of genes implied in chromatin remodeling causing ID. Many these genes, such as MECP2, are dose-sensitive so that not only deletions and point mutations, but also duplications cause ID. Both patients have severe ID, absent expressive speech, early hypotonia, behavior problems (hyperactivity, repetitive self-stimulatory behavior), postnatal growth deficiency, microcephaly, micrognathia, cryptorchidism, low-set, posteriorly angulated ears, and downslanting palpebral fissures. These findings are also usually present among patients with loss-of-function mutations of the ATRX gene. Completely skewed X inactivation was observed in the only informative carrier mother, a constant finding among female carriers of inactivating point mutations of this gene. Participation of other duplicated genes cannot be excluded; nevertheless we propose that the increased dosage of ATRX is the major pathogenic mechanism of this X-linked disorder, a syndrome reminiscent of MECP2 duplication.

  9. Ancient Duplications and Expression Divergence in the Globin Gene Superfamily of Vertebrates: Insights from the Elephant Shark Genome and Transcriptome. (United States)

    Opazo, Juan C; Lee, Alison P; Hoffmann, Federico G; Toloza-Villalobos, Jessica; Burmester, Thorsten; Venkatesh, Byrappa; Storz, Jay F


    Comparative analyses of vertebrate genomes continue to uncover a surprising diversity of genes in the globin gene superfamily, some of which have very restricted phyletic distributions despite their antiquity. Genomic analysis of the globin gene repertoire of cartilaginous fish (Chondrichthyes) should be especially informative about the duplicative origins and ancestral functions of vertebrate globins, as divergence between Chondrichthyes and bony vertebrates represents the most basal split within the jawed vertebrates. Here, we report a comparative genomic analysis of the vertebrate globin gene family that includes the complete globin gene repertoire of the elephant shark (Callorhinchus milii). Using genomic sequence data from representatives of all major vertebrate classes, integrated analyses of conserved synteny and phylogenetic relationships revealed that the last common ancestor of vertebrates possessed a repertoire of at least seven globin genes: single copies of androglobin and neuroglobin, four paralogous copies of globin X, and the single-copy progenitor of the entire set of vertebrate-specific globins. Combined with expression data, the genomic inventory of elephant shark globins yielded four especially surprising findings: 1) there is no trace of the neuroglobin gene (a highly conserved gene that is present in all other jawed vertebrates that have been examined to date), 2) myoglobin is highly expressed in heart, but not in skeletal muscle (reflecting a possible ancestral condition in vertebrates with single-circuit circulatory systems), 3) elephant shark possesses two highly divergent globin X paralogs, one of which is preferentially expressed in gonads, and 4) elephant shark possesses two structurally distinct α-globin paralogs, one of which is preferentially expressed in the brain. Expression profiles of elephant shark globin genes reveal distinct specializations of function relative to orthologs in bony vertebrates and suggest hypotheses about

  10. Translocations used to generate chromosome segment duplications in Neurospora can disrupt genes and create novel open reading frames

    Indian Academy of Sciences (India)

    Parmit K Singh; Srividhya V Iyer; T Naga Sowjanya; B Kranthi Raj; Durgadas P Kasbekar


    In Neurospora crassa, crosses between normal sequence strains and strains bearing some translocations can yield progeny bearing a duplication (Dp) of the translocated chromosome segment. Here, 30 breakpoint junction sequences of 12 Dp-generating translocations were determined. The breakpoints disrupted 13 genes (including predicted genes), and created 10 novel open reading frames. Insertion of sequences from LG III into LG I as translocation T(UK818) disrupts the eat-3 gene, which is the ortholog of the Podospora anserine gene ami1. Since ami1-homozygous Podospora crosses were reported to increase the frequency of repeat-induced point mutation (RIP), we performed crosses homozygous for a deficiency in eat-3 to test for a corresponding increase in RIP frequency. However, our results suggested that, unlike in Podospora, the eat-3 gene might be essential for ascus development in Neurospora. Duplication–heterozygous crosses are generally barren in Neurospora; however, by using molecular probes developed in this study, we could identify Dp segregants from two different translocation–heterozygous crosses, and using these we found that the barren phenotype of at least some duplication–heterozygous crosses was incompletely penetrant.

  11. Autopolyploidy genome duplication preserves other ancient genome duplications in Atlantic salmon (Salmo salar) (United States)

    Davidson, William S.


    Salmonids (e.g. Atlantic salmon, Pacific salmon, and trouts) have a long legacy of genome duplication. In addition to three ancient genome duplications that all teleosts are thought to share, salmonids have had one additional genome duplication. We explored a methodology for untangling these duplications from each other to better understand them in Atlantic salmon. In this methodology, homeologous regions (paralogous/duplicated genomic regions originating from a whole genome duplication) from the most recent genome duplication were assumed to have duplicated genes at greater density and have greater sequence similarity. This assumption was used to differentiate duplicated gene pairs in Atlantic salmon that are either from the most recent genome duplication or from earlier duplications. From a comparison with multiple vertebrate species, it is clear that Atlantic salmon have retained more duplicated genes from ancient genome duplications than other vertebrates--often at higher density in the genome and containing fewer synonymous mutations. It may be that polysomic inheritance is the mechanism responsible for maintaining ancient gene duplicates in salmonids. Polysomic inheritance (when multiple chromosomes pair during meiosis) is thought to be relatively common in salmonids compared to other vertebrate species. These findings illuminate how genome duplications may not only increase the number of duplicated genes, but may also be involved in the maintenance of them from previous genome duplications as well. PMID:28241055

  12. "Tandem duplication-random loss" is not a real feature of oyster mitochondrial genomes

    Directory of Open Access Journals (Sweden)

    Zhang Guofan


    Full Text Available Abstract Duplications and rearrangements of coding genes are major themes in the evolution of mitochondrial genomes, bearing important consequences in the function of mitochondria and the fitness of organisms. Yu et al. (BMC Genomics 2008, 9:477 reported the complete mt genome sequence of the oyster Crassostrea hongkongensis (16,475 bp and found that a DNA segment containing four tRNA genes (trnK1, trnC, trnQ1 and trnN, a duplicated (rrnS and a split rRNA gene (rrnL5' was absent compared with that of two other Crassostrea species. It was suggested that the absence was a novel case of "tandem duplication-random loss" with evolutionary significance. We independently sequenced the complete mt genome of three C. hongkongensis individuals, all of which were 18,622 bp and contained the segment that was missing in Yu et al.'s sequence. Further, we designed primers, verified sequences and demonstrated that the sequence loss in Yu et al.'s study was an artifact caused by placing primers in a duplicated region. The duplication and split of ribosomal RNA genes are unique for Crassostrea oysters and not lost in C. hongkongensis. Our study highlights the need for caution when amplifying and sequencing through duplicated regions of the genome.

  13. The complete mitochondrial genome of Pseudocellus pearsei (Chelicerata: Ricinulei and a comparison of mitochondrial gene rearrangements in Arachnida

    Directory of Open Access Journals (Sweden)

    Braband Anke


    Full Text Available Abstract Background Mitochondrial genomes are widely utilized for phylogenetic and population genetic analyses among animals. In addition to sequence data the mitochondrial gene order and RNA secondary structure data are used in phylogenetic analyses. Arachnid phylogeny is still highly debated and there is a lack of sufficient sequence data for many taxa. Ricinulei (hooded tickspiders are a morphologically distinct clade of arachnids with uncertain phylogenetic affinities. Results The first complete mitochondrial DNA genome of a member of the Ricinulei, Pseudocellus pearsei (Arachnida: Ricinulei was sequenced using a PCR-based approach. The mitochondrial genome is a typical circular duplex DNA molecule with a size of 15,099 bp, showing the complete set of genes usually present in bilaterian mitochondrial genomes. Five tRNA genes (trnW, trnY, trnN, trnL(CUN, trnV show different relative positions compared to other Chelicerata (e.g. Limulus polyphemus, Ixodes spp.. We propose that two events led to this derived gene order: (1 a tandem duplication followed by random deletion and (2 an independent translocation of trnN. Most of the inferred tRNA secondary structures show the common cloverleaf pattern except tRNA-Glu where the TψC-arm is missing. In phylogenetic analyses (maximum likelihood, maximum parsimony, Bayesian inference using concatenated amino acid and nucleotide sequences of protein-coding genes the basal relationships of arachnid orders remain unresolved. Conclusion Phylogenetic analyses (ML, MP, BI of arachnid mitochondrial genomes fail to resolve interordinal relationships of Arachnida and remain in a preliminary stage because there is still a lack of mitogenomic data from important taxa such as Opiliones and Pseudoscorpiones. Gene order varies considerably within Arachnida – only eight out of 23 species have retained the putative arthropod ground pattern. Some gene order changes are valuable characters in phylogenetic analysis of

  14. Spontaneous recurrent mutations and a complex rearrangement in the MECP2 gene in the light of current models of mutagenesis. (United States)

    Todorov, Tihomir; Todorova, Albena; Motoescu, Cristina; Dimova, Petia; Iancu, Daniela; Craiu, Dana; Stoian, Daniela; Barbarii, Ligia; Bojinova, Veneta; Mitev, Vanyo


    Mutations in the methyl-CpG-binding protein 2 (MECP2) gene are associated with Rett syndrome (RTT). The MECP2 gene has some unique characteristics: (1) it is mainly affected by de novo mutations, due to recurrent independent mutational events in a defined "hot spot" regions or positions; (2) complex mutational events along a single allele are frequently found in this gene; (3) most mutations arise on paternal X chromosome. The recurrent point mutations involve mainly CpG dinucleotides, where C>T transitions are explained by methylation-mediated deamination. The complex mutational events might be explained by the genomic architecture of the region involving the MECP2 gene. The finding that most spontaneous mutations arise on paternal X-chromosome supports the higher contribution of replication-mediated mechanism of mutagenesis. We present 9 types of mutations in the MECP2 gene, detected in a group of 22 Bulgarian and 6 Romanian classical RTT patients. Thirteen patients were clarified on molecular level (46.4%). The point mutations in our sample account for 61.5%. One intraexonic deletion was detected in the present study (7.7%). One novel insertion c.321_322insGAAG, p.(Lys107_Leu108insGluAlafs2*) was found (7.7%). Large deletions and complex mutations account for 23%. A novel complex mutational event c.[584_624del41insTT; 638delTinsCA] was detected in a Romanian patient. We discuss different types of the MECP2 mutations detected in our sample in the light of the possible mechanisms of mutagenesis. Complex gene rearrangements involving a combination of deletions and insertions have always been most difficult to detect, to specify precisely and hence to explain in terms of their underlying mutational mechanisms.

  15. Direct assessment of junctional diversity in rearranged T cell receptor β chain encoding genes by combined heteroduplex and single strand conformation polymorphism (SSCP) analysis

    NARCIS (Netherlands)

    Offermans, M.T.C.; Struyk, L.; Geus, B. de; Breedveld, F.C.; Elsen, P.J. van den; Rozing, J.


    In order to define the extent of T cell heterogeneity and clonality, unique DNA sequences in the junctional region in rearranged T cell receptor (TcR) genes can be studied. For this purpose we have adapted a non-denaturing nucleic acid gel electrophoresis procedure to detect TcR junctional diversity

  16. Comparative investigations of T cell receptor gamma gene rearrangements in frozen and formalin-fixed paraffin wax-embedded tissues by capillary electrophoresis

    DEFF Research Database (Denmark)

    Christensen, M; Funder, A D; Bendix, K


    AIM: To compare clonal T cell receptor gamma (TCRgamma) gene rearrangements in frozen and formalin-fixed paraffin wax-embedded (FFPE) tissue, using capillary electrophoresis for use in diagnostics, as T cell lymphomas may be difficult to diagnose by conventional methods. METHODS: The DNA for PCR...

  17. Comparison of different polymerase chain reaction-based approaches for clonality assessment of immunoglobulin heavy-chain gene rearrangements in B-cell neoplasia

    NARCIS (Netherlands)

    Derksen, P W; Langerak, A W; Kerkhof, E; Wolvers-Tettero, I L; Boor, P P; Mulder, A H; Vrints, L W; Coebergh, J W; van Krieken, J H; Schuuring, E; Kluin, P M; van Dongen, J J


    Several frequently applied polymerase chain reaction strategies for analysis of immunoglobulin heavy-chain gene rearrangements were compared by analyzing 70 B-cell lymphoproliferative disorders and 24 reactive lymphoid lesions. Southern blot analysis was used as the "gold standard" for clonality ass

  18. Systematic characterisation of disease associated balanced chromosome rearrangements by FISH: cytogenetically and genetically anchored YACs identify microdeletions and candidate regions for mental retardation genes

    DEFF Research Database (Denmark)

    Wirth, J; Nothwang, H G; van der Maarel, S


    Disease associated balanced chromosome rearrangements (DBCRs) have been instrumental in the isolation of many disease genes. To facilitate the molecular cytogenetic characterisation of DBCRs, we have generated a set of >1200 non-chimeric, cytogenetically and genetically anchored CEPH YACs, on ave...

  19. Direct assessment of junctional diversity in rearranged T cell receptor β chain encoding genes by combined heteroduplex and single strand conformation polymorphism (SSCP) analysis

    NARCIS (Netherlands)

    Offermans, M.T.C.; Struyk, L.; Geus, B. de; Breedveld, F.C.; Elsen, P.J. van den; Rozing, J.


    In order to define the extent of T cell heterogeneity and clonality, unique DNA sequences in the junctional region in rearranged T cell receptor (TcR) genes can be studied. For this purpose we have adapted a non-denaturing nucleic acid gel electrophoresis procedure to detect TcR junctional

  20. Different expression patterns of duplicated PHANTASTICA-like genes in Lotus japonicus suggest their divergent functions during compound leaf development

    Institute of Scientific and Technical Information of China (English)

    Jiang Hong LUO; Jun YAN; Lin WENG; Jun YANG; Zhong ZHAO; Jiang Hua CHEN; Xiao He HU; Da LUO


    Recent studies on leaf development demonstrate that the mechanism on the adaxial-abaxial polarity pattern formation could be well conserved among the far-related species, in which PHANTASTICA (PAHN)-like genes play important roles. In this study, we explored the conservation and diversity on functions of PHAN-like genes during the compound leaf development in Lotusjaponicus, a papilionoid legume. Two PHAN-like genes in L. japonicus, LjPHANa and LjPHANb,were found to originate from a gene duplication event and displayed different expression patterns during compound leaf development. Two mutants, reduced leaflets1 (rel1) and reduced leaflets3 (rel3), which exhibited decreased adaxial identity of leaflets and reduced leaflet initiation, were identified and investigated. The expression patterns of both LjPHANs in rel mutants were altered and correlated with abnormalities of compound leaves. Our data suggest that LjPHANa and LjPHANb play important but divergent roles in regulating adaxial-abaxial polarity of compound leaves in L. japonicus.

  1. Transposon Ac/Ds-induced chromosomal rearrangements at the rice OsRLG5 locus. (United States)

    Xuan, Yuan Hu; Piao, Hai Long; Je, Byoung Il; Park, Soon Ju; Park, Su Hyun; Huang, Jin; Zhang, Jian Bo; Peterson, Thomas; Han, Chang-deok


    Previous studies have shown that pairs of closely-linked Ac/Ds transposable elements can induce various chromosomal rearrangements in plant genomes. To study chromosomal rearrangements in rice, we isolated a line (OsRLG5-161) that contains two inversely-oriented Ds insertions in OsRLG5 (Oryza sativa Receptor like kinase Gene 5). Among approximately 300 plants regenerated from OsRLG5-161 heterozygous seeds, 107 contained rearrangements including deletions, duplications and inversions of various sizes. Most rearrangements were induced by previously identified alternative transposition mechanism. Furthermore, we also detected a new class of rearrangements that contain juxtaposed inversions and deletions on the same chromosome. We propose that these novel alleles were generated by a previously unreported type of alternative transposition reactions involving the 5' and 3' termini of two inversely-oriented Ds elements located on the same chromatid. Finally, 11% of rearrangements contained inversions resulting from homologous recombination between the two inverted Ds elements in OsRLG5-161. The high frequency inheritance and great variety of rearrangements obtained suggests that the rice regeneration system results in a burst of transposition activity and a relaxation of the controls which normally limit the transposition competence of individual Ds termini. Together, these results demonstrate a greatly enlarged potential of the Ac/Ds system for plant chromosome engineering. © The Author(s) 2011. Published by Oxford University Press.

  2. [Pooled Analysis of RET/PTC Gene Rearrangement Rate in Sporadic and Radiogenic Thyroid Papillary Carcinoma]. (United States)

    Ushenkova, L N; Koterov, A N; Biryukov, A P


    The database of publications on molecular epidemiology of RET/PTC rearrangements in sporadic and radiogenic thyroid papillary carcinoma has been formed (197 sources at the end of 2014; coverage of 100%). Based on this database a pooled analysis of data on the rates of RET/PTC1, RET/PTC3 and RET/PTC in total was conducted. Statistical approach involves a simple pooling, as well as calculations on the models of random and fixed effects. Since almost all the strata were characterized by heterogeneity, simple pooling and random effect models were adequate. Calculations using both models led to almost identical results. For rates of RET/PTC1, RET/PTC3 and RET/PTC in total with respect to formed carcinoma striations the following values (pooling, in %) were obtained: sporadic, total--13.2; 8.9; 21.2; sporadic, adults--13.3; 9.9; 21.1; sporadic, children--22.4; 17.5; 44.5; radiogenic, total--20.9; 20.3; 40.4; radiotherapy (exposure in childhood)--31.1; 11.8; 42.5; children affected after the Chernobyl accident--19.9; 23.6; 46.1; radiological incidents (exposure in adulthood)--19.9; 7.7; 18.4. Statistically proven is the reliability of differences of carcinoma indicators for children compared with adults (both sporadic and radiogenic tumors) and for radiogenic cancer compared with sporadic. The greatest increase in rate after irradiation was found for RET/PTC1, previously characterized in vitro as one of radiogenic types of RET/PTC.

  3. Tandem inversion duplication within F8 Intron 1 associated with mild haemophilia A. (United States)

    Lannoy, N; Bandelier, C; Grisart, B; Reginster, M; Ronge-Collard, E; Vikkula, M; Hermans, C


    In approximately 90% of mild haemophilia A (HA) patients, a missense mutation can be identified using complete gene sequencing. In this study, multiplex ligation-dependent probe amplification analysis was performed as a second step in 10 French-speaking Belgian with mild HA presenting no detectable causal mutation by complete sequencing of the factor VIII (FVIII) (F8) gene's 26 exons and its 1.2 kb of contiguous promoter sequence. This gene dosage technique enabled the detection of exon 1 duplications of F8 in three apparently unrelated subjects. Using array-comparative genomic hybridization, breakpoint analysis delimited the duplication extent to 210 kb in the F8 intron 1 and VBP1 gene intragenic position. We postulated that the rearrangement responsible for this duplication, never before reported, could be attributed to a symmetrical tandem inversion duplication, resulting in a large 233 kb rearrangement of F8 intron 1. This rearranged intron should lead to the production of a small number of normal mRNA transcripts in relation to the mild HA phenotype. Our analysis of the entire F8 mRNA from index case 1, particularly the segment containing exons 1-9, revealed normal amplification and sequencing. Reduced plasma FVIII antigen levels caused by cross-reacting material is associated with a quantitative deficiency of plasma FVIII. Male patients were unresponsive to desmopressin (1-deamino-8-D-arginine vasopressin). All patients displayed identical F8 haplotypes, despite not being related, which suggests a possible founder effect caused by a 210 kb duplication involving F8 exon 1.

  4. Detection of monoclonal immunoglobulin heavy chain gene rearrangement (FR3 in Thai malignant lymphoma by High Resolution Melting curve analysis

    Directory of Open Access Journals (Sweden)

    Pongpruttipan Tawatchai


    Full Text Available Abstract Malignant lymphoma, especially non-Hodgkin lymphoma, is one of the most common hematologic malignancies in Thailand. The diagnosis of malignant lymphoma is often problematic, especially in early stages of the disease. Detection of antigen receptor gene rearrangement including T cell receptor (TCR and immunoglobulin heavy chain (IgH by polymerase chain reaction followed by heteroduplex has currently become standard whereas fluorescent fragment analysis (GeneScan has been used for confirmation test. In this study, three techniques had been compared: thermocycler polymerase chain reaction (PCR followed by heteroduplex and polyacrylamide gel electrophoresis, GeneScan analysis, and real time PCR with High Resolution Melting curve analysis (HRM. The comparison was carried out with DNA extracted from paraffin embedded tissues diagnosed as B- cell non-Hodgkin lymphoma. Specific PCR primers sequences for IgH gene variable region 3, including fluorescence labeled IgH primers were used and results were compared with HRM. In conclusion, the detection IgH gene rearrangement by HRM in the LightCycler System showed potential for distinguishing monoclonality from polyclonality in B-cell non-Hodgkin lymphoma. Introduction Malignant lymphoma, especially non-Hodgkin lymphoma, is one of the most common hematologic malignancies in Thailand. The incidence rate as reported by Ministry of Public Health is 3.1 per 100,000 population in female whereas the rate in male is 4.5 per 100,000 population 1. At Siriraj Hospital, the new cases diagnosed as malignant lymphoma were 214.6 cases/year 2. The diagnosis of malignant lymphoma is often problematic, especially in early stages of the disease. Therefore, detection of antigen receptor gene rearrangement including T cell receptor (TCR and immunoglobulin heavy chain (IgH by polymerase chain reaction (PCR assay has recently become a standard laboratory test for discrimination of reactive from malignant clonal

  5. A partial MECP2 duplication in a mildly affected adult male: a putative role for the 3' untranslated region in the MECP2 duplication phenotype

    Directory of Open Access Journals (Sweden)

    Hanchard Neil A


    Full Text Available Abstract Background Duplications of the X-linked MECP2 gene are associated with moderate to severe intellectual disability, epilepsy, and neuropsychiatric illness in males, while triplications are associated with a more severe phenotype. Most carrier females show complete skewing of X-inactivation in peripheral blood and an apparent susceptibility to specific personality traits or neuropsychiatric symptoms. Methods We describe the clinical phenotype of a pedigree segregating a duplication of MECP2 found on clinical array comparative genomic hybridization. The position, size, and extent of the duplication were delineated in peripheral blood samples from affected individuals using multiplex ligation-dependent probe amplification and fluorescence in situ hybridization, as well as targeted high-resolution oligonucleotide microarray analysis and long-range PCR. The molecular consequences of the rearrangement were studied in lymphoblast cell lines using quantitative real-time PCR, reverse transcriptase PCR, and western blot analysis. Results We observed a partial MECP2 duplication in an adult male with epilepsy and mild neurocognitive impairment who was able to function independently; this phenotype has not previously been reported among males harboring gains in MECP2 copy number. The same duplication was inherited by this individual’s daughter who was also affected with neurocognitive impairment and epilepsy and carried an additional copy-number variant. The duplicated segment involved all four exons of MECP2, but excluded almost the entire 3' untranslated region (UTR, and the genomic rearrangement resulted in a MECP2-TEX28 fusion gene mRNA transcript. Increased expression of MECP2 and the resulting fusion gene were both confirmed; however, western blot analysis of lysates from lymphoblast cells demonstrated increased MeCP2 protein without evidence of a stable fusion gene protein product. Conclusion The observations of a mildly affected adult male

  6. Antagonistic roles for KNOX1 and KNOX2 genes in patterning the land plant body plan following an ancient gene duplication.

    Directory of Open Access Journals (Sweden)

    Chihiro Furumizu


    Full Text Available Neofunctionalization following gene duplication is thought to be one of the key drivers in generating evolutionary novelty. A gene duplication in a common ancestor of land plants produced two classes of KNOTTED-like TALE homeobox genes, class I (KNOX1 and class II (KNOX2. KNOX1 genes are linked to tissue proliferation and maintenance of meristematic potentials of flowering plant and moss sporophytes, and modulation of KNOX1 activity is implicated in contributing to leaf shape diversity of flowering plants. While KNOX2 function has been shown to repress the gametophytic (haploid developmental program during moss sporophyte (diploid development, little is known about KNOX2 function in flowering plants, hindering syntheses regarding the relationship between two classes of KNOX genes in the context of land plant evolution. Arabidopsis plants harboring loss-of-function KNOX2 alleles exhibit impaired differentiation of all aerial organs and have highly complex leaves, phenocopying gain-of-function KNOX1 alleles. Conversely, gain-of-function KNOX2 alleles in conjunction with a presumptive heterodimeric BELL TALE homeobox partner suppressed SAM activity in Arabidopsis and reduced leaf complexity in the Arabidopsis relative Cardamine hirsuta, reminiscent of loss-of-function KNOX1 alleles. Little evidence was found indicative of epistasis or mutual repression between KNOX1 and KNOX2 genes. KNOX proteins heterodimerize with BELL TALE homeobox proteins to form functional complexes, and contrary to earlier reports based on in vitro and heterologous expression, we find high selectivity between KNOX and BELL partners in vivo. Thus, KNOX2 genes confer opposing activities rather than redundant roles with KNOX1 genes, and together they act to direct the development of all above-ground organs of the Arabidopsis sporophyte. We infer that following the KNOX1/KNOX2 gene duplication in an ancestor of land plants, neofunctionalization led to evolution of antagonistic

  7. New organelles by gene duplication in a biophysical model of eukaryote endomembrane evolution

    National Research Council Canada - National Science Library

    Ramadas, Rohini; Thattai, Mukund


    .... This endomembrane system arose and diversified during a period characterized by massive expansions of gene families involved in trafficking after the acquisition of a mitochondrial endosymbiont...

  8. Independent and Parallel Evolution of New Genes by Gene Duplication in Two Origins of C4 Photosynthesis Provides New Insight into the Mechanism of Phloem Loading in C4 Species. (United States)

    Emms, David M; Covshoff, Sarah; Hibberd, Julian M; Kelly, Steven


    C4 photosynthesis is considered one of the most remarkable examples of evolutionary convergence in eukaryotes. However, it is unknown whether the evolution of C4 photosynthesis required the evolution of new genes. Genome-wide gene-tree species-tree reconciliation of seven monocot species that span two origins of C4 photosynthesis revealed that there was significant parallelism in the duplication and retention of genes coincident with the evolution of C4 photosynthesis in these lineages. Specifically, 21 orthologous genes were duplicated and retained independently in parallel at both C4 origins. Analysis of this gene cohort revealed that the set of parallel duplicated and retained genes is enriched for genes that are preferentially expressed in bundle sheath cells, the cell type in which photosynthesis was activated during C4 evolution. Furthermore, functional analysis of the cohort of parallel duplicated genes identified SWEET-13 as a potential key transporter in the evolution of C4 photosynthesis in grasses, and provides new insight into the mechanism of phloem loading in these C4 species. C4 photosynthesis, gene duplication, gene families, parallel evolution. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  9. Evolution of C2H2-zinc finger genes and subfamilies in mammals: Species-specific duplication and loss of clusters, genes and effector domains

    Directory of Open Access Journals (Sweden)

    Aubry Muriel


    Full Text Available Abstract Background C2H2 zinc finger genes (C2H2-ZNF constitute the largest class of transcription factors in humans and one of the largest gene families in mammals. Often arranged in clusters in the genome, these genes are thought to have undergone a massive expansion in vertebrates, primarily by tandem duplication. However, this view is based on limited datasets restricted to a single chromosome or a specific subset of genes belonging to the large KRAB domain-containing C2H2-ZNF subfamily. Results Here, we present the first comprehensive study of the evolution of the C2H2-ZNF family in mammals. We assembled the complete repertoire of human C2H2-ZNF genes (718 in total, about 70% of which are organized into 81 clusters across all chromosomes. Based on an analysis of their N-terminal effector domains, we identified two new C2H2-ZNF subfamilies encoding genes with a SET or a HOMEO domain. We searched for the syntenic counterparts of the human clusters in other mammals for which complete gene data are available: chimpanzee, mouse, rat and dog. Cross-species comparisons show a large variation in the numbers of C2H2-ZNF genes within homologous mammalian clusters, suggesting differential patterns of evolution. Phylogenetic analysis of selected clusters reveals that the disparity in C2H2-ZNF gene repertoires across mammals not only originates from differential gene duplication but also from gene loss. Further, we discovered variations among orthologs in the number of zinc finger motifs and association of the effector domains, the latter often undergoing sequence degeneration. Combined with phylogenetic studies, physical maps and an analysis of the exon-intron organization of genes from the SCAN and KRAB domains-containing subfamilies, this result suggests that the SCAN subfamily emerged first, followed by the SCAN-KRAB and finally by the KRAB subfamily. Conclusion Our results are in agreement with the "birth and death hypothesis" for the evolution of

  10. Actin evolution in ciliates (Protist, Alveolata) is characterized by high diversity and three duplication events. (United States)

    Yi, Zhenzhen; Huang, Lijuan; Yang, Ran; Lin, Xiaofeng; Song, Weibo


    Ciliates possess two distinct nuclear genomes and unique genomic features, including highly fragmented chromosomes and extensive chromosomal rearrangements. Recent transcriptomic surveys have revealed that ciliates have several multi-copy genes providing an ideal template to study gene family evolution. Nonetheless, this process remains little studied in ciliated protozoa and consequently, the evolutionary patterns that govern it are not well understood. In this study, we focused on obtaining fine-scale information relative to ciliate species divergence for the first time. A total of 230 actin gene sequences were derived from this study, among which 217 were from four closely related Pseudokeronopsis species and 13 from other hypotrichous ciliates. Our investigation shows that: (1) At least three duplication events occurred in ciliates: diversification of three actin genes (Actin I, II, III) happened after the divergence of ciliate classes but before that of subclasses. And several recent and genus-specific duplications were followed within Actin I (Sterkiella, Oxytricha, Uroleptus, etc.), Actin II (Sterkiella), respectively. (2) Within the genus Pseudokeronopsis, Actin I gene duplication events happened after P. carnea and P. erythrina diverged. In contrast, in the morphologically similar species P. flava and P. rubra, the duplication event preceded diversification of the two species. The Actin II gene duplication events preceded divergence of the genus Pseudokeronopsis. (3) Phylogenetic analyses revealed that actin is suitable for resolving ciliate classes, but may not be used to infer lower taxon relationships.

  11. T-cell receptor gene rearrangement analysis: cutaneous T cell lymphoma, peripheral T cell lymphoma, and premalignant and benign cutaneous lymphoproliferative disorders. (United States)

    Zelickson, B D; Peters, M S; Muller, S A; Thibodeau, S N; Lust, J A; Quam, L M; Pittelkow, M R


    T-cell receptor gene rearrangement analysis is a useful technique to detect clonality and determine lineage of lymphoid neoplasms. We examined 103 patients with mycosis fungoides, Sézary syndrome, peripheral T cell lymphoma, potentially malignant lymphoproliferative disorders including pre-Sézary syndrome, large plaque parapsoriasis, lymphomatoid papulosis and follicular mucinosis, and various benign inflammatory infiltrates. A clonal rearrangement was detected in skin samples in 20 of 24 patients with mycosis fungoides and in peripheral blood samples in 19 of 21 patients with Sézary syndrome. A clonal population was also detected in seven of eight cases classified as peripheral T cell lymphoma. The potentially malignant dermatoses tended to have clonal rearrangement, with the exception of large plaque parapsoriasis, and further follow-up is needed to correlate clonality with the disease course. These studies demonstrate the value of molecular genetics as an adjunct to morphology in the examination of patients with cutaneous lymphoproliferative disease.

  12. Internal tandem duplications in the Flt3-gene in human acute myeloid leukemia

    NARCIS (Netherlands)

    W.J.C. Rombouts


    textabstractIn the process of hematopoietic development errors may occur, resulting in the aber¬rant activation of (proto-)oncogenes and inactivation of tumor-suppressor genes. This aberrant gene expression may finally result in leukemia, a neoplastic disorder in which immature hematopoietic cells a

  13. Phylogenetic relationships among Perissodactyla: secretoglobin 1A1 gene duplication and triplication in the Equidae family. (United States)

    Côté, Olivier; Viel, Laurent; Bienzle, Dorothee


    Secretoglobin family 1A member 1 (SCGB 1A1) is a small anti-inflammatory and immunomodulatory protein that is abundantly secreted in airway surface fluids. We recently reported the existence of three distinct SCGB1A1 genes in the domestic horse genome as opposed to the single gene copy consensus present in other mammals. The origin of SCGB1A1 gene triplication and the evolutionary relationship of the three genes amongst Equidae family members are unknown. For this study, SCGB1A1 genomic data were collected from various Equus individuals including E. caballus, E. przewalskii, E. asinus, E. grevyi, and E. quagga. Three SCGB1A1 genes in E. przewalskii, two SCGB1A1 genes in E. asinus, and a single SCGB1A1 gene in E. grevyi and E. quagga were identified. Sequence analysis revealed that the non-synonymous nucleotide substitutions between the different equid genes coded for 17 amino acid changes. Most of these changes localized to the SCGB 1A1 central cavity that binds hydrophobic ligands, suggesting that this area of SCGB 1A1 evolved to accommodate diverse molecular interactions. Three-dimensional modeling of the proteins revealed that the size of the SCGB 1A1 central cavity is larger than that of SCGB 1A1A. Altogether, these findings suggest that evolution of the SCGB1A1 gene may parallel the separation of caballine and non-caballine species amongst Equidae, and may indicate an expansion of function for SCGB1A1 gene products. Copyright © 2013 Elsevier Inc. All rights reserved.

  14. Digging deeper: new gene order rearrangements and distinct patterns of codons usage in mitochondrial genomes among shrimps from the Axiidea, Gebiidea and Caridea (Crustacea: Decapoda

    Directory of Open Access Journals (Sweden)

    Mun Hua Tan


    Full Text Available Background Whole mitochondrial DNA is being increasingly utilized for comparative genomic and phylogenetic studies at deep and shallow evolutionary levels for a range of taxonomic groups. Although mitogenome sequences are deposited at an increasing rate into public databases, their taxonomic representation is unequal across major taxonomic groups. In the case of decapod crustaceans, several infraorders, including Axiidea (ghost shrimps, sponge shrimps, and mud lobsters and Caridea (true shrimps are still under-represented, limiting comprehensive phylogenetic studies that utilize mitogenomic information. Methods Sequence reads from partial genome scans were generated using the Illumina MiSeq platform and mitogenome sequences were assembled from these low coverage reads. In addition to examining phylogenetic relationships within the three infraorders, Axiidea, Gebiidea, and Caridea, we also investigated the diversity and frequency of codon usage bias and mitogenome gene order rearrangements. Results We present new mitogenome sequences for five shrimp species from Australia that includes two ghost shrimps, Callianassa ceramica and Trypaea australiensis, along with three caridean shrimps, Macrobrachium bullatum, Alpheus lobidens, and Caridina cf. nilotica. Strong differences in codon usage were discovered among the three infraorders and significant gene order rearrangements were observed. While the gene order rearrangements are congruent with the inferred phylogenetic relationships and consistent with taxonomic classification, they are unevenly distributed within and among the three infraorders. Discussion Our findings suggest potential for mitogenome rearrangements to be useful phylogenetic markers for decapod crustaceans and at the same time raise important questions concerning the drivers of mitogenome evolution in different decapod crustacean lineages.

  15. Specific duplication and dorsoventrally asymmetric expression patterns of Cycloidea-like genes in zygomorphic species of Ranunculaceae.

    Directory of Open Access Journals (Sweden)

    Florian Jabbour

    Full Text Available Floral bilateral symmetry (zygomorphy has evolved several times independently in angiosperms from radially symmetrical (actinomorphic ancestral states. Homologs of the Antirrhinum majus Cycloidea gene (Cyc have been shown to control floral symmetry in diverse groups in core eudicots. In the basal eudicot family Ranunculaceae, there is a single evolutionary transition from actinomorphy to zygomorphy in the stem lineage of the tribe Delphinieae. We characterized Cyc homologs in 18 genera of Ranunculaceae, including the four genera of Delphinieae, in a sampling that represents the floral morphological diversity of this tribe, and reconstructed the evolutionary history of this gene family in Ranunculaceae. Within each of the two RanaCyL (Ranunculaceae Cycloidea-like lineages previously identified, an additional duplication possibly predating the emergence of the Delphinieae was found, resulting in up to four gene copies in zygomorphic species. Expression analyses indicate that the RanaCyL paralogs are expressed early in floral buds and that the duration of their expression varies between species and paralog class. At most one RanaCyL paralog was expressed during the late stages of floral development in the actinomorphic species studied whereas all paralogs from the zygomorphic species were expressed, composing a species-specific identity code for perianth organs. The contrasted asymmetric patterns of expression observed in the two zygomorphic species is discussed in relation to their distinct perianth architecture.

  16. Multiple independent origins of mitochondrial control region duplications in the order Psittaciformes (United States)

    Schirtzinger, Erin E.; Tavares, Erika S.; Gonzales, Lauren A.; Eberhard, Jessica R.; Miyaki, Cristina Y.; Sanchez, Juan J.; Hernandez, Alexis; Müeller, Heinrich; Graves, Gary R.; Fleischer, Robert C.; Wright, Timothy F.


    Mitochondrial genomes are generally thought to be under selection for compactness, due to their small size, consistent gene content, and a lack of introns or intergenic spacers. As more animal mitochondrial genomes are fully sequenced, rearrangements and partial duplications are being identified with increasing frequency, particularly in birds (Class Aves). In this study, we investigate the evolutionary history of mitochondrial control region states within the avian order Psittaciformes (parrots and cockatoos). To this aim, we reconstructed a comprehensive multi-locus phylogeny of parrots, used PCR of three diagnostic fragments to classify the mitochondrial control region state as single or duplicated, and mapped these states onto the phylogeny. We further sequenced 44 selected species to validate these inferences of control region state. Ancestral state reconstruction using a range of weighting schemes identified six independent origins of mitochondrial control region duplications within Psittaciformes. Analysis of sequence data showed that varying levels of mitochondrial gene and tRNA homology and degradation were present within a given clade exhibiting duplications. Levels of divergence between control regions within an individual varied from 0–10.9% with the differences occurring mainly between 51 and 225 nucleotides 3′ of the goose hairpin in domain I. Further investigations into the fates of duplicated mitochondrial genes, the potential costs and benefits of having a second control region, and the complex relationship between evolutionary rates, selection, and time since duplication are needed to fully explain these patterns in the mitochondrial genome. PMID:22543055

  17. A split and rearranged nuclear gene encoding the iron-sulfur subunit of mitochondrial succinate dehydrogenase in Euglenozoa

    Directory of Open Access Journals (Sweden)

    Gray Michael W


    Full Text Available Abstract Background Analyses based on phylogenetic and ultrastructural data have suggested that euglenids (such as Euglena gracilis, trypanosomatids and diplonemids are members of a monophyletic lineage termed Euglenozoa. However, many uncertainties are associated with phylogenetic reconstructions for ancient and rapidly evolving groups; thus, rare genomic characters become increasingly important in reinforcing inferred phylogenetic relationships. Findings We discovered that the iron-sulfur subunit (SdhB of mitochondrial succinate dehydrogenase is encoded by a split and rearranged nuclear gene in Euglena gracilis and trypanosomatids, an example of a rare genomic character. The two subgenic modules are transcribed independently and the resulting mRNAs appear to be independently translated, with the two protein products imported into mitochondria, based on the presence of predicted mitochondrial targeting peptides. Although the inferred protein sequences are in general very divergent from those of other organisms, all of the required iron-sulfur cluster-coordinating residues are present. Moreover, the discontinuity in the euglenozoan SdhB sequence occurs between the two domains of a typical, covalently continuous SdhB, consistent with the inference that the euglenozoan 'half' proteins are functional. Conclusion The discovery of this unique molecular marker provides evidence for the monophyly of Euglenozoa that is independent of evolutionary models. Our results pose questions about the origin and timing of this novel gene arrangement and the structure and function of euglenozoan SdhB.

  18. Prevalence of Gene Rearrangements in Mexican Children with Acute Lymphoblastic Leukemia: A Population Study—Report from the Mexican Interinstitutional Group for the Identification of the Causes of Childhood Leukemia (United States)

    Bekker-Méndez, Vilma Carolina; Miranda-Peralta, Enrique; Núñez-Enríquez, Juan Carlos; Olarte-Carrillo, Irma; Guerra-Castillo, Francisco Xavier; Pompa-Mera, Ericka Nelly; Ocaña-Mondragón, Alicia; Bernáldez-Ríos, Roberto; Medina-Sanson, Aurora; Jiménez-Hernández, Elva; Amador-Sánchez, Raquel; Peñaloza-González, José Gabriel; de Diego Flores-Chapa, José; Fajardo-Gutiérrez, Arturo; Flores-Lujano, Janet; Rodríguez-Zepeda, María del Carmen; Dorantes-Acosta, Elisa María; Bolea-Murga, Victoria; Núñez-Villegas, Nancy; Velázquez-Aviña, Martha Margarita; Torres-Nava, José Refugio; Reyes-Zepeda, Nancy Carolina; González-Bonilla, Cesar; Mejía-Aranguré, Juan Manuel


    Mexico has one of the highest incidences of childhood leukemia worldwide and significantly higher mortality rates for this disease compared with other countries. One possible cause is the high prevalence of gene rearrangements associated with the etiology or with a poor prognosis of childhood acute lymphoblastic leukemia (ALL). The aims of this multicenter study were to determine the prevalence of the four most common gene rearrangements [ETV6-RUNX1, TCF3-PBX1, BCR-ABL1, and MLL rearrangements] and to explore their relationship with mortality rates during the first year of treatment in ALL children from Mexico City. Patients were recruited from eight public hospitals during 2010–2012. A total of 282 bone marrow samples were obtained at each child's diagnosis for screening by conventional and multiplex reverse transcription polymerase chain reaction to determine the gene rearrangements. Gene rearrangements were detected in 50 (17.7%) patients. ETV6-RUNX1 was detected in 21 (7.4%) patients, TCF3-PBX1 in 20 (7.1%) patients, BCR-ABL1 in 5 (1.8%) patients, and MLL rearrangements in 4 (1.4%) patients. The earliest deaths occurred at months 1, 2, and 3 after diagnosis in patients with MLL, ETV6-RUNX1, and BCR-ABL1 gene rearrangements, respectively. Gene rearrangements could be related to the aggressiveness of leukemia observed in Mexican children. PMID:25692130

  19. Prevalence of Gene Rearrangements in Mexican Children with Acute Lymphoblastic Leukemia: A Population Study—Report from the Mexican Interinstitutional Group for the Identification of the Causes of Childhood Leukemia

    Directory of Open Access Journals (Sweden)

    Vilma Carolina Bekker-Méndez


    Full Text Available Mexico has one of the highest incidences of childhood leukemia worldwide and significantly higher mortality rates for this disease compared with other countries. One possible cause is the high prevalence of gene rearrangements associated with the etiology or with a poor prognosis of childhood acute lymphoblastic leukemia (ALL. The aims of this multicenter study were to determine the prevalence of the four most common gene rearrangements [ETV6-RUNX1, TCF3-PBX1, BCR-ABL1, and MLL rearrangements] and to explore their relationship with mortality rates during the first year of treatment in ALL children from Mexico City. Patients were recruited from eight public hospitals during 2010–2012. A total of 282 bone marrow samples were obtained at each child’s diagnosis for screening by conventional and multiplex reverse transcription polymerase chain reaction to determine the gene rearrangements. Gene rearrangements were detected in 50 (17.7% patients. ETV6-RUNX1 was detected in 21 (7.4% patients, TCF3-PBX1 in 20 (7.1% patients, BCR-ABL1 in 5 (1.8% patients, and MLL rearrangements in 4 (1.4% patients. The earliest deaths occurred at months 1, 2, and 3 after diagnosis in patients with MLL, ETV6-RUNX1, and BCR-ABL1 gene rearrangements, respectively. Gene rearrangements could be related to the aggressiveness of leukemia observed in Mexican children.

  20. Rearrangements of MYC gene facilitate risk stratification in diffuse large B-cell lymphoma patients treated with rituximab-CHOP

    DEFF Research Database (Denmark)

    Tzankov, Alexandar; Xu-Monette, Zijun Y; Gerhard, Marc;


    In order to address the debatable prognostic role of MYC rearrangements in diffuse large B-cell lymphoma patients treated with rituximab, cyclophosphamide, hydroxydaunorubicin, vincristine, and prednisone, we evaluated MYC rearrangements by fluorescence in situ hybridization in 563 cases using br...

  1. Identification of functional toxin/immunity genes linked to contact-dependent growth inhibition (CDI and rearrangement hotspot (Rhs systems.

    Directory of Open Access Journals (Sweden)

    Stephen J Poole


    Full Text Available Bacterial contact-dependent growth inhibition (CDI is mediated by the CdiA/CdiB family of two-partner secretion proteins. Each CdiA protein exhibits a distinct growth inhibition activity, which resides in the polymorphic C-terminal region (CdiA-CT. CDI(+ cells also express unique CdiI immunity proteins that specifically block the activity of cognate CdiA-CT, thereby protecting the cell from autoinhibition. Here we show that many CDI systems contain multiple cdiA gene fragments that encode CdiA-CT sequences. These "orphan" cdiA-CT genes are almost always associated with downstream cdiI genes to form cdiA-CT/cdiI modules. Comparative genome analyses suggest that cdiA-CT/cdiI modules are mobile and exchanged between the CDI systems of different bacteria. In many instances, orphan cdiA-CT/cdiI modules are fused to full-length cdiA genes in other bacterial species. Examination of cdiA-CT/cdiI modules from Escherichia coli EC93, E. coli EC869, and Dickeya dadantii 3937 confirmed that these genes encode functional toxin/immunity pairs. Moreover, the orphan module from EC93 was functional in cell-mediated CDI when fused to the N-terminal portion of the EC93 CdiA protein. Bioinformatic analyses revealed that the genetic organization of CDI systems shares features with rhs (rearrangement hotspot loci. Rhs proteins also contain polymorphic C-terminal regions (Rhs-CTs, some of which