WorldWideScience

Sample records for duplicate gene copies

  1. Accelerated evolution after gene duplication: a time-dependent process affecting just one copy.

    Science.gov (United States)

    Pegueroles, Cinta; Laurie, Steve; Albà, M Mar

    2013-08-01

    Gene duplication is widely regarded as a major mechanism modeling genome evolution and function. However, the mechanisms that drive the evolution of the two, initially redundant, gene copies are still ill defined. Many gene duplicates experience evolutionary rate acceleration, but the relative contribution of positive selection and random drift to the retention and subsequent evolution of gene duplicates, and for how long the molecular clock may be distorted by these processes, remains unclear. Focusing on rodent genes that duplicated before and after the mouse and rat split, we find significantly increased sequence divergence after duplication in only one of the copies, which in nearly all cases corresponds to the novel daughter copy, independent of the mechanism of duplication. We observe that the evolutionary rate of the accelerated copy, measured as the ratio of nonsynonymous to synonymous substitutions, is on average 5-fold higher in the period spanning 4-12 My after the duplication than it was before the duplication. This increase can be explained, at least in part, by the action of positive selection according to the results of the maximum likelihood-based branch-site test. Subsequently, the rate decelerates until purifying selection completely returns to preduplication levels. Reversion to the original rates has already been accomplished 40.5 My after the duplication event, corresponding to a genetic distance of about 0.28 synonymous substitutions per site. Differences in tissue gene expression patterns parallel those of substitution rates, reinforcing the role of neofunctionalization in explaining the evolution of young gene duplicates.

  2. Duplication and relocation of the functional DPY19L2 gene within low copy repeats

    Directory of Open Access Journals (Sweden)

    Cheung Joseph

    2006-03-01

    Full Text Available Abstract Background Low copy repeats (LCRs are thought to play an important role in recent gene evolution, especially when they facilitate gene duplications. Duplicate genes are fundamental to adaptive evolution, providing substrates for the development of new or shared gene functions. Moreover, silencing of duplicate genes can have an indirect effect on adaptive evolution by causing genomic relocation of functional genes. These changes are theorized to have been a major factor in speciation. Results Here we present a novel example showing functional gene relocation within a LCR. We characterize the genomic structure and gene content of eight related LCRs on human Chromosomes 7 and 12. Two members of a novel transmembrane gene family, DPY19L, were identified in these regions, along with six transcribed pseudogenes. One of these genes, DPY19L2, is found on Chromosome 12 and is not syntenic with its mouse orthologue. Instead, the human locus syntenic to mouse Dpy19l2 contains a pseudogene, DPY19L2P1. This indicates that the ancestral copy of this gene has been silenced, while the descendant copy has remained active. Thus, the functional copy of this gene has been relocated to a new genomic locus. We then describe the expansion and evolution of the DPY19L gene family from a single gene found in invertebrate animals. Ancient duplications have led to multiple homologues in different lineages, with three in fish, frogs and birds and four in mammals. Conclusion Our results show that the DPY19L family has expanded throughout the vertebrate lineage and has undergone recent primate-specific evolution within LCRs.

  3. Impact of duplicate gene copies on phylogenetic analysis and divergence time estimates in butterflies

    Directory of Open Access Journals (Sweden)

    Liswi Saif W

    2009-05-01

    Full Text Available Abstract Background The increase in availability of genomic sequences for a wide range of organisms has revealed gene duplication to be a relatively common event. Encounters with duplicate gene copies have consequently become almost inevitable in the context of collecting gene sequences for inferring species trees. Here we examine the effect of incorporating duplicate gene copies evolving at different rates on tree reconstruction and time estimation of recent and deep divergences in butterflies. Results Sequences from ultraviolet-sensitive (UVRh, blue-sensitive (BRh, and long-wavelength sensitive (LWRh opsins,EF-1α and COI were obtained from 27 taxa representing the five major butterfly families (5535 bp total. Both BRh and LWRh are present in multiple copies in some butterfly lineages and the different copies evolve at different rates. Regardless of the phylogenetic reconstruction method used, we found that analyses of combined data sets using either slower or faster evolving copies of duplicate genes resulted in a single topology in agreement with our current understanding of butterfly family relationships based on morphology and molecules. Interestingly, individual analyses of BRh and LWRh sequences also recovered these family-level relationships. Two different relaxed clock methods resulted in similar divergence time estimates at the shallower nodes in the tree, regardless of whether faster or slower evolving copies were used, with larger discrepancies observed at deeper nodes in the phylogeny. The time of divergence between the monarch butterfly Danaus plexippus and the queen D. gilippus (15.3–35.6 Mya was found to be much older than the time of divergence between monarch co-mimic Limenitis archippus and red-spotted purple L. arthemis (4.7–13.6 Mya, and overlapping with the time of divergence of the co-mimetic passionflower butterflies Heliconius erato and H. melpomene (13.5–26.1 Mya. Our family-level results are congruent with

  4. Tandem duplication and copy number polymorphism of the SRY gene in patients with sex chromosome anomalies and males exposed to natural background radiation.

    Science.gov (United States)

    Premi, Sanjay; Srivastava, Jyoti; Chandy, Sebastian Padinjarel; Ahmad, Jamal; Ali, Sher

    2006-02-01

    Mutations in the SRY gene encompassing the HMG box have been well characterized in gonadal dysgenesis, male infertility and other types of sex chromosome related anomalies (SCRA). However, no information is available on copy number status of this gene under such abnormal conditions. Employing 'Taqman Probe Assay' specific to the SRY gene, we screened 16 DNA samples from patients with SCRA and 36 samples from males exposed to high levels of natural background radiation (HNBR). Patients with SCRA showed 2-16 copies of the SRY gene of which, one, Oxen (49, XYYYY) had eight copies with sequences different from one another. Of the 36 HNBR samples, 12 had one copy whereas 24 harboured 2-8 copies of the SRY gene. A HNBR male 33F had one normal and one mutated copy of this gene. Analysis of 25 DNA samples from blood and semen of normal males showed only one copy of this gene. Despite multiple copies in affected males, fluorescence in-situ hybridization (FISH) with SRY probe detected a single signal on the Y chromosome in HNBR males suggesting its possible localized tandem duplication. Copy number status of the other Y-linked loci is envisaged to augment DNA diagnostics facilitating genetic counselling to affected patients.

  5. Decomposition of Parallel Copies with Duplication

    Directory of Open Access Journals (Sweden)

    G. N. Purohit

    2012-05-01

    Full Text Available SSA form is becoming more popular in the context of JIT compilation since it allows the compiler to perform important optimizations like common sub-expression elimination or constant propagation without the drawbacks of keeping huge data structures in memory or requiring a lot of computing power. The recent approach of SSA-based register allocation performs SSA elimination after register allocation. F. Bouchez et al. proposed parallel copy motion to prevent the splitting of edges when going out of colored SSA by moving the code that should be assigned to the edges to a more convenient place. Duplications in parallel copies pose some problems when moving them. In this paper an approach has been developed to decompose parallel copies so that duplications can be handled separately and parallel copies can be easily moved away without duplication. A simple and elegant application is moving duplicated copies out of critical edges. This is often beneficial compared to the alternative splitting the edge.

  6. Collateral damage: Spread of repeat-induced point mutation from a duplicated DNA sequence into an adjoining single-copy gene in Neurospora crassa

    Indian Academy of Sciences (India)

    Meenal Vyas; Durgadas P Kasbekar

    2005-02-01

    Repeat-induced point mutation (RIP) is an unusual genome defense mechanism that was discovered in Neurospora crassa. RIP occurs during a sexual cross and induces numerous G : C to A : T mutations in duplicated DNA sequences and also methylates many of the remaining cytosine residues. We measured the susceptibility of the erg-3 gene, present in single copy, to the spread of RIP from duplications of adjoining sequences. Genomic segments of defined length (1, 1.5 or 2 kb) and located at defined distances (0, 0.5, 1 or 2 kb) upstream or downstream of the erg-3 open reading frame (ORF) were amplified by polymerase chain reaction (PCR), and the duplications were created by transformation of the amplified DNA. Crosses were made with the duplication strains and the frequency of erg-3 mutant progeny provided a measure of the spread of RIP from the duplicated segments into the erg-3 gene. Our results suggest that ordinarily RIP-spread does not occur. However, occasionally the mechanism that confines RIP to the duplicated segment seems to fail (frequency 0.1–0.8%) and then RIP can spread across as much as 1 kb of unduplicated DNA. Additionally, the bacterial hph gene appeared to be very susceptible to the spread of RIP-associated cytosine methylation.

  7. MARCH5 gene is duplicated in rainbow trout, but only fish-specific gene copy is up-regulated after VHSV infection.

    Science.gov (United States)

    Rebl, Alexander; Köbis, Judith M; Fischer, Uwe; Takizawa, Fumio; Verleih, Marieke; Wimmers, Klaus; Goldammer, Tom

    2011-12-01

    Ubiquitination regulates the activity, stability, and localization of a wide variety of proteins. Several mammalian MARCH ubiquitin E3 ligase proteins have been suggested to control cell surface immunoreceptors. The mitochondrial protein MARCH5 is a positive regulator of Toll-like receptor 7-mediated NF-κB activation in mammals. In the present study, duplicated MARCH5-like cDNA sequences were isolated from rainbow trout (Oncorhynchus mykiss) comprising open reading frames of 882 bp (MARCH5A) and 885 bp (MARCH5B), respectively. Trout MARCH5A and MARCH5B-encoding sequences share only 65% sequence identity. Phylogenetic analyses including an additionally isolated MARCH5-like sequence from whitefish (Coregonus maraena) suggest that teleosts possess an additional MARCH5 gene copy resulting from a fish-specific whole genome duplication. Coding sequences of MARCH5A and MARCH5B genes from trout are distributed over six exons. Hypothetical MARCH5 proteins from trout comprise four transmembrane helices and a single motif similar to a RING variant domain (RINGv) including eight highly conserved cysteine and histidine residues. A 'reverse-northern blot' analysis revealed furthermore a MARCH5B Δexon5 transcript variant. Both MARCH5 genes from trout show a strain-, tissue- and cell-specific expression profile indicating different functional roles. Fish-specific MARCH5A gene for instance might be involved in defense mechanisms, since in vivo-challenge with the viral pathogen VHSV caused a significant 1.7-fold elevated copy number of the respective gene in gills four days after infection, whereas MARCH5B transcript level did not increase.

  8. Increased copy number for methylated maternal 15q duplications leads to changes in gene and protein expression in human cortical samples

    Directory of Open Access Journals (Sweden)

    Scoles Haley A

    2011-12-01

    Full Text Available Abstract Background Duplication of chromosome 15q11-q13 (dup15q accounts for approximately 3% of autism cases. Chromosome 15q11-q13 contains imprinted genes necessary for normal mammalian neurodevelopment controlled by a differentially methylated imprinting center (imprinting center of the Prader-Willi locus, PWS-IC. Maternal dup15q occurs as both interstitial duplications and isodicentric chromosome 15. Overexpression of the maternally expressed gene UBE3A is predicted to be the primary cause of the autistic features associated with dup15q. Previous analysis of two postmortem dup15q frontal cortical samples showed heterogeneity between the two cases, with one showing levels of the GABAA receptor genes, UBE3A and SNRPN in a manner not predicted by copy number or parental imprint. Methods Postmortem human brain tissue (Brodmann area 19, extrastriate visual cortex was obtained from 8 dup15q, 10 idiopathic autism and 21 typical control tissue samples. Quantitative PCR was used to confirm duplication status. Quantitative RT-PCR and Western blot analyses were performed to measure 15q11-q13 transcript and protein levels, respectively. Methylation-sensitive high-resolution melting-curve analysis was performed on brain genomic DNA to identify the maternal:paternal ratio of methylation at PWS-IC. Results Dup15q brain samples showed a higher level of PWS-IC methylation than control or autism samples, indicating that dup15q was maternal in origin. UBE3A transcript and protein levels were significantly higher than control and autism in dup15q, as expected, although levels were variable and lower than expected based on copy number in some samples. In contrast, this increase in copy number did not result in consistently increased GABRB3 transcript or protein levels for dup15q samples. Furthermore, SNRPN was expected to be unchanged in expression in dup15q because it is expressed from the single unmethylated paternal allele, yet SNRPN levels were significantly

  9. Diagnosing Smith-Magenis syndrome and duplication 17p11.2 syndrome by RAI1 gene copy number variation using quantitative real-time PCR.

    Science.gov (United States)

    Truong, Hoa T; Solaymani-Kohal, Sara; Baker, Kevin R; Girirajan, Santhosh; Williams, Stephen R; Vlangos, Christopher N; Smith, Ann C M; Bunyan, David J; Roffey, Paul E; Blanchard, Christopher L; Elsea, Sarah H

    2008-03-01

    Smith-Magenis syndrome (SMS) and duplication 17p11.2 (dup17p11.2) syndrome are multiple congenital anomalies/mental retardation disorders resulting from either a deletion or duplication of the 17p11.2 region, respectively. The retinoic acid induced 1 (RAI1) gene is the causative gene for SMS and is included in the 17p11.2 region of dup17p11.2 syndrome. Currently SMS and dup17p11.2 syndrome are diagnosed using a combination of clinically recognized phenotypes and molecular cytogenetic analyses such as fluorescent in situ hybridization (FISH). However, these methods have proven to be highly expensive, time consuming, and dependent upon the low resolving capabilities of the assay. To address the need for improved diagnostic methods for SMS and dup17p11.2 syndrome, we designed a quantitative real-time PCR (Q-PCR) assay that measures RAI1 copy number using the comparative C(t) method, DeltaDeltaC(t). We tested our assay with samples blinded to their previous SMS or dup17p11.2 syndrome status. In all cases, we were able to determine RAI1 copy number status and render a correct diagnosis accordingly. We validated these results by both FISH and multiplex ligation-dependent probe amplification (MLPA). We conclude that Q-PCR is an accurate, reproducible, low-cost, and reliable assay that can be employed for routine use in SMS and dup17p11.2 diagnosis.

  10. Diverged Copies of the Seed Regulatory Opaque-2 Gene by a Segmental Duplication in the Progenitor Genome of Rice,Sorghum,and Maize

    Institute of Scientific and Technical Information of China (English)

    Jian-Hong Xu; Joachim Messing

    2008-01-01

    Comparative analyses of the sequence of entire genomes have shown that gene duplications,chromosomal segmental duplications.or even whole genome duplications(WGD)have played prominent roles in the evolution of many eukaryotic species.Here,we used the ancient duplication of a well known transcription factor in maize,encoded by the Opaque-2(02)IOCUS,to examine the generaI features of divergences of chromosomaI segmentaI duplications in a lineagespecific manner.We took advantage of contiguous chromosomal sequence information in rice(Oryza sativa,Nipponbare).sorghum(Sorghum bicoloc Btx623),and maize(Zea mays,B73)that were aligned by conserved gene order(synteny).This analysis showed that the maize O2 locus is contained within a 1.25 million base-pair(Mb)segment on chromosome 7.which was duplicated≈56 million years ago(mya)before the split of rice and maize 50 mya.The duplicated region on chromosome 1 is only half the size and contains the maize OHP gene.which does not restore the o2 mutation although it encodes a protein with the same DNA and protein binding properties in endosperm.The segmental duplication iS not only found in rice,but also in sorghum,which split from maize 11.9 mya.A detailed analysis of the duplicated regions provided examples for complex rearrangements including deletions.duplications,conversions,inversions,and translocations.Furthermore,the rice and sorghum genomes appeared to be more stable than the maize genome,probably because maize underwent allotetraploidization and then diploidization.

  11. Inferring angiosperm phylogeny from EST data with widespread gene duplication

    OpenAIRE

    Sanderson, Michael J.; McMahon, Michelle M.

    2007-01-01

    Background Most studies inferring species phylogenies use sequences from single copy genes or sets of orthologs culled from gene families. For taxa such as plants, with very high levels of gene duplication in their nuclear genomes, this has limited the exploitation of nuclear sequences for phylogenetic studies, such as those available in large EST libraries. One rarely used method of inference, gene tree parsimony, can infer species trees from gene families undergoing duplication and loss, bu...

  12. The Phenotypic Plasticity of Duplicated Genes in Saccharomyces cerevisiae and the Origin of Adaptations

    Directory of Open Access Journals (Sweden)

    Florian Mattenberger

    2017-01-01

    Full Text Available Gene and genome duplication are the major sources of biological innovations in plants and animals. Functional and transcriptional divergence between the copies after gene duplication has been considered the main driver of innovations . However, here we show that increased phenotypic plasticity after duplication plays a more major role than thought before in the origin of adaptations. We perform an exhaustive analysis of the transcriptional alterations of duplicated genes in the unicellular eukaryote Saccharomyces cerevisiae when challenged with five different environmental stresses. Analysis of the transcriptomes of yeast shows that gene duplication increases the transcriptional response to environmental changes, with duplicated genes exhibiting signatures of adaptive transcriptional patterns in response to stress. The mechanism of duplication matters, with whole-genome duplicates being more transcriptionally altered than small-scale duplicates. The predominant transcriptional pattern follows the classic theory of evolution by gene duplication; with one gene copy remaining unaltered under stress, while its sister copy presents large transcriptional plasticity and a prominent role in adaptation. Moreover, we find additional transcriptional profiles that are suggestive of neo- and subfunctionalization of duplicate gene copies. These patterns are strongly correlated with the functional dependencies and sequence divergence profiles of gene copies. We show that, unlike singletons, duplicates respond more specifically to stress, supporting the role of natural selection in the transcriptional plasticity of duplicates. Our results reveal the underlying transcriptional complexity of duplicated genes and its role in the origin of adaptations.

  13. Analysis of Duplicate Genes in Soybean

    Institute of Scientific and Technical Information of China (English)

    C.M. Cai; K.J. Van; M.Y. Kim; S.H. Lee

    2007-01-01

    @@ Gene duplication is a major determinant of the size and gene complement of eukaryotic genomes (Lockton and Gaut, 2005). There are a number of different ways in which duplicate genes can arise (Sankoff, 2001), but the most spectacular method of gene duplication may be whole genome duplication via polyploidization.

  14. Molecular trajectories leading to the alternative fates of duplicate genes.

    Directory of Open Access Journals (Sweden)

    Michael Marotta

    Full Text Available Gene duplication generates extra gene copies in which mutations can accumulate without risking the function of pre-existing genes. Such mutations modify duplicates and contribute to evolutionary novelties. However, the vast majority of duplicates appear to be short-lived and experience duplicate silencing within a few million years. Little is known about the molecular mechanisms leading to these alternative fates. Here we delineate differing molecular trajectories of a relatively recent duplication event between humans and chimpanzees by investigating molecular properties of a single duplicate: DNA sequences, gene expression and promoter activities. The inverted duplication of the Glutathione S-transferase Theta 2 (GSTT2 gene had occurred at least 7 million years ago in the common ancestor of African great apes and is preserved in chimpanzees (Pan troglodytes, whereas a deletion polymorphism is prevalent in humans. The alternative fates are associated with expression divergence between these species, and reduced expression in humans is regulated by silencing mutations that have been propagated between duplicates by gene conversion. In contrast, selective constraint preserved duplicate divergence in chimpanzees. The difference in evolutionary processes left a unique DNA footprint in which dying duplicates are significantly more similar to each other (99.4% than preserved ones. Such molecular trajectories could provide insights for the mechanisms underlying duplicate life and death in extant genomes.

  15. Gene duplication as a major force in evolution

    Indian Academy of Sciences (India)

    Santoshkumar Magadum; Urbi Banerjee; Priyadharshini Murugan; Doddabhimappa Gangapur; Rajasekar Ravikesavan

    2013-04-01

    Gene duplication is an important mechanism for acquiring new genes and creating genetic novelty in organisms. Many new gene functions have evolved through gene duplication and it has contributed tremendously to the evolution of developmental programmes in various organisms. Gene duplication can result from unequal crossing over, retroposition or chromosomal (or genome) duplication. Understanding the mechanisms that generate duplicate gene copies and the subsequent dynamics among gene duplicates is vital because these investigations shed light on localized and genomewide aspects of evolutionary forces shaping intra-specific and inter-specific genome contents, evolutionary relationships, and interactions. Based on whole-genome analysis of Arabidopsis thaliana, there is compelling evidence that angiosperms underwent two whole-genome duplication events early during their evolutionary history. Recent studies have shown that these events were crucial for creation of many important developmental and regulatory genes found in extant angiosperm genomes. Recent studies also provide strong indications that even yeast (Saccharomyces cerevisiae), with its compact genome, is in fact an ancient tetraploid. Gene duplication can provide new genetic material for mutation, drift and selection to act upon, the result of which is specialized or new gene functions. Without gene duplication the plasticity of a genome or species in adapting to changing environments would be severely limited. Whether a duplicate is retained depends upon its function, its mode of duplication, (i.e. whether it was duplicated during a whole-genome duplication event), the species in which it occurs, and its expression rate. The exaptation of preexisting secondary functions is an important feature in gene evolution, just as it is in morphological evolution.

  16. FUNCTIONAL SPECIALIZATION OF DUPLICATED FLAVONOID BIOSYNTHESIS GENES IN WHEAT

    Directory of Open Access Journals (Sweden)

    Khlestkina E.

    2012-08-01

    Full Text Available Gene duplication followed by subfunctionalization and neofunctionalization is of a great evolutionary importance. In plant genomes, duplicated genes may result from either polyploidization (homoeologous genes or segmental chromosome duplications (paralogous genes. In allohexaploid wheat Triticum aestivum L. (2n=6x=42, genome BBAADD, both homoeologous and paralogous copies were found for the regulatory gene Myc encoding MYC-like transcriptional factor in the biosynthesis of flavonoid pigments, anthocyanins, and for the structural gene F3h encoding one of the key enzymes of flavonoid biosynthesis, flavanone 3-hydroxylase. From the 5 copies (3 homoeologous and 2 paralogous of the Myc gene found in T. aestivum, only one plays a regulatory role in anthocyanin biosynthesis, interacting complementary with another transcriptional factor (MYB-like to confer purple pigmentation of grain pericarp in wheat. The role and functionality of the other 4 copies of the Myc gene remain unknown. From the 4 functional copies of the F3h gene in T. aestivum, three homoeologues have similar function. They are expressed in wheat organs colored with anthocyanins or in the endosperm, participating there in biosynthesis of uncolored flavonoid substances. The fourth copy (the B-genomic paralogue is transcribed neither in wheat organs colored with anthocyanins nor in seeds, however, it’s expression has been noticed in roots of aluminium-stressed plants, where the three homoeologous copies are not active. Functional diversification of the duplicated flavonoid biosynthesis genes in wheat may be a reason for maintenance of the duplicated copies and preventing them from pseudogenization.The study was supported by RFBR (11-04-92707. We also thank Ms. Galina Generalova for technical assistance.

  17. Benchmarking Transcriptome Quantification Methods for Duplicated Genes in Xenopus laevis.

    Science.gov (United States)

    Kwon, Taejoon

    2015-01-01

    Xenopus is an important model organism for the study of genome duplication in vertebrates. With the full genome sequence of diploid Xenopus tropicalis available, and that of allotetraploid X. laevis close to being finished, we will be able to expand our understanding of how duplicated genes have evolved. One of the key features in the study of the functional consequence of gene duplication is how their expression patterns vary across different conditions, and RNA-seq seems to have enough resolution to discriminate the expression of highly similar duplicated genes. However, most of the current RNA-seq analysis methods were not designed to study samples with duplicate genes such as in X. laevis. Here, various computational methods to quantify gene expression in RNA-seq data were evaluated, using 2 independent X. laevis egg RNA-seq datasets and 2 reference databases for duplicated genes. The fact that RNA-seq can measure expression levels of similar duplicated genes was confirmed, but long paired-end reads are more informative than short single-end reads to discriminate duplicated genes. Also, it was found that bowtie, one of the most popular mappers in RNA-seq analysis, reports significantly smaller numbers of unique hits according to a mapping quality score compared to other mappers tested (BWA, GSNAP, STAR). Calculated from unique hits based on a mapping quality score, both expression levels and the expression ratio of duplicated genes can be estimated consistently among biological replicates, demonstrating that this method can successfully discriminate the expression of each copy of a duplicated gene pair. This comprehensive evaluation will be a useful guideline for studying gene expression of organisms with genome duplication using RNA-seq in the future.

  18. Genomic evidence for adaptation by gene duplication.

    Science.gov (United States)

    Qian, Wenfeng; Zhang, Jianzhi

    2014-08-01

    Gene duplication is widely believed to facilitate adaptation, but unambiguous evidence for this hypothesis has been found in only a small number of cases. Although gene duplication may increase the fitness of the involved organisms by doubling gene dosage or neofunctionalization, it may also result in a simple division of ancestral functions into daughter genes, which need not promote adaptation. Hence, the general validity of the adaptation by gene duplication hypothesis remains uncertain. Indeed, a genome-scale experiment found similar fitness effects of deleting pairs of duplicate genes and deleting individual singleton genes from the yeast genome, leading to the conclusion that duplication rarely results in adaptation. Here we contend that the above comparison is unfair because of a known duplication bias among genes with different fitness contributions. To rectify this problem, we compare homologous genes from the budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe. We discover that simultaneously deleting a duplicate gene pair in S. cerevisiae reduces fitness significantly more than deleting their singleton counterpart in S. pombe, revealing post-duplication adaptation. The duplicates-singleton difference in fitness effect is not attributable to a potential increase in gene dose after duplication, suggesting that the adaptation is owing to neofunctionalization, which we find to be explicable by acquisitions of binary protein-protein interactions rather than gene expression changes. These results provide genomic evidence for the role of gene duplication in organismal adaptation and are important for understanding the genetic mechanisms of evolutionary innovation.

  19. Duplicability of self-interacting human genes.

    LENUS (Irish Health Repository)

    Pérez-Bercoff, Asa

    2010-01-01

    BACKGROUND: There is increasing interest in the evolution of protein-protein interactions because this should ultimately be informative of the patterns of evolution of new protein functions within the cell. One model proposes that the evolution of new protein-protein interactions and protein complexes proceeds through the duplication of self-interacting genes. This model is supported by data from yeast. We examined the relationship between gene duplication and self-interaction in the human genome. RESULTS: We investigated the patterns of self-interaction and duplication among 34808 interactions encoded by 8881 human genes, and show that self-interacting proteins are encoded by genes with higher duplicability than genes whose proteins lack this type of interaction. We show that this result is robust against the system used to define duplicate genes. Finally we compared the presence of self-interactions amongst proteins whose genes have duplicated either through whole-genome duplication (WGD) or small-scale duplication (SSD), and show that the former tend to have more interactions in general. After controlling for age differences between the two sets of duplicates this result can be explained by the time since the gene duplication. CONCLUSIONS: Genes encoding self-interacting proteins tend to have higher duplicability than proteins lacking self-interactions. Moreover these duplicate genes have more often arisen through whole-genome rather than small-scale duplication. Finally, self-interacting WGD genes tend to have more interaction partners in general in the PIN, which can be explained by their overall greater age. This work adds to our growing knowledge of the importance of contextual factors in gene duplicability.

  20. The genomic architecture of segmental duplications and associated copy number variants in dogs.

    Science.gov (United States)

    Nicholas, Thomas J; Cheng, Ze; Ventura, Mario; Mealey, Katrina; Eichler, Evan E; Akey, Joshua M

    2009-03-01

    Structural variation is an important and abundant source of genetic and phenotypic variation. Here we describe the first systematic and genome-wide analysis of segmental duplications and associated copy number variants (CNVs) in the modern domesticated dog, Canis familiaris, which exhibits considerable morphological, physiological, and behavioral variation. Through computational analyses of the publicly available canine reference sequence, we estimate that segmental duplications comprise approximately 4.21% of the canine genome. Segmental duplications overlap 841 genes and are significantly enriched for specific biological functions such as immunity and defense and KRAB box transcription factors. We designed high-density tiling arrays spanning all predicted segmental duplications and performed aCGH in a panel of 17 breeds and a gray wolf. In total, we identified 3583 CNVs, approximately 68% of which were found in two or more samples that map to 678 unique regions. CNVs span 429 genes that are involved in a wide variety of biological processes such as olfaction, immunity, and gene regulation. Our results provide insight into mechanisms of canine genome evolution and generate a valuable resource for future evolutionary and phenotypic studies.

  1. Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene

    Directory of Open Access Journals (Sweden)

    Shomron Noam

    2007-11-01

    Full Text Available Abstract Background Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Results Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. Conclusion The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains.

  2. Special Issue: Gene Conversion in Duplicated Genes

    Directory of Open Access Journals (Sweden)

    Hideki Innan

    2011-06-01

    Full Text Available Gene conversion is an outcome of recombination, causing non-reciprocal transfer of a DNA fragment. Several decades later than the discovery of crossing over, gene conversion was first recognized in fungi when non-Mendelian allelic distortion was observed. Gene conversion occurs when a double-strand break is repaired by using homologous sequences in the genome. In meiosis, there is a strong preference to use the orthologous region (allelic gene conversion, which causes non-Mendelian allelic distortion, but paralogous or duplicated regions can also be used for the repair (inter-locus gene conversion, also referred to as non-allelic and ectopic gene conversion. The focus of this special issue is the latter, interlocus gene conversion; the rate is lower than allelic gene conversion but it has more impact on phenotype because more drastic changes in DNA sequence are involved.

  3. The roles of whole-genome and small-scale duplications in the functional specialization of Saccharomyces cerevisiae genes.

    Directory of Open Access Journals (Sweden)

    Mario A Fares

    Full Text Available Researchers have long been enthralled with the idea that gene duplication can generate novel functions, crediting this process with great evolutionary importance. Empirical data shows that whole-genome duplications (WGDs are more likely to be retained than small-scale duplications (SSDs, though their relative contribution to the functional fate of duplicates remains unexplored. Using the map of genetic interactions and the re-sequencing of 27 Saccharomyces cerevisiae genomes evolving for 2,200 generations we show that SSD-duplicates lead to neo-functionalization while WGD-duplicates partition ancestral functions. This conclusion is supported by: (a SSD-duplicates establish more genetic interactions than singletons and WGD-duplicates; (b SSD-duplicates copies share more interaction-partners than WGD-duplicates copies; (c WGD-duplicates interaction partners are more functionally related than SSD-duplicates partners; (d SSD-duplicates gene copies are more functionally divergent from one another, while keeping more overlapping functions, and diverge in their sub-cellular locations more than WGD-duplicates copies; and (e SSD-duplicates complement their functions to a greater extent than WGD-duplicates. We propose a novel model that uncovers the complexity of evolution after gene duplication.

  4. The Roles of Whole-Genome and Small-Scale Duplications in the Functional Specialization of Saccharomyces cerevisiae Genes

    Science.gov (United States)

    Fares, Mario A.; Keane, Orla M.; Toft, Christina; Carretero-Paulet, Lorenzo; Jones, Gary W.

    2013-01-01

    Researchers have long been enthralled with the idea that gene duplication can generate novel functions, crediting this process with great evolutionary importance. Empirical data shows that whole-genome duplications (WGDs) are more likely to be retained than small-scale duplications (SSDs), though their relative contribution to the functional fate of duplicates remains unexplored. Using the map of genetic interactions and the re-sequencing of 27 Saccharomyces cerevisiae genomes evolving for 2,200 generations we show that SSD-duplicates lead to neo-functionalization while WGD-duplicates partition ancestral functions. This conclusion is supported by: (a) SSD-duplicates establish more genetic interactions than singletons and WGD-duplicates; (b) SSD-duplicates copies share more interaction-partners than WGD-duplicates copies; (c) WGD-duplicates interaction partners are more functionally related than SSD-duplicates partners; (d) SSD-duplicates gene copies are more functionally divergent from one another, while keeping more overlapping functions, and diverge in their sub-cellular locations more than WGD-duplicates copies; and (e) SSD-duplicates complement their functions to a greater extent than WGD–duplicates. We propose a novel model that uncovers the complexity of evolution after gene duplication. PMID:23300483

  5. Restriction and Recruitment—Gene Duplication and the Origin and Evolution of Snake Venom Toxins

    OpenAIRE

    Hargreaves, Adam D; Swain, Martin T.; Matthew J. Hegarty; Logan, Darren W; Mulley, John F

    2014-01-01

    Snake venom has been hypothesized to have originated and diversified through a process that involves duplication of genes encoding body proteins with subsequent recruitment of the copy to the venom gland, where natural selection acts to develop or increase toxicity. However, gene duplication is known to be a rare event in vertebrate genomes, and the recruitment of duplicated genes to a novel expression domain (neofunctionalization) is an even rarer process that requires the evolution of novel...

  6. Adaptive evolution of genes duplicated from the Drosophila pseudoobscura neo-X chromosome.

    Science.gov (United States)

    Meisel, Richard P; Hilldorfer, Benedict B; Koch, Jessica L; Lockton, Steven; Schaeffer, Stephen W

    2010-08-01

    Drosophila X chromosomes are disproportionate sources of duplicated genes, and these duplications are usually the result of retrotransposition of X-linked genes to the autosomes. The excess duplication is thought to be driven by natural selection for two reasons: X chromosomes are inactivated during spermatogenesis, and the derived copies of retroposed duplications tend to be testis expressed. Therefore, autosomal derived copies of retroposed genes provide a mechanism for their X-linked paralogs to "escape" X inactivation. Once these duplications have fixed, they may then be selected for male-specific functions. Throughout the evolution of the Drosophila genus, autosomes have fused with X chromosomes along multiple lineages giving rise to neo-X chromosomes. There has also been excess duplication from the two independent neo-X chromosomes that have been examined--one that occurred prior to the common ancestor of the willistoni species group and another that occurred along the lineage leading to Drosophila pseudoobscura. To determine what role natural selection plays in the evolution of genes duplicated from the D. pseudoobscura neo-X chromosome, we analyzed DNA sequence divergence between paralogs, polymorphism within each copy, and the expression profiles of these duplicated genes. We found that the derived copies of all duplicated genes have elevated nonsynonymous polymorphism, suggesting that they are under relaxed selective constraints. The derived copies also tend to have testis- or male-biased expression profiles regardless of their chromosome of origin. Genes duplicated from the neo-X chromosome appear to be under less constraints than those duplicated from other chromosome arms. We also find more evidence for historical adaptive evolution in genes duplicated from the neo-X chromosome, suggesting that they are under a unique selection regime in which elevated nonsynonymous polymorphism provides a large reservoir of functional variants, some of which are fixed

  7. Evolution of the duplicated intracellular lipid-binding protein genes of teleost fishes.

    Science.gov (United States)

    Venkatachalam, Ananda B; Parmar, Manoj B; Wright, Jonathan M

    2017-08-01

    Increasing organismal complexity during the evolution of life has been attributed to the duplication of genes and entire genomes. More recently, theoretical models have been proposed that postulate the fate of duplicated genes, among them the duplication-degeneration-complementation (DDC) model. In the DDC model, the common fate of a duplicated gene is lost from the genome owing to nonfunctionalization. Duplicated genes are retained in the genome either by subfunctionalization, where the functions of the ancestral gene are sub-divided between the sister duplicate genes, or by neofunctionalization, where one of the duplicate genes acquires a new function. Both processes occur either by loss or gain of regulatory elements in the promoters of duplicated genes. Here, we review the genomic organization, evolution, and transcriptional regulation of the multigene family of intracellular lipid-binding protein (iLBP) genes from teleost fishes. Teleost fishes possess many copies of iLBP genes owing to a whole genome duplication (WGD) early in the teleost fish radiation. Moreover, the retention of duplicated iLBP genes is substantially higher than the retention of all other genes duplicated in the teleost genome. The fatty acid-binding protein genes, a subfamily of the iLBP multigene family in zebrafish, are differentially regulated by peroxisome proliferator-activated receptor (PPAR) isoforms, which may account for the retention of iLBP genes in the zebrafish genome by the process of subfunctionalization of cis-acting regulatory elements in iLBP gene promoters.

  8. [Duplication of DNA--a mechanism for the development of new functionality of genes].

    Science.gov (United States)

    Maślanka, Roman; Zadrąg-Tęcza, Renata

    2015-01-01

    The amplification of DNA is considered as a mechanism for rapid evolution of organisms. Duplication can be especially advantageous in the case of changing environmental conditions. Whole genome duplication maintains the proper balance between gene expression. This seems to be the main reason why WGD is more favorable than duplication of the fragments of DNA. The polyploidy status disappear as a result of the loss of the majority of duplicated genes. The preservation of duplicated genes is associated with the development of their new functions. Polyploidization is often noted for plants. However due to sequencing technique, the duplications episodes are more frequently reports also for the other systematic taxa, including animals. The occurrence of ancient genome duplication is also considered for yeast Saccharomyces cerevisiae. The existence of two active copies of ribosomal protein genes can be a confirmation of this process. Development of the fermentation process might be one of the probable causes of the yeast genome duplication.

  9. Yeast genome duplication was followed by asynchronous differentiation of duplicated genes

    DEFF Research Database (Denmark)

    Langkjær, Rikke Breinhold; Cliften, P.F.; Johnston, M.

    2003-01-01

    Gene redundancy has been observed in yeast, plant and human genomes, and is thought to be a consequence of whole-genome duplications(1-3). Baker's yeast, Saccharomyces cerevisiae, contains several hundred duplicated genes(1). Duplication(s) could have occurred before or after a given speciation. ...

  10. Inferring angiosperm phylogeny from EST data with widespread gene duplication.

    Science.gov (United States)

    Sanderson, Michael J; McMahon, Michelle M

    2007-02-08

    Most studies inferring species phylogenies use sequences from single copy genes or sets of orthologs culled from gene families. For taxa such as plants, with very high levels of gene duplication in their nuclear genomes, this has limited the exploitation of nuclear sequences for phylogenetic studies, such as those available in large EST libraries. One rarely used method of inference, gene tree parsimony, can infer species trees from gene families undergoing duplication and loss, but its performance has not been evaluated at a phylogenomic scale for EST data in plants. A gene tree parsimony analysis based on EST data was undertaken for six angiosperm model species and Pinus, an outgroup. Although a large fraction of the tentative consensus sequences obtained from the TIGR database of ESTs was assembled into homologous clusters too small to be phylogenetically informative, some 557 clusters contained promising levels of information. Based on maximum likelihood estimates of the gene trees obtained from these clusters, gene tree parsimony correctly inferred the accepted species tree with strong statistical support. A slight variant of this species tree was obtained when maximum parsimony was used to infer the individual gene trees instead. Despite the complexity of the EST data and the relatively small fraction eventually used in inferring a species tree, the gene tree parsimony method performed well in the face of very high apparent rates of duplication.

  11. Copy number gain at Xp22.31 includes complex duplication rearrangements and recurrent triplications.

    Science.gov (United States)

    Liu, Pengfei; Erez, Ayelet; Nagamani, Sandesh C Sreenath; Bi, Weimin; Carvalho, Claudia M B; Simmons, Alexandra D; Wiszniewska, Joanna; Fang, Ping; Eng, Patricia A; Cooper, M Lance; Sutton, V Reid; Roeder, Elizabeth R; Bodensteiner, John B; Delgado, Mauricio R; Prakash, Siddharth K; Belmont, John W; Stankiewicz, Pawel; Berg, Jonathan S; Shinawi, Marwan; Patel, Ankita; Cheung, Sau Wai; Lupski, James R

    2011-05-15

    Genomic instability is a feature of the human Xp22.31 region wherein deletions are associated with X-linked ichthyosis, mental retardation and attention deficit hyperactivity disorder. A putative homologous recombination hotspot motif is enriched in low copy repeats that mediate recurrent deletion at this locus. To date, few efforts have focused on copy number gain at Xp22.31. However, clinical testing revealed a high incidence of duplication of Xp22.31 in subjects ascertained and referred with neurobehavioral phenotypes. We systematically studied 61 unrelated subjects with rearrangements revealing gain in copy number, using multiple molecular assays. We detected not only the anticipated recurrent and simple nonrecurrent duplications, but also unexpectedly identified recurrent triplications and other complex rearrangements. Breakpoint analyses enabled us to surmise the mechanisms for many of these rearrangements. The clinical significance of the recurrent duplications and triplications were assessed using different approaches. We cannot find any evidence to support pathogenicity of the Xp22.31 duplication. However, our data suggest that the Xp22.31 duplication may serve as a risk factor for abnormal phenotypes. Our findings highlight the need for more robust Xp22.31 triplication detection in that such further gain may be more penetrant than the duplications. Our findings reveal the distribution of different mechanisms for genomic duplication rearrangements at a given locus, and provide insights into aspects of strand exchange events between paralogous sequences in the human genome.

  12. Evolution dynamics of a model for gene duplication under adaptive conflict

    Science.gov (United States)

    Ancliff, Mark; Park, Jeong-Man

    2014-06-01

    We present and solve the dynamics of a model for gene duplication showing escape from adaptive conflict. We use a Crow-Kimura quasispecies model of evolution where the fitness landscape is a function of Hamming distances from two reference sequences, which are assumed to optimize two different gene functions, to describe the dynamics of a mixed population of individuals with single and double copies of a pleiotropic gene. The evolution equations are solved through a spin coherent state path integral, and we find two phases: one is an escape from an adaptive conflict phase, where each copy of a duplicated gene evolves toward subfunctionalization, and the other is a duplication loss of function phase, where one copy maintains its pleiotropic form and the other copy undergoes neutral mutation. The phase is determined by a competition between the fitness benefits of subfunctionalization and the greater mutational load associated with maintaining two gene copies. In the escape phase, we find a dynamics of an initial population of single gene sequences only which escape adaptive conflict through gene duplication and find that there are two time regimes: until a time t* single gene sequences dominate, and after t* double gene sequences outgrow single gene sequences. The time t* is identified as the time necessary for subfunctionalization to evolve and spread throughout the double gene sequences, and we show that there is an optimum mutation rate which minimizes this time scale.

  13. Familial Lymphoproliferative Malignancies and Tandem Duplication of NF1 Gene

    Directory of Open Access Journals (Sweden)

    Gustavo Fernandes

    2014-01-01

    Full Text Available Background. Neurofibromatosis type 1 is a genetic disorder caused by loss-of-function mutations in a tumor suppressor gene (NF1 which codifies the protein neurofibromin. The frequent genetic alterations that modify neurofibromin function are deletions and insertions. Duplications are rare and phenotype in patients bearing duplication of NF1 gene is thought to be restricted to developmental abnormalities, with no reference to cancer susceptibility in these patients. We evaluated a patient who presented with few clinical signs of neurofibromatosis type 1 and a conspicuous personal and familiar history of different types of cancer, especially lymphoproliferative malignancies. The coding region of the NF-1 gene was analyzed by real-time polymerase chain reaction and direct sequencing. Multiplex ligation-dependent probe amplification was performed to detect the number of mutant copies. The NF1 gene analysis showed the following alterations: mosaic duplication of NF1, TRAF4, and MYO1D. Fluorescence in situ hybridization using probes (RP5-1002G3 and RP5-92689 flanking NF1 gene in 17q11.2 and CEP17 for 17q11.11.1 was performed. There were three signals (RP5-1002G3conRP5-92689 in the interphases analyzed and two signals (RP5-1002G3conRP5-92689 in 93% of cells. These findings show a tandem duplication of 17q11.2. Conclusion. The case suggests the possibility that NF1 gene duplication may be associated with a phenotype characterized by lymphoproliferative disorders.

  14. Local synteny and codon usage contribute to asymmetric sequence divergence of Saccharomyces cerevisiae gene duplicates

    Directory of Open Access Journals (Sweden)

    Bergthorsson Ulfar

    2011-09-01

    Full Text Available Abstract Background Duplicated genes frequently experience asymmetric rates of sequence evolution. Relaxed selective constraints and positive selection have both been invoked to explain the observation that one paralog within a gene-duplicate pair exhibits an accelerated rate of sequence evolution. In the majority of studies where asymmetric divergence has been established, there is no indication as to which gene copy, ancestral or derived, is evolving more rapidly. In this study we investigated the effect of local synteny (gene-neighborhood conservation and codon usage on the sequence evolution of gene duplicates in the S. cerevisiae genome. We further distinguish the gene duplicates into those that originated from a whole-genome duplication (WGD event (ohnologs versus small-scale duplications (SSD to determine if there exist any differences in their patterns of sequence evolution. Results For SSD pairs, the derived copy evolves faster than the ancestral copy. However, there is no relationship between rate asymmetry and synteny conservation (ancestral-like versus derived-like in ohnologs. mRNA abundance and optimal codon usage as measured by the CAI is lower in the derived SSD copies relative to ancestral paralogs. Moreover, in the case of ohnologs, the faster-evolving copy has lower CAI and lowered expression. Conclusions Together, these results suggest that relaxation of selection for codon usage and gene expression contribute to rate asymmetry in the evolution of duplicated genes and that in SSD pairs, the relaxation of selection stems from the loss of ancestral regulatory information in the derived copy.

  15. Copy number expansion of the STX17 duplication in melanoma tissue from Grey horses

    Directory of Open Access Journals (Sweden)

    Sundström Elisabeth

    2012-08-01

    Full Text Available Abstract Background Greying with age in horses is an autosomal dominant trait, associated with loss of hair pigmentation, melanoma and vitiligo-like depigmentation. We recently identified a 4.6 kb duplication in STX17 to be associated with the phenotype. The aims of this study were to investigate if the duplication in Grey horses shows copy number variation and to exclude that any other polymorphism is uniquely associated with the Grey mutation. Results We found little evidence for copy number expansion of the duplicated sequence in blood DNA from Grey horses. In contrast, clear evidence for copy number expansions was indicated in five out of eight tested melanoma tissues or melanoma cell lines. A tendency of a higher copy number in aggressive tumours was also found. Massively parallel resequencing of the ~350 kb Grey haplotype did not reveal any additional mutations perfectly associated with the phenotype, confirming the duplication as the true causative mutation. We identified three SNP alleles that were present in a subset of Grey haplotypes within the 350 kb region that shows complete linkage disequilibrium with the causative mutation. Thus, these three nucleotide substitutions must have occurred subsequent to the duplication, consistent with our interpretation that the Grey mutation arose more than 2,000 years before present. Conclusions These results suggest that the mutation acts as a melanoma-driving regulatory element. The elucidation of the mechanistic features of the duplication will be of considerable interest for the characterization of these horse melanomas as well as for the field of human melanoma research.

  16. Global analysis of human duplicated genes reveals the relative importance of whole-genome duplicates originated in the early vertebrate evolution.

    Science.gov (United States)

    Acharya, Debarun; Ghosh, Tapash C

    2016-01-22

    Gene duplication is a genetic mutation that creates functionally redundant gene copies that are initially relieved from selective pressures and may adapt themselves to new functions with time. The levels of gene duplication may vary from small-scale duplication (SSD) to whole genome duplication (WGD). Studies with yeast revealed ample differences between these duplicates: Yeast WGD pairs were functionally more similar, less divergent in subcellular localization and contained a lesser proportion of essential genes. In this study, we explored the differences in evolutionary genomic properties of human SSD and WGD genes, with the identifiable human duplicates coming from the two rounds of whole genome duplication occurred early in vertebrate evolution. We observed that these two groups of duplicates were also dissimilar in terms of their evolutionary and genomic properties. But interestingly, this is not like the same observed in yeast. The human WGDs were found to be functionally less similar, diverge more in subcellular level and contain a higher proportion of essential genes than the SSDs, all of which are opposite from yeast. Additionally, we explored that human WGDs were more divergent in their gene expression profile, have higher multifunctionality and are more often associated with disease, and are evolutionarily more conserved than human SSDs. Our study suggests that human WGD duplicates are more divergent and entails the adaptation of WGDs to novel and important functions that consequently lead to their evolutionary conservation in the course of evolution.

  17. Partial duplication of the APBA2 gene in chromosome 15q13 corresponds to duplicon structures

    Directory of Open Access Journals (Sweden)

    Kesterson Robert A

    2003-04-01

    Full Text Available Abstract Background Chromosomal abnormalities affecting human chromosome 15q11-q13 underlie multiple genomic disorders caused by deletion, duplication and triplication of intervals in this region. These events are mediated by highly homologous segments of DNA, or duplicons, that facilitate mispairing and unequal cross-over in meiosis. The gene encoding an amyloid precursor protein-binding protein (APBA2 was previously mapped to the distal portion of the interval commonly deleted in Prader-Willi and Angelman syndromes and duplicated in cases of autism. Results We show that this gene actually maps to a more telomeric location and is partially duplicated within the broader region. Two highly homologous copies of an interval containing a large 5' exon and downstream sequence are located ~5 Mb distal to the intact locus. The duplicated copies, containing the first coding exon of APBA2, can be distinguished by single nucleotide sequence differences and are transcriptionally inactive. Adjacent to APBA2 maps a gene termed KIAA0574. The protein encoded by this gene is weakly homologous to a protein termed X123 that in turn maps adjacent to APBA1 on 9q21.12; APBA1 is highly homologous to APBA2 in the C-terminal region and is distinguished from APBA2 by the N-terminal region encoded by this duplicated exon. Conclusion The duplication of APBA2 sequences in this region adds to a complex picture of different low copy repeats present across this region and elsewhere on the chromosome.

  18. Dynamics of gene duplication in the genomes of chlorophyll d-producing cyanobacteria: implications for the ecological niche.

    Science.gov (United States)

    Miller, Scott R; Wood, A Michelle; Blankenship, Robert E; Kim, Maria; Ferriera, Steven

    2011-01-01

    Gene duplication may be an important mechanism for the evolution of new functions and for the adaptive modulation of gene expression via dosage effects. Here, we analyzed the fate of gene duplicates for two strains of a novel group of cyanobacteria (genus Acaryochloris) that produces the far-red light absorbing chlorophyll d as its main photosynthetic pigment. The genomes of both strains contain an unusually high number of gene duplicates for bacteria. As has been observed for eukaryotic genomes, we find that the demography of gene duplicates can be well modeled by a birth-death process. Most duplicated Acaryochloris genes are of comparatively recent origin, are strain-specific, and tend to be located on different genetic elements. Analyses of selection on duplicates of different divergence classes suggest that a minority of paralogs exhibit near neutral evolutionary dynamics immediately following duplication but that most duplicate pairs (including those which have been retained for long periods) are under strong purifying selection against amino acid change. The likelihood of duplicate retention varied among gene functional classes, and the pronounced differences between strains in the pool of retained recent duplicates likely reflects differences in the nutrient status and other characteristics of their respective environments. We conclude that most duplicates are quickly purged from Acaryochloris genomes and that those which are retained likely make important contributions to organism ecology by conferring fitness benefits via gene dosage effects. The mechanism of enhanced duplication may involve homologous recombination between genetic elements mediated by paralogous copies of recA.

  19. Gene duplication, modularity and adaptation in the evolution of the aflatoxin gene cluster

    Directory of Open Access Journals (Sweden)

    Jakobek Judy L

    2007-07-01

    Full Text Available Abstract Background The biosynthesis of aflatoxin (AF involves over 20 enzymatic reactions in a complex polyketide pathway that converts acetate and malonate to the intermediates sterigmatocystin (ST and O-methylsterigmatocystin (OMST, the respective penultimate and ultimate precursors of AF. Although these precursors are chemically and structurally very similar, their accumulation differs at the species level for Aspergilli. Notable examples are A. nidulans that synthesizes only ST, A. flavus that makes predominantly AF, and A. parasiticus that generally produces either AF or OMST. Whether these differences are important in the evolutionary/ecological processes of species adaptation and diversification is unknown. Equally unknown are the specific genomic mechanisms responsible for ordering and clustering of genes in the AF pathway of Aspergillus. Results To elucidate the mechanisms that have driven formation of these clusters, we performed systematic searches of aflatoxin cluster homologs across five Aspergillus genomes. We found a high level of gene duplication and identified seven modules consisting of highly correlated gene pairs (aflA/aflB, aflR/aflS, aflX/aflY, aflF/aflE, aflT/aflQ, aflC/aflW, and aflG/aflL. With the exception of A. nomius, contrasts of mean Ka/Ks values across all cluster genes showed significant differences in selective pressure between section Flavi and non-section Flavi species. A. nomius mean Ka/Ks values were more similar to partial clusters in A. fumigatus and A. terreus. Overall, mean Ka/Ks values were significantly higher for section Flavi than for non-section Flavi species. Conclusion Our results implicate several genomic mechanisms in the evolution of ST, OMST and AF cluster genes. Gene modules may arise from duplications of a single gene, whereby the function of the pre-duplication gene is retained in the copy (aflF/aflE or the copies may partition the ancestral function (aflA/aflB. In some gene modules, the

  20. Effect of Duplicate Genes on Mouse Genetic Robustness: An Update

    Directory of Open Access Journals (Sweden)

    Zhixi Su

    2014-01-01

    Full Text Available In contrast to S. cerevisiae and C. elegans, analyses based on the current knockout (KO mouse phenotypes led to the conclusion that duplicate genes had almost no role in mouse genetic robustness. It has been suggested that the bias of mouse KO database toward ancient duplicates may possibly cause this knockout duplicate puzzle, that is, a very similar proportion of essential genes (PE between duplicate genes and singletons. In this paper, we conducted an extensive and careful analysis for the mouse KO phenotype data and corroborated a strong effect of duplicate genes on mouse genetics robustness. Moreover, the effect of duplicate genes on mouse genetic robustness is duplication-age dependent, which holds after ruling out the potential confounding effect from coding-sequence conservation, protein-protein connectivity, functional bias, or the bias of duplicates generated by whole genome duplication (WGD. Our findings suggest that two factors, the sampling bias toward ancient duplicates and very ancient duplicates with a proportion of essential genes higher than that of singletons, have caused the mouse knockout duplicate puzzle; meanwhile, the effect of genetic buffering may be correlated with sequence conservation as well as protein-protein interactivity.

  1. Dating and functional characterization of duplicated genes in the apple (Malus domestica Borkh. by analyzing EST data

    Directory of Open Access Journals (Sweden)

    Sanzol Javier

    2010-05-01

    Full Text Available Abstract Background Gene duplication is central to genome evolution. In plants, genes can be duplicated through small-scale events and large-scale duplications often involving polyploidy. The apple belongs to the subtribe Pyrinae (Rosaceae, a diverse lineage that originated via allopolyploidization. Both small-scale duplications and polyploidy may have been important mechanisms shaping the genome of this species. Results This study evaluates the gene duplication and polyploidy history of the apple by characterizing duplicated genes in this species using EST data. Overall, 68% of the apple genes were clustered into families with a mean copy-number of 4.6. Analysis of the age distribution of gene duplications supported a continuous mode of small-scale duplications, plus two episodes of large-scale duplicates of vastly different ages. The youngest was consistent with the polyploid origin of the Pyrinae 37-48 MYBP, whereas the older may be related to γ-triplication; an ancient hexapolyploidization previously characterized in the four sequenced eurosid genomes and basal to the eurosid-asterid divergence. Duplicated genes were studied for functional diversification with an emphasis on young paralogs; those originated during or after the formation of the Pyrinae lineage. Unequal assignment of single-copy genes and gene families to Gene Ontology categories suggested functional bias in the pattern of gene retention of paralogs. Young paralogs related to signal transduction, metabolism, and energy pathways have been preferentially retained. Non-random retention of duplicated genes seems to have mediated the expansion of gene families, some of which may have substantially increased their members after the origin of the Pyrinae. The joint analysis of over-duplicated functional categories and phylogenies, allowed evaluation of the role of both polyploidy and small-scale duplications during this process. Finally, gene expression analysis indicated that 82

  2. Gene duplication as a mechanism of genomic adaptation to a changing environment

    Science.gov (United States)

    Kondrashov, Fyodor A.

    2012-01-01

    A subject of extensive study in evolutionary theory has been the issue of how neutral, redundant copies can be maintained in the genome for long periods of time. Concurrently, examples of adaptive gene duplications to various environmental conditions in different species have been described. At this point, it is too early to tell whether or not a substantial fraction of gene copies have initially achieved fixation by positive selection for increased dosage. Nevertheless, enough examples have accumulated in the literature that such a possibility should be considered. Here, I review the recent examples of adaptive gene duplications and make an attempt to draw generalizations on what types of genes may be particularly prone to be selected for under certain environmental conditions. The identification of copy-number variation in ecological field studies of species adapting to stressful or novel environmental conditions may improve our understanding of gene duplications as a mechanism of adaptation and its relevance to the long-term persistence of gene duplications. PMID:22977152

  3. A critical assessment of cross-species detection of gene duplicates using comparative genomic hybridization

    Directory of Open Access Journals (Sweden)

    Renn Suzy CP

    2010-05-01

    Full Text Available Abstract Background Comparison of genomic DNA among closely related strains or species is a powerful approach for identifying variation in evolutionary processes. One potent source of genomic variation is gene duplication, which is prevalent among individuals and species. Array comparative genomic hybridization (aCGH has been successfully utilized to detect this variation among lineages. Here, beyond the demonstration that gene duplicates among species can be quantified with aCGH, we consider the effect of sequence divergence on the ability to detect gene duplicates. Results Using the X chromosome genomic content difference between male D. melanogaster and female D. yakuba and D. simulans, we describe a decrease in the ability to accurately measure genomic content (copy number for orthologs that are only 90% identical. We demonstrate that genome characteristics (e.g. chromatin environment and non-orthologous sequence similarity can also affect the ability to accurately measure genomic content. We describe a normalization strategy and statistical criteria to be used for the identification of gene duplicates among any species group for which an array platform is available from a closely related species. Conclusions Array CGH can be used to effectively identify gene duplication and genome content; however, certain biases are present due to sequence divergence and other genome characteristics resulting from the divergence between lineages. Highly conserved gene duplicates will be more readily recovered by aCGH. Duplicates that have been retained for a selective advantage due to directional selection acting on many loci in one or both gene copies are likely to be under-represented. The results of this study should inform the interpretation of both previously published and future work that employs this powerful technique.

  4. Restriction and recruitment-gene duplication and the origin and evolution of snake venom toxins.

    Science.gov (United States)

    Hargreaves, Adam D; Swain, Martin T; Hegarty, Matthew J; Logan, Darren W; Mulley, John F

    2014-08-01

    Snake venom has been hypothesized to have originated and diversified through a process that involves duplication of genes encoding body proteins with subsequent recruitment of the copy to the venom gland, where natural selection acts to develop or increase toxicity. However, gene duplication is known to be a rare event in vertebrate genomes, and the recruitment of duplicated genes to a novel expression domain (neofunctionalization) is an even rarer process that requires the evolution of novel combinations of transcription factor binding sites in upstream regulatory regions. Therefore, although this hypothesis concerning the evolution of snake venom is very unlikely and should be regarded with caution, it is nonetheless often assumed to be established fact, hindering research into the true origins of snake venom toxins. To critically evaluate this hypothesis, we have generated transcriptomic data for body tissues and salivary and venom glands from five species of venomous and nonvenomous reptiles. Our comparative transcriptomic analysis of these data reveals that snake venom does not evolve through the hypothesized process of duplication and recruitment of genes encoding body proteins. Indeed, our results show that many proposed venom toxins are in fact expressed in a wide variety of body tissues, including the salivary gland of nonvenomous reptiles and that these genes have therefore been restricted to the venom gland following duplication, not recruited. Thus, snake venom evolves through the duplication and subfunctionalization of genes encoding existing salivary proteins. These results highlight the danger of the elegant and intuitive "just-so story" in evolutionary biology.

  5. Duplication and amplification of antibiotic resistance genes enable increased resistance in isolates of multidrug-resistant Salmonella Typhimurium

    Science.gov (United States)

    During normal bacterial DNA replication, gene duplication and amplification (GDA) events occur randomly at a low frequency in the genome throughout a population. In the absence of selection, GDA events that increase the number of copies of a bacterial gene (or a set of genes) are lost. Antibiotic ...

  6. Gene duplication models for directed networks with limits on growth

    Science.gov (United States)

    Enemark, Jakob; Sneppen, Kim

    2007-11-01

    Background: Duplication of genes is important for evolution of molecular networks. Many authors have therefore considered gene duplication as a driving force in shaping the topology of molecular networks. In particular it has been noted that growth via duplication would act as an implicit means of preferential attachment, and thereby provide the observed broad degree distributions of molecular networks. Results: We extend current models of gene duplication and rewiring by including directions and the fact that molecular networks are not a result of unidirectional growth. We introduce upstream sites and downstream shapes to quantify potential links during duplication and rewiring. We find that this in itself generates the observed scaling of transcription factors for genome sites in prokaryotes. The dynamical model can generate a scale-free degree distribution, p(k)\\propto 1/k^{\\gamma } , with exponent γ = 1 in the non-growing case, and with γ>1 when the network is growing. Conclusions: We find that duplication of genes followed by substantial recombination of upstream regions could generate features of genetic regulatory networks. Our steady state degree distribution is however too broad to be consistent with data, thereby suggesting that selective pruning acts as a main additional constraint on duplicated genes. Our analysis shows that gene duplication can only be a main cause for the observed broad degree distributions if there are also substantial recombinations between upstream regions of genes.

  7. Trichomonas transmembrane cyclases result from massive gene duplication and concomitant development of pseudogenes.

    Directory of Open Access Journals (Sweden)

    Jike Cui

    2010-08-01

    Full Text Available Trichomonas vaginalis has an unusually large genome (approximately 160 Mb encoding approximately 60,000 proteins. With the goal of beginning to understand why some Trichomonas genes are present in so many copies, we characterized here a family of approximately 123 Trichomonas genes that encode transmembrane adenylyl cyclases (TMACs.The large family of TMACs genes is the result of recent duplications of a small set of ancestral genes that appear to be unique to trichomonads. Duplicated TMAC genes are not closely associated with repetitive elements, and duplications of flanking sequences are rare. However, there is evidence for TMAC gene replacements by homologous recombination. A high percentage of TMAC genes (approximately 46% are pseudogenes, as they contain stop codons and/or frame shifts, or the genes are truncated. Numerous stop codons present in the genome project G3 strain are not present in orthologous genes of two other Trichomonas strains (S1 and B7RC2. Each TMAC is composed of a series of N-terminal transmembrane helices and a single C-terminal cyclase domain that has adenylyl cyclase activity. Multiple TMAC genes are transcribed by Trichomonas cloned by limiting dilution.We conclude that one reason for the unusually large genome of Trichomonas is the presence of unstable families of genes such as those encoding TMACs that are undergoing massive gene duplication and concomitant development of pseudogenes.

  8. Histone modification pattern evolution after yeast gene duplication

    Directory of Open Access Journals (Sweden)

    Zou Yangyun

    2012-07-01

    Full Text Available Abstract Background Gene duplication and subsequent functional divergence especially expression divergence have been widely considered as main sources for evolutionary innovations. Many studies evidenced that genetic regulatory network evolved rapidly shortly after gene duplication, thus leading to accelerated expression divergence and diversification. However, little is known whether epigenetic factors have mediated the evolution of expression regulation since gene duplication. In this study, we conducted detailed analyses on yeast histone modification (HM, the major epigenetics type in this organism, as well as other available functional genomics data to address this issue. Results Duplicate genes, on average, share more common HM-code patterns than random singleton pairs in their promoters and open reading frames (ORF. Though HM-code divergence between duplicates in both promoter and ORF regions increase with their sequence divergence, the HM-code in ORF region evolves slower than that in promoter region, probably owing to the functional constraints imposed on protein sequences. After excluding the confounding effect of sequence divergence (or evolutionary time, we found the evidence supporting the notion that in yeast, the HM-code may co-evolve with cis- and trans-regulatory factors. Moreover, we observed that deletion of some yeast HM-related enzymes increases the expression divergence between duplicate genes, yet the effect is lower than the case of transcription factor (TF deletion or environmental stresses. Conclusions Our analyses demonstrate that after gene duplication, yeast histone modification profile between duplicates diverged with evolutionary time, similar to genetic regulatory elements. Moreover, we found the evidence of the co-evolution between genetic and epigenetic elements since gene duplication, together contributing to the expression divergence between duplicate genes.

  9. Subfunctionalization reduces the fitness cost of gene duplication in humans by buffering dosage imbalances

    Directory of Open Access Journals (Sweden)

    Fernández Ariel

    2011-12-01

    Full Text Available Abstract Background Driven essentially by random genetic drift, subfunctionalization has been identified as a possible non-adaptive mechanism for the retention of duplicate genes in small-population species, where widespread deleterious mutations are likely to cause complementary loss of subfunctions across gene copies. Through subfunctionalization, duplicates become indispensable to maintain the functional requirements of the ancestral locus. Yet, gene duplication produces a dosage imbalance in the encoded proteins and thus, as investigated in this paper, subfunctionalization must be subject to the selective forces arising from the fitness bottleneck introduced by the duplication event. Results We show that, while arising from random drift, subfunctionalization must be inescapably subject to selective forces, since the diversification of expression patterns across paralogs mitigates duplication-related dosage imbalances in the concentrations of encoded proteins. Dosage imbalance effects become paramount when proteins rely on obligatory associations to maintain their structural integrity, and are expected to be weaker when protein complexation is ephemeral or adventitious. To establish the buffering effect of subfunctionalization on selection pressure, we determine the packing quality of encoded proteins, an established indicator of dosage sensitivity, and correlate this parameter with the extent of paralog segregation in humans, using species with larger population -and more efficient selection- as controls. Conclusions Recognizing the role of subfunctionalization as a dosage-imbalance buffer in gene duplication events enabled us to reconcile its mechanistic nonadaptive origin with its adaptive role as an enabler of the evolution of genetic redundancy. This constructive role was established in this paper by proving the following assertion: If subfunctionalization is indeed adaptive, its effect on paralog segregation should scale with the dosage

  10. Detecting functional divergence after gene duplication through evolutionary changes in posttranslational regulatory sequences.

    Science.gov (United States)

    Nguyen Ba, Alex N; Strome, Bob; Hua, Jun Jie; Desmond, Jonathan; Gagnon-Arsenault, Isabelle; Weiss, Eric L; Landry, Christian R; Moses, Alan M

    2014-12-01

    Gene duplication is an important evolutionary mechanism that can result in functional divergence in paralogs due to neo-functionalization or sub-functionalization. Consistent with functional divergence after gene duplication, recent studies have shown accelerated evolution in retained paralogs. However, little is known in general about the impact of this accelerated evolution on the molecular functions of retained paralogs. For example, do new functions typically involve changes in enzymatic activities, or changes in protein regulation? Here we study the evolution of posttranslational regulation by examining the evolution of important regulatory sequences (short linear motifs) in retained duplicates created by the whole-genome duplication in budding yeast. To do so, we identified short linear motifs whose evolutionary constraint has relaxed after gene duplication with a likelihood-ratio test that can account for heterogeneity in the evolutionary process by using a non-central chi-squared null distribution. We find that short linear motifs are more likely to show changes in evolutionary constraints in retained duplicates compared to single-copy genes. We examine changes in constraints on known regulatory sequences and show that for the Rck1/Rck2, Fkh1/Fkh2, Ace2/Swi5 paralogs, they are associated with previously characterized differences in posttranslational regulation. Finally, we experimentally confirm our prediction that for the Ace2/Swi5 paralogs, Cbk1 regulated localization was lost along the lineage leading to SWI5 after gene duplication. Our analysis suggests that changes in posttranslational regulation mediated by short regulatory motifs systematically contribute to functional divergence after gene duplication.

  11. Gene and genome duplication in Acanthamoeba polyphaga Mimivirus.

    Science.gov (United States)

    Suhre, Karsten

    2005-11-01

    Gene duplication is key to molecular evolution in all three domains of life and may be the first step in the emergence of new gene function. It is a well-recognized feature in large DNA viruses but has not been studied extensively in the largest known virus to date, the recently discovered Acanthamoeba polyphaga Mimivirus. Here, I present a systematic analysis of gene and genome duplication events in the mimivirus genome. I found that one-third of the mimivirus genes are related to at least one other gene in the mimivirus genome, either through a large segmental genome duplication event that occurred in the more remote past or through more recent gene duplication events, which often occur in tandem. This shows that gene and genome duplication played a major role in shaping the mimivirus genome. Using multiple alignments, together with remote-homology detection methods based on Hidden Markov Model comparison, I assign putative functions to some of the paralogous gene families. I suggest that a large part of the duplicated mimivirus gene families are likely to interfere with important host cell processes, such as transcription control, protein degradation, and cell regulatory processes. My findings support the view that large DNA viruses are complex evolving organisms, possibly deeply rooted within the tree of life, and oppose the paradigm that viral evolution is dominated by lateral gene acquisition, at least in regard to large DNA viruses.

  12. Assessment and reconstruction of novel HSP90 genes: duplications, gains and losses in fungal and animal lineages.

    Directory of Open Access Journals (Sweden)

    Chrysoula N Pantzartzi

    Full Text Available Hsp90s, members of the Heat Shock Protein class, protect the structure and function of proteins and play a significant task in cellular homeostasis and signal transduction. In order to determine the number of hsp90 gene copies and encoded proteins in fungal and animal lineages and through that key duplication events that this family has undergone, we collected and evaluated Hsp90 protein sequences and corresponding Expressed Sequence Tags and analyzed available genomes from various taxa. We provide evidence for duplication events affecting either single species or wider taxonomic groups. With regard to Fungi, duplicated genes have been detected in several lineages. In invertebrates, we demonstrate key duplication events in certain clades of Arthropoda and Mollusca, and a possible gene loss event in a hymenopteran family. Finally, we infer that the duplication event responsible for the two (a and b isoforms in vertebrates occurred probably shortly after the split of Hyperoartia and Gnathostomata.

  13. Duplication and maintenance of the Myb genes of vertebrate animals

    Directory of Open Access Journals (Sweden)

    Colin J. Davidson

    2012-11-01

    Gene duplication is an important means of generating new genes. The major mechanisms by which duplicated genes are preserved in the face of purifying selection are thought to be neofunctionalization, subfunctionalization, and increased gene dosage. However, very few duplicated gene families in vertebrate species have been analyzed by functional tests in vivo. We have therefore examined the three vertebrate Myb genes (c-Myb, A-Myb, and B-Myb by cytogenetic map analysis, by sequence analysis, and by ectopic expression in Drosophila. We provide evidence that the vertebrate Myb genes arose by two rounds of regional genomic duplication. We found that ubiquitous expression of c-Myb and A-Myb, but not of B-Myb or Drosophila Myb, was lethal in Drosophila. Expression of any of these genes during early larval eye development was well tolerated. However, expression of c-Myb and A-Myb, but not of B-Myb or Drosophila Myb, during late larval eye development caused drastic alterations in adult eye morphology. Mosaic analysis implied that this eye phenotype was cell-autonomous. Interestingly, some of the eye phenotypes caused by the retroviral v-Myb oncogene and the normal c-Myb proto-oncogene from which v-Myb arose were quite distinct. Finally, we found that post-translational modifications of c-Myb by the GSK-3 protein kinase and by the Ubc9 SUMO-conjugating enzyme that normally occur in vertebrate cells can modify the eye phenotype caused by c-Myb in Drosophila. These results support a model in which the three Myb genes of vertebrates arose by two sequential duplications. The first duplication was followed by a subfunctionalization of gene expression, then neofunctionalization of protein function to yield a c/A-Myb progenitor. The duplication of this progenitor was followed by subfunctionalization of gene expression to give rise to tissue-specific c-Myb and A-Myb genes.

  14. Gene duplication in the genome of parasitic Giardia lamblia

    Directory of Open Access Journals (Sweden)

    Flores Roberto

    2010-02-01

    Full Text Available Abstract Background Giardia are a group of widespread intestinal protozoan parasites in a number of vertebrates. Much evidence from G. lamblia indicated they might be the most primitive extant eukaryotes. When and how such a group of the earliest branching unicellular eukaryotes developed the ability to successfully parasitize the latest branching higher eukaryotes (vertebrates is an intriguing question. Gene duplication has long been thought to be the most common mechanism in the production of primary resources for the origin of evolutionary novelties. In order to parse the evolutionary trajectory of Giardia parasitic lifestyle, here we carried out a genome-wide analysis about gene duplication patterns in G. lamblia. Results Although genomic comparison showed that in G. lamblia the contents of many fundamental biologic pathways are simplified and the whole genome is very compact, in our study 40% of its genes were identified as duplicated genes. Evolutionary distance analyses of these duplicated genes indicated two rounds of large scale duplication events had occurred in G. lamblia genome. Functional annotation of them further showed that the majority of recent duplicated genes are VSPs (Variant-specific Surface Proteins, which are essential for the successful parasitic life of Giardia in hosts. Based on evolutionary comparison with their hosts, it was found that the rapid expansion of VSPs in G. lamblia is consistent with the evolutionary radiation of placental mammals. Conclusions Based on the genome-wide analysis of duplicated genes in G. lamblia, we found that gene duplication was essential for the origin and evolution of Giardia parasitic lifestyle. The recent expansion of VSPs uniquely occurring in G. lamblia is consistent with the increment of its hosts. Therefore we proposed a hypothesis that the increment of Giradia hosts might be the driving force for the rapid expansion of VSPs.

  15. Complexity of Gene Expression Evolution after Duplication: Protein Dosage Rebalancing

    Directory of Open Access Journals (Sweden)

    Igor B. Rogozin

    2014-01-01

    Full Text Available Ongoing debates about functional importance of gene duplications have been recently intensified by a heated discussion of the “ortholog conjecture” (OC. Under the OC, which is central to functional annotation of genomes, orthologous genes are functionally more similar than paralogous genes at the same level of sequence divergence. However, a recent study challenged the OC by reporting a greater functional similarity, in terms of gene ontology (GO annotations and expression profiles, among within-species paralogs compared to orthologs. These findings were taken to indicate that functional similarity of homologous genes is primarily determined by the cellular context of the genes, rather than evolutionary history. Subsequent studies suggested that the OC appears to be generally valid when applied to mammalian evolution but the complete picture of evolution of gene expression also has to incorporate lineage-specific aspects of paralogy. The observed complexity of gene expression evolution after duplication can be explained through selection for gene dosage effect combined with the duplication-degeneration-complementation model. This paper discusses expression divergence of recent duplications occurring before functional divergence of proteins encoded by duplicate genes.

  16. Functional divergence of gene duplicates – a domain-centric view

    Directory of Open Access Journals (Sweden)

    Khaladkar Mugdha

    2012-07-01

    Full Text Available Abstract Background Gene duplicates have been shown to evolve at different rates. Here we further investigate the mechanism and functional underpinning of this phenomenon by assessing asymmetric evolution specifically within functional domains of gene duplicates. Results Based on duplicate genes in five teleost fishes resulting from a whole genome duplication event, we first show that a Fisher Exact test based approach to detect asymmetry is more sensitive than the previously used Likelihood Ratio test. Using our Fisher Exact test, we found that the evolutionary rate asymmetry in the overall protein is largely explained by the asymmetric evolution within specific protein domains. Moreover, among cases of asymmetrically evolving domains, for the gene copy containing a fast evolving domain, the non-synonymous substitutions often cluster within the fast evolving domain. We found that rare substitutions were preferred within asymmetrically evolving domains suggestive of functional divergence. While overall ~32 % of the domains tested were found to be evolving asymmetrically, certain protein domains such as the Tyrosine and Ser/Thr Kinase domains had a much greater prevalence of asymmetric evolution. Finally, based on the spatial expression of Zebra fish duplicate proteins during development, we found that protein pairs containing asymmetrically evolving domains had a greater divergence in gene expression as compared to the duplicate proteins that did not exhibit asymmetric evolution. Conclusions Taken together, our results suggest that the previously observed asymmetry in the overall duplicate protein evolution is largely due to divergence of specific domains of the protein, and coincides with divergence in spatial expression domains.

  17. A rare case of plastid protein-coding gene duplication in the chloroplast genome of Euglena archaeoplastidiata (Euglenophyta).

    Science.gov (United States)

    Bennett, Matthew S; Shiu, Shin-Han; Triemer, Richard E

    2017-03-12

    Gene duplication is an important evolutionary process that allows duplicate functions to diverge, or, in some cases, allows for new functional gains. However, in contrast to the nuclear genome, gene duplications within the chloroplast are extremely rare. Here, we present the chloroplast genome of the photosynthetic protist Euglena archaeoplastidiata. Upon annotation, it was found that the chloroplast genome contained a novel tandem direct duplication that encoded a portion of RuBisCO large subunit (rbcL) followed by a complete copy of ribosomal protein L32 (rpl32), as well as the associated intergenic sequences. Analyses of the duplicated rpl32 were inconclusive regarding selective pressures, although it was found that substitutions in the duplicated region, all non-synonymous, likely had a neutral functional effect. The duplicated region did not exhibit patterns consistent with previously described mechanisms for tandem direct duplications, and demonstrated an unknown mechanism of duplication. In addition, a comparison of this chloroplast genome to other previously characterized chloroplast genomes from the same family revealed characteristics that indicated E. archaeoplastidiata was probably more closely related to taxa in the genera Monomorphina, Cryptoglena, and Euglenaria than it was to other Euglena taxa. Taken together, the chloroplast genome of E. archaeoplastidiata demonstrated multiple characteristics unique to the euglenoid world, and has justified the longstanding curiosity regarding this enigmatic taxon.

  18. Simultaneous identification of duplications and lateral gene transfers.

    Science.gov (United States)

    Tofigh, Ali; Hallett, Michael; Lagergren, Jens

    2011-01-01

    The incongruency between a gene tree and a corresponding species tree can be attributed to evolutionary events such as gene duplication and gene loss. This paper describes a combinatorial model where so-called DTL-scenarios are used to explain the differences between a gene tree and a corresponding species tree taking into account gene duplications, gene losses, and lateral gene transfers (also known as horizontal gene transfers). The reasonable biological constraint that a lateral gene transfer may only occur between contemporary species leads to the notion of acyclic DTL-scenarios. Parsimony methods are introduced by defining appropriate optimization problems. We show that finding most parsimonious acyclic DTL-scenarios is NP-hard. However, by dropping the condition of acyclicity, the problem becomes tractable, and we provide a dynamic programming algorithm as well as a fixed-parameter tractable algorithm for finding most parsimonious DTL-scenarios.

  19. Exon duplications in the ATP7A gene

    DEFF Research Database (Denmark)

    Mogensen, Mie; Skjørringe, Tina; Kodama, Hiroko

    2011-01-01

    BACKGROUND: Menkes disease (MD) is an X-linked, fatal neurodegenerative disorder of copper metabolism, caused by mutations in the ATP7A gene. Thirty-three Menkes patients in whom no mutation had been detected with standard diagnostic tools were screened for exon duplications in the ATP7A gene...

  20. Prevalent role of gene features in determining evolutionary fates of whole-genome duplication duplicated genes in flowering plants.

    Science.gov (United States)

    Jiang, Wen-kai; Liu, Yun-long; Xia, En-hua; Gao, Li-zhi

    2013-04-01

    The evolution of genes and genomes after polyploidization has been the subject of extensive studies in evolutionary biology and plant sciences. While a significant number of duplicated genes are rapidly removed during a process called fractionation, which operates after the whole-genome duplication (WGD), another considerable number of genes are retained preferentially, leading to the phenomenon of biased gene retention. However, the evolutionary mechanisms underlying gene retention after WGD remain largely unknown. Through genome-wide analyses of sequence and functional data, we comprehensively investigated the relationships between gene features and the retention probability of duplicated genes after WGDs in six plant genomes, Arabidopsis (Arabidopsis thaliana), poplar (Populus trichocarpa), soybean (Glycine max), rice (Oryza sativa), sorghum (Sorghum bicolor), and maize (Zea mays). The results showed that multiple gene features were correlated with the probability of gene retention. Using a logistic regression model based on principal component analysis, we resolved evolutionary rate, structural complexity, and GC3 content as the three major contributors to gene retention. Cluster analysis of these features further classified retained genes into three distinct groups in terms of gene features and evolutionary behaviors. Type I genes are more prone to be selected by dosage balance; type II genes are possibly subject to subfunctionalization; and type III genes may serve as potential targets for neofunctionalization. This study highlights that gene features are able to act jointly as primary forces when determining the retention and evolution of WGD-derived duplicated genes in flowering plants. These findings thus may help to provide a resolution to the debate on different evolutionary models of gene fates after WGDs.

  1. EPSPS Gene Copy Number and Whole-Plant Glyphosate Resistance Level in Kochia scoparia

    OpenAIRE

    Gaines, Todd A.; Barker, Abigail L.; Patterson, Eric L.; Westra, Philip; Westra, Eric P.; Wilson, Robert G.; Jha, Prashant; Kumar, Vipan; Andrew R Kniss

    2016-01-01

    Glyphosate-resistant (GR) Kochia scoparia has evolved in dryland chemical fallow systems throughout North America and the mechanism of resistance involves 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene duplication. Agricultural fields in four states were surveyed for K. scoparia in 2013 and tested for glyphosate-resistance level and EPSPS gene copy number. Glyphosate resistance was confirmed in K. scoparia populations collected from sugarbeet fields in Colorado, Wyoming, and Nebrask...

  2. The evolution of pepsinogen C genes in vertebrates: duplication, loss and functional diversification.

    Directory of Open Access Journals (Sweden)

    Luís Filipe Costa Castro

    Full Text Available BACKGROUND: Aspartic proteases comprise a large group of enzymes involved in peptide proteolysis. This collection includes prominent enzymes globally categorized as pepsins, which are derived from pepsinogen precursors. Pepsins are involved in gastric digestion, a hallmark of vertebrate physiology. An important member among the pepsinogens is pepsinogen C (Pgc. A particular aspect of Pgc is its apparent single copy status, which contrasts with the numerous gene copies found for example in pepsinogen A (Pga. Although gene sequences with similarity to Pgc have been described in some vertebrate groups, no exhaustive evolutionary framework has been considered so far. METHODOLOGY/PRINCIPAL FINDINGS: By combining phylogenetics and genomic analysis, we find an unexpected Pgc diversity in the vertebrate sub-phylum. We were able to reconstruct gene duplication timings relative to the divergence of major vertebrate clades. Before tetrapod divergence, a single Pgc gene tandemly expanded to produce two gene lineages (Pgbc and Pgc2. These have been differentially retained in various classes. Accordingly, we find Pgc2 in sauropsids, amphibians and marsupials, but not in eutherian mammals. Pgbc was retained in amphibians, but duplicated in the ancestor of amniotes giving rise to Pgb and Pgc1. The latter was retained in mammals and probably in reptiles and marsupials but not in birds. Pgb was kept in all of the amniote clade with independent episodes of loss in some mammalian species. Lineage specific expansions of Pgc2 and Pgbc have also occurred in marsupials and amphibians respectively. We find that teleost and tetrapod Pgc genes reside in distinct genomic regions hinting at a possible translocation. CONCLUSIONS: We conclude that the repertoire of Pgc genes is larger than previously reported, and that tandem duplications have modelled the history of Pgc genes. We hypothesize that gene expansion lead to functional divergence in tetrapods, coincident with the

  3. Gene duplications circumvent trade-offs in enzyme function: Insect adaptation to toxic host plants.

    Science.gov (United States)

    Dalla, Safaa; Dobler, Susanne

    2016-12-01

    Herbivorous insects and their adaptations against plant toxins provide striking opportunities to investigate the genetic basis of traits involved in coevolutionary interactions. Target site insensitivity to cardenolides has evolved convergently across six orders of insects, involving identical substitutions in the Na,K-ATPase gene and repeated convergent gene duplications. The large milkweed bug, Oncopeltus fasciatus, has three copies of the Na,K-ATPase α-subunit gene that bear differing numbers of amino acid substitutions in the binding pocket for cardenolides. To analyze the effect of these substitutions on cardenolide resistance and to infer possible trade-offs in gene function, we expressed the cardenolide-sensitive Na,K-ATPase of Drosophila melanogaster in vitro and introduced four distinct combinations of substitutions observed in the three gene copies of O. fasciatus. With an increasing number of substitutions, the sensitivity of the Na,K-ATPase to a standard cardenolide decreased in a stepwise manner. At the same time, the enzyme's overall activity decreased significantly with increasing cardenolide resistance and only the least substituted mimic of the Na,K-ATPase α1C copy maintained activity similar to the wild-type enzyme. Our results suggest that the Na,K-ATPase copies in O. fasciatus have diverged in function, enabling specific adaptations to dietary cardenolides while maintaining the functionality of this critical ion carrier. © 2016 The Author(s). Evolution © 2016 The Society for the Study of Evolution.

  4. Concerted evolution of duplicated protein-coding genes in Drosophila.

    OpenAIRE

    Hickey, D. A.; Bally-Cuif, L.; Abukashawa, S; Payant, V; Benkel, B F

    1991-01-01

    Very rapid rates of gene conversion were observed between duplicated alpha-amylase-coding sequences in Drosophila melanogaster. This gene conversion process was also seen in the related species Drosophila erecta. Specifically, there is virtual sequence identity between the coding regions of the two genes within each species, while the sequence divergence between species is close to that expected based on their phylogenetic relationship. The flanking, noncoding regions are much more highly div...

  5. Duplication and divergent evolution of the CHS and CHS-like genes in the chalcone synthase (CHS) superfamily

    Institute of Scientific and Technical Information of China (English)

    2006-01-01

    The enzymes of the CHS-superfamily are responsible for biosynthesis of a wide range of natural products in plants. They are important for flower pigmentation, protection against UV light and defense against phytopathogens. Many plants were found to contain multiple copies of CHS genes. This review summarizes the recent progress in the studies of the CHS-superfamily, focusing on the duplication and divergent evolution of the CHS and CHS-like genes. Comparative analyses of gene structure, expression patterns and catalytic properties revealed extensive differentiation in both regulation and function among duplicate CHS genes. It is also proposed that the CHS-like enzymes in the CHS-superfamily evolved from CHS at different times in various organisms. The CHS-superfamily thus offers a valuable model to study the rates and patterns of sequence divergence between duplicate genes.

  6. Correlating Traits of Gene Retention, Sequence Divergence, Duplicability and Essentiality in Vertebrates, Arthropods, and Fungi

    Science.gov (United States)

    Waterhouse, Robert M.; Zdobnov, Evgeny M.; Kriventseva, Evgenia V.

    2011-01-01

    Delineating ancestral gene relations among a large set of sequenced eukaryotic genomes allowed us to rigorously examine links between evolutionary and functional traits. We classified 86% of over 1.36 million protein-coding genes from 40 vertebrates, 23 arthropods, and 32 fungi into orthologous groups and linked over 90% of them to Gene Ontology or InterPro annotations. Quantifying properties of ortholog phyletic retention, copy-number variation, and sequence conservation, we examined correlations with gene essentiality and functional traits. More than half of vertebrate, arthropod, and fungal orthologs are universally present across each lineage. These universal orthologs are preferentially distributed in groups with almost all single-copy or all multicopy genes, and sequence evolution of the predominantly single-copy orthologous groups is markedly more constrained. Essential genes from representative model organisms, Mus musculus, Drosophila melanogaster, and Saccharomyces cerevisiae, are significantly enriched in universal orthologs within each lineage, and essential-gene-containing groups consistently exhibit greater sequence conservation than those without. This study of eukaryotic gene repertoire evolution identifies shared fundamental principles and highlights lineage-specific features, it also confirms that essential genes are highly retained and conclusively supports the “knockout-rate prediction” of stronger constraints on essential gene sequence evolution. However, the distinction between sequence conservation of single- versus multicopy orthologs is quantitatively more prominent than between orthologous groups with and without essential genes. The previously underappreciated difference in the tolerance of gene duplications and contrasting evolutionary modes of “single-copy control” versus “multicopy license” may reflect a major evolutionary mechanism that allows extended exploration of gene sequence space. PMID:21148284

  7. Concerted evolution of duplicated protein-coding genes in Drosophila.

    Science.gov (United States)

    Hickey, D A; Bally-Cuif, L; Abukashawa, S; Payant, V; Benkel, B F

    1991-03-01

    Very rapid rates of gene conversion were observed between duplicated alpha-amylase-coding sequences in Drosophila melanogaster. This gene conversion process was also seen in the related species Drosophila erecta. Specifically, there is virtual sequence identity between the coding regions of the two genes within each species, while the sequence divergence between species is close to that expected based on their phylogenetic relationship. The flanking, noncoding regions are much more highly diverged and do not appear to be subject to gene conversion. Comparison of amylase sequences between the two species provides a clear demonstration that recurrent gene conversion does indeed lead to the concerted evolution of the gene pair.

  8. Duplication of the NPHP1 gene in patients with autism spectrum disorder and normal intellectual ability: a case series.

    Science.gov (United States)

    Yasuda, Yuka; Hashimoto, Ryota; Fukai, Ryoko; Okamoto, Nobuhiko; Hiraki, Yoko; Yamamori, Hidenaga; Fujimoto, Michiko; Ohi, Kazutaka; Taniike, Masako; Mohri, Ikuko; Nakashima, Mitsuko; Tsurusaki, Yoshinori; Saitsu, Hirotomo; Matsumoto, Naomichi; Miyake, Noriko; Takeda, Masatoshi

    2014-01-01

    Autism spectrum disorder is a neurodevelopmental disorder characterized by impairments in social interactions, reduced verbal communication abilities, stereotyped repetitive behaviors, and restricted interests. It is a complex condition caused by genetic and environmental factors; the high heritability of this disorder supports the presence of a significant genetic contribution. Many studies have suggested that copy-number variants contribute to the etiology of autism spectrum disorder. Recently, copy-number variants of the nephronophthisis 1 gene have been reported in patients with autism spectrum disorder. To the best of our knowledge, only six autism spectrum disorder cases with duplications of the nephronophthisis 1 gene have been reported. These patients exhibited intellectual dysfunction, including verbal dysfunction in one patient, below-average verbal intellectual ability in one patient, and intellectual disability in four patients. In this study, we identified nephronophthisis 1 duplications in two unrelated Japanese patients with autism spectrum disorder using a high-resolution single-nucleotide polymorphism array. This report is the first to describe a nephronophthisis 1 duplication in an autism spectrum disorder patient with an average verbal intelligence quotient and an average performance intelligence quotient. However, the second autism spectrum disorder patient with a nephronophthisis 1 duplication had a below-average performance intelligence quotient. Neither patient exhibited physical dysfunction, motor developmental delay, or neurological abnormalities. This study supports the clinical observation of nephronophthisis 1 duplication in autism spectrum disorder cases and might contribute to our understanding of the clinical phenotype that arises from this duplication.

  9. Duplication, divergence and persistence in the Phytochrome photoreceptor gene family of cottons (Gossypium spp.

    Directory of Open Access Journals (Sweden)

    Abdukarimov Abdusattor

    2010-06-01

    Full Text Available Abstract Background Phytochromes are a family of red/far-red photoreceptors that regulate a number of important developmental traits in cotton (Gossypium spp., including plant architecture, fiber development, and photoperiodic flowering. Little is known about the composition and evolution of the phytochrome gene family in diploid (G. herbaceum, G. raimondii or allotetraploid (G. hirsutum, G. barbadense cotton species. The objective of this study was to obtain a preliminary inventory and molecular-evolutionary characterization of the phytochrome gene family in cotton. Results We used comparative sequence resources to design low-degeneracy PCR primers that amplify genomic sequence tags (GSTs for members of the PHYA, PHYB/D, PHYC and PHYE gene sub-families from A- and D-genome diploid and AD-genome allotetraploid Gossypium species. We identified two paralogous PHYA genes (designated PHYA1 and PHYA2 in diploid cottons, the result of a Malvaceae-specific PHYA gene duplication that occurred approximately 14 million years ago (MYA, before the divergence of the A- and D-genome ancestors. We identified a single gene copy of PHYB, PHYC, and PHYE in diploid cottons. The allotetraploid genomes have largely retained the complete gene complements inherited from both of the diploid genome ancestors, with at least four PHYA genes and two genes encoding PHYB, PHYC and PHYE in the AD-genomes. We did not identify a PHYD gene in any cotton genomes examined. Conclusions Detailed sequence analysis suggests that phytochrome genes retained after duplication by segmental duplication and allopolyploidy appear to be evolving independently under a birth-and-death-process with strong purifying selection. Our study provides a preliminary phytochrome gene inventory that is necessary and sufficient for further characterization of the biological functions of each of the cotton phytochrome genes, and for the development of 'candidate gene' markers that are potentially useful for

  10. A role for gene duplication and natural variation of gene expression in the evolution of metabolism.

    Directory of Open Access Journals (Sweden)

    Daniel J Kliebenstein

    Full Text Available BACKGROUND: Most eukaryotic genomes have undergone whole genome duplications during their evolutionary history. Recent studies have shown that the function of these duplicated genes can diverge from the ancestral gene via neo- or sub-functionalization within single genotypes. An additional possibility is that gene duplicates may also undergo partitioning of function among different genotypes of a species leading to genetic differentiation. Finally, the ability of gene duplicates to diverge may be limited by their biological function. METHODOLOGY/PRINCIPAL FINDINGS: To test these hypotheses, I estimated the impact of gene duplication and metabolic function upon intraspecific gene expression variation of segmental and tandem duplicated genes within Arabidopsis thaliana. In all instances, the younger tandem duplicated genes showed higher intraspecific gene expression variation than the average Arabidopsis gene. Surprisingly, the older segmental duplicates also showed evidence of elevated intraspecific gene expression variation albeit typically lower than for the tandem duplicates. The specific biological function of the gene as defined by metabolic pathway also modulated the level of intraspecific gene expression variation. The major energy metabolism and biosynthetic pathways showed decreased variation, suggesting that they are constrained in their ability to accumulate gene expression variation. In contrast, a major herbivory defense pathway showed significantly elevated intraspecific variation suggesting that it may be under pressure to maintain and/or generate diversity in response to fluctuating insect herbivory pressures. CONCLUSION: These data show that intraspecific variation in gene expression is facilitated by an interaction of gene duplication and biological activity. Further, this plays a role in controlling diversity of plant metabolism.

  11. EPSPS Gene Copy Number and Whole-Plant Glyphosate Resistance Level in Kochia scoparia.

    Science.gov (United States)

    Gaines, Todd A; Barker, Abigail L; Patterson, Eric L; Westra, Philip; Westra, Eric P; Wilson, Robert G; Jha, Prashant; Kumar, Vipan; Kniss, Andrew R

    2016-01-01

    Glyphosate-resistant (GR) Kochia scoparia has evolved in dryland chemical fallow systems throughout North America and the mechanism of resistance involves 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene duplication. Agricultural fields in four states were surveyed for K. scoparia in 2013 and tested for glyphosate-resistance level and EPSPS gene copy number. Glyphosate resistance was confirmed in K. scoparia populations collected from sugarbeet fields in Colorado, Wyoming, and Nebraska, and Montana. Glyphosate resistance was also confirmed in K. scoparia accessions collected from wheat-fallow fields in Montana. All GR samples had increased EPSPS gene copy number, with median population values up to 11 from sugarbeet fields and up to 13 in Montana wheat-fallow fields. The results indicate that glyphosate susceptibility can be accurately diagnosed using EPSPS gene copy number.

  12. Recombination facilitates neofunctionalization of duplicate genes via originalization

    Directory of Open Access Journals (Sweden)

    Huang Ren

    2010-06-01

    Full Text Available Abstract Background Recently originalization was proposed to be an effective way of duplicate-gene preservation, in which recombination provokes the high frequency of original (or wild-type allele on both duplicated loci. Because the high frequency of wild-type allele might drive the arising and accumulating of advantageous mutation, it is hypothesized that recombination might enlarge the probability of neofunctionalization (Pneo of duplicate genes. In this article this hypothesis has been tested theoretically. Results Results show that through originalization recombination might not only shorten mean time to neofunctionalizaiton, but also enlarge Pneo. Conclusions Therefore, recombination might facilitate neofunctionalization via originalization. Several extensive applications of these results on genomic evolution have been discussed: 1. Time to nonfunctionalization can be much longer than a few million generations expected before; 2. Homogenization on duplicated loci results from not only gene conversion, but also originalization; 3. Although the rate of advantageous mutation is much small compared with that of degenerative mutation, Pneo cannot be expected to be small.

  13. The transformer genes in the fig wasp Ceratosolen solmsi provide new evidence for duplications independent of complementary sex determination.

    Science.gov (United States)

    Jia, L-Y; Xiao, J-H; Xiong, T-L; Niu, L-M; Huang, D-W

    2016-06-01

    Transformer (tra) is the key gene that turns on the sex-determination cascade in Drosophila melanogaster and in some other insects. The honeybee Apis mellifera has two duplicates of tra, one of which (complementary sex determiner, csd) is the primary signal for complementary sex-determination (CSD), regulating the other duplicate (feminizer). Two tra duplicates have been found in some other hymenopteran species, resulting in the assumption that a single ancestral duplication of tra took place in the Hymenoptera. Here, we searched for tra homologues and pseudogenes in the Hymenoptera, focusing on five newly published hymenopteran genomes. We found three tra copies in the fig wasp Ceratosolen solmsi. Further evolutionary and expression analyses also showed that the two duplicates (Csoltra-B and Csoltra-C) are under positive selection, and have female-specific expression, suggesting possible sex-related functions. Moreover, Aculeata species exhibit many pseudogenes generated by lineage-specific duplications. We conclude that phylogenetic reconstruction and pseudogene screening provide novel evidence supporting the hypothesis of independent duplications rather an ancestral origin of multiple tra paralogues in the Hymenoptera. The case of C. solmsi is the first example of a non-CSD species with duplicated tra, contrary to the previous assumption that derived tra paralogues function as the CSD locus. © 2016 The Royal Entomological Society.

  14. Phylogenomics of the benzoxazinoid biosynthetic pathway of Poaceae: gene duplications and origin of the Bx cluster

    Directory of Open Access Journals (Sweden)

    Dutartre Leslie

    2012-05-01

    Full Text Available Abstract Background The benzoxazinoids 2,4-dihydroxy-1,4-benzoxazin-3-one (DIBOA and 2,4-dihydroxy-7- methoxy-1,4-benzoxazin-3-one (DIMBOA, are key defense compounds present in major agricultural crops such as maize and wheat. Their biosynthesis involves nine enzymes thought to form a linear pathway leading to the storage of DI(MBOA as glucoside conjugates. Seven of the genes (Bx1-Bx6 and Bx8 form a cluster at the tip of the short arm of maize chromosome 4 that includes four P450 genes (Bx2-5 belonging to the same CYP71C subfamily. The origin of this cluster is unknown. Results We show that the pathway appeared following several duplications of the TSA gene (α-subunit of tryptophan synthase and of a Bx2-like ancestral CYP71C gene and the recruitment of Bx8 before the radiation of Poaceae. The origins of Bx6 and Bx7 remain unclear. We demonstrate that the Bx2-like CYP71C ancestor was not committed to the benzoxazinoid pathway and that after duplications the Bx2-Bx5 genes were under positive selection on a few sites and underwent functional divergence, leading to the current specific biochemical properties of the enzymes. The absence of synteny between available Poaceae genomes involving the Bx gene regions is in contrast with the conserved synteny in the TSA gene region. Conclusions These results demonstrate that rearrangements following duplications of an IGL/TSA gene and of a CYP71C gene probably resulted in the clustering of the new copies (Bx1 and Bx2 at the tip of a chromosome in an ancestor of grasses. Clustering favored cosegregation and tip chromosomal location favored gene rearrangements that allowed the further recruitment of genes to the pathway. These events, a founding event and elongation events, may have been the key to the subsequent evolution of the benzoxazinoid biosynthetic cluster.

  15. Phylogenomics of the benzoxazinoid biosynthetic pathway of Poaceae: gene duplications and origin of the Bx cluster.

    Science.gov (United States)

    Dutartre, Leslie; Hilliou, Frédérique; Feyereisen, René

    2012-05-11

    The benzoxazinoids 2,4-dihydroxy-1,4-benzoxazin-3-one (DIBOA) and 2,4-dihydroxy-7- methoxy-1,4-benzoxazin-3-one (DIMBOA), are key defense compounds present in major agricultural crops such as maize and wheat. Their biosynthesis involves nine enzymes thought to form a linear pathway leading to the storage of DI(M)BOA as glucoside conjugates. Seven of the genes (Bx1-Bx6 and Bx8) form a cluster at the tip of the short arm of maize chromosome 4 that includes four P450 genes (Bx2-5) belonging to the same CYP71C subfamily. The origin of this cluster is unknown. We show that the pathway appeared following several duplications of the TSA gene (α-subunit of tryptophan synthase) and of a Bx2-like ancestral CYP71C gene and the recruitment of Bx8 before the radiation of Poaceae. The origins of Bx6 and Bx7 remain unclear. We demonstrate that the Bx2-like CYP71C ancestor was not committed to the benzoxazinoid pathway and that after duplications the Bx2-Bx5 genes were under positive selection on a few sites and underwent functional divergence, leading to the current specific biochemical properties of the enzymes. The absence of synteny between available Poaceae genomes involving the Bx gene regions is in contrast with the conserved synteny in the TSA gene region. These results demonstrate that rearrangements following duplications of an IGL/TSA gene and of a CYP71C gene probably resulted in the clustering of the new copies (Bx1 and Bx2) at the tip of a chromosome in an ancestor of grasses. Clustering favored cosegregation and tip chromosomal location favored gene rearrangements that allowed the further recruitment of genes to the pathway. These events, a founding event and elongation events, may have been the key to the subsequent evolution of the benzoxazinoid biosynthetic cluster.

  16. Transcriptional rewiring of the sex determining dmrt1 gene duplicate by transposable elements.

    Directory of Open Access Journals (Sweden)

    Amaury Herpin

    2010-02-01

    Full Text Available Control and coordination of eukaryotic gene expression rely on transcriptional and posttranscriptional regulatory networks. Evolutionary innovations and adaptations often require rapid changes of such networks. It has long been hypothesized that transposable elements (TE might contribute to the rewiring of regulatory interactions. More recently it emerged that TEs might bring in ready-to-use transcription factor binding sites to create alterations to the promoters by which they were captured. A process where the gene regulatory architecture is of remarkable plasticity is sex determination. While the more downstream components of the sex determination cascades are evolutionary conserved, the master regulators can switch between groups of organisms even on the interspecies level or between populations. In the medaka fish (Oryzias latipes a duplicated copy of dmrt1, designated dmrt1bY or DMY, on the Y chromosome was shown to be the master regulator of male development, similar to Sry in mammals. We found that the dmrt1bY gene has acquired a new feedback downregulation of its expression. Additionally, the autosomal dmrt1a gene is also able to regulate transcription of its duplicated paralog by binding to a unique target Dmrt1 site nested within the dmrt1bY proximal promoter region. We could trace back this novel regulatory element to a highly conserved sequence within a new type of TE that inserted into the upstream region of dmrt1bY shortly after the duplication event. Our data provide functional evidence for a role of TEs in transcriptional network rewiring for sub- and/or neo-functionalization of duplicated genes. In the particular case of dmrt1bY, this contributed to create new hierarchies of sex-determining genes.

  17. Signals of historical interlocus gene conversion in human segmental duplications.

    Directory of Open Access Journals (Sweden)

    Beth L Dumont

    Full Text Available Standard methods of DNA sequence analysis assume that sequences evolve independently, yet this assumption may not be appropriate for segmental duplications that exchange variants via interlocus gene conversion (IGC. Here, we use high quality multiple sequence alignments from well-annotated segmental duplications to systematically identify IGC signals in the human reference genome. Our analysis combines two complementary methods: (i a paralog quartet method that uses DNA sequence simulations to identify a statistical excess of sites consistent with inter-paralog exchange, and (ii the alignment-based method implemented in the GENECONV program. One-quarter (25.4% of the paralog families in our analysis harbor clear IGC signals by the quartet approach. Using GENECONV, we identify 1477 gene conversion tracks that cumulatively span 1.54 Mb of the genome. Our analyses confirm the previously reported high rates of IGC in subtelomeric regions and Y-chromosome palindromes, and identify multiple novel IGC hotspots, including the pregnancy specific glycoproteins and the neuroblastoma breakpoint gene families. Although the duplication history of a paralog family is described by a single tree, we show that IGC has introduced incredible site-to-site variation in the evolutionary relationships among paralogs in the human genome. Our findings indicate that IGC has left significant footprints in patterns of sequence diversity across segmental duplications in the human genome, out-pacing the contributions of single base mutation by orders of magnitude. Collectively, the IGC signals we report comprise a catalog that will provide a critical reference for interpreting observed patterns of DNA sequence variation across duplicated genomic regions, including targets of recent adaptive evolution in humans.

  18. Concomitant duplications of opioid peptide and receptor genes before the origin of jawed vertebrates.

    Directory of Open Access Journals (Sweden)

    Görel Sundström

    Full Text Available BACKGROUND: The opioid system is involved in reward and pain mechanisms and consists in mammals of four receptors and several peptides. The peptides are derived from four prepropeptide genes, PENK, PDYN, PNOC and POMC, encoding enkephalins, dynorphins, orphanin/nociceptin and beta-endorphin, respectively. Previously we have described how two rounds of genome doubling (2R before the origin of jawed vertebrates formed the receptor family. METHODOLOGY/PRINCIPAL FINDINGS: Opioid peptide gene family members were investigated using a combination of sequence-based phylogeny and chromosomal locations of the peptide genes in various vertebrates. Several adjacent gene families were investigated similarly. The results show that the ancestral peptide gene gave rise to two additional copies in the genome doublings. The fourth member was generated by a local gene duplication, as the genes encoding POMC and PNOC are located on the same chromosome in the chicken genome and all three teleost genomes that we have studied. A translocation has disrupted this synteny in mammals. The PDYN gene seems to have been lost in chicken, but not in zebra finch. Duplicates of some peptide genes have arisen in the teleost fishes. Within the prepropeptide precursors, peptides have been lost or gained in different lineages. CONCLUSIONS/SIGNIFICANCE: The ancestral peptide and receptor genes were located on the same chromosome and were thus duplicated concomitantly. However, subsequently genetic linkage has been lost. In conclusion, the system of opioid peptides and receptors was largely formed by the genome doublings that took place early in vertebrate evolution.

  19. Some novel intron positions in conserved Drosophila genes are caused by intron sliding or tandem duplication

    Directory of Open Access Journals (Sweden)

    Stadler Peter F

    2010-05-01

    Full Text Available Abstract Background Positions of spliceosomal introns are often conserved between remotely related genes. Introns that reside in non-conserved positions are either novel or remnants of frequent losses of introns in some evolutionary lineages. A recent gain of such introns is difficult to prove. However, introns verified as novel are needed to evaluate contemporary processes of intron gain. Results We identified 25 unambiguous cases of novel intron positions in 31 Drosophila genes that exhibit near intron pairs (NIPs. Here, a NIP consists of an ancient and a novel intron position that are separated by less than 32 nt. Within a single gene, such closely-spaced introns are very unlikely to have coexisted. In most cases, therefore, the ancient intron position must have disappeared in favour of the novel one. A survey for NIPs among 12 Drosophila genomes identifies intron sliding (migration as one of the more frequent causes of novel intron positions. Other novel introns seem to have been gained by regional tandem duplications of coding sequences containing a proto-splice site. Conclusions Recent intron gains sometimes appear to have arisen by duplication of exonic sequences and subsequent intronization of one of the copies. Intron migration and exon duplication together may account for a significant amount of novel intron positions in conserved coding sequences.

  20. Sox genes in grass carp (Ctenopharyngodon idella with their implications for genome duplication and evolution

    Directory of Open Access Journals (Sweden)

    Tong Jingou

    2006-11-01

    Full Text Available Abstract The Sox gene family is found in a broad range of animal taxa and encodes important gene regulatory proteins involved in a variety of developmental processes. We have obtained clones representing the HMG boxes of twelve Sox genes from grass carp (Ctenopharyngodon idella, one of the four major domestic carps in China. The cloned Sox genes belong to group B1, B2 and C. Our analyses show that whereas the human genome contains a single copy of Sox4, Sox11 and Sox14, each of these genes has two co-orthologs in grass carp, and the duplication of Sox4 and Sox11 occurred before the divergence of grass carp and zebrafish, which support the "fish-specific whole-genome duplication" theory. An estimation for the origin of grass carp based on the molecular clock using Sox1, Sox3 and Sox11 genes as markers indicates that grass carp (subfamily Leuciscinae and zebrafish (subfamily Danioninae diverged approximately 60 million years ago. The potential uses of Sox genes as markers in revealing the evolutionary history of grass carp are discussed.

  1. Hox gene duplications correlate with posterior heteronomy in scorpions.

    Science.gov (United States)

    Sharma, Prashant P; Schwager, Evelyn E; Extavour, Cassandra G; Wheeler, Ward C

    2014-10-07

    The evolutionary success of the largest animal phylum, Arthropoda, has been attributed to tagmatization, the coordinated evolution of adjacent metameres to form morphologically and functionally distinct segmental regions called tagmata. Specification of regional identity is regulated by the Hox genes, of which 10 are inferred to be present in the ancestor of arthropods. With six different posterior segmental identities divided into two tagmata, the bauplan of scorpions is the most heteronomous within Chelicerata. Expression domains of the anterior eight Hox genes are conserved in previously surveyed chelicerates, but it is unknown how Hox genes regionalize the three tagmata of scorpions. Here, we show that the scorpion Centruroides sculpturatus has two paralogues of all Hox genes except Hox3, suggesting cluster and/or whole genome duplication in this arachnid order. Embryonic anterior expression domain boundaries of each of the last four pairs of Hox genes (two paralogues each of Antp, Ubx, abd-A and Abd-B) are unique and distinguish segmental groups, such as pectines, book lungs and the characteristic tail, while maintaining spatial collinearity. These distinct expression domains suggest neofunctionalization of Hox gene paralogues subsequent to duplication. Our data reconcile previous understanding of Hox gene function across arthropods with the extreme heteronomy of scorpions.

  2. Profiling of gene duplication patterns of sequenced teleost genomes: evidence for rapid lineage-specific genome expansion mediated by recent tandem duplications

    Directory of Open Access Journals (Sweden)

    Lu Jianguo

    2012-06-01

    Full Text Available Abstract Background Gene duplication has had a major impact on genome evolution. Localized (or tandem duplication resulting from unequal crossing over and whole genome duplication are believed to be the two dominant mechanisms contributing to vertebrate genome evolution. While much scrutiny has been directed toward discerning patterns indicative of whole-genome duplication events in teleost species, less attention has been paid to the continuous nature of gene duplications and their impact on the size, gene content, functional diversity, and overall architecture of teleost genomes. Results Here, using a Markov clustering algorithm directed approach we catalogue and analyze patterns of gene duplication in the four model teleost species with chromosomal coordinates: zebrafish, medaka, stickleback, and Tetraodon. Our analyses based on set size, duplication type, synonymous substitution rate (Ks, and gene ontology emphasize shared and lineage-specific patterns of genome evolution via gene duplication. Most strikingly, our analyses highlight the extraordinary duplication and retention rate of recent duplicates in zebrafish and their likely role in the structural and functional expansion of the zebrafish genome. We find that the zebrafish genome is remarkable in its large number of duplicated genes, small duplicate set size, biased Ks distribution toward minimal mutational divergence, and proportion of tandem and intra-chromosomal duplicates when compared with the other teleost model genomes. The observed gene duplication patterns have played significant roles in shaping the architecture of teleost genomes and appear to have contributed to the recent functional diversification and divergence of important physiological processes in zebrafish. Conclusions We have analyzed gene duplication patterns and duplication types among the available teleost genomes and found that a large number of genes were tandemly and intrachromosomally duplicated, suggesting

  3. The role of human-specific gene duplications during brain development and evolution.

    Science.gov (United States)

    Sassa, Takayuki

    2013-09-01

    One of the most fascinating questions in evolutionary biology is how traits unique to humans, such as their high cognitive abilities, erect bipedalism, and hairless skin, are encoded in the genome. Recent advances in genomics have begun to reveal differences between the genomes of the great apes. It has become evident that one of the many mutation types, segmental duplication, has drastically increased in the primate genomes, and most remarkably in the human genome. Genes contained in these segmental duplications have a tremendous potential to cause genetic innovation, probably accounting for the acquisition of human-specific traits. In this review, I begin with an overview of the genes, which have increased their copy number specifically in the human lineage, following its separation from the common ancestor with our closest living relative, the chimpanzee. Then, I introduce the recent experimental approaches, focusing on SRGAP2, which has been partially duplicated, to elucidate the role of SRGAP2 protein and its human-specific paralogs in human brain development and evolution.

  4. A salmonid EST genomic study: genes, duplications, phylogeny and microarrays

    Directory of Open Access Journals (Sweden)

    Brahmbhatt Sonal

    2008-11-01

    Full Text Available Abstract Background Salmonids are of interest because of their relatively recent genome duplication, and their extensive use in wild fisheries and aquaculture. A comprehensive gene list and a comparison of genes in some of the different species provide valuable genomic information for one of the most widely studied groups of fish. Results 298,304 expressed sequence tags (ESTs from Atlantic salmon (69% of the total, 11,664 chinook, 10,813 sockeye, 10,051 brook trout, 10,975 grayling, 8,630 lake whitefish, and 3,624 northern pike ESTs were obtained in this study and have been deposited into the public databases. Contigs were built and putative full-length Atlantic salmon clones have been identified. A database containing ESTs, assemblies, consensus sequences, open reading frames, gene predictions and putative annotation is available. The overall similarity between Atlantic salmon ESTs and those of rainbow trout, chinook, sockeye, brook trout, grayling, lake whitefish, northern pike and rainbow smelt is 93.4, 94.2, 94.6, 94.4, 92.5, 91.7, 89.6, and 86.2% respectively. An analysis of 78 transcript sets show Salmo as a sister group to Oncorhynchus and Salvelinus within Salmoninae, and Thymallinae as a sister group to Salmoninae and Coregoninae within Salmonidae. Extensive gene duplication is consistent with a genome duplication in the common ancestor of salmonids. Using all of the available EST data, a new expanded salmonid cDNA microarray of 32,000 features was created. Cross-species hybridizations to this cDNA microarray indicate that this resource will be useful for studies of all 68 salmonid species. Conclusion An extensive collection and analysis of salmonid RNA putative transcripts indicate that Pacific salmon, Atlantic salmon and charr are 94–96% similar while the more distant whitefish, grayling, pike and smelt are 93, 92, 89 and 86% similar to salmon. The salmonid transcriptome reveals a complex history of gene duplication that is

  5. Matrix Gla protein and osteocalcin: from gene duplication to neofunctionalization.

    Science.gov (United States)

    Cancela, M Leonor; Laizé, Vincent; Conceição, Natércia

    2014-11-01

    Osteocalcin (OC or bone Gla protein, BGP) and matrix Gla protein (MGP) are two members of the growing family of vitamin K-dependent (VKD) proteins. They were the first VKD proteins found not to be involved in coagulation and synthesized outside the liver. Both proteins were isolated from bone although it is now known that only OC is synthesized by bone cells under normal physiological conditions, but since both proteins can bind calcium and hydroxyapatite, they can also accumulate in bone. Both OC and MGP share similar structural features, both in terms of protein domains and gene organization. OC gene is likely to have appeared from MGP through a tandem gene duplication that occurred concomitantly with the appearance of the bony vertebrates. Despite their relatively close relationship and the fact that both can bind calcium and affect mineralization, their functions are not redundant and they also have other unrelated functions. Interestingly, these two proteins appear to have followed quite different evolutionary strategies in order to acquire novel functionalities, with OC following a gene duplication strategy while MGP variability was obtained mostly by the use of multiple promoters and alternative splicing, leading to proteins with additional functional characteristics and alternative gene regulatory pathways. Copyright © 2014 Elsevier Inc. All rights reserved.

  6. Gene duplications in prokaryotes can be associated with environmental adaptation

    Directory of Open Access Journals (Sweden)

    Lempicki Richard A

    2010-10-01

    Full Text Available Abstract Background Gene duplication is a normal evolutionary process. If there is no selective advantage in keeping the duplicated gene, it is usually reduced to a pseudogene and disappears from the genome. However, some paralogs are retained. These gene products are likely to be beneficial to the organism, e.g. in adaptation to new environmental conditions. The aim of our analysis is to investigate the properties of paralog-forming genes in prokaryotes, and to analyse the role of these retained paralogs by relating gene properties to life style of the corresponding prokaryotes. Results Paralogs were identified in a number of prokaryotes, and these paralogs were compared to singletons of persistent orthologs based on functional classification. This showed that the paralogs were associated with for example energy production, cell motility, ion transport, and defence mechanisms. A statistical overrepresentation analysis of gene and protein annotations was based on paralogs of the 200 prokaryotes with the highest fraction of paralog-forming genes. Biclustering of overrepresented gene ontology terms versus species was used to identify clusters of properties associated with clusters of species. The clusters were classified using similarity scores on properties and species to identify interesting clusters, and a subset of clusters were analysed by comparison to literature data. This analysis showed that paralogs often are associated with properties that are important for survival and proliferation of the specific organisms. This includes processes like ion transport, locomotion, chemotaxis and photosynthesis. However, the analysis also showed that the gene ontology terms sometimes were too general, imprecise or even misleading for automatic analysis. Conclusions Properties described by gene ontology terms identified in the overrepresentation analysis are often consistent with individual prokaryote lifestyles and are likely to give a competitive

  7. Evolutionary Fates and Dynamic Functionalization of Young Duplicate Genes in Arabidopsis Genomes1[OPEN

    Science.gov (United States)

    Wang, Jun; Tao, Feng; Marowsky, Nicholas C.; Fan, Chuanzhu

    2016-01-01

    Gene duplication is a primary means to generate genomic novelties, playing an essential role in speciation and adaptation. Particularly in plants, a high abundance of duplicate genes has been maintained for significantly long periods of evolutionary time. To address the manner in which young duplicate genes were derived primarily from small-scale gene duplication and preserved in plant genomes and to determine the underlying driving mechanisms, we generated transcriptomes to produce the expression profiles of five tissues in Arabidopsis thaliana and the closely related species Arabidopsis lyrata and Capsella rubella. Based on the quantitative analysis metrics, we investigated the evolutionary processes of young duplicate genes in Arabidopsis. We determined that conservation, neofunctionalization, and specialization are three main evolutionary processes for Arabidopsis young duplicate genes. We explicitly demonstrated the dynamic functionalization of duplicate genes along the evolutionary time scale. Upon origination, duplicates tend to maintain their ancestral functions; but as they survive longer, they might be likely to develop distinct and novel functions. The temporal evolutionary processes and functionalization of plant duplicate genes are associated with their ancestral functions, dynamic DNA methylation levels, and histone modification abundances. Furthermore, duplicate genes tend to be initially expressed in pollen and then to gain more interaction partners over time. Altogether, our study provides novel insights into the dynamic retention processes of young duplicate genes in plant genomes. PMID:27485883

  8. Expression Divergence of Duplicate Genes in the Protein Kinase Superfamily in Pacific Oyster.

    Science.gov (United States)

    Gao, Dahai; Ko, Dennis C; Tian, Xinmin; Yang, Guang; Wang, Liuyang

    2015-01-01

    Gene duplication has been proposed to serve as the engine of evolutionary innovation. It is well recognized that eukaryotic genomes contain a large number of duplicated genes that evolve new functions or expression patterns. However, in mollusks, the evolutionary mechanisms underlying the divergence and the functional maintenance of duplicate genes remain little understood. In the present study, we performed a comprehensive analysis of duplicate genes in the protein kinase superfamily using whole genome and transcriptome data for the Pacific oyster. A total of 64 duplicated gene pairs were identified based on a phylogenetic approach and the reciprocal best BLAST method. By analyzing gene expression from RNA-seq data from 69 different developmental and stimuli-induced conditions (nine tissues, 38 developmental stages, eight dry treatments, seven heat treatments, and seven salty treatments), we found that expression patterns were significantly correlated for a number of duplicate gene pairs, suggesting the conservation of regulatory mechanisms following divergence. Our analysis also identified a subset of duplicate gene pairs with very high expression divergence, indicating that these gene pairs may have been subjected to transcriptional subfunctionalization or neofunctionalization after the initial duplication events. Further analysis revealed a significant correlation between expression and sequence divergence (as revealed by synonymous or nonsynonymous substitution rates) under certain conditions. Taken together, these results provide evidence for duplicate gene sequence and expression divergence in the Pacific oyster, accompanying its adaptation to harsh environments. Our results provide new insights into the evolution of duplicate genes and their expression levels in the Pacific oyster.

  9. Systematic Inference of Copy-Number Genotypes from Personal Genome Sequencing Data Reveals Extensive Olfactory Receptor Gene Content Diversity

    Science.gov (United States)

    Waszak, Sebastian M.; Hasin, Yehudit; Zichner, Thomas; Olender, Tsviya; Keydar, Ifat; Khen, Miriam; Stütz, Adrian M.; Schlattl, Andreas; Lancet, Doron; Korbel, Jan O.

    2010-01-01

    Copy-number variations (CNVs) are widespread in the human genome, but comprehensive assignments of integer locus copy-numbers (i.e., copy-number genotypes) that, for example, enable discrimination of homozygous from heterozygous CNVs, have remained challenging. Here we present CopySeq, a novel computational approach with an underlying statistical framework that analyzes the depth-of-coverage of high-throughput DNA sequencing reads, and can incorporate paired-end and breakpoint junction analysis based CNV-analysis approaches, to infer locus copy-number genotypes. We benchmarked CopySeq by genotyping 500 chromosome 1 CNV regions in 150 personal genomes sequenced at low-coverage. The assessed copy-number genotypes were highly concordant with our performed qPCR experiments (Pearson correlation coefficient 0.94), and with the published results of two microarray platforms (95–99% concordance). We further demonstrated the utility of CopySeq for analyzing gene regions enriched for segmental duplications by comprehensively inferring copy-number genotypes in the CNV-enriched >800 olfactory receptor (OR) human gene and pseudogene loci. CopySeq revealed that OR loci display an extensive range of locus copy-numbers across individuals, with zero to two copies in some OR loci, and two to nine copies in others. Among genetic variants affecting OR loci we identified deleterious variants including CNVs and SNPs affecting ∼15% and ∼20% of the human OR gene repertoire, respectively, implying that genetic variants with a possible impact on smell perception are widespread. Finally, we found that for several OR loci the reference genome appears to represent a minor-frequency variant, implying a necessary revision of the OR repertoire for future functional studies. CopySeq can ascertain genomic structural variation in specific gene families as well as at a genome-wide scale, where it may enable the quantitative evaluation of CNVs in genome-wide association studies involving high

  10. The major resistance gene cluster in lettuce is highly duplicated and spans several megabases.

    Science.gov (United States)

    Meyers, B C; Chin, D B; Shen, K A; Sivaramakrishnan, S; Lavelle, D O; Zhang, Z; Michelmore, R W

    1998-11-01

    At least 10 Dm genes conferring resistance to the oomycete downy mildew fungus Bremia lactucae map to the major resistance cluster in lettuce. We investigated the structure of this cluster in the lettuce cultivar Diana, which contains Dm3. A deletion breakpoint map of the chromosomal region flanking Dm3 was saturated with a variety of molecular markers. Several of these markers are components of a family of resistance gene candidates (RGC2) that encode a nucleotide binding site and a leucine-rich repeat region. These motifs are characteristic of plant disease resistance genes. Bacterial artificial chromosome clones were identified by using duplicated restriction fragment length polymorphism markers from the region, including the nucleotide binding site-encoding region of RGC2. Twenty-two distinct members of the RGC2 family were characterized from the bacterial artificial chromosomes; at least two additional family members exist. The RGC2 family is highly divergent; the nucleotide identity was as low as 53% between the most distantly related copies. These RGC2 genes span at least 3.5 Mb. Eighteen members were mapped on the deletion breakpoint map. A comparison between the phylogenetic and physical relationships of these sequences demonstrated that closely related copies are physically separated from one another and indicated that complex rearrangements have shaped this region. Analysis of low-copy genomic sequences detected no genes, including RGC2, in the Dm3 region, other than sequences related to retrotransposons and transposable elements. The related but divergent family of RGC2 genes may act as a resource for the generation of new resistance phenotypes through infrequent recombination or unequal crossing over.

  11. The Orphan Gene dauerless Regulates Dauer Development and Intraspecific Competition in Nematodes by Copy Number Variation.

    Science.gov (United States)

    Mayer, Melanie G; Rödelsperger, Christian; Witte, Hanh; Riebesell, Metta; Sommer, Ralf J

    2015-06-01

    Many nematodes form dauer larvae when exposed to unfavorable conditions, representing an example of phenotypic plasticity and a major survival and dispersal strategy. In Caenorhabditis elegans, the regulation of dauer induction is a model for pheromone, insulin, and steroid-hormone signaling. Recent studies in Pristionchus pacificus revealed substantial natural variation in various aspects of dauer development, i.e. pheromone production and sensing and dauer longevity and fitness. One intriguing example is a strain from Ohio, having extremely long-lived dauers associated with very high fitness and often forming the most dauers in response to other strains' pheromones, including the reference strain from California. While such examples have been suggested to represent intraspecific competition among strains, the molecular mechanisms underlying these dauer-associated patterns are currently unknown. We generated recombinant-inbred-lines between the Californian and Ohioan strains and used quantitative-trait-loci analysis to investigate the molecular mechanism determining natural variation in dauer development. Surprisingly, we discovered that the orphan gene dauerless controls dauer formation by copy number variation. The Ohioan strain has one dauerless copy causing high dauer formation, whereas the Californian strain has two copies, resulting in strongly reduced dauer formation. Transgenic animals expressing multiple copies do not form dauers. dauerless is exclusively expressed in CAN neurons, and both CAN ablation and dauerless mutations increase dauer formation. Strikingly, dauerless underwent several duplications and acts in parallel or downstream of steroid-hormone signaling but upstream of the nuclear-hormone-receptor daf-12. We identified the novel or fast-evolving gene dauerless as inhibitor of dauer development. Our findings reveal the importance of gene duplications and copy number variations for orphan gene function and suggest daf-12 as major target for

  12. The Orphan Gene dauerless Regulates Dauer Development and Intraspecific Competition in Nematodes by Copy Number Variation.

    Directory of Open Access Journals (Sweden)

    Melanie G Mayer

    2015-06-01

    Full Text Available Many nematodes form dauer larvae when exposed to unfavorable conditions, representing an example of phenotypic plasticity and a major survival and dispersal strategy. In Caenorhabditis elegans, the regulation of dauer induction is a model for pheromone, insulin, and steroid-hormone signaling. Recent studies in Pristionchus pacificus revealed substantial natural variation in various aspects of dauer development, i.e. pheromone production and sensing and dauer longevity and fitness. One intriguing example is a strain from Ohio, having extremely long-lived dauers associated with very high fitness and often forming the most dauers in response to other strains' pheromones, including the reference strain from California. While such examples have been suggested to represent intraspecific competition among strains, the molecular mechanisms underlying these dauer-associated patterns are currently unknown. We generated recombinant-inbred-lines between the Californian and Ohioan strains and used quantitative-trait-loci analysis to investigate the molecular mechanism determining natural variation in dauer development. Surprisingly, we discovered that the orphan gene dauerless controls dauer formation by copy number variation. The Ohioan strain has one dauerless copy causing high dauer formation, whereas the Californian strain has two copies, resulting in strongly reduced dauer formation. Transgenic animals expressing multiple copies do not form dauers. dauerless is exclusively expressed in CAN neurons, and both CAN ablation and dauerless mutations increase dauer formation. Strikingly, dauerless underwent several duplications and acts in parallel or downstream of steroid-hormone signaling but upstream of the nuclear-hormone-receptor daf-12. We identified the novel or fast-evolving gene dauerless as inhibitor of dauer development. Our findings reveal the importance of gene duplications and copy number variations for orphan gene function and suggest daf-12 as

  13. Neutral and Non-Neutral Evolution of Duplicated Genes with Gene Conversion

    Directory of Open Access Journals (Sweden)

    Jeffrey A. Fawcett

    2011-02-01

    Full Text Available Gene conversion is one of the major mutational mechanisms involved in the DNA sequence evolution of duplicated genes. It contributes to create unique patters of DNA polymorphism within species and divergence between species. A typical pattern is so-called concerted evolution, in which the divergence between duplicates is maintained low for a long time because of frequent exchanges of DNA fragments. In addition, gene conversion affects the DNA evolution of duplicates in various ways especially when selection operates. Here, we review theoretical models to understand the evolution of duplicates in both neutral and non-neutral cases. We also explain how these theories contribute to interpreting real polymorphism and divergence data by using some intriguing examples.

  14. Evolution vs the number of gene copies per primitive cell.

    Science.gov (United States)

    Koch, A L

    1984-01-01

    Computer simulations are presented of the rate at which an advantageous mutant would displace the prototype in a replicating system without an accurate segregation mechanism. If the number of gene copies in the system is indefinitely large, Darwinian evolution is essentially stopped because there is no coupling of phenotype with genotype, i.e., there is no growth advantage to the advantageous gene relative to the prototype and therefore no "survival of the fittest." The inhibition of evolution due to a number of gene copies less than 100 would have been not insurmountable. Although the presence of multiple copies would have allowed replacement by an advantageous mutant, it provided a way for the primitive cell to conserve less immediately useful genes that could evolve into different or more effective genes. This possibility was lost as accurate segregation mechanisms evolved and cells with few copies of each gene, such as modern procaryotes, arose.

  15. North Carolina macular dystrophy (MCDR1) caused by a novel tandem duplication of the PRDM13 gene

    Science.gov (United States)

    Sullivan, Lori S.; Wheaton, Dianna K.; Locke, Kirsten G.; Jones, Kaylie D.; Koboldt, Daniel C.; Fulton, Robert S.; Wilson, Richard K.; Blanton, Susan H.; Birch, David G.; Daiger, Stephen P.

    2016-01-01

    Purpose To identify the underlying cause of disease in a large family with North Carolina macular dystrophy (NCMD). Methods A large four-generation family (RFS355) with an autosomal dominant form of NCMD was ascertained. Family members underwent comprehensive visual function evaluations. Blood or saliva from six affected family members and three unaffected spouses was collected and DNA tested for linkage to the MCDR1 locus on chromosome 6q12. Three affected family members and two unaffected spouses underwent whole exome sequencing (WES) and subsequently, custom capture of the linkage region followed by next-generation sequencing (NGS). Standard PCR and dideoxy sequencing were used to further characterize the mutation. Results Of the 12 eyes examined in six affected individuals, all but two had Gass grade 3 macular degeneration features. Large central excavation of the retinal and choroid layers, referred to as a macular caldera, was seen in an age-independent manner in the grade 3 eyes. The calderas are unique to affected individuals with MCDR1. Genome-wide linkage mapping and haplotype analysis of markers from the chromosome 6q region were consistent with linkage to the MCDR1 locus. Whole exome sequencing and custom-capture NGS failed to reveal any rare coding variants segregating with the phenotype. Analysis of the custom-capture NGS sequencing data for copy number variants uncovered a tandem duplication of approximately 60 kb on chromosome 6q. This region contains two genes, CCNC and PRDM13. The duplication creates a partial copy of CCNC and a complete copy of PRDM13. The duplication was found in all affected members of the family and is not present in any unaffected members. The duplication was not seen in 200 ethnically matched normal chromosomes. Conclusions The cause of disease in the original family with MCDR1 and several others has been recently reported to be dysregulation of the PRDM13 gene, caused by either single base substitutions in a DNase 1

  16. Duplication and Diversification of the Hypoxia-Inducible IGFBP-1 Gene in Zebrafish

    DEFF Research Database (Denmark)

    Kamei, Hiroyasu; Lu, Ling; Jiao, Shuang

    2008-01-01

    Background: Gene duplication is the primary force of new gene evolution. Deciphering whether a pair of duplicated genes has evolved divergent functions is often challenging. The zebrafish is uniquely positioned to provide insight into the process of functional gene evolution due to its amenabilit...

  17. Divergence of Recently Duplicated Mg-Type MADS-Box Genes in Petunia

    NARCIS (Netherlands)

    Bemer, M.; Gordon, J.; Weterings, K.; Angenent, G.C.

    2010-01-01

    The MADS-box transcription factor family has expanded considerably in plants via gene and genome duplications and can be subdivided into type I and MIKC-type genes. The two gene classes show a different evolutionary history. Whereas the MIKC-type genes originated during ancient genome duplications,

  18. Genome-wide analysis of homeobox gene family in legumes: identification, gene duplication and expression profiling.

    Science.gov (United States)

    Bhattacharjee, Annapurna; Ghangal, Rajesh; Garg, Rohini; Jain, Mukesh

    2015-01-01

    Homeobox genes encode transcription factors that are known to play a major role in different aspects of plant growth and development. In the present study, we identified homeobox genes belonging to 14 different classes in five legume species, including chickpea, soybean, Medicago, Lotus and pigeonpea. The characteristic differences within homeodomain sequences among various classes of homeobox gene family were quite evident. Genome-wide expression analysis using publicly available datasets (RNA-seq and microarray) indicated that homeobox genes are differentially expressed in various tissues/developmental stages and under stress conditions in different legumes. We validated the differential expression of selected chickpea homeobox genes via quantitative reverse transcription polymerase chain reaction. Genome duplication analysis in soybean indicated that segmental duplication has significantly contributed in the expansion of homeobox gene family. The Ka/Ks ratio of duplicated homeobox genes in soybean showed that several members of this family have undergone purifying selection. Moreover, expression profiling indicated that duplicated genes might have been retained due to sub-functionalization. The genome-wide identification and comprehensive gene expression profiling of homeobox gene family members in legumes will provide opportunities for functional analysis to unravel their exact role in plant growth and development.

  19. Copy-number changes in evolution: rates, fitness effects and adaptive significance

    Directory of Open Access Journals (Sweden)

    Vaishali eKatju

    2013-12-01

    Full Text Available Gene copy-number differences due to gene duplications and deletions are rampant in natural populations and play a crucial role in the evolution of genome complexity. Per-locus analyses of gene duplication rates in the pre-genomic era revealed that gene duplication rates are much higher than the per nucleotide substitution rate. Analyses of gene duplication and deletion rates in mutation accumulation lines of model organisms have revealed that these high rates of copy-number mutations occur at a genome-wide scale. Furthermore, comparisons of the spontaneous duplication and deletion rates to copy-number polymorphism data and bioinformatic-based estimates of duplication rates from sequenced genomes suggest that the vast majority of gene duplications are detrimental and removed by natural selection. The rate at which new gene copies appear in populations greatly influences their evolutionary dynamics and standing gene copy-number variation in populations. The opportunity for mutations that result in the maintenance of duplicate copies, either through neofunctionalization or subfunctionalization, also depends on the equilibrium frequency of additional gene copies in the population, and hence on the spontaneous gene duplication (and loss rate. The duplication rate may therefore have profound effects on the role of adaptation in the evolution of duplicated genes as well as important consequences for the evolutionary potential of organisms. We further discuss the broad ramifications of this standing gene copy-number variation on fitness and adaptive potential from a population-genetic and genome-wide perspective.

  20. DNA Copy Number Variants of Known Glaucoma Genes in Relation to Primary Open-Angle Glaucoma

    Science.gov (United States)

    Liu, Yutao; Garrett, Melanie E.; Yaspan, Brian L.; Bailey, Jessica Cooke; Loomis, Stephanie J.; Brilliant, Murray; Budenz, Donald L.; Christen, William G.; Fingert, John H.; Gaasterland, Douglas; Gaasterland, Terry; Kang, Jae H.; Lee, Richard K.; Lichter, Paul; Moroi, Sayoko E.; Realini, Anthony; Richards, Julia E.; Schuman, Joel S.; Scott, William K.; Singh, Kuldev; Sit, Arthur J.; Vollrath, Douglas; Weinreb, Robert; Wollstein, Gadi; Zack, Donald J.; Zhang, Kang; Pericak-Vance, Margaret A.; Haines, Jonathan L.; Pasquale, Louis R.; Wiggs, Janey L.; Allingham, R. Rand; Ashley-Koch, Allison E.; Hauser, Michael A.

    2014-01-01

    Purpose. We examined the role of DNA copy number variants (CNVs) of known glaucoma genes in relation to primary open angle glaucoma (POAG). Methods. Our study included DNA samples from two studies (NEIGHBOR and GLAUGEN). All the samples were genotyped with the Illumina Human660W_Quad_v1 BeadChip. After removing non–blood-derived and amplified DNA samples, we applied quality control steps based on the mean Log R Ratio and the mean B allele frequency. Subsequently, data from 3057 DNA samples (1599 cases and 1458 controls) were analyzed with PennCNV software. We defined CNVs as those ≥5 kilobases (kb) in size and interrogated by ≥5 consecutive probes. We further limited our investigation to CNVs in known POAG-related genes, including CDKN2B-AS1, TMCO1, SIX1/SIX6, CAV1/CAV2, the LRP12-ZFPM2 region, GAS7, ATOH7, FNDC3B, CYP1B1, MYOC, OPTN, WDR36, SRBD1, TBK1, and GALC. Results. Genomic duplications of CDKN2B-AS1 and TMCO1 were each found in a single case. Two cases carried duplications in the GAS7 region. Genomic deletions of SIX6 and ATOH7 were each identified in one case. One case carried a TBK1 deletion and another case carried a TBK1 duplication. No controls had duplications or deletions in these six genes. A single control had a duplication in the MYOC region. Deletions of GALC were observed in five cases and two controls. Conclusions. The CNV analysis of a large set of cases and controls revealed the presence of rare CNVs in known POAG susceptibility genes. Our data suggest that these rare CNVs may contribute to POAG pathogenesis and merit functional evaluation. PMID:25414181

  1. Are duplicated genes responsible for anthracnose resistance in common bean?

    Science.gov (United States)

    Costa, Larissa Carvalho; Nalin, Rafael Storto; Ramalho, Magno Antonio Patto; de Souza, Elaine Aparecida

    2017-01-01

    The race 65 of Colletotrichum lindemuthianum, etiologic agent of anthracnose in common bean, is distributed worldwide, having great importance in breeding programs for anthracnose resistance. Several resistance alleles have been identified promoting resistance to this race. However, the variability that has been detected within race has made it difficult to obtain cultivars with durable resistance, because cultivars may have different reactions to each strain of race 65. Thus, this work aimed at studying the resistance inheritance of common bean lines to different strains of C. lindemuthianum, race 65. We used six C. lindemuthianum strains previously characterized as belonging to the race 65 through the international set of differential cultivars of anthracnose and nine commercial cultivars, adapted to the Brazilian growing conditions and with potential ability to discriminate the variability within this race. To obtain information on the resistance inheritance related to nine commercial cultivars to six strains of race 65, these cultivars were crossed two by two in all possible combinations, resulting in 36 hybrids. Segregation in the F2 generations revealed that the resistance to each strain is conditioned by two independent genes with the same function, suggesting that they are duplicated genes, where the dominant allele promotes resistance. These results indicate that the specificity between host resistance genes and pathogen avirulence genes is not limited to races, it also occurs within strains of the same race. Further research may be carried out in order to establish if the alleles identified in these cultivars are different from those described in the literature.

  2. Determination of Cytochrome P450 2D6 (CYP2D6 Gene Copy Number by Real-Time Quantitative PCR

    Directory of Open Access Journals (Sweden)

    Laurent Bodin

    2005-01-01

    Full Text Available Gene dosage by real-time quantitative PCR has proved to be accurate for measuring gene copy number. The aim of this study was to apply this approach to the CYP2D6 gene to allow for rapid identification of poor and ultrarapid metabolizers (0, 1, or more than 2 gene copy number. Using the 2−ΔΔCt calculation method and a duplex reaction, the number of CYP2D6 gene copies was determined. Quantitative PCR was performed on 43 samples previously analyzed by Southern blotting and long PCR including 20 samples with a heterozygous deletion, 11 with normal copy number (2 copies, and 12 samples with duplicated genes. The average ratio ranged from 1.02 to 1.28, 1.85 to 2.21, and 2.55 to 3.30, respectively, for the samples with 1 copy, 2 copies, and 3 copies. This study shows that this method is sensitive enough to detect either a heterozygous gene deletion or duplication.

  3. Duplications of the neuropeptide receptor gene VIPR2 confer significant risk for schizophrenia.

    LENUS (Irish Health Repository)

    Vacic, Vladimir

    2011-03-24

    Rare copy number variants (CNVs) have a prominent role in the aetiology of schizophrenia and other neuropsychiatric disorders. Substantial risk for schizophrenia is conferred by large (>500-kilobase) CNVs at several loci, including microdeletions at 1q21.1 (ref. 2), 3q29 (ref. 3), 15q13.3 (ref. 2) and 22q11.2 (ref. 4) and microduplication at 16p11.2 (ref. 5). However, these CNVs collectively account for a small fraction (2-4%) of cases, and the relevant genes and neurobiological mechanisms are not well understood. Here we performed a large two-stage genome-wide scan of rare CNVs and report the significant association of copy number gains at chromosome 7q36.3 with schizophrenia. Microduplications with variable breakpoints occurred within a 362-kilobase region and were detected in 29 of 8,290 (0.35%) patients versus 2 of 7,431 (0.03%) controls in the combined sample. All duplications overlapped or were located within 89 kilobases upstream of the vasoactive intestinal peptide receptor gene VIPR2. VIPR2 transcription and cyclic-AMP signalling were significantly increased in cultured lymphocytes from patients with microduplications of 7q36.3. These findings implicate altered vasoactive intestinal peptide signalling in the pathogenesis of schizophrenia and indicate the VPAC2 receptor as a potential target for the development of new antipsychotic drugs.

  4. Duplication of pilus gene complexes of Haemophilus influenzae biogroup aegyptius.

    Science.gov (United States)

    Read, T D; Dowdell, M; Satola, S W; Farley, M M

    1996-11-01

    Brazilian purpuric fever (BPF) is a recently described pediatric septicemia caused by a strain of Haemophilus influenzae biogroup aegyptius. The pilus specified by this bacterium may be important in BPF pathogenesis, enhancing attachment to host tissue. Here, we report the cloning of two haf (for H. influenzae biogroup aegyptius fimbriae) gene clusters from a cosmid library of strain F3031. We sequenced a 6.8-kb segment of the haf1 cluster and identified five genes (hafA to hafE). The predicted protein products, HafA to HafD, are 72, 95, 98, and 90% similar, respectively, to HifA to HifD of the closely related H. influenzae type b pilus. Strikingly, the putative pilus adhesion, HifE, shares only 44% identity with HafE, suggesting that the proteins may differ in receptor specificity. Insertion of a mini-gammadelta transposon in the hafE gene eliminated hemadsorption. The nucleotide sequences of the haf1 and haf2 clusters are more than 99% identical. Using the recently published sequence of the H. influenzae Rd genome, we determined that the haf1 complex lies at a unique position in the chromosome between the pmbA gene and a hypothetical open reading frame, HI1153. The location of the haf2 cluster, inserted between the purE and pepN genes, is analogous to the hif genes on H. influenzae type b. BPF fimbrial phase switching appears to involve slip-strand mispairing of repeated dinucleotides in the pilus promoter. The BPF-associated H. influenzae biogroup aegyptius pilus system generally resembles other H. influenzae, but the possession of a second fimbrial gene cluster, which appears to have arisen by a recent duplication event, and the novel sequence of the HafE adhesin may be significant in the unusual pathogenesis of BPF.

  5. Comparative Inference of Duplicated Genes Produced by Polyploidization in Soybean Genome

    Directory of Open Access Journals (Sweden)

    Yanmei Yang

    2013-01-01

    Full Text Available Soybean (Glycine max is one of the most important crop plants for providing protein and oil. It is important to investigate soybean genome for its economic and scientific value. Polyploidy is a widespread and recursive phenomenon during plant evolution, and it could generate massive duplicated genes which is an important resource for genetic innovation. Improved sequence alignment criteria and statistical analysis are used to identify and characterize duplicated genes produced by polyploidization in soybean. Based on the collinearity method, duplicated genes by whole genome duplication account for 70.3% in soybean. From the statistical analysis of the molecular distances between duplicated genes, our study indicates that the whole genome duplication event occurred more than once in the genome evolution of soybean, which is often distributed near the ends of chromosomes.

  6. Evolution of Three Parent Genes and Their Retrogene Copies in Drosophila Species

    Directory of Open Access Journals (Sweden)

    Ryan S. O'Neill

    2013-01-01

    Full Text Available Retrogenes form a class of gene duplicate lacking the regulatory sequences found outside of the mRNA-coding regions of the parent gene. It is not clear how a retrogene’s lack of parental regulatory sequences affects the evolution of the gene pair. To explore the evolution of parent genes and retrogenes, we investigated three such gene pairs in the family Drosophilidae; in Drosophila melanogaster, these gene pairs are CG8331 and CG4960, CG17734 and CG11825, and Sep2 and Sep5. We investigated the embryonic expression patterns of these gene pairs across multiple Drosophila species. Expression patterns of the parent genes and their single copy orthologs are relatively conserved across species, whether or not a species has a retrogene copy, although there is some variation in CG8331 and CG17734. In contrast, expression patterns of the retrogene orthologs have diversified. We used the genome sequences of 20 Drosophila species to investigate coding sequence evolution. The coding sequences of the three gene pairs appear to be evolving predominantly under negative selection; however, the parent genes and retrogenes show some distinct differences in amino acid sequence. Therefore, in general, retrogene expression patterns and coding sequences are distinct compared to their parents and, in some cases, retrogene expression patterns diversify.

  7. The Evolutionary Relationship between Alternative Splicing and Gene Duplication

    Science.gov (United States)

    Iñiguez, Luis P.; Hernández, Georgina

    2017-01-01

    The protein diversity that exists today has resulted from various evolutionary processes. It is well known that gene duplication (GD) along with the accumulation of mutations are responsible, among other factors, for an increase in the number of different proteins. The gene structure in eukaryotes requires the removal of non-coding sequences, introns, to produce mature mRNAs. This process, known as cis-splicing, referred to here as splicing, is regulated by several factors which can lead to numerous splicing arrangements, commonly designated as alternative splicing (AS). AS, producing several transcripts isoforms form a single gene, also increases the protein diversity. However, the evolution and manner for increasing protein variation differs between AS and GD. An important question is how are patterns of AS affected after a GD event. Here, we review the current knowledge of AS and GD, focusing on their evolutionary relationship. These two processes are now considered the main contributors to the increasing protein diversity and therefore their relationship is a relevant, yet understudied, area of evolutionary study. PMID:28261262

  8. Copy number variations exploration of multiple genes in Graves' disease.

    Science.gov (United States)

    Song, Rong-Hua; Shao, Xiao-Qing; Li, Ling; Wang, Wen; Zhang, Jin-An

    2017-01-01

    Few previous published papers reported copy number variations of genes could affect the predisposition of Graves' disease (GD). Herein, the aim of this study was to explore the association between copy number variations (CNV) profile and GD. The preliminary copy number microarray used to screen copy number variant genes was performed in 6 GD patients. Five CNV candidate genes (CFH, CFHR1, KIAA0125, UGT2B15, and UGT2B17) were then validated in an independent set of samples (50 GD patients and 50 matched healthy ones) by the Accucopy assay method. The CNV of the other 2 genes TRY6 and CCL3L1 was investigated in 144 GD patients and 144 healthy volunteers by the definitive genotyping technique using the Taqman quantitative polymerase-chain-reaction (Taqman qPCR). TRY6 gene-associated single nucleotide polymorphism (SNP), rs13230029, was genotyped by the PCR-ligase detection reaction (LDR) in 675 GD patients and 898 healthy controls. There were no correlation of the gene copy number (GCN) of CFH, CFHR1, KIAA0125, UGT2B15, and UGT2B17 with GD. In comparison with that of controls, the GCN distribution of TRY6 and CCL3L1 in GD patients did not show significantly differ (P > 0.05). Furthermore, TRY6-related polymorphism (rs13230029) showed no difference between GD patients and controls. No correlation was found between CNV or SNP genotype and clinical phenotypes. Generally, there were no link of the copy numbers of several genes, including CFH, CFHR1, KIAA0125, UGT2B15, UGT2B17, TRY6, and CCL3L1 to GD. Our results clearly indicated that the copy number variations of multiple genes, namely CFH, CFHR1, KIAA0125, UGT2B15, UGT2B17, TRY6, and CCL3L1, were not associated with the development of GD.

  9. Evidence of neofunctionalization after the duplication of the highly conserved Polycomb group gene Caf1-55 in the obscura group of Drosophila.

    Science.gov (United States)

    Calvo-Martín, Juan M; Papaceit, Montserrat; Segarra, Carmen

    2017-01-17

    Drosophila CAF1-55 protein is a subunit of the Polycomb repressive complex PRC2 and other protein complexes. It is a multifunctional and evolutionarily conserved protein that participates in nucleosome assembly and remodelling, as well as in the epigenetic regulation of a large set of target genes. Here, we describe and analyze the duplication of Caf1-55 in the obscura group of Drosophila. Paralogs exhibited a strong asymmetry in evolutionary rates, which suggests that they have evolved according to a neofunctionalization process. During this process, the ancestral copy has been kept under steady purifying selection to retain the ancestral function and the derived copy (Caf1-55dup) that originated via a DNA-mediated duplication event ~18 Mya, has been under clear episodic selection. Different maximum likelihood approaches confirmed the action of positive selection, in contrast to relaxed selection, on Caf1-55dup after the duplication. This adaptive process has also taken place more recently during the divergence of D. subobscura and D. guanche. The possible association of this duplication with a previously detected acceleration in the evolutionary rate of three CAF1-55 partners in PRC2 complexes is discussed. Finally, the timing and functional consequences of the Caf1-55 duplication is compared to other duplications of Polycomb genes.

  10. Evolution of vertebrate central nervous system is accompanied by novel expression changes of duplicate genes.

    Science.gov (United States)

    Chen, Yuan; Ding, Yun; Zhang, Zuming; Wang, Wen; Chen, Jun-Yuan; Ueno, Naoto; Mao, Bingyu

    2011-12-20

    The evolution of the central nervous system (CNS) is one of the most striking changes during the transition from invertebrates to vertebrates. As a major source of genetic novelties, gene duplication might play an important role in the functional innovation of vertebrate CNS. In this study, we focused on a group of CNS-biased genes that duplicated during early vertebrate evolution. We investigated the tempo-spatial expression patterns of 33 duplicate gene families and their orthologs during the embryonic development of the vertebrate Xenopus laevis and the cephalochordate Brachiostoma belcheri. Almost all the identified duplicate genes are differentially expressed in the CNS in Xenopus embryos, and more than 50% and 30% duplicate genes are expressed in the telencephalon and mid-hindbrain boundary, respectively, which are mostly considered as two innovations in the vertebrate CNS. Interestingly, more than 50% of the amphioxus orthologs do not show apparent expression in the CNS in amphioxus embryos as detected by in situ hybridization, indicating that some of the vertebrate CNS-biased duplicate genes might arise from non-CNS genes in invertebrates. Our data accentuate the functional contribution of gene duplication in the CNS evolution of vertebrate and uncover an invertebrate non-CNS history for some vertebrate CNS-biased duplicate genes. Copyright © 2011. Published by Elsevier Ltd.

  11. Voltage-gated sodium channel gene repertoire of lampreys: gene duplications, tissue-specific expression and discovery of a long-lost gene.

    Science.gov (United States)

    Zakon, Harold H; Li, Weiming; Pillai, Nisha E; Tohari, Sumanty; Shingate, Prashant; Ren, Jianfeng; Venkatesh, Byrappa

    2017-09-27

    Studies of the voltage-gated sodium (Nav) channels of extant gnathostomes have made it possible to deduce that ancestral gnathostomes possessed four voltage-gated sodium channel genes derived from a single ancestral chordate gene following two rounds of genome duplication early in vertebrates. We investigated the Nav gene family in two species of lampreys (the Japanese lamprey Lethenteron japonicum and sea lamprey Petromyzon marinus) (jawless vertebrates-agnatha) and compared them with those of basal vertebrates to better understand the origin of Nav genes in vertebrates. We noted six Nav genes in both lamprey species, but orthology with gnathostome (jawed vertebrate) channels was inconclusive. Surprisingly, the Nav2 gene, ubiquitously found in invertebrates and believed to have been lost in vertebrates, is present in lampreys, elephant shark (Callorhinchus milii) and coelacanth (Latimeria chalumnae). Despite repeated duplication of the Nav1 family in vertebrates, Nav2 is only in single copy in those vertebrates in which it is retained, and was independently lost in ray-finned fishes and tetrapods. Of the other five Nav channel genes, most were expressed in brain, one in brain and heart, and one exclusively in skeletal muscle. Invertebrates do not express Nav channel genes in muscle. Thus, early in the vertebrate lineage Nav channels began to diversify and different genes began to express in heart and muscle. © 2017 The Author(s).

  12. Candidate gene copy number analysis by PCR and multicapillary electrophoresis.

    Science.gov (United States)

    Szantai, Eszter; Elek, Zsuzsanna; Guttman, András; Sasvari-Szekely, Maria

    2009-04-01

    Genetic polymorphisms are often considered as risk factors of complex diseases serving as valuable and easily detectable biomarkers, also stable during the whole lifespan. A novel type of genetic polymorphism has been identified just recently, referred to as gene copy number variation (CNV) or copy number polymorphism. CNV of glycogen synthase kinase 3 beta and its adjacent gene, Nr1i2 (pregnane X receptor isoform), has been reported to associate with bipolar depression. In our study we introduced multicapillary electrophoresis for gene copy number analysis as an affordable alternative to real-time PCR quantification with TaqMan gene probes. Our results show the reliability of the developed method based on conventional PCR followed by separation of products by multicapillary electrophoresis with quantitative evaluation. This method can be readily implemented for the analysis of candidate gene CNVs in high throughput clinical laboratories and also in personalized medicine care of depression-related risk factors.

  13. Duplication of OsHAP family genes and their association with heading date in rice.

    Science.gov (United States)

    Li, Qiuping; Yan, Wenhao; Chen, Huaxia; Tan, Cong; Han, Zhongmin; Yao, Wen; Li, Guangwei; Yuan, Mengqi; Xing, Yongzhong

    2016-03-01

    Heterotrimeric Heme Activator Protein (HAP) family genes are involved in the regulation of flowering in plants. It is not clear how many HAP genes regulate heading date in rice. In this study, we identified 35 HAP genes, including seven newly identified genes, and performed gene duplication and candidate gene-based association analyses. Analyses showed that segmental duplication and tandem duplication are the main mechanisms of HAP gene duplication. Expression profiling and functional identification indicated that duplication probably diversifies the functions of HAP genes. A nucleotide diversity analysis revealed that 13 HAP genes underwent selection. A candidate gene-based association analysis detected four HAP genes related to heading date. An investigation of transgenic plants or mutants of 23 HAP genes confirmed that overexpression of at least four genes delayed heading date under long-day conditions, including the previously cloned Ghd8/OsHAP3H. Our results indicate that the large number of HAP genes in rice was mainly produced by gene duplication, and a few HAP genes function to regulate heading date. Selection of HAP genes is probably caused by their diverse functions rather than regulation of heading.

  14. Integrated analysis of DNA copy number and gene expression microarray data using gene sets

    NARCIS (Netherlands)

    R.X. de Menezes (Renee); M. Boetzer (Marten); M. Sieswerda (Melle); G.J.B. van Ommen; J.M. Boer (Judith)

    2009-01-01

    textabstractBackground: Genes that play an important role in tumorigenesis are expected to show association between DNA copy number and RNA expression. Optimal power to find such associations can only be achieved if analysing copy number and gene expression jointly. Furthermore, some copy number

  15. Divergence of gene body DNA methylation and evolution of plant duplicate genes.

    Directory of Open Access Journals (Sweden)

    Jun Wang

    Full Text Available It has been shown that gene body DNA methylation is associated with gene expression. However, whether and how deviation of gene body DNA methylation between duplicate genes can influence their divergence remains largely unexplored. Here, we aim to elucidate the potential role of gene body DNA methylation in the fate of duplicate genes. We identified paralogous gene pairs from Arabidopsis and rice (Oryza sativa ssp. japonica genomes and reprocessed their single-base resolution methylome data. We show that methylation in paralogous genes nonlinearly correlates with several gene properties including exon number/gene length, expression level and mutation rate. Further, we demonstrated that divergence of methylation level and pattern in paralogs indeed positively correlate with their sequence and expression divergences. This result held even after controlling for other confounding factors known to influence the divergence of paralogs. We observed that methylation level divergence might be more relevant to the expression divergence of paralogs than methylation pattern divergence. Finally, we explored the mechanisms that might give rise to the divergence of gene body methylation in paralogs. We found that exonic methylation divergence more closely correlates with expression divergence than intronic methylation divergence. We show that genomic environments (e.g., flanked by transposable elements and repetitive sequences of paralogs generated by various duplication mechanisms are associated with the methylation divergence of paralogs. Overall, our results suggest that the changes in gene body DNA methylation could provide another avenue for duplicate genes to develop differential expression patterns and undergo different evolutionary fates in plant genomes.

  16. Phylogenetics of Lophotrochozoan bHLH Genes and the Evolution of Lineage-Specific Gene Duplicates

    Science.gov (United States)

    Bao, Yongbo

    2017-01-01

    The gain and loss of genes encoding transcription factors is of importance to understanding the evolution of gene regulatory complexity. The basic helix–loop–helix (bHLH) genes encode a large superfamily of transcription factors. We systematically classify the bHLH genes from five mollusc, two annelid and one brachiopod genomes, tracing the pattern of bHLH gene evolution across these poorly studied Phyla. In total, 56–88 bHLH genes were identified in each genome, with most identifiable as members of previously described bilaterian families, or of new families we define. Of such families only one, Mesp, appears lost by all these species. Additional duplications have also played a role in the evolution of the bHLH gene repertoire, with many new lophotrochozoan-, mollusc-, bivalve-, or gastropod-specific genes defined. Using a combination of transcriptome mining, RT-PCR, and in situ hybridization we compared the expression of several of these novel genes in tissues and embryos of the molluscs Crassostrea gigas and Patella vulgata, finding both conserved expression and evidence for neofunctionalization. We also map the positions of the genes across these genomes, identifying numerous gene linkages. Some reflect recent paralog divergence by tandem duplication, others are remnants of ancient tandem duplications dating to the lophotrochozoan or bilaterian common ancestors. These data are built into a model of the evolution of bHLH genes in molluscs, showing formidable evolutionary stasis at the family level but considerable within-family diversification by tandem gene duplication. PMID:28338988

  17. Recurrent deletions and reciprocal duplications of 10q11.21q11.23 including CHAT and SLC18A3 are likely mediated by complex low-copy repeats.

    Science.gov (United States)

    Stankiewicz, Paweł; Kulkarni, Shashikant; Dharmadhikari, Avinash V; Sampath, Srirangan; Bhatt, Samarth S; Shaikh, Tamim H; Xia, Zhilian; Pursley, Amber N; Cooper, M Lance; Shinawi, Marwan; Paciorkowski, Alex R; Grange, Dorothy K; Noetzel, Michael J; Saunders, Scott; Simons, Paul; Summar, Marshall; Lee, Brendan; Scaglia, Fernando; Fellmann, Florence; Martinet, Danielle; Beckmann, Jacques S; Asamoah, Alexander; Platky, Kathryn; Sparks, Susan; Martin, Ann S; Madan-Khetarpal, Suneeta; Hoover, Jacqueline; Medne, Livija; Bonnemann, Carsten G; Moeschler, John B; Vallee, Stephanie E; Parikh, Sumit; Irwin, Polly; Dalzell, Victoria P; Smith, Wendy E; Banks, Valerie C; Flannery, David B; Lovell, Carolyn M; Bellus, Gary A; Golden-Grant, Kathryn; Gorski, Jerome L; Kussmann, Jennifer L; McGregor, Tracy L; Hamid, Rizwan; Pfotenhauer, Jean; Ballif, Blake C; Shaw, Chad A; Kang, Sung-Hae L; Bacino, Carlos A; Patel, Ankita; Rosenfeld, Jill A; Cheung, Sau Wai; Shaffer, Lisa G

    2012-01-01

    We report 24 unrelated individuals with deletions and 17 additional cases with duplications at 10q11.21q21.1 identified by chromosomal microarray analysis. The rearrangements range in size from 0.3 to 12 Mb. Nineteen of the deletions and eight duplications are flanked by large, directly oriented segmental duplications of >98% sequence identity, suggesting that nonallelic homologous recombination (NAHR) caused these genomic rearrangements. Nine individuals with deletions and five with duplications have additional copy number changes. Detailed clinical evaluation of 20 patients with deletions revealed variable clinical features, with developmental delay (DD) and/or intellectual disability (ID) as the only features common to a majority of individuals. We suggest that some of the other features present in more than one patient with deletion, including hypotonia, sleep apnea, chronic constipation, gastroesophageal and vesicoureteral refluxes, epilepsy, ataxia, dysphagia, nystagmus, and ptosis may result from deletion of the CHAT gene, encoding choline acetyltransferase, and the SLC18A3 gene, mapping in the first intron of CHAT and encoding vesicular acetylcholine transporter. The phenotypic diversity and presence of the deletion in apparently normal carrier parents suggest that subjects carrying 10q11.21q11.23 deletions may exhibit variable phenotypic expressivity and incomplete penetrance influenced by additional genetic and nongenetic modifiers.

  18. Consensus properties and their large-scale applications for the gene duplication problem.

    Science.gov (United States)

    Moon, Jucheol; Lin, Harris T; Eulenstein, Oliver

    2016-06-01

    Solving the gene duplication problem is a classical approach for species tree inference from gene trees that are confounded by gene duplications. This problem takes a collection of gene trees and seeks a species tree that implies the minimum number of gene duplications. Wilkinson et al. posed the conjecture that the gene duplication problem satisfies the desirable Pareto property for clusters. That is, for every instance of the problem, all clusters that are commonly present in the input gene trees of this instance, called strict consensus, will also be found in every solution to this instance. We prove that this conjecture does not generally hold. Despite this negative result we show that the gene duplication problem satisfies a weaker version of the Pareto property where the strict consensus is found in at least one solution (rather than all solutions). This weaker property contributes to our design of an efficient scalable algorithm for the gene duplication problem. We demonstrate the performance of our algorithm in analyzing large-scale empirical datasets. Finally, we utilize the algorithm to evaluate the accuracy of standard heuristics for the gene duplication problem using simulated datasets.

  19. Buffering by gene duplicates: an analysis of molecular correlates and evolutionary conservation

    Directory of Open Access Journals (Sweden)

    Vogel Christine

    2008-12-01

    Full Text Available Abstract Background One mechanism to account for robustness against gene knockouts or knockdowns is through buffering by gene duplicates, but the extent and general correlates of this process in organisms is still a matter of debate. To reveal general trends of this process, we provide a comprehensive comparison of gene essentiality, duplication and buffering by duplicates across seven bacteria (Mycoplasma genitalium, Bacillus subtilis, Helicobacter pylori, Haemophilus influenzae, Mycobacterium tuberculosis, Pseudomonas aeruginosa, Escherichia coli, and four eukaryotes (Saccharomyces cerevisiae (yeast, Caenorhabditis elegans (worm, Drosophila melanogaster (fly, Mus musculus (mouse. Results In nine of the eleven organisms, duplicates significantly increase chances of survival upon gene deletion (P-value ≤ 0.05, but only by up to 13%. Given that duplicates make up to 80% of eukaryotic genomes, the small contribution is surprising and points to dominant roles of other buffering processes, such as alternative metabolic pathways. The buffering capacity of duplicates appears to be independent of the degree of gene essentiality and tends to be higher for genes with high expression levels. For example, buffering capacity increases to 23% amongst highly expressed genes in E. coli. Sequence similarity and the number of duplicates per gene are weak predictors of the duplicate's buffering capacity. In a case study we show that buffering gene duplicates in yeast and worm are somewhat more similar in their functions than non-buffering duplicates and have increased transcriptional and translational activity. Conclusion In sum, the extent of gene essentiality and buffering by duplicates is not conserved across organisms and does not correlate with the organisms' apparent complexity. This heterogeneity goes beyond what would be expected from differences in experimental approaches alone. Buffering by duplicates contributes to robustness in several organisms

  20. Duplication of the dystroglycan gene in most branches of teleost fish

    Directory of Open Access Journals (Sweden)

    Giardina Bruno

    2007-05-01

    Full Text Available Abstract Background The dystroglycan (DG complex is a major non-integrin cell adhesion system whose multiple biological roles involve, among others, skeletal muscle stability, embryonic development and synapse maturation. DG is composed of two subunits: α-DG, extracellular and highly glycosylated, and the transmembrane β-DG, linking the cytoskeleton to the surrounding basement membrane in a wide variety of tissues. A single copy of the DG gene (DAG1 has been identified so far in humans and other mammals, encoding for a precursor protein which is post-translationally cleaved to liberate the two DG subunits. Similarly, D. rerio (zebrafish seems to have a single copy of DAG1, whose removal was shown to cause a severe dystrophic phenotype in adult animals, although it is known that during evolution, due to a whole genome duplication (WGD event, many teleost fish acquired multiple copies of several genes (paralogues. Results Data mining of pufferfish (T. nigroviridis and T. rubripes and other teleost fish (O. latipes and G. aculeatus available nucleotide sequences revealed the presence of two functional paralogous DG sequences. RT-PCR analysis proved that both the DG sequences are transcribed in T. nigroviridis. One of the two DG sequences harbours an additional mini-intronic sequence, 137 bp long, interrupting the uncomplicated exon-intron-exon pattern displayed by DAG1 in mammals and D. rerio. A similar scenario emerged also in D. labrax (sea bass, from whose genome we have cloned and sequenced a new DG sequence that also harbours a shorter additional intronic sequence of 116 bp. Western blot analysis confirmed the presence of DG protein products in all the species analysed including two teleost Antarctic species (T. bernacchii and C. hamatus. Conclusion Our evolutionary analysis has shown that the whole-genome duplication event in the Class Actinopterygii (ray-finned fish involved also DAG1. We unravelled new important molecular genetic details

  1. Subfunctionalization of duplicated zebrafish pax6 genes by cis-regulatory divergence

    National Research Council Canada - National Science Library

    Kleinjan, Dirk A; Bancewicz, Ruth M; Gautier, Philippe; Dahm, Ralf; Schonthaler, Helia B; Damante, Giuseppe; Seawright, Anne; Hever, Ann M; Yeyati, Patricia L; van Heyningen, Veronica; Coutinho, Pedro

    2008-01-01

    Gene duplication is a major driver of evolutionary divergence. In most vertebrates a single PAX6 gene encodes a transcription factor required for eye, brain, olfactory system, and pancreas development...

  2. Copy number variants in patients with intellectual disability affect the regulation of ARX transcription factor gene.

    Science.gov (United States)

    Ishibashi, Minaka; Manning, Elizabeth; Shoubridge, Cheryl; Krecsmarik, Monika; Hawkins, Thomas A; Giacomotto, Jean; Zhao, Ting; Mueller, Thomas; Bader, Patricia I; Cheung, Sau W; Stankiewicz, Pawel; Bain, Nicole L; Hackett, Anna; Reddy, Chilamakuri C S; Mechaly, Alejandro S; Peers, Bernard; Wilson, Stephen W; Lenhard, Boris; Bally-Cuif, Laure; Gecz, Jozef; Becker, Thomas S; Rinkwitz, Silke

    2015-11-01

    Protein-coding mutations in the transcription factor-encoding gene ARX cause various forms of intellectual disability (ID) and epilepsy. In contrast, variations in surrounding non-coding sequences are correlated with milder forms of non-syndromic ID and autism and had suggested the importance of ARX gene regulation in the etiology of these disorders. We compile data on several novel and some already identified patients with or without ID that carry duplications of ARX genomic region and consider likely genetic mechanisms underlying the neurodevelopmental defects. We establish the long-range regulatory domain of ARX and identify its brain region-specific autoregulation. We conclude that neurodevelopmental disturbances in the patients may not simply arise from increased dosage due to ARX duplication. This is further exemplified by a small duplication involving a non-functional ARX copy, but with duplicated enhancers. ARX enhancers are located within a 504-kb region and regulate expression specifically in the forebrain in developing and adult zebrafish. Transgenic enhancer-reporter lines were used as in vivo tools to delineate a brain region-specific negative and positive autoregulation of ARX. We find autorepression of ARX in the telencephalon and autoactivation in the ventral thalamus. Fluorescently labeled brain regions in the transgenic lines facilitated the identification of neuronal outgrowth and pathfinding disturbances in the ventral thalamus and telencephalon that occur when arxa dosage is diminished. In summary, we have established a model for how breakpoints in long-range gene regulation alter the expression levels of a target gene brain region-specifically, and how this can cause subtle neuronal phenotypes relating to the etiology of associated neuropsychiatric disease.

  3. Copy number variations of the ATP-binding cassette transporter ABCC6 gene and its pseudogenes

    Directory of Open Access Journals (Sweden)

    Kringen Marianne K

    2012-08-01

    Full Text Available Abstract Background The ATP-binding cassette transporter ABCC6 gene is located on chromosome 16 between its two pseudogenes (ABCC6P1 and ABCC6P2. Previously, we have shown that ABCC6P1 is transcribed and affects ABCC6 at the transcriptional level. In this study we aimed to determine copy number variations of ABCC6, ABCC6P1 and ABCC6P2 in different populations. Moreover, we sought to study the transcription pattern of ABCC6 and ABCC6 pseudogenes in 39 different human tissues. Findings Genomic DNA from healthy individuals from five populations, Chinese (n = 24, Middle East (n = 20, Mexicans (n = 24, Caucasians (n = 50 and Africans (n = 24, were examined for copy number variations of ABCC6 and its pseudogenes by pyrosequencing and quantitative PCR. Copy number variation of ABCC6 was very rare (2/142; 1.4%. However, one or three copies of ABCC6P1 were relatively common (3% and 8%, respectively. Only one person had a single copy of ABCC6P2 while none had three copies. In Chinese, deletions or duplications of ABCC6P1 were more frequent than in any other population (9/24; 37.5%. The transcription pattern of ABCC6P2 was highly similar to ABCC6 and ABCC6P1, with highest transcription in liver and kidney. Interestingly, the total transcription level of pseudogenes, ABCC6P1 + ABCC6P2, was higher than ABCC6 in most tissues, including liver and kidney. Conclusions Copy number variations of the ABCC6 pseudogenes are quite common, especially in populations of Chinese ancestry. The expression pattern of ABCC6P2 in 39 human tissues was highly similar to that of ABCC6 and ABCC6P1 suggesting similar regulatory mechanisms for ABCC6 and its pseudogenes.

  4. Mosaic supernumerary inv dup(15) chromosome with four copies of the P gene in a boy with pigmentary dysplasia.

    Science.gov (United States)

    Akahoshi, Keiko; Spritz, Richard A; Fukai, Kazuyoshi; Mitsui, Norimasa; Matsushima, Kazushige; Ohashi, Hirofumi

    2004-04-30

    Association of the pink-eye-dilution gene (P) with hypopigmentation is seen in patients who have oculocutaneous albinism type 2 (OCA2) and Prader-Willi syndrome (PWS) or Angelman syndrome (AS). However, it remains unknown whether duplication or amplification of the P gene causes hyperpigmentation. We previously reported a woman who had hyperpigmentation with a duplication of the proximal part of 15q, including the P gene. Here, we describe an additional patient with mosaicism of inv dup(15) and clinical manifestations of severe psychmoter retardation, epilepsy, and pigmentary dysplasia showing mottled and linear patterns of hyperpigmentation. His karyotype was 47,XY,+idic(15)(pter-->q14::q14-->pter)[38]/46,XY[12] de novo. Chromosomal fluorescence in situ hybridization (FISH) showed six copies of the P gene. Therefore, his cutaneous mosaicism might be caused by the presence of both normal and hyperpigmented skin due to multicopies of the P gene.

  5. Copy number change: evolving views on gene amplification.

    Science.gov (United States)

    Elliott, Kathryn T; Cuff, Laura E; Neidle, Ellen L

    2013-07-01

    The rapid pace of genomic sequence analysis is increasing the awareness of intrinsically dynamic genetic landscapes. Gene duplication and amplification (GDA) contribute to adaptation and evolution by allowing DNA regions to expand and contract in an accordion-like fashion. This process affects diverse aspects of bacterial infection, including antibiotic resistance and host-pathogen interactions. In this review, microbial GDA is discussed, primarily using recent bacterial examples that demonstrate medical and evolutionary consequences. Interplay between GDA and horizontal gene transfer further impact evolutionary trajectories. Complementing the discovery of gene duplication in clinical and environmental settings, experimental evolution provides a powerful method to document genetic change over time. New methods for GDA detection highlight both its importance and its potential application for genetic engineering, synthetic biology and biotechnology.

  6. Reconciling gene and genome duplication events: using multiple nuclear gene families to infer the phylogeny of the aquatic plant family Pontederiaceae.

    Science.gov (United States)

    Ness, Rob W; Graham, Sean W; Barrett, Spencer C H

    2011-11-01

    Most plant phylogenetic inference has used DNA sequence data from the plastid genome. This genome represents a single genealogical sample with no recombination among genes, potentially limiting the resolution of evolutionary relationships in some contexts. In contrast, nuclear DNA is inherently more difficult to employ for phylogeny reconstruction because major mutational events in the genome, including polyploidization, gene duplication, and gene extinction can result in homologous gene copies that are difficult to identify as orthologs or paralogs. Gene tree parsimony (GTP) can be used to infer the rooted species tree by fitting gene genealogies to species trees while simultaneously minimizing the estimated number of duplications needed to reconcile conflicts among them. Here, we use GTP for five nuclear gene families and a previously published plastid data set to reconstruct the phylogenetic backbone of the aquatic plant family Pontederiaceae. Plastid-based phylogenetic studies strongly supported extensive paraphyly of Eichhornia (one of the four major genera) but also depicted considerable ambiguity concerning the true root placement for the family. Our results indicate that species trees inferred from the nuclear genes (alone and in combination with the plastid data) are highly congruent with gene trees inferred from plastid data alone. Consideration of optimal and suboptimal gene tree reconciliations place the root of the family at (or near) a branch leading to the rare and locally restricted E. meyeri. We also explore methods to incorporate uncertainty in individual gene trees during reconciliation by considering their individual bootstrap profiles and relate inferred excesses of gene duplication events on individual branches to whole-genome duplication events inferred for the same branches. Our study improves understanding of the phylogenetic history of Pontederiaceae and also demonstrates the utility of GTP for phylogenetic analysis.

  7. Multiple recurrent de novo copy number variations (CNVs), including duplications of the 7q11.23 Williams-Beuren syndrome region, are strongly associated with autism

    Science.gov (United States)

    Sanders, Stephan J.; Ercan-Sencicek, A. Gulhan; Hus, Vanessa; Luo, Rui; Murtha, Michael T.; Moreno-De-Luca, Daniel; Chu, Su H.; Moreau, Michael P.; Gupta, Abha R.; Thomson, Susanne A.; Mason, Christopher E.; Bilguvar, Kaya; Celestino-Soper, Patricia B. S.; Choi, Murim; Crawford, Emily L.; Davis, Lea; Wright, Nicole R. Davis; Dhodapkar, Rahul M.; DiCola, Michael; DiLullo, Nicholas M.; Fernandez, Thomas V.; Fielding-Singh, Vikram; Fishman, Daniel O.; Frahm, Stephanie; Garagaloyan, Rouben; Goh, Gerald S.; Kammela, Sindhuja; Klei, Lambertus; Lowe, Jennifer K.; Lund, Sabata C.; McGrew, Anna D.; Meyer, Kyle A.; Moffat, William J.; Murdoch, John D.; O'Roak, Brian J.; Ober, Gordon T.; Pottenger, Rebecca S.; Raubeson, Melanie J.; Song, Youeun; Wang, Qi; Yaspan, Brian L.; Yu, Timothy W.; Yurkiewicz, Ilana R.; Beaudet, Arthur L.; Cantor, Rita M.; Curland, Martin; Grice, Dorothy E.; Günel, Murat; Lifton, Richard P.; Mane, Shrikant M.; Martin, Donna M.; Shaw, Chad A.; Sheldon, Michael; Tischfield, Jay A.; Walsh, Christopher A.; Morrow, Eric M.; Ledbetter, David H.; Fombonne, Eric; Lord, Catherine; Martin, Christa Lese; Brooks, Andrew I.; Sutcliffe, James S.; Cook, Edwin H.; Geschwind, Daniel; Roeder, Kathryn; Devlin, Bernie; State, Matthew W.

    2014-01-01

    Summary Given prior evidence for the contribution of rare copy number variations (CNVs) to autism spectrum disorders (ASD), we studied these events in 4,457 individuals from 1,174 simplex families, composed of parents, a proband and, in most kindreds, an unaffected sibling. We find significant association of ASD with de novo duplications of 7q11.23, where the reciprocal deletion causes Williams-Beuren syndrome, featuring a highly social personality. We identify rare recurrent de novo CNVs at five additional regions including two novel ASD loci, 16p13.2 (including the genes USP7 and C16orf72) and Cadherin13, and implement a rigorous new approach to evaluating the statistical significance of these observations. Overall, we find large de novo CNVs carry substantial risk (OR=3.55; CI =2.16-7.46, p=6.9 × 10−6); estimate the presence of 130-234 distinct ASD-related CNV intervals across the genome; and, based on data from multiple studies, present compelling evidence for the association of rare de novo events at 7q11.23, 15q11.2-13.1, 16p11.2, and Neurexin1. PMID:21658581

  8. The evolutionary fate of alternatively spliced homologous exons after gene duplication.

    Science.gov (United States)

    Abascal, Federico; Tress, Michael L; Valencia, Alfonso

    2015-04-29

    Alternative splicing and gene duplication are the two main processes responsible for expanding protein functional diversity. Although gene duplication can generate new genes and alternative splicing can introduce variation through alternative gene products, the interplay between the two processes is complex and poorly understood. Here, we have carried out a study of the evolution of alternatively spliced exons after gene duplication to better understand the interaction between the two processes. We created a manually curated set of 97 human genes with mutually exclusively spliced homologous exons and analyzed the evolution of these exons across five distantly related vertebrates (lamprey, spotted gar, zebrafish, fugu, and coelacanth). Most of these exons had an ancient origin (more than 400 Ma). We found examples supporting two extreme evolutionary models for the behaviour of homologous axons after gene duplication. We observed 11 events in which gene duplication was accompanied by splice isoform separation, that is, each paralog specifically conserved just one distinct ancestral homologous exon. At other extreme, we identified genes in which the homologous exons were always conserved within paralogs, suggesting that the alternative splicing event cannot easily be separated from the function in these genes. That many homologous exons fall in between these two extremes highlights the diversity of biological systems and suggests that the subtle balance between alternative splicing and gene duplication is adjusted to the specific cellular context of each gene. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  9. Gene duplication and divergence affecting drug content in Cannabis sativa.

    Science.gov (United States)

    Weiblen, George D; Wenger, Jonathan P; Craft, Kathleen J; ElSohly, Mahmoud A; Mehmedic, Zlatko; Treiber, Erin L; Marks, M David

    2015-12-01

    Cannabis sativa is an economically important source of durable fibers, nutritious seeds, and psychoactive drugs but few economic plants are so poorly understood genetically. Marijuana and hemp were crossed to evaluate competing models of cannabinoid inheritance and to explain the predominance of tetrahydrocannabinolic acid (THCA) in marijuana compared with cannabidiolic acid (CBDA) in hemp. Individuals in the resulting F2 population were assessed for differential expression of cannabinoid synthase genes and were used in linkage mapping. Genetic markers associated with divergent cannabinoid phenotypes were identified. Although phenotypic segregation and a major quantitative trait locus (QTL) for the THCA/CBDA ratio were consistent with a simple model of codominant alleles at a single locus, the diversity of THCA and CBDA synthase sequences observed in the mapping population, the position of enzyme coding loci on the map, and patterns of expression suggest multiple linked loci. Phylogenetic analysis further suggests a history of duplication and divergence affecting drug content. Marijuana is distinguished from hemp by a nonfunctional CBDA synthase that appears to have been positively selected to enhance psychoactivity. An unlinked QTL for cannabinoid quantity may also have played a role in the recent escalation of drug potency.

  10. Gene duplications and losses among vertebrate deoxyribonucleoside kinases of the non-TK1 Family

    DEFF Research Database (Denmark)

    Mutahir, Zeeshan; Christiansen, Louise Slot; Clausen, Anders R.;

    2016-01-01

    , among vertebrates only four mammalian dNKs have been studied for their substrate specificity and kinetic properties. However, some vertebrates, such as fish, frogs, and birds, apparently possess a duplicated homolog of deoxycytidine kinase (dCK). In this study, we characterized a family of d......CK/deoxyguanosine kinase (dGK)-like enzymes from a frog Xenopus laevis and a bird Gallus gallus. We showed that X. laevis has a duplicated dCK gene and a dGK gene, whereas G. gallus has a duplicated dCK gene but has lost the dGK gene. We cloned, expressed, purified, and subsequently determined the kinetic parameters...

  11. A new resource for characterizing X-linked genes in Drosophila melanogaster: systematic coverage and subdivision of the X chromosome with nested, Y-linked duplications.

    Science.gov (United States)

    Cook, R Kimberley; Deal, Megan E; Deal, Jennifer A; Garton, Russell D; Brown, C Adam; Ward, Megan E; Andrade, Rachel S; Spana, Eric P; Kaufman, Thomas C; Cook, Kevin R

    2010-12-01

    Interchromosomal duplications are especially important for the study of X-linked genes. Males inheriting a mutation in a vital X-linked gene cannot survive unless there is a wild-type copy of the gene duplicated elsewhere in the genome. Rescuing the lethality of an X-linked mutation with a duplication allows the mutation to be used experimentally in complementation tests and other genetic crosses and it maps the mutated gene to a defined chromosomal region. Duplications can also be used to screen for dosage-dependent enhancers and suppressors of mutant phenotypes as a way to identify genes involved in the same biological process. We describe an ongoing project in Drosophila melanogaster to generate comprehensive coverage and extensive breakpoint subdivision of the X chromosome with megabase-scale X segments borne on Y chromosomes. The in vivo method involves the creation of X inversions on attached-XY chromosomes by FLP-FRT site-specific recombination technology followed by irradiation to induce large internal X deletions. The resulting chromosomes consist of the X tip, a medial X segment placed near the tip by an inversion, and a full Y. A nested set of medial duplicated segments is derived from each inversion precursor. We have constructed a set of inversions on attached-XY chromosomes that enable us to isolate nested duplicated segments from all X regions. To date, our screens have provided a minimum of 78% X coverage with duplication breakpoints spaced a median of nine genes apart. These duplication chromosomes will be valuable resources for rescuing and mapping X-linked mutations and identifying dosage-dependent modifiers of mutant phenotypes.

  12. Tandem gene arrays in Trypanosoma brucei: Comparative phylogenomic analysis of duplicate sequence variation

    Directory of Open Access Journals (Sweden)

    Jackson Andrew P

    2007-04-01

    Full Text Available Abstract Background The genome sequence of the protistan parasite Trypanosoma brucei contains many tandem gene arrays. Gene duplicates are created through tandem duplication and are expressed through polycistronic transcription, suggesting that the primary purpose of long, tandem arrays is to increase gene dosage in an environment where individual gene promoters are absent. This report presents the first account of the tandem gene arrays in the T. brucei genome, employing several related genome sequences to establish how variation is created and removed. Results A systematic survey of tandem gene arrays showed that substantial sequence variation existed across the genome; variation from different regions of an array often produced inconsistent phylogenetic affinities. Phylogenetic relationships of gene duplicates were consistent with concerted evolution being a widespread homogenising force. However, tandem duplicates were not usually identical; therefore, any homogenising effect was coincident with divergence among duplicates. Allelic gene conversion was detected using various criteria and was apparently able to both remove and introduce sequence variation. Tandem arrays containing structural heterogeneity demonstrated how sequence homogenisation and differentiation can occur within a single locus. Conclusion The use of multiple genome sequences in a comparative analysis of tandem gene arrays identified substantial sequence variation among gene duplicates. The distribution of sequence variation is determined by a dynamic balance of conservative and innovative evolutionary forces. Gene trees from various species showed that intraspecific duplicates evolve in concert, perhaps through frequent gene conversion, although this does not prevent sequence divergence, especially where structural heterogeneity physically separates a duplicate from its neighbours. In describing dynamics of sequence variation that have consequences beyond gene dosage, this

  13. Effect of Incomplete Lineage Sorting On Tree-Reconciliation-Based Inference of Gene Duplication.

    Science.gov (United States)

    Zheng, Yu; Zhang, Louxin

    2014-01-01

    In the tree reconciliation approach to infer the duplication history of a gene family, the gene (family) tree is compared to the corresponding species tree. Incomplete lineage sorting (ILS) gives rise to stochastic variation in the topology of a gene tree and hence likely introduces false duplication events when a tree reconciliation method is used. We quantify the effect of ILS on gene duplication inference in a species tree in terms of the expected number of false duplication events inferred from reconciling a random gene tree, which occurs with a probability predicted in coalescent theory, and the species tree. We computationally examine the relationship between the effect of ILS on duplication inference in a species tree and its topological parameters. Our findings suggest that ILS may cause non-negligible bias on duplication inference, particularly on an asymmetric species tree. Hence, when gene duplication is inferred via tree reconciliation or any other approach that takes gene tree topology into account, the ILS-induced bias should be examined cautiously.

  14. Pinda: a web service for detection and analysis of intraspecies gene duplication events.

    Science.gov (United States)

    Kontopoulos, Dimitrios-Georgios; Glykos, Nicholas M

    2013-09-01

    We present Pinda, a Web service for the detection and analysis of possible duplications of a given protein or DNA sequence within a source species. Pinda fully automates the whole gene duplication detection procedure, from performing the initial similarity searches, to generating the multiple sequence alignments and the corresponding phylogenetic trees, to bootstrapping the trees and producing a Z-score-based list of duplication candidates for the input sequence. Pinda has been cross-validated using an extensive set of known and bibliographically characterized duplication events. The service facilitates the automatic and dependable identification of gene duplication events, using some of the most successful bioinformatics software to perform an extensive analysis protocol. Pinda will prove of use for the analysis of newly discovered genes and proteins, thus also assisting the study of recently sequenced genomes. The service's location is http://orion.mbg.duth.gr/Pinda. The source code is freely available via https://github.com/dgkontopoulos/Pinda/.

  15. Comparative study of human mitochondrial proteome reveals extensive protein subcellular relocalization after gene duplications

    Directory of Open Access Journals (Sweden)

    Huang Yong

    2009-11-01

    Full Text Available Abstract Background Gene and genome duplication is the principle creative force in evolution. Recently, protein subcellular relocalization, or neolocalization was proposed as one of the mechanisms responsible for the retention of duplicated genes. This hypothesis received support from the analysis of yeast genomes, but has not been tested thoroughly on animal genomes. In order to evaluate the importance of subcellular relocalizations for retention of duplicated genes in animal genomes, we systematically analyzed nuclear encoded mitochondrial proteins in the human genome by reconstructing phylogenies of mitochondrial multigene families. Results The 456 human mitochondrial proteins selected for this study were clustered into 305 gene families including 92 multigene families. Among the multigene families, 59 (64% consisted of both mitochondrial and cytosolic (non-mitochondrial proteins (mt-cy families while the remaining 33 (36% were composed of mitochondrial proteins (mt-mt families. Phylogenetic analyses of mt-cy families revealed three different scenarios of their neolocalization following gene duplication: 1 relocalization from mitochondria to cytosol, 2 from cytosol to mitochondria and 3 multiple subcellular relocalizations. The neolocalizations were most commonly enabled by the gain or loss of N-terminal mitochondrial targeting signals. The majority of detected subcellular relocalization events occurred early in animal evolution, preceding the evolution of tetrapods. Mt-mt protein families showed a somewhat different pattern, where gene duplication occurred more evenly in time. However, for both types of protein families, most duplication events appear to roughly coincide with two rounds of genome duplications early in vertebrate evolution. Finally, we evaluated the effects of inaccurate and incomplete annotation of mitochondrial proteins and found that our conclusion of the importance of subcellular relocalization after gene duplication on

  16. Spider Transcriptomes Identify Ancient Large-Scale Gene Duplication Event Potentially Important in Silk Gland Evolution.

    Science.gov (United States)

    Clarke, Thomas H; Garb, Jessica E; Hayashi, Cheryl Y; Arensburger, Peter; Ayoub, Nadia A

    2015-06-08

    The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae). © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  17. Extensive local gene duplication and functional divergence among paralogs in Atlantic salmon.

    Science.gov (United States)

    Warren, Ian A; Ciborowski, Kate L; Casadei, Elisa; Hazlerigg, David G; Martin, Sam; Jordan, William C; Sumner, Seirian

    2014-06-19

    Many organisms can generate alternative phenotypes from the same genome, enabling individuals to exploit diverse and variable environments. A prevailing hypothesis is that such adaptation has been favored by gene duplication events, which generate redundant genomic material that may evolve divergent functions. Vertebrate examples of recent whole-genome duplications are sparse although one example is the salmonids, which have undergone a whole-genome duplication event within the last 100 Myr. The life-cycle of the Atlantic salmon, Salmo salar, depends on the ability to produce alternating phenotypes from the same genome, to facilitate migration and maintain its anadromous life history. Here, we investigate the hypothesis that genome-wide and local gene duplication events have contributed to the salmonid adaptation. We used high-throughput sequencing to characterize the transcriptomes of three key organs involved in regulating migration in S. salar: Brain, pituitary, and olfactory epithelium. We identified over 10,000 undescribed S. salar sequences and designed an analytic workflow to distinguish between paralogs originating from local gene duplication events or from whole-genome duplication events. These data reveal that substantial local gene duplications took place shortly after the whole-genome duplication event. Many of the identified paralog pairs have either diverged in function or become noncoding. Future functional genomics studies will reveal to what extent this rich source of divergence in genetic sequence is likely to have facilitated the evolution of extreme phenotypic plasticity required for an anadromous life-cycle.

  18. Investigation of modifier genes within copy number variations in Rett syndrome.

    Science.gov (United States)

    Artuso, Rosangela; Papa, Filomena T; Grillo, Elisa; Mucciolo, Mafalda; Yasui, Dag H; Dunaway, Keith W; Disciglio, Vittoria; Mencarelli, Maria A; Pollazzon, Marzia; Zappella, Michele; Hayek, Giuseppe; Mari, Francesca; Renieri, Alessandra; Lasalle, Janine M; Ariani, Francesca

    2011-07-01

    MECP2 mutations are responsible for two different phenotypes in females, classical Rett syndrome and the milder Zappella variant (Z-RTT). We investigated whether copy number variants (CNVs) may modulate the phenotype by comparison of array-CGH data from two discordant pairs of sisters and four additional discordant pairs of unrelated girls matched by mutation type. We also searched for potential MeCP2 targets within CNVs by chromatin immunopreceipitation microarray (ChIP-chip) analysis. We did not identify one major common gene/region, suggesting that modifiers may be complex and variable between cases. However, we detected CNVs correlating with disease severity that contain candidate modifiers. CROCC (1p36.13) is a potential MeCP2 target, in which a duplication in a Z-RTT and a deletion in a classic patient were observed. CROCC encodes a structural component of ciliary motility that is required for correct brain development. CFHR1 and CFHR3, on 1q31.3, may be involved in the regulation of complement during synapse elimination, and were found to be deleted in a Z-RTT but duplicated in two classic patients. The duplication of 10q11.22, present in two Z-RTT patients, includes GPRIN2, a regulator of neurite outgrowth and PPYR1, involved in energy homeostasis. Functional analyses are necessary to confirm candidates and to define targets for future therapies.

  19. Interlocus gene conversion explains at least 2.7% of single nucleotide variants in human segmental duplications.

    Science.gov (United States)

    Dumont, Beth L

    2015-06-16

    Interlocus gene conversion (IGC) is a recombination-based mechanism that results in the unidirectional transfer of short stretches of sequence between paralogous loci. Although IGC is a well-established mechanism of human disease, the extent to which this mutagenic process has shaped overall patterns of segregating variation in multi-copy regions of the human genome remains unknown. One expected manifestation of IGC in population genomic data is the presence of one-to-one paralogous SNPs that segregate identical alleles. Here, I use SNP genotype calls from the low-coverage phase 3 release of the 1000 Genomes Project to identify 15,790 parallel, shared SNPs in duplicated regions of the human genome. My approach for identifying these sites accounts for the potential redundancy of short read mapping in multi-copy genomic regions, thereby effectively eliminating false positive SNP calls arising from paralogous sequence variation. I demonstrate that independent mutation events to identical nucleotides at paralogous sites are not a significant source of shared polymorphisms in the human genome, consistent with the interpretation that these sites are the outcome of historical IGC events. These putative signals of IGC are enriched in genomic contexts previously associated with non-allelic homologous recombination, including clear signals in gene families that form tandem intra-chromosomal clusters. Taken together, my analyses implicate IGC, not point mutation, as the mechanism generating at least 2.7% of single nucleotide variants in duplicated regions of the human genome.

  20. Distinct Defects in Spine Formation or Pruning in Two Gene Duplication Mouse Models of Autism.

    Science.gov (United States)

    Wang, Miao; Li, Huiping; Takumi, Toru; Qiu, Zilong; Xu, Xiu; Yu, Xiang; Bian, Wen-Jie

    2017-04-01

    Autism spectrum disorder (ASD) encompasses a complex set of developmental neurological disorders, characterized by deficits in social communication and excessive repetitive behaviors. In recent years, ASD is increasingly being considered as a disease of the synapse. One main type of genetic aberration leading to ASD is gene duplication, and several mouse models have been generated mimicking these mutations. Here, we studied the effects of MECP2 duplication and human chromosome 15q11-13 duplication on synaptic development and neural circuit wiring in the mouse sensory cortices. We showed that mice carrying MECP2 duplication had specific defects in spine pruning, while the 15q11-13 duplication mouse model had impaired spine formation. Our results demonstrate that spine pathology varies significantly between autism models and that distinct aspects of neural circuit development may be targeted in different ASD mutations. Our results further underscore the importance of gene dosage in normal development and function of the brain.

  1. Increased RPA1 gene dosage affects genomic stability potentially contributing to 17p13.3 duplication syndrome.

    Directory of Open Access Journals (Sweden)

    Emily Outwin

    2011-08-01

    Full Text Available A novel microduplication syndrome involving various-sized contiguous duplications in 17p13.3 has recently been described, suggesting that increased copy number of genes in 17p13.3, particularly PAFAH1B1, is associated with clinical features including facial dysmorphism, developmental delay, and autism spectrum disorder. We have previously shown that patient-derived cell lines from individuals with haploinsufficiency of RPA1, a gene within 17p13.3, exhibit an impaired ATR-dependent DNA damage response (DDR. Here, we show that cell lines from patients with duplications specifically incorporating RPA1 exhibit a different although characteristic spectrum of DDR defects including abnormal S phase distribution, attenuated DNA double strand break (DSB-induced RAD51 chromatin retention, elevated genomic instability, and increased sensitivity to DNA damaging agents. Using controlled conditional over-expression of RPA1 in a human model cell system, we also see attenuated DSB-induced RAD51 chromatin retention. Furthermore, we find that transient over-expression of RPA1 can impact on homologous recombination (HR pathways following DSB formation, favouring engagement in aberrant forms of recombination and repair. Our data identifies unanticipated defects in the DDR associated with duplications in 17p13.3 in humans involving modest RPA1 over-expression.

  2. Tubulin evolution in insects: gene duplication and subfunctionalization provide specialized isoforms in a functionally constrained gene family

    Directory of Open Access Journals (Sweden)

    Gadagkar Sudhindra R

    2010-04-01

    Full Text Available Abstract Background The completion of 19 insect genome sequencing projects spanning six insect orders provides the opportunity to investigate the evolution of important gene families, here tubulins. Tubulins are a family of eukaryotic structural genes that form microtubules, fundamental components of the cytoskeleton that mediate cell division, shape, motility, and intracellular trafficking. Previous in vivo studies in Drosophila find a stringent relationship between tubulin structure and function; small, biochemically similar changes in the major alpha 1 or testis-specific beta 2 tubulin protein render each unable to generate a motile spermtail axoneme. This has evolutionary implications, not a single non-synonymous substitution is found in beta 2 among 17 species of Drosophila and Hirtodrosophila flies spanning 60 Myr of evolution. This raises an important question, How do tubulins evolve while maintaining their function? To answer, we use molecular evolutionary analyses to characterize the evolution of insect tubulins. Results Sixty-six alpha tubulins and eighty-six beta tubulin gene copies were retrieved and subjected to molecular evolutionary analyses. Four ancient clades of alpha and beta tubulins are found in insects, a major isoform clade (alpha 1, beta 1 and three minor, tissue-specific clades (alpha 2-4, beta 2-4. Based on a Homarus americanus (lobster outgroup, these were generated through gene duplication events on major beta and alpha tubulin ancestors, followed by subfunctionalization in expression domain. Strong purifying selection acts on all tubulins, yet maximum pairwise amino acid distances between tubulin paralogs are large (0.464 substitutions/site beta tubulins, 0.707 alpha tubulins. Conversely orthologs, with the exception of reproductive tissue isoforms, show little sequence variation except in the last 15 carboxy terminus tail (CTT residues, which serve as sites for post-translational modifications (PTMs and interactions

  3. Gene duplication and divergence of long wavelength-sensitive opsin genes in the guppy, Poecilia reticulata.

    Science.gov (United States)

    Watson, Corey T; Gray, Suzanne M; Hoffmann, Margarete; Lubieniecki, Krzysztof P; Joy, Jeffrey B; Sandkam, Ben A; Weigel, Detlef; Loew, Ellis; Dreyer, Christine; Davidson, William S; Breden, Felix

    2011-02-01

    Female preference for male orange coloration in the genus Poecilia suggests a role for duplicated long wavelength-sensitive (LWS) opsin genes in facilitating behaviors related to mate choice in these species. Previous work has shown that LWS gene duplication in this genus has resulted in expansion of long wavelength visual capacity as determined by microspectrophotometry (MSP). However, the relationship between LWS genomic repertoires and expression of LWS retinal cone classes within a given species is unclear. Our previous study in the related species, Xiphophorus helleri, was the first characterization of the complete LWS opsin genomic repertoire in conjunction with MSP expression data in the family Poeciliidae, and revealed the presence of four LWS loci and two distinct LWS cone classes. In this study we characterized the genomic organization of LWS opsin genes by BAC clone sequencing, and described the full range of cone cell types in the retina of the colorful Cumaná guppy, Poecilia reticulata. In contrast to X. helleri, MSP data from the Cumaná guppy revealed three LWS cone classes. Comparisons of LWS genomic organization described here for Cumaná to that of X. helleri indicate that gene divergence and not duplication was responsible for the evolution of a novel LWS haplotype in the Cumaná guppy. This lineage-specific divergence is likely responsible for a third additional retinal cone class not present in X. helleri, and may have facilitated the strong sexual selection driven by female preference for orange color patterns associated with the genus Poecilia.

  4. Rapid genome reshaping by multiple-gene loss after whole-genome duplication in teleost fish suggested by mathematical modeling.

    Science.gov (United States)

    Inoue, Jun; Sato, Yukuto; Sinclair, Robert; Tsukamoto, Katsumi; Nishida, Mutsumi

    2015-12-01

    Whole-genome duplication (WGD) is believed to be a significant source of major evolutionary innovation. Redundant genes resulting from WGD are thought to be lost or acquire new functions. However, the rates of gene loss and thus temporal process of genome reshaping after WGD remain unclear. The WGD shared by all teleost fish, one-half of all jawed vertebrates, was more recent than the two ancient WGDs that occurred before the origin of jawed vertebrates, and thus lends itself to analysis of gene loss and genome reshaping. Using a newly developed orthology identification pipeline, we inferred the post-teleost-specific WGD evolutionary histories of 6,892 protein-coding genes from nine phylogenetically representative teleost genomes on a time-calibrated tree. We found that rapid gene loss did occur in the first 60 My, with a loss of more than 70-80% of duplicated genes, and produced similar genomic gene arrangements within teleosts in that relatively short time. Mathematical modeling suggests that rapid gene loss occurred mainly by events involving simultaneous loss of multiple genes. We found that the subsequent 250 My were characterized by slow and steady loss of individual genes. Our pipeline also identified about 1,100 shared single-copy genes that are inferred to have become singletons before the divergence of clupeocephalan teleosts. Therefore, our comparative genome analysis suggests that rapid gene loss just after the WGD reshaped teleost genomes before the major divergence, and provides a useful set of marker genes for future phylogenetic analysis.

  5. Use of the MLPA assay in the molecular diagnosis of gene copy number alterations in human genetic diseases.

    Science.gov (United States)

    Stuppia, Liborio; Antonucci, Ivana; Palka, Giandomenico; Gatta, Valentina

    2012-01-01

    Multiplex Ligation-dependent Probe Amplification (MLPA) assay is a recently developed technique able to evidence variations in the copy number of several human genes. Due to this ability, MLPA can be used in the molecular diagnosis of several genetic diseases whose pathogenesis is related to the presence of deletions or duplications of specific genes. Moreover, MLPA assay can also be used in the molecular diagnosis of genetic diseases characterized by the presence of abnormal DNA methylation. Due to the large number of genes that can be analyzed by a single technique, MLPA assay represents the gold standard for molecular analysis of all pathologies derived from the presence of gene copy number variation. In this review, the main applications of the MLPA technique for the molecular diagnosis of human diseases are described.

  6. Ancient Duplications and Expression Divergence in the Globin Gene Superfamily of Vertebrates: Insights from the Elephant Shark Genome and Transcriptome.

    Science.gov (United States)

    Opazo, Juan C; Lee, Alison P; Hoffmann, Federico G; Toloza-Villalobos, Jessica; Burmester, Thorsten; Venkatesh, Byrappa; Storz, Jay F

    2015-07-01

    Comparative analyses of vertebrate genomes continue to uncover a surprising diversity of genes in the globin gene superfamily, some of which have very restricted phyletic distributions despite their antiquity. Genomic analysis of the globin gene repertoire of cartilaginous fish (Chondrichthyes) should be especially informative about the duplicative origins and ancestral functions of vertebrate globins, as divergence between Chondrichthyes and bony vertebrates represents the most basal split within the jawed vertebrates. Here, we report a comparative genomic analysis of the vertebrate globin gene family that includes the complete globin gene repertoire of the elephant shark (Callorhinchus milii). Using genomic sequence data from representatives of all major vertebrate classes, integrated analyses of conserved synteny and phylogenetic relationships revealed that the last common ancestor of vertebrates possessed a repertoire of at least seven globin genes: single copies of androglobin and neuroglobin, four paralogous copies of globin X, and the single-copy progenitor of the entire set of vertebrate-specific globins. Combined with expression data, the genomic inventory of elephant shark globins yielded four especially surprising findings: 1) there is no trace of the neuroglobin gene (a highly conserved gene that is present in all other jawed vertebrates that have been examined to date), 2) myoglobin is highly expressed in heart, but not in skeletal muscle (reflecting a possible ancestral condition in vertebrates with single-circuit circulatory systems), 3) elephant shark possesses two highly divergent globin X paralogs, one of which is preferentially expressed in gonads, and 4) elephant shark possesses two structurally distinct α-globin paralogs, one of which is preferentially expressed in the brain. Expression profiles of elephant shark globin genes reveal distinct specializations of function relative to orthologs in bony vertebrates and suggest hypotheses about

  7. Methods for identifying and mapping recent segmental and gene duplications in eukaryotic genomes.

    Science.gov (United States)

    Khaja, Razi; MacDonald, Jeffrey R; Zhang, Junjun; Scherer, Stephen W

    2006-01-01

    The aim of this chapter is to provide instruction for analyzing and mapping recent segmental and gene duplications in eukaryotic genomes. We describe a bioinformatics-based approach utilizing computational tools to manage eukaryotic genome sequences to characterize and understand the evolutionary fates and trajectories of duplicated genes. An introduction to bioinformatics tools and programs such as BLAST, Perl, BioPerl, and the GFF specification provides the necessary background to complete this analysis for any eukaryotic genome of interest.

  8. Divergence of recently duplicated M{gamma}-type MADS-box genes in Petunia.

    Science.gov (United States)

    Bemer, Marian; Gordon, Jonathan; Weterings, Koen; Angenent, Gerco C

    2010-02-01

    The MADS-box transcription factor family has expanded considerably in plants via gene and genome duplications and can be subdivided into type I and MIKC-type genes. The two gene classes show a different evolutionary history. Whereas the MIKC-type genes originated during ancient genome duplications, as well as during more recent events, the type I loci appear to experience high turnover with many recent duplications. This different mode of origin also suggests a different fate for the type I duplicates, which are thought to have a higher chance to become silenced or lost from the genome. To get more insight into the evolution of the type I MADS-box genes, we isolated nine type I genes from Petunia, which belong to the Mgamma subclass, and investigated the divergence of their coding and regulatory regions. The isolated genes could be subdivided into two categories: two genes were highly similar to Arabidopsis Mgamma-type genes, whereas the other seven genes showed less similarity to Arabidopsis genes and originated more recently. Two of the recently duplicated genes were found to contain deleterious mutations in their coding regions, and expression analysis revealed that a third paralog was silenced by mutations in its regulatory region. However, in addition to the three genes that were subjected to nonfunctionalization, we also found evidence for neofunctionalization of one of the Petunia Mgamma-type genes. Our study shows a rapid divergence of recently duplicated Mgamma-type MADS-box genes and suggests that redundancy among type I paralogs may be less common than expected.

  9. Expression, subcellular localization, and cis-regulatory structure of duplicated phytoene synthase genes in melon (Cucumis melo L.).

    Science.gov (United States)

    Qin, Xiaoqiong; Coku, Ardian; Inoue, Kentaro; Tian, Li

    2011-10-01

    Carotenoids perform many critical functions in plants, animals, and humans. It is therefore important to understand carotenoid biosynthesis and its regulation in plants. Phytoene synthase (PSY) catalyzes the first committed and rate-limiting step in carotenoid biosynthesis. While PSY is present as a single copy gene in Arabidopsis, duplicated PSY genes have been identified in many economically important monocot and dicot crops. CmPSY1 was previously identified from melon (Cucumis melo L.), but was not functionally characterized. We isolated a second PSY gene, CmPSY2, from melon in this work. CmPSY2 possesses a unique intron/exon structure that has not been observed in other plant PSYs. Both CmPSY1 and CmPSY2 are functional in vitro, but exhibit distinct expression patterns in different melon tissues and during fruit development, suggesting differential regulation of the duplicated melon PSY genes. In vitro chloroplast import assays verified the plastidic localization of CmPSY1 and CmPSY2 despite the lack of an obvious plastid target peptide in CmPSY2. Promoter motif analysis of the duplicated melon and tomato PSY genes and the Arabidopsis PSY revealed distinctive cis-regulatory structures of melon PSYs and identified gibberellin-responsive motifs in all PSYs except for SlPSY1, which has not been reported previously. Overall, these data provide new insights into the evolutionary history of plant PSY genes and the regulation of PSY expression by developmental and environmental signals that may involve different regulatory networks.

  10. Genome-wide analysis of copy number variants in attention deficit hyperactivity disorder: the role of rare variants and duplications at 15q13.3.

    NARCIS (Netherlands)

    Williams, N.M.; Franke, B.; Mick, E.; Anney, R.J.; Freitag, C.M.; Gill, M.; Thapar, A.; O'Donovan, M.C.; Owen, M.J.; Holmans, P.; Kent, L.; Middleton, F.; Zhang-James, Y.; Liu, L.; Meyer, J.; Nguyen, T.T.M.; Romanos, J.; Romanos, M.; Seitz, C.; Renner, T.J.; Walitza, S.; Warnke, A.; Palmason, H.; Buitelaar, J.K.; Rommelse, N.N.; Arias Vasquez, A.; Hawi, Z.; Langley, K.; Sergeant, J.A.; Steinhausen, H.C.; Roeyers, H.; Biederman, J.; Zaharieva, I.; Hakonarson, H.; Elia, J.; Lionel, A.C.; Crosbie, J.; Marshall, C.R.; Schachar, R.; Scherer, S.W.; Todorov, A.; Smalley, S.L.; Loo, S.; Nelson, S.; Shtir, C.; Asherson, P.; Reif, A.; Lesch, K.P.; Faraone, S.V.

    2012-01-01

    OBJECTIVE: Attention deficit hyperactivity disorder (ADHD) is a common, highly heritable psychiatric disorder. Because of its multifactorial etiology, however, identifying the genes involved has been difficult. The authors followed up on recent findings suggesting that rare copy number variants (CNV

  11. Copy Number Variation of UGT 2B Genes in Indian Families Using Whole Genome Scans

    Directory of Open Access Journals (Sweden)

    Avinash M. Veerappa

    2016-01-01

    Full Text Available Background and Objectives. Uridine diphospho-glucuronosyltransferase 2B (UGT2B is a family of genes involved in metabolizing steroid hormones and several other xenobiotics. These UGT2B genes are highly polymorphic in nature and have distinct polymorphisms associated with specific regions around the globe. Copy number variations (CNVs status of UGT2B17 in Indian population is not known and their disease associations have been inconclusive. It was therefore of interest to investigate the CNV profile of UGT2B genes. Methods. We investigated the presence of CNVs in UGT2B genes in 31 members from eight Indian families using Affymetrix Genome-Wide Human SNP Array 6.0 chip. Results. Our data revealed >50% of the study members carried CNVs in UGT2B genes, of which 76% showed deletion polymorphism. CNVs were observed more in UGT2B17 (76.4% than in UGT2B15 (17.6%. Molecular network and pathway analysis found enrichment related to steroid metabolic process, carboxylesterase activity, and sequence specific DNA binding. Interpretation and Conclusion. We report the presence of UGT2B gene deletion and duplication polymorphisms in Indian families. Network analysis indicates the substitutive role of other possible genes in the UGT activity. The CNVs of UGT2B genes are very common in individuals indicating that the effect is neutral in causing any suspected diseases.

  12. Long-term maintenance of stable copy number in the eukaryotic SMC family: origin of a vertebrate meiotic SMC1 and fate of recent segmental duplicates%真核生物SMC基因家族中拷贝数目的长期稳定进化

    Institute of Scientific and Technical Information of China (English)

    Alexandra SURCEL; Xiaofan ZHOU; Li QUAN; 马红

    2008-01-01

    Members of the Structural Maintenance of Chromosome (SMC) family have long been of interest to molecular and evolutionary biologists for their role in chromosome structural dynamics, particularly sister chromatid cohesion, condensation, and DNA repair. SMC and related proteins are found in all major groups of living organisms and share a common structure of conserved N and C globular domains separated from the conserved hinge domain by long coiled-coil regions. In eukaryotes there are six paralogous proteins that form three heterodimeric pairs, whereas in prokaryotes there is only one SMC protein that homodimerizes. From recently completed genome sequences, we have identified SMC genes from 34 eukaryotes that have not been described in previous reports. Our phylogenetic analysis of these and previously identified SMC genes supports an origin for the vertebrate meiotic SMC1 in the most recent common ancestor since the divergence from invertebrate animals. Additionally, we have identified duplicate copies due to segmental duplications for some of the SMC paralogs in plants and yeast, mainly SMC2 and SMC6, and detected evidence that duplicates of other paralogs were lost, suggesting differential evolution for these genes. Our analysis indicates that the SMC paralogs have been stably maintained at very low copy numbers, even after segmental (genome-wide) duplications. It is possible that such low copy numbers might be selected during eukaryotic evolution, although other possibilities are not ruled out.

  13. Ancestral gene duplication enabled the evolution of multifunctional cellulases in stick insects (Phasmatodea).

    Science.gov (United States)

    Shelomi, Matan; Heckel, David G; Pauchet, Yannick

    2016-04-01

    The Phasmatodea (stick insects) have multiple, endogenous, highly expressed copies of glycoside hydrolase family 9 (GH9) genes. The purpose for retaining so many was unknown. We cloned and expressed the enzymes in transfected insect cell lines, and tested the individual proteins against different plant cell wall component poly- and oligosaccharides. Nearly all isolated enzymes were active against carboxymethylcellulose, however most could also degrade glucomannan, and some also either xylan or xyloglucan. The latter two enzyme groups were each monophyletic, suggesting the evolution of these novel substrate specificities in an early ancestor of the order. Such enzymes are highly unusual for Metazoa, for which no xyloglucanases had been reported. Phasmatodea gut extracts could degrade multiple plant cell wall components fully into sugar monomers, suggesting that enzymatic breakdown of plant cell walls by the entire Phasmatodea digestome may contribute to the Phasmatodea nutritional budget. The duplication and neofunctionalization of GH9s in the ancestral Phasmatodea may have enabled them to specialize as folivores and diverge from their omnivorous ancestors. The structural changes enabling these unprecedented activities in the cellulases require further study.

  14. Copy number variations in alternative splicing gene networks impact lifespan.

    Directory of Open Access Journals (Sweden)

    Joseph T Glessner

    Full Text Available Longevity has a strong genetic component evidenced by family-based studies. Lipoprotein metabolism, FOXO proteins, and insulin/IGF-1 signaling pathways in model systems have shown polygenic variations predisposing to shorter lifespan. To test the hypothesis that rare variants could influence lifespan, we compared the rates of CNVs in healthy children (0-18 years of age with individuals 67 years or older. CNVs at a significantly higher frequency in the pediatric cohort were considered risk variants impacting lifespan, while those enriched in the geriatric cohort were considered longevity protective variants. We performed a whole-genome CNV analysis on 7,313 children and 2,701 adults of European ancestry genotyped with 302,108 SNP probes. Positive findings were evaluated in an independent cohort of 2,079 pediatric and 4,692 geriatric subjects. We detected 8 deletions and 10 duplications that were enriched in the pediatric group (P=3.33×10(-8-1.6×10(-2 unadjusted, while only one duplication was enriched in the geriatric cohort (P=6.3×10(-4. Population stratification correction resulted in 5 deletions and 3 duplications remaining significant (P=5.16×10(-5-4.26×10(-2 in the replication cohort. Three deletions and four duplications were significant combined (combined P=3.7×10(-4-3.9×10(-2. All associated loci were experimentally validated using qPCR. Evaluation of these genes for pathway enrichment demonstrated ~50% are involved in alternative splicing (P=0.0077 Benjamini and Hochberg corrected. We conclude that genetic variations disrupting RNA splicing could have long-term biological effects impacting lifespan.

  15. Temporal pattern of loss/persistence of duplicate genes involved in signal transduction and metabolic pathways after teleost-specific genome duplication

    Directory of Open Access Journals (Sweden)

    Sato Yukuto

    2009-06-01

    Full Text Available Abstract Background Recent genomic studies have revealed a teleost-specific third-round whole genome duplication (3R-WGD event occurred in a common ancestor of teleost fishes. However, it is unclear how the genes duplicated in this event were lost or persisted during the diversification of teleosts, and therefore, how many of the duplicated genes contribute to the genetic differences among teleosts. This subject is also important for understanding the process of vertebrate evolution through WGD events. We applied a comparative evolutionary approach to this question by focusing on the genes involved in long-term potentiation, taste and olfactory transduction, and the tricarboxylic acid cycle, based on the whole genome sequences of four teleosts; zebrafish, medaka, stickleback, and green spotted puffer fish. Results We applied a state-of-the-art method of maximum-likelihood phylogenetic inference and conserved synteny analyses to each of 130 genes involved in the above biological systems of human. These analyses identified 116 orthologous gene groups between teleosts and tetrapods, and 45 pairs of 3R-WGD-derived duplicate genes among them. This suggests that more than half [(45×2/(116+45] = 56.5% of the loci, probably more than ten thousand genes, present in a common ancestor of the four teleosts were still duplicated after the 3R-WGD. The estimated temporal pattern of gene loss suggested that, after the 3R-WGD, many (71/116 of the duplicated genes were rapidly lost during the initial 75 million years (MY, whereas on average more than half (27.3/45 of the duplicated genes remaining in the ancestor of the four teleosts (45/116 have persisted for about 275 MY. The 3R-WGD-derived duplicates that have persisted for a long evolutionary periods of time had significantly larger number of interacting partners and longer length of protein coding sequence, implying that they tend to be more multifunctional than the singletons after the 3R-WGD. Conclusion

  16. A young Drosophila duplicate gene plays essential roles in spermatogenesis by regulating several Y-linked male fertility genes.

    Directory of Open Access Journals (Sweden)

    Yun Ding

    Full Text Available Gene duplication is supposed to be the major source for genetic innovations. However, how a new duplicate gene acquires functions by integrating into a pathway and results in adaptively important phenotypes has remained largely unknown. Here, we investigated the biological roles and the underlying molecular mechanism of the young kep1 gene family in the Drosophila melanogaster species subgroup to understand the origin and evolution of new genes with new functions. Sequence and expression analysis demonstrates that one of the new duplicates, nsr (novel spermatogenesis regulator, exhibits positive selection signals and novel subcellular localization pattern. Targeted mutagenesis and whole-transcriptome sequencing analysis provide evidence that nsr is required for male reproduction associated with sperm individualization, coiling, and structural integrity of the sperm axoneme via regulation of several Y chromosome fertility genes post-transcriptionally. The absence of nsr-like expression pattern and the presence of the corresponding cis-regulatory elements of the parental gene kep1 in the pre-duplication species Drosophila yakuba indicate that kep1 might not be ancestrally required for male functions and that nsr possibly has experienced the neofunctionalization process, facilitated by changes of trans-regulatory repertories. These findings not only present a comprehensive picture about the evolution of a new duplicate gene but also show that recently originated duplicate genes can acquire multiple biological roles and establish novel functional pathways by regulating essential genes.

  17. Genome-wide linkage and copy number variation analysis reveals 710 kb duplication on chromosome 1p31.3 responsible for autosomal dominant omphalocele

    Science.gov (United States)

    Radhakrishna, Uppala; Nath, Swapan K; McElreavey, Ken; Ratnamala, Uppala; Sun, Celi; Maiti, Amit K; Gagnebin, Maryline; Béna, Frédérique; Newkirk, Heather L; Sharp, Andrew J; Everman, David B; Murray, Jeffrey C; Schwartz, Charles E; Antonarakis, Stylianos E; Butler, Merlin G

    2017-01-01

    Background Omphalocele is a congenital birth defect characterised by the presence of internal organs located outside of the ventral abdominal wall. The purpose of this study was to identify the underlying genetic mechanisms of a large autosomal dominant Caucasian family with omphalocele. Methods and findings A genetic linkage study was conducted in a large family with an autosomal dominant transmission of an omphalocele using a genome-wide single nucleotide polymorphism (SNP) array. The analysis revealed significant evidence of linkage (non-parametric NPL = 6.93, p=0.0001; parametric logarithm of odds (LOD) = 2.70 under a fully penetrant dominant model) at chromosome band 1p31.3. Haplotype analysis narrowed the locus to a 2.74 Mb region between markers rs2886770 (63014807 bp) and rs1343981 (65757349 bp). Molecular characterisation of this interval using array comparative genomic hybridisation followed by quantitative microsphere hybridisation analysis revealed a 710 kb duplication located at 63.5–64.2 Mb. All affected individuals who had an omphalocele and shared the haplotype were positive for this duplicated region, while the duplication was absent from all normal individuals of this family. Multipoint linkage analysis using the duplication as a marker yielded a maximum LOD score of 3.2 at 1p31.3 under a dominant model. The 710 kb duplication at 1p31.3 band contains seven known genes including FOXD3, ALG6, ITGB3BP, KIAA1799, DLEU2L, PGM1, and the proximal portion of ROR1. Importantly, this duplication is absent from the database of genomic variants. Conclusions The present study suggests that development of an omphalocele in this family is controlled by overexpression of one or more genes in the duplicated region. To the authors’ knowledge, this is the first reported association of an inherited omphalocele condition with a chromosomal rearrangement. PMID:22499347

  18. Ribosomal Protein Genes S23 and L35 from Amphioxus Branchiostoma belcheri tsingtauense: Identification and Copy Number

    Institute of Scientific and Technical Information of China (English)

    Xian LI; Shi-Cui ZHANG; Zhen-Hui LIU; Hong-Yan LI

    2005-01-01

    The complete cDNA and deduced amino acid sequences of the ribosomal proteins S23 (AmphiS23) and L35 (AmphiL35) from amphioxus Branchiostoma belcheri tsingtauense were identified in this study. AmphiS23 cDNA is 546 bp long and encodes a protein of 143 amino acids. It has a predicted molecular mass of 15,851 Da and a pI of 10.7. AmphiL35 cDNA comprises 473 bp, and codes for a protein of 123 amino acids with a predicted molecular mass of 14,543 Da and a pI of 10.8. AmphiS23 shares more than 83% identity with its homologues in the vertebrates and more than 84% identity with those in the invertebrates. AmphiL35 is more than 63% identical to its counterparts in the vertebrates and more than 52% identical to those in the invertebrates, Southern blot analysis demonstrated the existence of 1-2 copies of the S23 gene and 2-3 copies of the L35 gene in the genome of amphioxus B. belcheri tsingtauense. This is in sharp contrast to the presence of 6-13 copies of the S23 gene and 15-17 copies of the L35 gene in the rat genome. It is clear that the housekeeping genes like S23 and L35 underwent a large-scale duplication in the vertebrate lineage, reinforcing the gene/genome duplication hypothesis.

  19. Gene Duplication, Population Genomics, and Species-Level Differentiation within a Tropical Mountain Shrub

    Science.gov (United States)

    Mastretta-Yanes, Alicia; Zamudio, Sergio; Jorgensen, Tove H.; Arrigo, Nils; Alvarez, Nadir; Piñero, Daniel; Emerson, Brent C.

    2014-01-01

    Gene duplication leads to paralogy, which complicates the de novo assembly of genotyping-by-sequencing (GBS) data. The issue of paralogous genes is exacerbated in plants, because they are particularly prone to gene duplication events. Paralogs are normally filtered from GBS data before undertaking population genomics or phylogenetic analyses. However, gene duplication plays an important role in the functional diversification of genes and it can also lead to the formation of postzygotic barriers. Using populations and closely related species of a tropical mountain shrub, we examine 1) the genomic differentiation produced by putative orthologs, and 2) the distribution of recent gene duplication among lineages and geography. We find high differentiation among populations from isolated mountain peaks and species-level differentiation within what is morphologically described as a single species. The inferred distribution of paralogs among populations is congruent with taxonomy and shows that GBS could be used to examine recent gene duplication as a source of genomic differentiation of nonmodel species. PMID:25223767

  20. The vertebrate makorin ubiquitin ligase gene family has been shaped by large-scale duplication and retroposition from an ancestral gonad-specific, maternal-effect gene

    Directory of Open Access Journals (Sweden)

    Volff Jean-Nicolas

    2010-12-01

    fingers and even complete gene elimination from certain groups of vertebrates. Comparative expression analysis strongly suggests that the ancestral E3 ubiquitin ligase function of the single copy mkrn gene before duplication in vertebrates was gonad-specific, with maternal expression in early embryos.

  1. Confirmed rare copy number variants implicate novel genes in schizophrenia.

    Science.gov (United States)

    Tam, Gloria W C; van de Lagemaat, Louie N; Redon, Richard; Strathdee, Karen E; Croning, Mike D R; Malloy, Mary P; Muir, Walter J; Pickard, Ben S; Deary, Ian J; Blackwood, Douglas H R; Carter, Nigel P; Grant, Seth G N

    2010-04-01

    Understanding how cognitive processes including learning, memory, decision making and ideation are encoded by the genome is a key question in biology. Identification of sets of genes underlying human mental disorders is a path towards this objective. Schizophrenia is a common disease with cognitive symptoms, high heritability and complex genetics. We have identified genes involved with schizophrenia by measuring differences in DNA copy number across the entire genome in 91 schizophrenia cases and 92 controls in the Scottish population. Our data reproduce rare and common variants observed in public domain data from >3000 schizophrenia cases, confirming known disease loci as well as identifying novel loci. We found copy number variants in PDE10A (phosphodiesterase 10A), CYFIP1 [cytoplasmic FMR1 (Fragile X mental retardation 1)-interacting protein 1], K(+) channel genes KCNE1 and KCNE2, the Down's syndrome critical region 1 gene RCAN1 (regulator of calcineurin 1), cell-recognition protein CHL1 (cell adhesion molecule with homology with L1CAM), the transcription factor SP4 (specificity protein 4) and histone deacetylase HDAC9, among others (see http://www.genes2cognition.org/SCZ-CNV). Integrating the function of these many genes into a coherent model of schizophrenia and cognition is a major unanswered challenge.

  2. Copy number gain of VCX, X-linked multi-copy gene, leads to cell proliferation and apoptosis during spermatogenesis

    Science.gov (United States)

    Ji, Juan; Qin, Yufeng; Wang, Rong; Huang, Zhenyao; Zhang, Yan; Zhou, Ran; Song, Ling; Ling, Xiufeng; Hu, Zhibin; Miao, Dengshun; Shen, Hongbing; Xia, Yankai; Wang, Xinru; Lu, Chuncheng

    2016-01-01

    Male factor infertility affects one-sixth of couples worldwide, and non-obstructive azoospermia (NOA) is one of the most severe forms. In recent years there has been increasing evidence to implicate the participation of X chromosome in the process of spermatogenesis. To uncover the roles of X-linked multi-copy genes in spermatogenesis, we performed systematic analysis of X-linked gene copy number variations (CNVs) and Y chromosome haplogrouping in 447 idiopathic NOA patients and 485 healthy controls. Interestingly, the frequency of individuals with abnormal level copy of Variable charge, X-linked (VCX) was significantly different between cases and controls after multiple test correction (p = 5.10 × 10−5). To discriminate the effect of gain/loss copies in these genes, we analyzed the frequency of X-linked multi-copy genes in subjects among subdivided groups. Our results demonstrated that individuals with increased copy numbers of Nuclear RNA export factor 2 (NXF2) (p = 9.21 × 10−8) and VCX (p = 1.97 × 10−4) conferred the risk of NOA. In vitro analysis demonstrated that increasing copy number of VCX could upregulate the gene expression and regulate cell proliferation and apoptosis. Our study establishes a robust association between the VCX CNVs and NOA risk. PMID:27705943

  3. Copy number gain of VCX, X-linked multi-copy gene, leads to cell proliferation and apoptosis during spermatogenesis.

    Science.gov (United States)

    Ji, Juan; Qin, Yufeng; Wang, Rong; Huang, Zhenyao; Zhang, Yan; Zhou, Ran; Song, Ling; Ling, Xiufeng; Hu, Zhibin; Miao, Dengshun; Shen, Hongbing; Xia, Yankai; Wang, Xinru; Lu, Chuncheng

    2016-11-29

    Male factor infertility affects one-sixth of couples worldwide, and non-obstructive azoospermia (NOA) is one of the most severe forms. In recent years there has been increasing evidence to implicate the participation of X chromosome in the process of spermatogenesis. To uncover the roles of X-linked multi-copy genes in spermatogenesis, we performed systematic analysis of X-linked gene copy number variations (CNVs) and Y chromosome haplogrouping in 447 idiopathic NOA patients and 485 healthy controls. Interestingly, the frequency of individuals with abnormal level copy of Variable charge, X-linked (VCX) was significantly different between cases and controls after multiple test correction (p = 5.10 × 10-5). To discriminate the effect of gain/loss copies in these genes, we analyzed the frequency of X-linked multi-copy genes in subjects among subdivided groups. Our results demonstrated that individuals with increased copy numbers of Nuclear RNA export factor 2 (NXF2) (p = 9.21 × 10-8) and VCX (p = 1.97 × 10-4) conferred the risk of NOA. In vitro analysis demonstrated that increasing copy number of VCX could upregulate the gene expression and regulate cell proliferation and apoptosis. Our study establishes a robust association between the VCX CNVs and NOA risk.

  4. ssb gene duplication restores the viability of ΔholC and ΔholD Escherichia coli mutants.

    Directory of Open Access Journals (Sweden)

    Stéphane Duigou

    2014-10-01

    Full Text Available The HolC-HolD (χψ complex is part of the DNA polymerase III holoenzyme (Pol III HE clamp-loader. Several lines of evidence indicate that both leading- and lagging-strand synthesis are affected in the absence of this complex. The Escherichia coli ΔholD mutant grows poorly and suppressor mutations that restore growth appear spontaneously. Here we show that duplication of the ssb gene, encoding the single-stranded DNA binding protein (SSB, restores ΔholD mutant growth at all temperatures on both minimal and rich medium. RecFOR-dependent SOS induction, previously shown to occur in the ΔholD mutant, is unaffected by ssb gene duplication, suggesting that lagging-strand synthesis remains perturbed. The C-terminal SSB disordered tail, which interacts with several E. coli repair, recombination and replication proteins, must be intact in both copies of the gene in order to restore normal growth. This suggests that SSB-mediated ΔholD suppression involves interaction with one or more partner proteins. ssb gene duplication also suppresses ΔholC single mutant and ΔholC ΔholD double mutant growth defects, indicating that it bypasses the need for the entire χψ complex. We propose that doubling the amount of SSB stabilizes HolCD-less Pol III HE DNA binding through interactions between SSB and a replisome component, possibly DnaE. Given that SSB binds DNA in vitro via different binding modes depending on experimental conditions, including SSB protein concentration and SSB interactions with partner proteins, our results support the idea that controlling the balance between SSB binding modes is critical for DNA Pol III HE stability in vivo, with important implications for DNA replication and genome stability.

  5. Comparative Evolution of Duplicated Ddx3 Genes in Teleosts: Insights from Japanese Flounder, Paralichthys olivaceus.

    Science.gov (United States)

    Wang, Zhongkai; Liu, Wei; Song, Huayu; Wang, Huizhen; Liu, Jinxiang; Zhao, Haitao; Du, Xinxin; Zhang, Quanqi

    2015-06-24

    Following the two rounds of whole-genome duplication that occurred during deuterostome evolution, a third genome duplication event occurred in the stem lineage of ray-finned fishes. This teleost-specific genome duplication is thought to be responsible for the biological diversification of ray-finned fishes. DEAD-box polypeptide 3 (DDX3) belongs to the DEAD-box RNA helicase family. Although their functions in humans have been well studied, limited information is available regarding their function in teleosts. In this study, two teleost Ddx3 genes were first identified in the transcriptome of Japanese flounder (Paralichthys olivaceus). We confirmed that the two genes originated from teleost-specific genome duplication through synteny and phylogenetic analysis. Additionally, comparative analysis of genome structure, molecular evolution rate, and expression pattern of the two genes in Japanese flounder revealed evidence of subfunctionalization of the duplicated Ddx3 genes in teleosts. Thus, the results of this study reveal novel insights into the evolution of the teleost Ddx3 genes and constitute important groundwork for further research on this gene family.

  6. Differential transcriptional modulation of duplicated fatty acid-binding protein genes by dietary fatty acids in zebrafish (Danio rerio: evidence for subfunctionalization or neofunctionalization of duplicated genes

    Directory of Open Access Journals (Sweden)

    Denovan-Wright Eileen M

    2009-09-01

    Full Text Available Abstract Background In the Duplication-Degeneration-Complementation (DDC model, subfunctionalization and neofunctionalization have been proposed as important processes driving the retention of duplicated genes in the genome. These processes are thought to occur by gain or loss of regulatory elements in the promoters of duplicated genes. We tested the DDC model by determining the transcriptional induction of fatty acid-binding proteins (Fabps genes by dietary fatty acids (FAs in zebrafish. We chose zebrafish for this study for two reasons: extensive bioinformatics resources are available for zebrafish at zfin.org and zebrafish contains many duplicated genes owing to a whole genome duplication event that occurred early in the ray-finned fish lineage approximately 230-400 million years ago. Adult zebrafish were fed diets containing either fish oil (12% lipid, rich in highly unsaturated fatty acid, sunflower oil (12% lipid, rich in linoleic acid, linseed oil (12% lipid, rich in linolenic acid, or low fat (4% lipid, low fat diet for 10 weeks. FA profiles and the steady-state levels of fabp mRNA and heterogeneous nuclear RNA in intestine, liver, muscle and brain of zebrafish were determined. Result FA profiles assayed by gas chromatography differed in the intestine, brain, muscle and liver depending on diet. The steady-state level of mRNA for three sets of duplicated genes, fabp1a/fabp1b.1/fabp1b.2, fabp7a/fabp7b, and fabp11a/fabp11b, was determined by reverse transcription, quantitative polymerase chain reaction (RT-qPCR. In brain, the steady-state level of fabp7b mRNAs was induced in fish fed the linoleic acid-rich diet; in intestine, the transcript level of fabp1b.1 and fabp7b were elevated in fish fed the linolenic acid-rich diet; in liver, the level of fabp7a mRNAs was elevated in fish fed the low fat diet; and in muscle, the level of fabp7a and fabp11a mRNAs were elevated in fish fed the linolenic acid-rich or the low fat diets. In all cases

  7. Phylogenomic approaches to common problems encountered in the analysis of low copy repeats: The sulfotransferase 1A gene family example

    Directory of Open Access Journals (Sweden)

    Benner Steven A

    2005-03-01

    Full Text Available Abstract Background Blocks of duplicated genomic DNA sequence longer than 1000 base pairs are known as low copy repeats (LCRs. Identified by their sequence similarity, LCRs are abundant in the human genome, and are interesting because they may represent recent adaptive events, or potential future adaptive opportunities within the human lineage. Sequence analysis tools are needed, however, to decide whether these interpretations are likely, whether a particular set of LCRs represents nearly neutral drift creating junk DNA, or whether the appearance of LCRs reflects assembly error. Here we investigate an LCR family containing the sulfotransferase (SULT 1A genes involved in drug metabolism, cancer, hormone regulation, and neurotransmitter biology as a first step for defining the problems that those tools must manage. Results Sequence analysis here identified a fourth sulfotransferase gene, which may be transcriptionally active, located on human chromosome 16. Four regions of genomic sequence containing the four human SULT1A paralogs defined a new LCR family. The stem hominoid SULT1A progenitor locus was identified by comparative genomics involving complete human and rodent genomes, and a draft chimpanzee genome. SULT1A expansion in hominoid genomes was followed by positive selection acting on specific protein sites. This episode of adaptive evolution appears to be responsible for the dopamine sulfonation function of some SULT enzymes. Each of the conclusions that this bioinformatic analysis generated using data that has uncertain reliability (such as that from the chimpanzee genome sequencing project has been confirmed experimentally or by a "finished" chromosome 16 assembly, both of which were published after the submission of this manuscript. Conclusion SULT1A genes expanded from one to four copies in hominoids during intra-chromosomal LCR duplications, including (apparently one after the divergence of chimpanzees and humans. Thus, LCRs may

  8. Molecular evolution of the duplicated TFIIAγ genes in Oryzeae and its relatives

    Directory of Open Access Journals (Sweden)

    Sun Hong-Zheng

    2010-05-01

    Full Text Available Abstract Background Gene duplication provides raw genetic materials for evolutionary novelty and adaptation. The evolutionary fate of duplicated transcription factor genes is less studied although transcription factor gene plays important roles in many biological processes. TFIIAγ is a small subunit of TFIIA that is one of general transcription factors required by RNA polymerase II. Previous studies identified two TFIIAγ-like genes in rice genome and found that these genes either conferred resistance to rice bacterial blight or could be induced by pathogen invasion, raising the question as to their functional divergence and evolutionary fates after gene duplication. Results We reconstructed the evolutionary history of the TFIIAγ genes from main lineages of angiosperms and demonstrated that two TFIIAγ genes (TFIIAγ1 and TFIIAγ5 arose from a whole genome duplication that happened in the common ancestor of grasses. Likelihood-based analyses with branch, codon, and branch-site models showed no evidence of positive selection but a signature of relaxed selective constraint after the TFIIAγ duplication. In particular, we found that the nonsynonymous/synonymous rate ratio (ω = dN/dS of the TFIIAγ1 sequences was two times higher than that of TFIIAγ5 sequences, indicating highly asymmetric rates of protein evolution in rice tribe and its relatives, with an accelerated rate of TFIIAγ1 gene. Our expression data and EST database search further indicated that after whole genome duplication, the expression of TFIIAγ1 gene was significantly reduced while TFIIAγ5 remained constitutively expressed and maintained the ancestral role as a subunit of the TFIIA complex. Conclusion The evolutionary fate of TFIIAγ duplicates is not consistent with the neofunctionalization model that predicts that one of the duplicated genes acquires a new function because of positive Darwinian selection. Instead, we suggest that subfunctionalization might be involved in

  9. Selection shaped the evolution of mouse androgen-binding protein (ABP) function and promoted the duplication of Abp genes.

    Science.gov (United States)

    Karn, Robert C; Laukaitis, Christina M

    2014-08-01

    In the present article, we summarize two aspects of our work on mouse ABP (androgen-binding protein): (i) the sexual selection function producing incipient reinforcement on the European house mouse hybrid zone, and (ii) the mechanism behind the dramatic expansion of the Abp gene region in the mouse genome. Selection unifies these two components, although the ways in which selection has acted differ. At the functional level, strong positive selection has acted on key sites on the surface of one face of the ABP dimer, possibly to influence binding to a receptor. A different kind of selection has apparently driven the recent and rapid expansion of the gene region, probably by increasing the amount of Abp transcript, in one or both of two ways. We have shown previously that groups of Abp genes behave as LCRs (low-copy repeats), duplicating as relatively large blocks of genes by NAHR (non-allelic homologous recombination). The second type of selection involves the close link between the accumulation of L1 elements and the expansion of the Abp gene family by NAHR. It is probably predicated on an initial selection for increased transcription of existing Abp genes and/or an increase in Abp gene number providing more transcriptional sites. Either or both could increase initial transcript production, a quantitative change similar to increasing the volume of a radio transmission. In closing, we also provide a note on Abp gene nomenclature.

  10. Molecular evolution accompanying functional divergence of duplicated genes along the plant starch biosynthesis pathway.

    Science.gov (United States)

    Nougué, Odrade; Corbi, Jonathan; Ball, Steven G; Manicacci, Domenica; Tenaillon, Maud I

    2014-05-15

    Starch is the main source of carbon storage in the Archaeplastida. The starch biosynthesis pathway (sbp) emerged from cytosolic glycogen metabolism shortly after plastid endosymbiosis and was redirected to the plastid stroma during the green lineage divergence. The SBP is a complex network of genes, most of which are members of large multigene families. While some gene duplications occurred in the Archaeplastida ancestor, most were generated during the sbp redirection process, and the remaining few paralogs were generated through compartmentalization or tissue specialization during the evolution of the land plants. In the present study, we tested models of duplicated gene evolution in order to understand the evolutionary forces that have led to the development of SBP in angiosperms. We combined phylogenetic analyses and tests on the rates of evolution along branches emerging from major duplication events in six gene families encoding sbp enzymes. We found evidence of positive selection along branches following cytosolic or plastidial specialization in two starch phosphorylases and identified numerous residues that exhibited changes in volume, polarity or charge. Starch synthases, branching and debranching enzymes functional specializations were also accompanied by accelerated evolution. However, none of the sites targeted by selection corresponded to known functional domains, catalytic or regulatory. Interestingly, among the 13 duplications tested, 7 exhibited evidence of positive selection in both branches emerging from the duplication, 2 in only one branch, and 4 in none of the branches. The majority of duplications were followed by accelerated evolution targeting specific residues along both branches. This pattern was consistent with the optimization of the two sub-functions originally fulfilled by the ancestral gene before duplication. Our results thereby provide strong support to the so-called "Escape from Adaptive Conflict" (EAC) model. Because none of the

  11. Opposing phenotypes in mice with Smith-Magenis deletion and Potocki-Lupski duplication syndromes suggest gene dosage effects on fluid consumption behavior.

    Science.gov (United States)

    Heck, Detlef H; Gu, Wenli; Cao, Ying; Qi, Shuhua; Lacaria, Melanie; Lupski, James R

    2012-11-01

    A quantitative long-term fluid consumption and fluid-licking assay was performed in two mouse models with either an ∼2 Mb genomic deletion, Df(11)17, or the reciprocal duplication copy number variation (CNV), Dp(11)17, analogous to the human genomic rearrangements causing either Smith-Magenis syndrome [SMS; OMIM #182290] or Potocki-Lupski syndrome [PTLS; OMIM #610883], respectively. Both mouse strains display distinct quantitative alterations in fluid consumption compared to their wild-type littermates; several of these changes are diametrically opposing between the two chromosome engineered mouse models. Mice with duplication versus deletion showed longer versus shorter intervals between visits to the waterspout, generated more versus less licks per visit and had higher versus lower variability in the number of licks per lick-burst as compared to their respective wild-type littermates. These findings suggest that copy number variation can affect long-term fluid consumption behavior in mice. Other behavioral differences were unique for either the duplication or deletion mutants; the deletion CNV resulted in increased variability of the licking rhythm, and the duplication CNV resulted in a significant slowing of the licking rhythm. Our findings document a readily quantitated complex behavioral response that can be directly and reciprocally influenced by a gene dosage effect.

  12. Copy number of pilus gene clusters in Haemophilus influenzae and variation in the hifE pilin gene.

    Science.gov (United States)

    Read, T D; Satola, S W; Opdyke, J A; Farley, M M

    1998-04-01

    Brazilian purpuric fever (BPF)-associated Haemophilus influenzae biogroup aegyptius strain F3031 contains two identical copies of a five gene cluster (hifA to hifE) encoding pili similar to well-characterized Hif fimbriae of H. influenzae type b. HifE, the putative pilus tip adhesin of F3031, shares only 40% amino acid sequence similarity with the same molecule from type b strains, whereas the other four proteins have 75 to 95% identity. To determine whether pilus cluster duplication and the hifE(F3031) allele were special features of BPF-associated bacteria, we analyzed a collection of H. influenzae strains by PCR with hifA- and hifE-specific oligonucleotides, by Southern hybridization with a hifC gene probe, and by nucleotide sequencing. The presence of two pilus clusters was limited to some H. influenzae biogroup aegyptius strains. The hifE(F3031) allele was limited to H. influenzae biogroup aegyptius. Two strains contained one copy of hifE(F3031) and one copy of a variant hifE allele. We determined the nucleotide sequences of four hifE genes from H. influenzae biogroup aegyptius and H. influenzae capsule serotypes a and c. The predicted proteins produced by these genes demonstrated only 35 to 70% identity to the three published HifE proteins from nontypeable H. influenzae, serotype b, and BPF strains. The C-terminal third of the molecules implicated in chaperone binding was the most highly conserved region. Three conserved domains in the otherwise highly variable N-terminal putative receptor-binding region of HifE were similar to conserved portions in the N terminus of Neisseria pilus adhesin PilC. We concluded that two pilus clusters and hifE(F3031) were not specific for BPF-causing H. influenzae, and we also identified portions of HifE possibly involved in binding mammalian cell receptors.

  13. Partial duplications of the ATRX gene cause the ATR-X syndrome.

    Science.gov (United States)

    Thienpont, Bernard; de Ravel, Thomy; Van Esch, Hilde; Van Schoubroeck, Dominique; Moerman, Philippe; Vermeesch, Joris Robert; Fryns, Jean-Pierre; Froyen, Guy; Lacoste, Caroline; Badens, Catherine; Devriendt, Koen

    2007-10-01

    ATR-X syndrome is a rare syndromic X-linked mental retardation disorder. We report that some of the patients suspected of ATR-X carry large intragenic duplications in the ATRX gene, leading to an absence of ATRX mRNA and of the protein. These findings underscore the need for including quantitative analyses to mutation analysis of the ATRX gene.

  14. Compensatory Drift and the Evolutionary Dynamics of Dosage-Sensitive Duplicate Genes.

    Science.gov (United States)

    Thompson, Ammon; Zakon, Harold H; Kirkpatrick, Mark

    2016-02-01

    Dosage-balance selection preserves functionally redundant duplicates (paralogs) at the optimum for their combined expression. Here we present a model of the dynamics of duplicate genes coevolving under dosage-balance selection. We call this the compensatory drift model. Results show that even when strong dosage-balance selection constrains total expression to the optimum, expression of each duplicate can diverge by drift from its original level. The rate of divergence slows as the strength of stabilizing selection, the size of the mutation effect, and/or the size of the population increases. We show that dosage-balance selection impedes neofunctionalization early after duplication but can later facilitate it. We fit this model to data from sodium channel duplicates in 10 families of teleost fish; these include two convergent lineages of electric fish in which one of the duplicates neofunctionalized. Using the model, we estimated the strength of dosage-balance selection for these genes. The results indicate that functionally redundant paralogs still may undergo radical functional changes after a prolonged period of compensatory drift.

  15. Multiplex ligation-dependent probe amplification for rapid detection of deletions and duplications in the dystrophin gene

    Institute of Scientific and Technical Information of China (English)

    2007-01-01

    Objective:Duchenne muscular dystrophy (DMD) and Becker muscular dystrophy (BMD) are X-linked disorders caused by mutations in the dystrophin gene. The majority of recognized mutations are copy number changes of individual exons. The objective of the present study was to assess the multiplex ligation-dependent probe amplification (MLPA) effects of detection of gene mutations. Methods: Samples of 20 control males and 80 males and their mothers referred to our diagnostic facility on the clinical suspicion of DMD or BMD were tested by MLPA and multiplex PCR. Results: The mean DQs for all peak of 20 control male samples was 1.02 (range from 0.83 to 1.21) by MLPA. Deletions or duplications were identified in 6 out of 31 families that had been previously tested as negative by multiplex PCR. One case of complex rearrangement involving a duplication of two regions: dupEX3-9 and dupEX 17-41 were found by MLPA. Conclusions: MLPA is a highly sensitive method and rapid alternative to multiplex PCR for detection of DMD and BMD.

  16. Whole-gene positive selection, elevated synonymous substitution rates, duplication, and indel evolution of the chloroplast clpP1 gene.

    Directory of Open Access Journals (Sweden)

    Per Erixon

    Full Text Available BACKGROUND: Synonymous DNA substitution rates in the plant chloroplast genome are generally relatively slow and lineage dependent. Non-synonymous rates are usually even slower due to purifying selection acting on the genes. Positive selection is expected to speed up non-synonymous substitution rates, whereas synonymous rates are expected to be unaffected. Until recently, positive selection has seldom been observed in chloroplast genes, and large-scale structural rearrangements leading to gene duplications are hitherto supposed to be rare. METHODOLOGY/PRINCIPLE FINDINGS: We found high substitution rates in the exons of the plastid clpP1 gene in Oenothera (the Evening Primrose family and three separate lineages in the tribe Sileneae (Caryophyllaceae, the Carnation family. Introns have been lost in some of the lineages, but where present, the intron sequences have substitution rates similar to those found in other introns of their genomes. The elevated substitution rates of clpP1 are associated with statistically significant whole-gene positive selection in three branches of the phylogeny. In two of the lineages we found multiple copies of the gene. Neighboring genes present in the duplicated fragments do not show signs of elevated substitution rates or positive selection. Although non-synonymous substitutions account for most of the increase in substitution rates, synonymous rates are also markedly elevated in some lineages. Whereas plant clpP1 genes experiencing negative (purifying selection are characterized by having very conserved lengths, genes under positive selection often have large insertions of more or less repetitive amino acid sequence motifs. CONCLUSIONS/SIGNIFICANCE: We found positive selection of the clpP1 gene in various plant lineages to correlated with repeated duplication of the clpP1 gene and surrounding regions, repetitive amino acid sequences, and increase in synonymous substitution rates. The present study sheds light on the

  17. Genomic analysis reveals extensive gene duplication within the bovine TRB locus

    Directory of Open Access Journals (Sweden)

    Law Andy

    2009-04-01

    Full Text Available Abstract Background Diverse TR and IG repertoires are generated by V(DJ somatic recombination. Genomic studies have been pivotal in cataloguing the V, D, J and C genes present in the various TR/IG loci and describing how duplication events have expanded the number of these genes. Such studies have also provided insights into the evolution of these loci and the complex mechanisms that regulate TR/IG expression. In this study we analyze the sequence of the third bovine genome assembly to characterize the germline repertoire of bovine TRB genes and compare the organization, evolution and regulatory structure of the bovine TRB locus with that of humans and mice. Results The TRB locus in the third bovine genome assembly is distributed over 5 scaffolds, extending to ~730 Kb. The available sequence contains 134 TRBV genes, assigned to 24 subgroups, and 3 clusters of DJC genes, each comprising a single TRBD gene, 5–7 TRBJ genes and a single TRBC gene. Seventy-nine of the TRBV genes are predicted to be functional. Comparison with the human and murine TRB loci shows that the gene order, as well as the sequences of non-coding elements that regulate TRB expression, are highly conserved in the bovine. Dot-plot analyses demonstrate that expansion of the genomic TRBV repertoire has occurred via a complex and extensive series of duplications, predominantly involving DNA blocks containing multiple genes. These duplication events have resulted in massive expansion of several TRBV subgroups, most notably TRBV6, 9 and 21 which contain 40, 35 and 16 members respectively. Similarly, duplication has lead to the generation of a third DJC cluster. Analyses of cDNA data confirms the diversity of the TRBV genes and, in addition, identifies a substantial number of TRBV genes, predominantly from the larger subgroups, which are still absent from the genome assembly. The observed gene duplication within the bovine TRB locus has created a repertoire of phylogenetically

  18. On the Complexity of Duplication-Transfer-Loss Reconciliation with Non-Binary Gene Trees.

    Science.gov (United States)

    Kordi, Misagh; Bansal, Mukul

    2015-12-23

    Duplication-Transfer-Loss (DTL) reconciliation has emerged as a powerful technique for studying gene family evolution in the presence of horizontal gene transfer. DTL reconciliation takes as input a gene family phylogeny and the corresponding species phylogeny, and reconciles the two by postulating speciation, gene duplication, horizontal gene transfer, and gene loss events. Efficient algorithms exist for finding optimal DTL reconciliations when the gene tree is binary. However, gene trees are frequently non-binary. With such non-binary gene trees, the reconciliation problem seeks to find a binary resolution of the gene tree that minimizes the reconciliation cost. Given the prevalence of non-binary gene trees, many efficient algorithms have been developed for this problem in the context of the simpler Duplication-Loss (DL) reconciliation model. Yet, no efficient algorithms exist for DTL reconciliation with non-binary gene trees and the complexity of the problem remains unknown. In this work, we resolve this open question by showing that the problem is, in fact, NP-hard. Our reduction applies to both the dated and undated formulations of DTL reconciliation. By resolving this long-standing open problem, this work will spur the development of both exact and heuristic algorithms for this important problem.

  19. Evidence of duplicated Hox genes in the most recent common ancestor of extant scorpions.

    Science.gov (United States)

    Sharma, Prashant P; Santiago, Marc A; González-Santillán, Edmundo; Monod, Lionel; Wheeler, Ward C

    2015-01-01

    Scorpions (order Scorpiones) are unusual among arthropods, both for the extreme heteronomy of their bauplan and for the high gene family turnover exhibited in their genomes. These phenomena appear to be correlated, as two scorpion species have been shown to possess nearly twice the number of Hox genes present in most arthropods. Segmentally offset anterior expression boundaries of a subset of Hox paralogs have been shown to correspond to transitions in segmental identities in the scorpion posterior tagmata, suggesting that posterior heteronomy in scorpions may have been achieved by neofunctionalization of Hox paralogs. However, both the first scorpion genome sequenced and the developmental genetic data are based on exemplars of Buthidae, one of 19 families of scorpions. It is therefore not known whether Hox paralogy is limited to Buthidae or widespread among scorpions. We surveyed 24 high throughput transcriptomes and the single whole genome available for scorpions, in order to test the prediction that Hox gene duplications are common to the order. We used gene tree parsimony to infer whether the paralogy was consistent with a duplication event in the scorpion common ancestor. Here we show that duplicated Hox genes in non-buthid scorpions occur in six of the ten Hox classes. Gene tree topologies and parsimony-based reconciliation of the gene trees are consistent with a duplication event in the most recent common ancestor of scorpions. These results suggest that a Hox paralogy, and by extension the model of posterior patterning established in a buthid, can be extended to non-Buthidae scorpions.

  20. Segmental duplication as one of the driving forces underlying the diversity of the human immunoglobulin heavy chain variable gene region

    Directory of Open Access Journals (Sweden)

    Gao Richeng

    2011-01-01

    Full Text Available Abstract Background Segmental duplication and deletion were implicated for a region containing the human immunoglobulin heavy chain variable (IGHV gene segments, 1.9III/hv3005 (possible allelic variants of IGHV3-30 and hv3019b9 (a possible allelic variant of IGHV3-33. However, very little is known about the ranges of the duplication and the polymorphic region. This is mainly because of the difficulty associated with distinguishing between allelic and paralogous sequences in the IGHV region containing extensive repetitive sequences. Inability to separate the two parental haploid genomes in the subjects is another serious barrier. To address these issues, unique DNA sequence tags evenly distributed within and flanking the duplicated region implicated by the previous studies were selected. The selected tags in single sperm from six unrelated healthy donors were amplified by multiplex PCR followed by microarray detection. In this way, individual haplotypes of different parental origins in the sperm donors could be analyzed separately and precisely. The identified polymorphic region was further analyzed at the nucleotide sequence level using sequences from the three human genomic sequence assemblies in the database. Results A large polymorphic region was identified using the selected sequence tags. Four of the 12 haplotypes were shown to contain consecutively undetectable tags spanning in a variable range. Detailed analysis of sequences from the genomic sequence assemblies revealed two large duplicate sequence blocks of 24,696 bp and 24,387 bp, respectively, and an incomplete copy of 961 bp in this region. It contains up to 13 IGHV gene segments depending on haplotypes. A polymorphic region was found to be located within the duplicated blocks. The variants of this polymorphism unusually diverged at the nucleotide sequence level and in IGHV gene segment number, composition and organization, indicating a limited selection pressure in general. However

  1. Duplication and diversification of the hypoxia-inducible IGFBP-1 gene in zebrafish.

    Directory of Open Access Journals (Sweden)

    Hiroyasu Kamei

    Full Text Available BACKGROUND: Gene duplication is the primary force of new gene evolution. Deciphering whether a pair of duplicated genes has evolved divergent functions is often challenging. The zebrafish is uniquely positioned to provide insight into the process of functional gene evolution due to its amenability to genetic and experimental manipulation and because it possess a large number of duplicated genes. METHODOLOGY/PRINCIPAL FINDINGS: We report the identification and characterization of two hypoxia-inducible genes in zebrafish that are co-ortholgs of human IGF binding protein-1 (IGFBP-1. IGFBP-1 is a secreted protein that binds to IGF and modulates IGF actions in somatic growth, development, and aging. Like their human and mouse counterparts, in adult zebrafish igfbp-1a and igfbp-1b are exclusively expressed in the liver. During embryogenesis, the two genes are expressed in overlapping spatial domains but with distinct temporal patterns. While zebrafish IGFBP-1a mRNA was easily detected throughout embryogenesis, IGFBP-1b mRNA was detectable only in advanced stages. Hypoxia induces igfbp-1a expression in early embryogenesis, but induces the igfbp-1b expression later in embryogenesis. Both IGFBP-1a and -b are capable of IGF binding, but IGFBP-1b has much lower affinities for IGF-I and -II because of greater dissociation rates. Overexpression of IGFBP-1a and -1b in zebrafish embryos caused significant decreases in growth and developmental rates. When tested in cultured zebrafish embryonic cells, IGFBP-1a and -1b both inhibited IGF-1-induced cell proliferation but the activity of IGFBP-1b was significantly weaker. CONCLUSIONS/SIGNIFICANCE: These results indicate subfunction partitioning of the duplicated IGFBP-1 genes at the levels of gene expression, physiological regulation, protein structure, and biological actions. The duplicated IGFBP-1 may provide additional flexibility in fine-tuning IGF signaling activities under hypoxia and other catabolic

  2. Copy number variants in the kallikrein gene cluster.

    Directory of Open Access Journals (Sweden)

    Pernilla Lindahl

    Full Text Available The kallikrein gene family (KLK1-KLK15 is the largest contiguous group of protease genes within the human genome and is associated with both risk and outcome of cancer and other diseases. We searched for copy number variants in all KLK genes using quantitative PCR analysis and analysis of inheritance patterns of single nucleotide polymorphisms. Two deletions were identified: one 2235-bp deletion in KLK9 present in 1.2% of alleles, and one 3394-bp deletion in KLK15 present in 4.0% of alleles. Each deletion eliminated one complete exon and created out-of-frame coding that eliminated the catalytic triad of the resulting truncated gene product, which therefore likely is a non-functional protein. Deletion breakpoints identified by DNA sequencing located the KLK9 deletion breakpoint to a long interspersed element (LINE repeated sequence, while the deletion in KLK15 is located in a single copy sequence. To search for an association between each deletion and risk of prostate cancer (PC, we analyzed a cohort of 667 biopsied men (266 PC cases and 401 men with no evidence of PC at biopsy using short deletion-specific PCR assays. There was no association between evidence of PC in this cohort and the presence of either gene deletion. Haplotyping revealed a single origin of each deletion, with most recent common ancestor estimates of 3000-8000 and 6000-14 000 years for the deletions in KLK9 and KLK15, respectively. The presence of the deletions on the same haplotypes in 1000 Genomes data of both European and African populations indicate an early origin of both deletions. The old age in combination with homozygous presence of loss-of-function variants suggests that some kallikrein-related peptidases have non-essential functions.

  3. Diversity and population-genetic properties of copy number variations and multicopy genes in cattle.

    Science.gov (United States)

    Bickhart, Derek M; Xu, Lingyang; Hutchison, Jana L; Cole, John B; Null, Daniel J; Schroeder, Steven G; Song, Jiuzhou; Garcia, Jose Fernando; Sonstegard, Tad S; Van Tassell, Curtis P; Schnabel, Robert D; Taylor, Jeremy F; Lewin, Harris A; Liu, George E

    2016-06-01

    The diversity and population genetics of copy number variation (CNV) in domesticated animals are not well understood. In this study, we analysed 75 genomes of major taurine and indicine cattle breeds (including Angus, Brahman, Gir, Holstein, Jersey, Limousin, Nelore, and Romagnola), sequenced to 11-fold coverage to identify 1,853 non-redundant CNV regions. Supported by high validation rates in array comparative genomic hybridization (CGH) and qPCR experiments, these CNV regions accounted for 3.1% (87.5 Mb) of the cattle reference genome, representing a significant increase over previous estimates of the area of the genome that is copy number variable (∼2%). Further population genetics and evolutionary genomics analyses based on these CNVs revealed the population structures of the cattle taurine and indicine breeds and uncovered potential diversely selected CNVs near important functional genes, including AOX1, ASZ1, GAT, GLYAT, and KRTAP9-1 Additionally, 121 CNV gene regions were found to be either breed specific or differentially variable across breeds, such as RICTOR in dairy breeds and PNPLA3 in beef breeds. In contrast, clusters of the PRP and PAG genes were found to be duplicated in all sequenced animals, suggesting that subfunctionalization, neofunctionalization, or overdominance play roles in diversifying those fertility-related genes. These CNV results provide a new glimpse into the diverse selection histories of cattle breeds and a basis for correlating structural variation with complex traits in the future. Published by Oxford University Press on behalf of Kazusa DNA Research Institute 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.

  4. 7 CFR 505.3 - Fees for paper copying, duplicating, and reproduction of materials in library collections.

    Science.gov (United States)

    2010-01-01

    ... of materials in library collections. 505.3 Section 505.3 Agriculture Regulations of the Department of Agriculture (Continued) AGRICULTURAL RESEARCH SERVICE, DEPARTMENT OF AGRICULTURE NATIONAL AGRICULTURAL LIBRARY... in library collections. (a) Photocopy reproduction of paper copy will be set as a flat fee of...

  5. Functional characterization of duplicated Suppressor of Overexpression of Constans 1-like genes in petunia.

    Directory of Open Access Journals (Sweden)

    Jill C Preston

    Full Text Available Flowering time is strictly controlled by a combination of internal and external signals that match seed set with favorable environmental conditions. In the model plant species Arabidopsis thaliana (Brassicaceae, many of the genes underlying development and evolution of flowering have been discovered. However, much remains unknown about how conserved the flowering gene networks are in plants with different growth habits, gene duplication histories, and distributions. Here we functionally characterize three homologs of the flowering gene Suppressor Of Overexpression of Constans 1 (SOC1 in the short-lived perennial Petunia hybrida (petunia, Solanaceae. Similar to A. thaliana soc1 mutants, co-silencing of duplicated petunia SOC1-like genes results in late flowering. This phenotype is most severe when all three SOC1-like genes are silenced. Furthermore, expression levels of the SOC1-like genes Unshaven (UNS and Floral Binding Protein 21 (FBP21, but not FBP28, are positively correlated with developmental age. In contrast to A. thaliana, petunia SOC1-like gene expression did not increase with longer photoperiods, and FBP28 transcripts were actually more abundant under short days. Despite evidence of functional redundancy, differential spatio-temporal expression data suggest that SOC1-like genes might fine-tune petunia flowering in response to photoperiod and developmental stage. This likely resulted from modification of SOC1-like gene regulatory elements following recent duplication, and is a possible mechanism to ensure flowering under both inductive and non-inductive photoperiods.

  6. Early vertebrate chromosome duplications and the evolution of the neuropeptide Y receptor gene regions

    Directory of Open Access Journals (Sweden)

    Brenner Sydney

    2008-06-01

    Full Text Available Abstract Background One of the many gene families that expanded in early vertebrate evolution is the neuropeptide (NPY receptor family of G-protein coupled receptors. Earlier work by our lab suggested that several of the NPY receptor genes found in extant vertebrates resulted from two genome duplications before the origin of jawed vertebrates (gnathostomes and one additional genome duplication in the actinopterygian lineage, based on their location on chromosomes sharing several gene families. In this study we have investigated, in five vertebrate genomes, 45 gene families with members close to the NPY receptor genes in the compact genomes of the teleost fishes Tetraodon nigroviridis and Takifugu rubripes. These correspond to Homo sapiens chromosomes 4, 5, 8 and 10. Results Chromosome regions with conserved synteny were identified and confirmed by phylogenetic analyses in H. sapiens, M. musculus, D. rerio, T. rubripes and T. nigroviridis. 26 gene families, including the NPY receptor genes, (plus 3 described recently by other labs showed a tree topology consistent with duplications in early vertebrate evolution and in the actinopterygian lineage, thereby supporting expansion through block duplications. Eight gene families had complications that precluded analysis (such as short sequence length or variable number of repeated domains and another eight families did not support block duplications (because the paralogs in these families seem to have originated in another time window than the proposed genome duplication events. RT-PCR carried out with several tissues in T. rubripes revealed that all five NPY receptors were expressed in the brain and subtypes Y2, Y4 and Y8 were also expressed in peripheral organs. Conclusion We conclude that the phylogenetic analyses and chromosomal locations of these gene families support duplications of large blocks of genes or even entire chromosomes. Thus, these results are consistent with two early vertebrate

  7. Gene Duplication and Gene Expression Changes Play a Role in the Evolution of Candidate Pollen Feeding Genes in Heliconius Butterflies.

    Science.gov (United States)

    Smith, Gilbert; Macias-Muñoz, Aide; Briscoe, Adriana D

    2016-09-02

    Heliconius possess a unique ability among butterflies to feed on pollen. Pollen feeding significantly extends their lifespan, and is thought to have been important to the diversification of the genus. We used RNA sequencing to examine feeding-related gene expression in the mouthparts of four species of Heliconius and one nonpollen feeding species, Eueides isabella We hypothesized that genes involved in morphology and protein metabolism might be upregulated in Heliconius because they have longer proboscides than Eueides, and because pollen contains more protein than nectar. Using de novo transcriptome assemblies, we tested these hypotheses by comparing gene expression in mouthparts against antennae and legs. We first looked for genes upregulated in mouthparts across all five species and discovered several hundred genes, many of which had functional annotations involving metabolism of proteins (cocoonase), lipids, and carbohydrates. We then looked specifically within Heliconius where we found eleven common upregulated genes with roles in morphology (CPR cuticle proteins), behavior (takeout-like), and metabolism (luciferase-like). Closer examination of these candidates revealed that cocoonase underwent several duplications along the lineage leading to heliconiine butterflies, including two Heliconius-specific duplications. Luciferase-like genes also underwent duplication within lepidopterans, and upregulation in Heliconius mouthparts. Reverse-transcription PCR confirmed that three cocoonases, a peptidase, and one luciferase-like gene are expressed in the proboscis with little to no expression in labial palps and salivary glands. Our results suggest pollen feeding, like other dietary specializations, was likely facilitated by adaptive expansions of preexisting genes-and that the butterfly proboscis is involved in digestive enzyme production.

  8. Zebrafish IGF genes: gene duplication, conservation and divergence, and novel roles in midline and notochord development.

    Directory of Open Access Journals (Sweden)

    Shuming Zou

    Full Text Available Insulin-like growth factors (IGFs are key regulators of development, growth, and longevity. In most vertebrate species including humans, there is one IGF-1 gene and one IGF-2 gene. Here we report the identification and functional characterization of 4 distinct IGF genes (termed as igf-1a, -1b, -2a, and -2b in zebrafish. These genes encode 4 structurally distinct and functional IGF peptides. IGF-1a and IGF-2a mRNAs were detected in multiple tissues in adult fish. IGF-1b mRNA was detected only in the gonad and IGF-2b mRNA only in the liver. Functional analysis showed that all 4 IGFs caused similar developmental defects but with different potencies. Many of these embryos had fully or partially duplicated notochords, suggesting that an excess of IGF signaling causes defects in the midline formation and an expansion of the notochord. IGF-2a, the most potent IGF, was analyzed in depth. IGF-2a expression caused defects in the midline formation and expansion of the notochord but it did not alter the anterior neural patterning. These results not only provide new insights into the functional conservation and divergence of the multiple igf genes but also reveal a novel role of IGF signaling in midline formation and notochord development in a vertebrate model.

  9. Phylogeny and divergence times of gymnosperms inferred from single-copy nuclear genes.

    Science.gov (United States)

    Lu, Ying; Ran, Jin-Hua; Guo, Dong-Mei; Yang, Zu-Yu; Wang, Xiao-Quan

    2014-01-01

    Phylogenetic reconstruction is fundamental to study evolutionary biology and historical biogeography. However, there was not a molecular phylogeny of gymnosperms represented by extensive sampling at the genus level, and most published phylogenies of this group were constructed based on cytoplasmic DNA markers and/or the multi-copy nuclear ribosomal DNA. In this study, we use LFY and NLY, two single-copy nuclear genes that originated from an ancient gene duplication in the ancestor of seed plants, to reconstruct the phylogeny and estimate divergence times of gymnosperms based on a complete sampling of extant genera. The results indicate that the combined LFY and NLY coding sequences can resolve interfamilial relationships of gymnosperms and intergeneric relationships of most families. Moreover, the addition of intron sequences can improve the resolution in Podocarpaceae but not in cycads, although divergence times of the cycad genera are similar to or longer than those of the Podocarpaceae genera. Our study strongly supports cycads as the basal-most lineage of gymnosperms rather than sister to Ginkgoaceae, and a sister relationship between Podocarpaceae and Araucariaceae and between Cephalotaxaceae-Taxaceae and Cupressaceae. In addition, intergeneric relationships of some families that were controversial, and the relationships between Taxaceae and Cephalotaxaceae and between conifers and Gnetales are discussed based on the nuclear gene evidence. The molecular dating analysis suggests that drastic extinctions occurred in the early evolution of gymnosperms, and extant coniferous genera in the Northern Hemisphere are older than those in the Southern Hemisphere on average. This study provides an evolutionary framework for future studies on gymnosperms.

  10. Phylogeny and divergence times of gymnosperms inferred from single-copy nuclear genes.

    Directory of Open Access Journals (Sweden)

    Ying Lu

    Full Text Available Phylogenetic reconstruction is fundamental to study evolutionary biology and historical biogeography. However, there was not a molecular phylogeny of gymnosperms represented by extensive sampling at the genus level, and most published phylogenies of this group were constructed based on cytoplasmic DNA markers and/or the multi-copy nuclear ribosomal DNA. In this study, we use LFY and NLY, two single-copy nuclear genes that originated from an ancient gene duplication in the ancestor of seed plants, to reconstruct the phylogeny and estimate divergence times of gymnosperms based on a complete sampling of extant genera. The results indicate that the combined LFY and NLY coding sequences can resolve interfamilial relationships of gymnosperms and intergeneric relationships of most families. Moreover, the addition of intron sequences can improve the resolution in Podocarpaceae but not in cycads, although divergence times of the cycad genera are similar to or longer than those of the Podocarpaceae genera. Our study strongly supports cycads as the basal-most lineage of gymnosperms rather than sister to Ginkgoaceae, and a sister relationship between Podocarpaceae and Araucariaceae and between Cephalotaxaceae-Taxaceae and Cupressaceae. In addition, intergeneric relationships of some families that were controversial, and the relationships between Taxaceae and Cephalotaxaceae and between conifers and Gnetales are discussed based on the nuclear gene evidence. The molecular dating analysis suggests that drastic extinctions occurred in the early evolution of gymnosperms, and extant coniferous genera in the Northern Hemisphere are older than those in the Southern Hemisphere on average. This study provides an evolutionary framework for future studies on gymnosperms.

  11. Phylogeny and Divergence Times of Gymnosperms Inferred from Single-Copy Nuclear Genes

    Science.gov (United States)

    Guo, Dong-Mei; Yang, Zu-Yu; Wang, Xiao-Quan

    2014-01-01

    Phylogenetic reconstruction is fundamental to study evolutionary biology and historical biogeography. However, there was not a molecular phylogeny of gymnosperms represented by extensive sampling at the genus level, and most published phylogenies of this group were constructed based on cytoplasmic DNA markers and/or the multi-copy nuclear ribosomal DNA. In this study, we use LFY and NLY, two single-copy nuclear genes that originated from an ancient gene duplication in the ancestor of seed plants, to reconstruct the phylogeny and estimate divergence times of gymnosperms based on a complete sampling of extant genera. The results indicate that the combined LFY and NLY coding sequences can resolve interfamilial relationships of gymnosperms and intergeneric relationships of most families. Moreover, the addition of intron sequences can improve the resolution in Podocarpaceae but not in cycads, although divergence times of the cycad genera are similar to or longer than those of the Podocarpaceae genera. Our study strongly supports cycads as the basal-most lineage of gymnosperms rather than sister to Ginkgoaceae, and a sister relationship between Podocarpaceae and Araucariaceae and between Cephalotaxaceae-Taxaceae and Cupressaceae. In addition, intergeneric relationships of some families that were controversial, and the relationships between Taxaceae and Cephalotaxaceae and between conifers and Gnetales are discussed based on the nuclear gene evidence. The molecular dating analysis suggests that drastic extinctions occurred in the early evolution of gymnosperms, and extant coniferous genera in the Northern Hemisphere are older than those in the Southern Hemisphere on average. This study provides an evolutionary framework for future studies on gymnosperms. PMID:25222863

  12. A duplicated PLP gene causing Pelizaeus-Merzbacher disease detected by comparative multiplex PCR

    Energy Technology Data Exchange (ETDEWEB)

    Inoue, K.; Sugiyama, N.; Kawanishi, C. [Yokohama City Univ., Yokohama (Japan)] [and others

    1996-07-01

    Pelizaeus-Merzbacher disease (PMD) is an X-linked dysmyelinating disorder caused by abnormalities in the proteolipid protein (PLP) gene, which is essential for oligodendrocyte differentiation and CNS myelin formation. Although linkage analysis has shown the homogeneity at the PLP locus in patients with PMD, exonic mutations in the PLP gene have been identified in only 10% - 25% of all cases, which suggests the presence of other genetic aberrations, including gene duplication. In this study, we examined five families with PMD not carrying exonic mutations in PLP gene, using comparative multiplex PCR (CM-PCR) as a semiquantitative assay of gene dosage. PLP gene duplications were identified in four families by CM-PCR and confirmed in three families by densitometric RFLP analysis. Because a homologous myelin protein gene, PMP22, is duplicated in the majority of patients with Charcot-Marie-Tooth 1A, PLP gene overdosage may be an important genetic abnormality in PMD and affect myelin formation. 38 ref., 5 figs., 2 tabs.

  13. Preferential duplication of intermodular hub genes: an evolutionary signature in eukaryotes genome networks.

    Directory of Open Access Journals (Sweden)

    Ricardo M Ferreira

    Full Text Available Whole genome protein-protein association networks are not random and their topological properties stem from genome evolution mechanisms. In fact, more connected, but less clustered proteins are related to genes that, in general, present more paralogs as compared to other genes, indicating frequent previous gene duplication episodes. On the other hand, genes related to conserved biological functions present few or no paralogs and yield proteins that are highly connected and clustered. These general network characteristics must have an evolutionary explanation. Considering data from STRING database, we present here experimental evidence that, more than not being scale free, protein degree distributions of organisms present an increased probability for high degree nodes. Furthermore, based on this experimental evidence, we propose a simulation model for genome evolution, where genes in a network are either acquired de novo using a preferential attachment rule, or duplicated with a probability that linearly grows with gene degree and decreases with its clustering coefficient. For the first time a model yields results that simultaneously describe different topological distributions. Also, this model correctly predicts that, to produce protein-protein association networks with number of links and number of nodes in the observed range for Eukaryotes, it is necessary 90% of gene duplication and 10% of de novo gene acquisition. This scenario implies a universal mechanism for genome evolution.

  14. A gene duplication led to specialized gamma-aminobutyrate and beta-alanine aminotransferase in yeast

    DEFF Research Database (Denmark)

    Andersen, Gorm; Andersen, Birgit; Dobritzsch, D.

    2007-01-01

    and related yeasts have two different genes/enzymes to apparently 'distinguish' between the two reactions in a single cell. It is likely that upon duplication similar to 200 million years ago, a specialized Uga1p evolved into a 'novel' transaminase enzyme with broader substrate specificity....

  15. Duplication and Divergence of Floral MADS-Box Genes in Grasses: Evidence for the Generation and Modification of Novel Regulators

    Institute of Scientific and Technical Information of China (English)

    Guixia Xu; Hongzhi Kong

    2007-01-01

    The process of flowering is controlled by a hierarchy of floral genes that act as flowering time genes, inflorescence/floral meristem identity genes, and/or floral organ-identity genes. The most important and well-characterized floral genes are those that belong to the MADS-box family of transcription factors. Compelling evidence suggests that floral MADS-box genes have experienced a few large-scale duplication events. In particular, the pre-core eudicot duplication events have been considered to correlate with the emergence and diversification of core eudicots. Duplication of floral MADS-box genes has also been documented in monocots, particularly in grasses, although a systematic study is lacking. In the present study, by conducting extensive phylogenetic analyses, we identified pre-Poaceae gene duplication events in each of the AP1, PI, AG, AGL11, AGL2/3/4, and AGL9gene lineages. Comparative genomic studies further indicated that some of these duplications actually resulted from the genome doubling event that occurred 66-70 million years ago (MYA). In addition, we found that after gene duplication, exonization (of intron sequences) and pseudoexonization (of exon sequences) have contributed to the divergence of duplicate genes in sequence structure and, possibly, gene function.

  16. Exact Algorithms for Duplication-Transfer-Loss Reconciliation with Non-Binary Gene Trees.

    Science.gov (United States)

    Kordi, Misagh; Bansal, Mukul S

    2017-06-01

    Duplication-Transfer-Loss (DTL) reconciliation is a powerful method for studying gene family evolution in the presence of horizontal gene transfer. DTL reconciliation seeks to reconcile gene trees with species trees by postulating speciation, duplication, transfer, and loss events. Efficient algorithms exist for finding optimal DTL reconciliations when the gene tree is binary. In practice, however, gene trees are often non-binary due to uncertainty in the gene tree topologies, and DTL reconciliation with non-binary gene trees is known to be NP-hard. In this paper, we present the first exact algorithms for DTL reconciliation with non-binary gene trees. Specifically, we (i) show that the DTL reconciliation problem for non-binary gene trees is fixed-parameter tractable in the maximum degree of the gene tree, (ii) present an exponential-time, but in-practice efficient, algorithm to track and enumerate all optimal binary resolutions of a non-binary input gene tree, and (iii) apply our algorithms to a large empirical data set of over 4700 gene trees from 100 species to study the impact of gene tree uncertainty on DTL-reconciliation and to demonstrate the applicability and utility of our algorithms. The new techniques and algorithms introduced in this paper will help biologists avoid incorrect evolutionary inferences caused by gene tree uncertainty.

  17. A duplicated coxI gene is associated with cytoplasmic male sterility in an alloplasmic Brassica juncea line derived from somatic hybridization with Diplotaxis catholica

    Indian Academy of Sciences (India)

    Aruna Pathania; Rajesh Kumar; V. Dinesh Kumar; Ashutosh; K. K. Dwivedi; P. B. Kirti; P. Prakash; V. L. Chopra; S. R. Bhat

    2007-08-01

    A cytoplasmic male sterile (CMS) line of Brassica juncea was derived by repeated backcrossing of the somatic hybrid (Diplotaxis catholica + B. juncea) to B. juncea. The new CMS line is comparable to euplasmic lines for almost all characters, except for flowers which bear slender, needle-like anthers with aborted pollen. Detailed Southern analysis revealed two copies of coxI gene in the CMS line. One copy, coxI-1 is similar to the coxI gene of B. juncea, whereas the second copy, coxI-2 is present in a novel rearranged region. Northern analysis with eight mitochondrial gene probes showed altered transcript pattern only for the coxI gene. Two transcripts of 2.0 and 2.4 kb, respectively, were detected in the CMS line. The novel 2.4 kb transcript was present in floral bud tissue but absent in the leaf tissue. In plants where male sterility broke down under high temperature during the later part of the growing season, the 2.4 kb coxI transcript was absent, which suggested its association with the CMS. The two coxI genes from the CMS line showed two amino acid changes in the coding region. The novel coxI gene showed unique repeats in the 5′ region suggesting recombination of mitochondrial genomes of the two species. The possible role of the duplicated coxI gene in causing male sterility is discussed.

  18. Restriction fragment length polymorphism and multiple copies of DNA sequences homologous with probes for P-fimbriae and hemolysin genes among uropathogenic Escherichia coli.

    Science.gov (United States)

    Hull, S I; Bieler, S; Hull, R A

    1988-03-01

    Hemolysin and P-fimbriae are two virulence traits frequently found together in uropathogenic Escherichia coli. Previous studies have discovered evidence both for linkage between the genes for these traits and for their duplication in the chromosomes of a limited number of strains. To test whether these observations are characteristic of uropathogenic Escherichia coli, the method of DNA hybridization to DNA restriction fragments separated by electrophoresis and transferred to nylon was used to determine copy number of genes for P-fimbriae (pap) among 51 E. coli strains isolated from symptomatic urinary tract infections. Twenty percent of the strains had more than one copy of pap homologous sequences. Fifteen strains, each representing a unique clone, were examined for the presence of sequences homologous with cloned hemolysin genes (hly). Samples of DNA from 14 of the 15 strains hybridized with hly probes. In eight strains the number of copies of pap equalled the number of copies of hly, including one strain with two apparent copies of each. Five strains appeared to have one more copy of pap than of hly, and one strain had an extra copy of hly.

  19. Species-specific duplications of NBS-encoding genes in Chinese chestnut (Castanea mollissima)

    Science.gov (United States)

    Zhong, Yan; Li, Yingjun; Huang, Kaihui; Cheng, Zong-Ming

    2015-01-01

    The disease resistance (R) genes play an important role in protecting plants from infection by diverse pathogens in the environment. The nucleotide-binding site (NBS)-leucine-rich repeat (LRR) class of genes is one of the largest R gene families. Chinese chestnut (Castanea mollissima) is resistant to Chestnut Blight Disease, but relatively little is known about the resistance mechanism. We identified 519 NBS-encoding genes, including 374 NBS-LRR genes and 145 NBS-only genes. The majority of Ka/Ks were less than 1, suggesting the purifying selection operated during the evolutionary history of NBS-encoding genes. A minority (4/34) of Ka/Ks in non-TIR gene families were greater than 1, showing that some genes were under positive selection pressure. Furthermore, Ks peaked at a range of 0.4 to 0.5, indicating that ancient duplications arose during the evolution. The relationship between Ka/Ks and Ks indicated greater selective pressure on the newer and older genes with the critical value of Ks = 0.4–0.5. Notably, species-specific duplications were detected in NBS-encoding genes. In addition, the group of RPW8-NBS-encoding genes clustered together as an independent clade located at a relatively basal position in the phylogenetic tree. Many cis-acting elements related to plant defense responses were detected in promoters of NBS-encoding genes. PMID:26559332

  20. Insight into transcription factor gene duplication from Caenorhabditis elegans Promoterome-driven expression patterns

    Directory of Open Access Journals (Sweden)

    Vidal Marc

    2007-01-01

    Full Text Available Abstract Background The C. elegans Promoterome is a powerful resource for revealing the regulatory mechanisms by which transcription is controlled pan-genomically. Transcription factors will form the core of any systems biology model of genome control and therefore the promoter activity of Promoterome inserts for C. elegans transcription factor genes was examined, in vivo, with a reporter gene approach. Results Transgenic C. elegans strains were generated for 366 transcription factor promoter/gfp reporter gene fusions. GFP distributions were determined, and then summarized with reference to developmental stage and cell type. Reliability of these data was demonstrated by comparison to previously described gene product distributions. A detailed consideration of the results for one C. elegans transcription factor gene family, the Six family, comprising ceh-32, ceh-33, ceh-34 and unc-39 illustrates the value of these analyses. The high proportion of Promoterome reporter fusions that drove GFP expression, compared to previous studies, led to the hypothesis that transcription factor genes might be involved in local gene duplication events less frequently than other genes. Comparison of transcription factor genes of C. elegans and Caenorhabditis briggsae was therefore carried out and revealed very few examples of functional gene duplication since the divergence of these species for most, but not all, transcription factor gene families. Conclusion Examining reporter expression patterns for hundreds of promoters informs, and thereby improves, interpretation of this data type. Genes encoding transcription factors involved in intrinsic developmental control processes appear acutely sensitive to changes in gene dosage through local gene duplication, on an evolutionary time scale.

  1. Copy-number and gene dependency analysis reveals partial copy loss of wild-type SF3B1 as a novel cancer vulnerability. | Office of Cancer Genomics

    Science.gov (United States)

    Genomic instability is a hallmark of human cancer, and results in widespread somatic copy number alterations. We used a genome-scale shRNA viability screen in human cancer cell lines to systematically identify genes that are essential in the context of particular copy-number alterations (copy-number associated gene dependencies). The most enriched class of copy-number associated gene dependencies was CYCLOPS (Copy-number alterations Yielding Cancer Liabilities Owing to Partial losS) genes, and spliceosome components were the most prevalent.

  2. Ancient gene duplication provided a key molecular step for anaerobic growth of Baker's yeast.

    Science.gov (United States)

    Hayashi, Masaya; Schilke, Brenda; Marszalek, Jaroslaw; Williams, Barry; Craig, Elizabeth A

    2011-07-01

    Mitochondria are essential organelles required for a number of key cellular processes. As most mitochondrial proteins are nuclear encoded, their efficient translocation into the organelle is critical. Transport of proteins across the inner membrane is driven by a multicomponent, matrix-localized "import motor," which is based on the activity of the molecular chaperone Hsp70 and a J-protein cochaperone. In Saccharomyces cerevisiae, two paralogous J-proteins, Pam18 and Mdj2, can form the import motor. Both contain transmembrane and matrix domains, with Pam18 having an additional intermembrane space (IMS) domain. Evolutionary analyses revealed that the origin of the IMS domain of S. cerevisiae Pam18 coincides with a gene duplication event that generated the PAM18/MDJ2 gene pair. The duplication event and origin of the Pam18 IMS domain occurred at the relatively ancient divergence of the fungal subphylum Saccharomycotina. The timing of the duplication event also corresponds with a number of additional functional changes related to mitochondrial function and respiration. Physiological and genetic studies revealed that the IMS domain of Pam18 is required for efficient growth under anaerobic conditions, even though it is dispensable when oxygen is present. Thus, the gene duplication was beneficial for growth capacity under particular environmental conditions as well as diversification of the import motor components.

  3. A single enhancer regulating the differential expression of duplicated red-sensitive opsin genes in zebrafish.

    Directory of Open Access Journals (Sweden)

    Taro Tsujimura

    2010-12-01

    Full Text Available A fundamental step in the evolution of the visual system is the gene duplication of visual opsins and differentiation between the duplicates in absorption spectra and expression pattern in the retina. However, our understanding of the mechanism of expression differentiation is far behind that of spectral tuning of opsins. Zebrafish (Danio rerio have two red-sensitive cone opsin genes, LWS-1 and LWS-2. These genes are arrayed in a tail-to-head manner, in this order, and are both expressed in the long member of double cones (LDCs in the retina. Expression of the longer-wave sensitive LWS-1 occurs later in development and is thus confined to the peripheral, especially ventral-nasal region of the adult retina, whereas expression of LWS-2 occurs earlier and is confined to the central region of the adult retina, shifted slightly to the dorsal-temporal region. In this study, we employed a transgenic reporter assay using fluorescent proteins and P1-artificial chromosome (PAC clones encompassing the two genes and identified a 0.6-kb "LWS-activating region" (LAR upstream of LWS-1, which regulates expression of both genes. Under the 2.6-kb flanking upstream region containing the LAR, the expression pattern of LWS-1 was recapitulated by the fluorescent reporter. On the other hand, when LAR was directly conjugated to the LWS-2 upstream region, the reporter was expressed in the LDCs but also across the entire outer nuclear layer. Deletion of LAR from the PAC clones drastically lowered the reporter expression of the two genes. These results suggest that LAR regulates both LWS-1 and LWS-2 by enhancing their expression and that interaction of LAR with the promoters is competitive between the two genes in a developmentally restricted manner. Sharing a regulatory region between duplicated genes could be a general way to facilitate the expression differentiation in duplicated visual opsins.

  4. New Organelles by Gene Duplication in a Biophysical Model of Eukaryote Endomembrane Evolution

    OpenAIRE

    Ramadas, Rohini; Thattai, Mukund

    2013-01-01

    Extant eukaryotic cells have a dynamic traffic network that consists of diverse membrane-bound organelles exchanging matter via vesicles. This endomembrane system arose and diversified during a period characterized by massive expansions of gene families involved in trafficking after the acquisition of a mitochondrial endosymbiont by a prokaryotic host cell >1.8 billion years ago. Here we investigate the mechanistic link between gene duplication and the emergence of new nonendosymbiotic organe...

  5. CTDGFinder: A Novel Homology-Based Algorithm for Identifying Closely Spaced Clusters of Tandemly Duplicated Genes.

    Science.gov (United States)

    Ortiz, Juan F; Rokas, Antonis

    2017-01-01

    Closely spaced clusters of tandemly duplicated genes (CTDGs) contribute to the diversity of many phenotypes, including chemosensation, snake venom, and animal body plans. CTDGs have traditionally been identified subjectively as genomic neighborhoods containing several gene duplicates in close proximity; however, CTDGs are often highly variable with respect to gene number, intergenic distance, and synteny. This lack of formal definition hampers the study of CTDG evolutionary dynamics and the discovery of novel CTDGs in the exponentially growing body of genomic data. To address this gap, we developed a novel homology-based algorithm, CTDGFinder, which formalizes and automates the identification of CTDGs by examining the physical distribution of individual members of families of duplicated genes across chromosomes. Application of CTDGFinder accurately identified CTDGs for many well-known gene clusters (e.g., Hox and beta-globin gene clusters) in the human, mouse and 20 other mammalian genomes. Differences between previously annotated gene clusters and our inferred CTDGs were due to the exclusion of nonhomologs that have historically been considered parts of specific gene clusters, the inclusion or absence of genes between the CTDGs and their corresponding gene clusters, and the splitting of certain gene clusters into distinct CTDGs. Examination of human genes showing tissue-specific enhancement of their expression by CTDGFinder identified members of several well-known gene clusters (e.g., cytochrome P450s and olfactory receptors) and revealed that they were unequally distributed across tissues. By formalizing and automating CTDG identification, CTDGFinder will facilitate understanding of CTDG evolutionary dynamics, their functional implications, and how they are associated with phenotypic diversity. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e

  6. Identification of shared single copy nuclear genes in Arabidopsis, Populus, Vitis and Oryza and their phylogenetic utility across various taxonomic levels

    Directory of Open Access Journals (Sweden)

    Ma Hong

    2010-02-01

    Full Text Available Abstract Background Although the overwhelming majority of genes found in angiosperms are members of gene families, and both gene- and genome-duplication are pervasive forces in plant genomes, some genes are sufficiently distinct from all other genes in a genome that they can be operationally defined as 'single copy'. Using the gene clustering algorithm MCL-tribe, we have identified a set of 959 single copy genes that are shared single copy genes in the genomes of Arabidopsis thaliana, Populus trichocarpa, Vitis vinifera and Oryza sativa. To characterize these genes, we have performed a number of analyses examining GO annotations, coding sequence length, number of exons, number of domains, presence in distant lineages, such as Selaginella and Physcomitrella, and phylogenetic analysis to estimate copy number in other seed plants and to demonstrate their phylogenetic utility. We then provide examples of how these genes may be used in phylogenetic analyses to reconstruct organismal history, both by using extant coverage in EST databases for seed plants and de novo amplification via RT-PCR in the family Brassicaceae. Results There are 959 single copy nuclear genes shared in Arabidopsis, Populus, Vitis and Oryza ["APVO SSC genes"]. The majority of these genes are also present in the Selaginella and Physcomitrella genomes. Public EST sets for 197 species suggest that most of these genes are present across a diverse collection of seed plants, and appear to exist as single or very low copy genes, though exceptions are seen in recently polyploid taxa and in lineages where there is significant evidence for a shared large-scale duplication event. Genes encoding proteins localized in organelles are more commonly single copy than expected by chance, but the evolutionary forces responsible for this bias are unknown. Regardless of the evolutionary mechanisms responsible for the large number of shared single copy genes in diverse flowering plant lineages, these

  7. Identification of shared single copy nuclear genes in Arabidopsis, Populus, Vitis and Oryza and their phylogenetic utility across various taxonomic levels

    Science.gov (United States)

    2010-01-01

    Background Although the overwhelming majority of genes found in angiosperms are members of gene families, and both gene- and genome-duplication are pervasive forces in plant genomes, some genes are sufficiently distinct from all other genes in a genome that they can be operationally defined as 'single copy'. Using the gene clustering algorithm MCL-tribe, we have identified a set of 959 single copy genes that are shared single copy genes in the genomes of Arabidopsis thaliana, Populus trichocarpa, Vitis vinifera and Oryza sativa. To characterize these genes, we have performed a number of analyses examining GO annotations, coding sequence length, number of exons, number of domains, presence in distant lineages, such as Selaginella and Physcomitrella, and phylogenetic analysis to estimate copy number in other seed plants and to demonstrate their phylogenetic utility. We then provide examples of how these genes may be used in phylogenetic analyses to reconstruct organismal history, both by using extant coverage in EST databases for seed plants and de novo amplification via RT-PCR in the family Brassicaceae. Results There are 959 single copy nuclear genes shared in Arabidopsis, Populus, Vitis and Oryza ["APVO SSC genes"]. The majority of these genes are also present in the Selaginella and Physcomitrella genomes. Public EST sets for 197 species suggest that most of these genes are present across a diverse collection of seed plants, and appear to exist as single or very low copy genes, though exceptions are seen in recently polyploid taxa and in lineages where there is significant evidence for a shared large-scale duplication event. Genes encoding proteins localized in organelles are more commonly single copy than expected by chance, but the evolutionary forces responsible for this bias are unknown. Regardless of the evolutionary mechanisms responsible for the large number of shared single copy genes in diverse flowering plant lineages, these genes are valuable for

  8. Application of droplet digital PCR to determine copy number of endogenous genes and transgenes in sugarcane.

    Science.gov (United States)

    Sun, Yue; Joyce, Priya Aiyar

    2017-08-28

    Droplet digital PCR combined with the low copy ACT allele as endogenous reference gene, makes accurate and rapid estimation of gene copy number in Q208 (A) and Q240 (A) attainable. Sugarcane is an important cultivated crop with both high polyploidy and aneuploidy in its 10 Gb genome. Without a known copy number reference gene, it is difficult to accurately estimate the copy number of any gene of interest by PCR-based methods in sugarcane. Recently, a new technology, known as droplet digital PCR (ddPCR) has been developed which can measure the absolute amount of the target DNA in a given sample. In this study, we deduced the true copy number of three endogenous genes, actin depolymerizing factor (ADF), adenine phosphoribosyltransferase (APRT) and actin (ACT) in three Australian sugarcane varieties, using ddPCR by comparing the absolute amounts of the above genes with a transgene of known copy number. A single copy of the ACT allele was detected in Q208 (A) , two copies in Q240 (A) , but was absent in Q117. Copy number variation was also observed for both APRT and ADF, and ranged from 9 to 11 in the three tested varieties. Using this newly developed ddPCR method, transgene copy number was successfully determined in 19 transgenic Q208 (A) and Q240 (A) events using ACT as the reference endogenous gene. Our study demonstrates that ddPCR can be used for high-throughput genetic analysis and is a quick, accurate and reliable alternative method for gene copy number determination in sugarcane. This discovered ACT allele would be a suitable endogenous reference gene for future gene copy number variation and dosage studies of functional genes in Q208 (A) and Q240 (A) .

  9. Genomics 4.0 : syntenic gene and genome duplication drives diversification of plant secondary metabolism and innate immunity in flowering plants : advanced pattern analytics in duplicate genomes

    NARCIS (Netherlands)

    Hofberger, J.A.

    2015-01-01

    Genomics 4.0 - Syntenic Gene and Genome Duplication Drives Diversification of Plant Secondary Metabolism and Innate Immunity in Flowering Plants   Johannes A. Hofberger1, 2, 3 1 Biosystematics Group, Wageningen University & Research Center, Droevendaalsesteeg 1, 6708 PB Wageningen, The Neth

  10. Higher primates, but not New World monkeys, have a duplicate set of enhancers flanking their apoC-I genes.

    Science.gov (United States)

    Puppione, Donald L

    2014-09-01

    Previous studies have demonstrated that the apoC-I gene and its pseudogene on human chromosome 19 are flanked by a duplicate set of enhancers. Multienhancers, ME.1 and ME.2, are located upstream from the genes and the hepatic control region enhancers, HCR.1 and HCR.2, are located downstream. The duplication of the enhancers has been thought to have occurred when the apoC-I gene was duplicated during primate evolution. Currently, the only primate data are for the human enhancers. Examining the genome of other primates (great and lesser apes, Old and New World monkeys), it was possible to locate the duplicate set of enhancers in apes and Old World monkeys. However, only a single set was found in New World monkeys. These observations provide additional evidence that the apoC-I gene and the flanking enhancers underwent duplication after the divergence of Old and New World monkeys.

  11. Physical Mapping of Amplified Copies of the 5-Enolpyruvylshikimate-3-Phosphate Synthase Gene in Glyphosate-Resistant Amaranthus tuberculatus1[OPEN

    Science.gov (United States)

    Dillon, Andrew; Varanasi, Vijay K.; Koo, Dal-Hoe; Nakka, Sridevi; Peterson, Dallas E.; Friebe, Bernd

    2017-01-01

    Recent and rapid evolution of resistance to glyphosate, the most widely used herbicides, in several weed species, including common waterhemp (Amaranthus tuberculatus), poses a serious threat to sustained crop production. We report that glyphosate resistance in A. tuberculatus was due to amplification of the 5-enolpyruvylshikimate-3-P synthase (EPSPS) gene, which encodes the molecular target of glyphosate. There was a positive correlation between EPSPS gene copies and its transcript expression. We analyzed the distribution of EPSPS copies in the genome of A. tuberculatus using fluorescence in situ hybridization on mitotic metaphase chromosomes and interphase nuclei. Fluorescence in situ hybridization analysis mapped the EPSPS gene to pericentromeric regions of two homologous chromosomes in glyphosate sensitive A. tuberculatus. In glyphosate-resistant plants, a cluster of EPSPS genes on the pericentromeric region on one pair of homologous chromosomes was detected. Intriguingly, two highly glyphosate-resistant plants harbored an additional chromosome with several EPSPS copies besides the native chromosome pair with EPSPS copies. These results suggest that the initial event of EPSPS gene duplication may have occurred because of unequal recombination mediated by repetitive DNA. Subsequently, gene amplification may have resulted via several other mechanisms, such as chromosomal rearrangements, deletion/insertion, transposon-mediated dispersion, or possibly by interspecific hybridization. This report illustrates the physical mapping of amplified EPSPS copies in A. tuberculatus. PMID:27956489

  12. Cheetahs have 4 serum amyloid a genes evolved through repeated duplication events.

    Science.gov (United States)

    Chen, Lei; Une, Yumi; Higuchi, Keiichi; Mori, Masayuki

    2012-01-01

    Amyloid A (AA) amyloidosis is a leading cause of mortality in captive cheetahs (Acinonyx jubatus). We performed genome walking and PCR cloning and revealed that cheetahs have 4 SAA genes (provisionally named SAA1A, SAA1B, SAA3A, and SAA3B). In addition, we identified multiple nucleotide polymorphisms in the 4 SAA genes by screening 51 cheetahs. The polymorphisms defined 4, 7, 6, and 4 alleles for SAA1A, SAA3A, SAA1B, and SAA3B, respectively. Pedigree analysis of the inheritance of genotypes for the SAA genes revealed that specific combinations of alleles for the 4 SAA genes cosegregated as a unit (haplotype) in pedigrees, indicating that the 4 genes were linked on the same chromosome. Notably, cheetah SAA1A and SAA1B were highly homologous in their nucleotide sequences. Likewise, SAA3A and SAA3B genes were homologous. These observations suggested a model for the evolution of the 4 SAA genes in cheetahs in which duplication of an ancestral SAA gene first gave rise to SAA1 and SAA3. Subsequently, each gene duplicated one more time, uniquely making 4 genes in the cheetah genome. The monomorphism of the cheetah SAA1A protein might be one of the factors responsible for the high incidence of AA amyloidosis in this species.

  13. Specific duplication and dorsoventrally asymmetric expression patterns of Cycloidea-like genes in zygomorphic species of Ranunculaceae.

    Directory of Open Access Journals (Sweden)

    Florian Jabbour

    Full Text Available Floral bilateral symmetry (zygomorphy has evolved several times independently in angiosperms from radially symmetrical (actinomorphic ancestral states. Homologs of the Antirrhinum majus Cycloidea gene (Cyc have been shown to control floral symmetry in diverse groups in core eudicots. In the basal eudicot family Ranunculaceae, there is a single evolutionary transition from actinomorphy to zygomorphy in the stem lineage of the tribe Delphinieae. We characterized Cyc homologs in 18 genera of Ranunculaceae, including the four genera of Delphinieae, in a sampling that represents the floral morphological diversity of this tribe, and reconstructed the evolutionary history of this gene family in Ranunculaceae. Within each of the two RanaCyL (Ranunculaceae Cycloidea-like lineages previously identified, an additional duplication possibly predating the emergence of the Delphinieae was found, resulting in up to four gene copies in zygomorphic species. Expression analyses indicate that the RanaCyL paralogs are expressed early in floral buds and that the duration of their expression varies between species and paralog class. At most one RanaCyL paralog was expressed during the late stages of floral development in the actinomorphic species studied whereas all paralogs from the zygomorphic species were expressed, composing a species-specific identity code for perianth organs. The contrasted asymmetric patterns of expression observed in the two zygomorphic species is discussed in relation to their distinct perianth architecture.

  14. Reticulate evolution in diploid and tetraploid species of Polystachya (Orchidaceae) as shown by plastid DNA sequences and low-copy nuclear genes

    Science.gov (United States)

    Russell, Anton; Samuel, Rosabelle; Klejna, Verena; Barfuss, Michael H. J.; Rupp, Barbara; Chase, Mark W.

    2010-01-01

    Background and Aims Here evidence for reticulation in the pantropical orchid genus Polystachya is presented, using gene trees from five nuclear and plastid DNA data sets, first among only diploid samples (homoploid hybridization) and then with the inclusion of cloned tetraploid sequences (allopolyploids). Two groups of tetraploids are compared with respect to their origins and phylogenetic relationships. Methods Sequences from plastid regions, three low-copy nuclear genes and ITS nuclear ribosomal DNA were analysed for 56 diploid and 17 tetraploid accessions using maximum parsimony and Bayesian inference. Reticulation was inferred from incongruence between gene trees using supernetwork and consensus network analyses and from cloning and sequencing duplicated loci in tetraploids. Key Results Diploid trees from individual loci showed considerable incongruity but little reticulation signal when support from more than one gene tree was required to infer reticulation. This was coupled with generally low support in the individual gene trees. Sequencing the duplicated gene copies in tetraploids showed clearer evidence of hybrid evolution, including multiple origins of one group of tetraploids included in the study. Conclusions A combination of cloning duplicate gene copies in allotetraploids and consensus network comparison of gene trees allowed a phylogenetic framework for reticulation in Polystachya to be built. There was little evidence for homoploid hybridization, but our knowledge of the origins and relationships of three groups of allotetraploids are greatly improved by this study. One group showed evidence of multiple long-distance dispersals to achieve a pantropical distribution; another showed no evidence of multiple origins or long-distance dispersal but had greater morphological variation, consistent with hybridization between more distantly related parents. PMID:20525745

  15. Gains, losses and changes of function after gene duplication: study of the metallothionein family.

    Directory of Open Access Journals (Sweden)

    Ana Moleirinho

    Full Text Available Metallothioneins (MT are small proteins involved in heavy metal detoxification and protection against oxidative stress and cancer. The mammalian MT family originated through a series of duplication events which generated four major genes (MT1 to MT4. MT1 and MT2 encode for ubiquitous proteins, while MT3 and MT4 evolved to accomplish specific roles in brain and epithelium, respectively. Herein, phylogenetic, transcriptional and polymorphic analyses are carried out to expose gains, losses and diversification of functions that characterize the evolutionary history of the MT family. The phylogenetic analyses show that all four major genes originated through a single duplication event prior to the radiation of mammals. Further expansion of the MT1 gene has occurred in the primate lineage reaching in humans a total of 13 paralogs, five of which are pseudogenes. In humans, the reading frame of all five MT1 pseudogenes is reconstructed by sequence homology with a functional duplicate revealing that loss of invariant cysteines is the most frequent event accounting for pseudogeneisation. Expression analyses based on EST counts and RT-PCR experiments show that, as for MT1 and MT2, human MT3 is also ubiquitously expressed while MT4 transcripts are present in brain, testes, esophagus and mainly in thymus. Polymorphic variation reveals two deleterious mutations (Cys30Tyr and Arg31Trp in MT4 with frequencies reaching about 30% in African and Asian populations suggesting the gene is inactive in some individuals and physiological compensation for its loss must arise from a functional equivalent. Altogether our findings provide novel data on the evolution and diversification of MT gene duplicates, a valuable resource for understanding the vast set of biological processes in which these proteins are involved.

  16. The effect of functional compensation among duplicate genes can constrain their evolutionary divergence

    Indian Academy of Sciences (India)

    Joseph Esfandiar Hannon Bozorgmehr

    2011-04-01

    Gene duplicates have the inherent property of initially being functionally redundant. This means that they can compensate for the effect of deleterious variation occurring at one or more sister sites. Here, I present data bearing on evolutionary theory that illustrates the manner in which any functional adaptation in duplicate genes is markedly constrained because of the compensatory utility provided by a sustained genetic redundancy. Specifically, a two-locus epistatic model of paralogous genes was simulated to investigate the degree of purifying selection imposed, and whether this would serve to impede any possible biochemical innovation. Three population sizes were considered to see if, as expected, there was a significant difference in any selection for robustness. Interestingly, physical linkage between tandem duplicates was actually found to increase the probability of any neofunctionalization and the efficacy of selection, contrary to what is expected in the case of singleton genes. The results indicate that an evolutionary trade-off often exists between any functional change under either positive or relaxed selection and the need to compensate for failures due to degenerative mutations, thereby guaranteeing the reliability of protein production.

  17. Duplication of 7q36.3 encompassing the Sonic Hedgehog (SHH) gene is associated with congenital muscular hypertrophy

    DEFF Research Database (Denmark)

    Kroeldrup, L; Kjaergaard, S; Kirchhoff, Eva Maria

    2012-01-01

    with muscular hypertrophy and mildly retarded psychomotor development. Array-CGH identified a small duplication of 7q36.3 including the Sonic Hedgehog (SHH) gene in both the aborted foetus and the live born male sib. Neither of the parents carried the 7q36.3 duplication. The consequences of overexpression...

  18. Genome-wide copy number analysis uncovers a new HSCR gene: NRG3.

    Directory of Open Access Journals (Sweden)

    Clara Sze-Man Tang

    Full Text Available Hirschsprung disease (HSCR is a congenital disorder characterized by aganglionosis of the distal intestine. To assess the contribution of copy number variants (CNVs to HSCR, we analysed the data generated from our previous genome-wide association study on HSCR patients, whereby we identified NRG1 as a new HSCR susceptibility locus. Analysis of 129 Chinese patients and 331 ethnically matched controls showed that HSCR patients have a greater burden of rare CNVs (p = 1.50 × 10(-5, particularly for those encompassing genes (p = 5.00 × 10(-6. Our study identified 246 rare-genic CNVs exclusive to patients. Among those, we detected a NRG3 deletion (p = 1.64 × 10(-3. Subsequent follow-up (96 additional patients and 220 controls on NRG3 revealed 9 deletions (combined p = 3.36 × 10(-5 and 2 de novo duplications among patients and two deletions among controls. Importantly, NRG3 is a paralog of NRG1. Stratification of patients by presence/absence of HSCR-associated syndromes showed that while syndromic-HSCR patients carried significantly longer CNVs than the non-syndromic or controls (p = 1.50 × 10(-5, non-syndromic patients were enriched in CNV number when compared to controls (p = 4.00 × 10(-6 or the syndromic counterpart. Our results suggest a role for NRG3 in HSCR etiology and provide insights into the relative contribution of structural variants in both syndromic and non-syndromic HSCR. This would be the first genome-wide catalog of copy number variants identified in HSCR.

  19. The butterfly plant arms-race escalated by gene and genome duplications.

    Science.gov (United States)

    Edger, Patrick P; Heidel-Fischer, Hanna M; Bekaert, Michaël; Rota, Jadranka; Glöckner, Gernot; Platts, Adrian E; Heckel, David G; Der, Joshua P; Wafula, Eric K; Tang, Michelle; Hofberger, Johannes A; Smithson, Ann; Hall, Jocelyn C; Blanchette, Matthieu; Bureau, Thomas E; Wright, Stephen I; dePamphilis, Claude W; Eric Schranz, M; Barker, Michael S; Conant, Gavin C; Wahlberg, Niklas; Vogel, Heiko; Pires, J Chris; Wheat, Christopher W

    2015-07-07

    Coevolutionary interactions are thought to have spurred the evolution of key innovations and driven the diversification of much of life on Earth. However, the genetic and evolutionary basis of the innovations that facilitate such interactions remains poorly understood. We examined the coevolutionary interactions between plants (Brassicales) and butterflies (Pieridae), and uncovered evidence for an escalating evolutionary arms-race. Although gradual changes in trait complexity appear to have been facilitated by allelic turnover, key innovations are associated with gene and genome duplications. Furthermore, we show that the origins of both chemical defenses and of molecular counter adaptations were associated with shifts in diversification rates during the arms-race. These findings provide an important connection between the origins of biodiversity, coevolution, and the role of gene and genome duplications as a substrate for novel traits.

  20. Droplet digital PCR-aided screening and characterization of Pichia pastoris multiple gene copy strains.

    Science.gov (United States)

    Cámara, Elena; Albiol, Joan; Ferrer, Pau

    2016-07-01

    Pichia (syn. Komagataella) pastoris is a widely used yeast platform for heterologous protein production. Expression cassettes are usually stably integrated into the genome of this host via homologous recombination. Although increasing gene dosage is a powerful strategy to improve recombinant protein production, an excess in the number of gene copies often leads to decreased product yields and increased metabolic burden, particularly for secreted proteins. We have constructed a series of strains harboring different copy numbers of a Rhizopus oryzae lipase gene (ROL), aiming to find the optimum gene dosage for secreted Rol production. In order to accurately determine ROL gene dosage, we implemented a novel protocol based on droplet digital PCR (ddPCR), and cross validated it with conventional real-time PCR. Gene copy number determination based on ddPCR allowed for an accurate ranking of transformants according to their ROL gene dosage. Results indicated that ddPCR was particularly superior at lower gene dosages (one to five copies) over quantitative real-time PCR (qPCR). This facilitated the determination of the optimal ROL gene dosage as low as two copies. The ranking of ROL gene dosage versus Rol yield was consistent at both small scale and bioreactor chemostat cultures, thereby easing clone characterization in terms of gene dosage dependent physiological effects, which could be discriminated even among strains differing by only one ROL copy. A selected two-copy strain showed twofold increase in Rol specific production in a chemostat culture over the single copy strain. Conversely, strains harboring more than two copies of the ROL gene showed decreased product and biomass yields, as well as altered substrate consumption specific rates, compared to the reference (one-copy) strain. Biotechnol. Bioeng. 2016;113: 1542-1551. © 2015 Wiley Periodicals, Inc.

  1. Gene duplication of the human peptide YY gene (PYY) generated the pancreatic polypeptide gene (PPY) on chromosome 17q21.1

    Energy Technology Data Exchange (ETDEWEB)

    Hort, Y.; Shine, J.; Herzog, H. [Garvan Inst. of Medical Research, Sydney (Australia)

    1995-03-01

    Neuropeptide Y (NPY), peptide YY (PYY), and pancreatic polypeptide (PP) are structurally related but functionally diverse peptides, encoded by separate genes and expressed in different tissues. Although the human NPY gene has been mapped to chromosome 7, the authors demonstrate here that the genes for human PYY and PP (PPY) are localized only 10 kb apart from each another on chromosome 17q21.1. The high degree of homology between the members of this gene family, both in primary sequence and exon/intron structure, suggests that the NYP and the PYY genes arose from an initial gene duplication event, with a subsequent tandem duplication of the PYY gene being responsible for the creation of the PPY gene. A second weaker hybridization signal also found on chromosome 17q11 and results obtained by Southern blot analysis suggest that the entire PYY-PPY region has undergone a further duplication event. 27 refs., 5 figs.

  2. Gene duplications and losses among vertebrate deoxyribonucleoside kinases of the non-TK1 Family

    DEFF Research Database (Denmark)

    Mutahir, Zeeshan; Christiansen, Louise Slot; Clausen, Anders R.;

    2016-01-01

    of the dCK/dGK enzymes encoded by these genes. The two dCK enzymes in G. gallus have broader substrate specificity than their human or X. laevis counterparts. Additionally, the duplicated dCK enzyme in G. gallus might have become mitochondria. Based on our study we postulate that changing and adapting...... substrate specificities and subcellular localization are likely the drivers behind the evolution of vertebrate dNKs...

  3. Whole-Genome Duplications Spurred the Functional Diversification of the Globin Gene Superfamily in Vertebrates

    OpenAIRE

    Hoffmann, Federico G.; Opazo, Juan C; Storz, Jay F.

    2011-01-01

    It has been hypothesized that two successive rounds of whole-genome duplication (WGD) in the stem lineage of vertebrates provided genetic raw materials for the evolutionary innovation of many vertebrate-specific features. However, it has seldom been possible to trace such innovations to specific functional differences between paralogous gene products that derive from a WGD event. Here, we report genomic evidence for a direct link between WGD and key physiological innovations in the vertebrate...

  4. Adaptations to endosymbiosis in a cnidarian-dinoflagellate association: differential gene expression and specific gene duplications.

    Directory of Open Access Journals (Sweden)

    Philippe Ganot

    2011-07-01

    Full Text Available Trophic endosymbiosis between anthozoans and photosynthetic dinoflagellates forms the key foundation of reef ecosystems. Dysfunction and collapse of symbiosis lead to bleaching (symbiont expulsion, which is responsible for the severe worldwide decline of coral reefs. Molecular signals are central to the stability of this partnership and are therefore closely related to coral health. To decipher inter-partner signaling, we developed genomic resources (cDNA library and microarrays from the symbiotic sea anemone Anemonia viridis. Here we describe differential expression between symbiotic (also called zooxanthellate anemones or aposymbiotic (also called bleached A. viridis specimens, using microarray hybridizations and qPCR experiments. We mapped, for the first time, transcript abundance separately in the epidermal cell layer and the gastrodermal cells that host photosynthetic symbionts. Transcriptomic profiles showed large inter-individual variability, indicating that aposymbiosis could be induced by different pathways. We defined a restricted subset of 39 common genes that are characteristic of the symbiotic or aposymbiotic states. We demonstrated that transcription of many genes belonging to this set is specifically enhanced in the symbiotic cells (gastroderm. A model is proposed where the aposymbiotic and therefore heterotrophic state triggers vesicular trafficking, whereas the symbiotic and therefore autotrophic state favors metabolic exchanges between host and symbiont. Several genetic pathways were investigated in more detail: i a key vitamin K-dependant process involved in the dinoflagellate-cnidarian recognition; ii two cnidarian tissue-specific carbonic anhydrases involved in the carbon transfer from the environment to the intracellular symbionts; iii host collagen synthesis, mostly supported by the symbiotic tissue. Further, we identified specific gene duplications and showed that the cnidarian-specific isoform was also up-regulated both

  5. Sub-functionalization to ovule development following duplication of a floral organ identity gene.

    Science.gov (United States)

    Galimba, Kelsey D; Di Stilio, Verónica S

    2015-09-01

    Gene duplications result in paralogs that may be maintained due to the gain of novel functions (neo-functionalization) or the partitioning of ancestral function (sub-functionalization). Plant genomes are especially prone to duplication; paralogs are particularly widespread in the floral MADS box transcription factors that control organ identity through the ABC model of flower development. C class genes establish stamen and carpel identity and control floral meristem determinacy, and are largely conserved across the angiosperm phylogeny. Originally, an additional D class had been identified as controlling ovule identity; yet subsequent studies indicated that both C and D lineage genes more commonly control ovule development redundantly. The ranunculid Thalictrum thalictroides has two orthologs of the Arabidopsis thaliana C class gene AGAMOUS (AG), ThtAG1 and ThtAG2 (Thalictrum thalictroides AGAMOUS1/2). We previously showed that ThtAG1 exhibits typical C class function; here we examine the role of its paralog, ThtAG2. Our phylogenetic analysis shows that ThtAG2 falls within the C lineage, together with ThtAG1, and is consistent with previous findings of a Ranunculales-specific duplication in this clade. However, ThtAG2 is not expressed in stamens, but rather solely in carpels and ovules. This female-specific expression pattern is consistent with D lineage genes, and with other C lineage genes known to be involved in ovule identity. Given the divergent expression of ThtAG2, we tested the hypothesis that it has acquired ovule identity function. Molecular evolution analyses showed evidence of positive selection on ThtAG2-a pattern that supports divergence of function by sub-functionalization. Down-regulation of ThtAG2 by virus-induced gene silencing resulted in homeotic conversions of ovules into carpel-like structures. Taken together, our results suggest that, although ThtAG2 falls within the C lineage, it has diverged to acquire "D function" as an ovule identity gene

  6. Comparative genomic organization and tissue-specific transcription of the duplicated fabp7 and fabp10 genes in teleost fishes.

    Science.gov (United States)

    Parmar, Manoj B; Wright, Jonathan M

    2013-11-01

    A whole-genome duplication (WGD) early in the teleost fish lineage makes fish ideal organisms to study the fate of duplicated genes and underlying evolutionary trajectories that have led to the retention of ohnologous gene duplicates in fish genomes. Here, we compare the genomic organization and tissue-specific transcription of the ohnologous fabp7 and fabp10 genes in medaka, three-spined stickleback, and spotted green pufferfish to the well-studied duplicated fabp7 and fabp10 genes of zebrafish. Teleost fabp7 and fabp10 genes contain four exons interrupted by three introns. Polypeptide sequences of Fabp7 and Fabp10 show the highest sequence identity and similarity with their orthologs from vertebrates. Orthology was evident as the ohnologous Fabp7 and Fabp10 polypeptides of teleost fishes each formed distinct clades and clustered together with their orthologs from other vertebrates in a phylogenetic tree. Furthermore, ohnologous teleost fabp7 and fabp10 genes exhibit conserved gene synteny with human FABP7 and chicken FABP10, respectively, which provides compelling evidence that the duplicated fabp7 and fabp10 genes of teleost fishes most likely arose from the well-documented WGD. The tissue-specific distribution of fabp7a, fabp7b, fabp10a, and fabp10b transcripts provides evidence of diverged spatial transcriptional regulation between ohnologous gene duplicates of fabp7 and fabp10 in teleost fishes.

  7. Evolution history of duplicated smad3 genes in teleost: insights from Japanese flounder, Paralichthys olivaceus

    Directory of Open Access Journals (Sweden)

    Xinxin Du

    2016-09-01

    Full Text Available Following the two rounds of whole-genome duplication (WGD during deuterosome evolution, a third genome duplication occurred in the ray-fined fish lineage and is considered to be responsible for the teleost-specific lineage diversification and regulation mechanisms. As a receptor-regulated SMAD (R-SMAD, the function of SMAD3 was widely studied in mammals. However, limited information of its role or putative paralogs is available in ray-finned fishes. In this study, two SMAD3 paralogs were first identified in the transcriptome and genome of Japanese flounder (Paralichthys olivaceus. We also explored SMAD3 duplication in other selected species. Following identification, genomic structure, phylogenetic reconstruction, and synteny analyses performed by MrBayes and online bioinformatic tools confirmed that smad3a/3b most likely originated from the teleost-specific WGD. Additionally, selection pressure analysis and expression pattern of the two genes performed by PAML and quantitative real-time PCR (qRT-PCR revealed evidence of subfunctionalization of the two SMAD3 paralogs in teleost. Our results indicate that two SMAD3 genes originate from teleost-specific WGD, remain transcriptionally active, and may have likely undergone subfunctionalization. This study provides novel insights to the evolution fates of smad3a/3b and draws attentions to future function analysis of SMAD3 gene family.

  8. Host mitochondrial association evolved in the human parasite Toxoplasma gondii via neofunctionalization of a gene duplicate

    Science.gov (United States)

    In Toxoplasma gondii, an intracellular parasite of humans and other warm-blooded animals, the ability to associate with host mitochondria (HMA) is driven by a locally expanded gene family that encodes multiple mitochondrial association factor 1 (MAF1) proteins. The importance of copy number in the e...

  9. Abundant copy-number loss of CYCLOPS and STOP genes in gastric adenocarcinoma.

    Science.gov (United States)

    Cutcutache, Ioana; Wu, Alice Yingting; Suzuki, Yuka; McPherson, John Richard; Lei, Zhengdeng; Deng, Niantao; Zhang, Shenli; Wong, Wai Keong; Soo, Khee Chee; Chan, Weng Hoong; Ooi, London Lucien; Welsch, Roy; Tan, Patrick; Rozen, Steven G

    2016-04-01

    Gastric cancer, a leading cause of cancer death worldwide, has been little studied compared with other cancers that impose similar health burdens. Our goal is to assess genomic copy-number loss and the possible functional consequences and therapeutic implications thereof across a large series of gastric adenocarcinomas. We used high-density single-nucleotide polymorphism microarrays to determine patterns of copy-number loss and allelic imbalance in 74 gastric adenocarcinomas. We investigated whether suppressor of tumorigenesis and/or proliferation (STOP) genes are associated with genomic copy-number loss. We also analyzed the extent to which copy-number loss affects Copy-number alterations Yielding Cancer Liabilities Owing to Partial losS (CYCLOPS) genes-genes that may be attractive targets for therapeutic inhibition when partially deleted. The proportion of the genome subject to copy-number loss varies considerably from tumor to tumor, with a median of 5.5 %, and a mean of 12 % (range 0-58.5 %). On average, 91 STOP genes were subject to copy-number loss per tumor (median 35, range 0-452), and STOP genes tended to have lower copy-number compared with the rest of the genes. Furthermore, on average, 1.6 CYCLOPS genes per tumor were both subject to copy-number loss and downregulated, and 51.4 % of the tumors had at least one such gene. The enrichment of STOP genes in regions of copy-number loss indicates that their deletion may contribute to gastric carcinogenesis. Furthermore, the presence of several deleted and downregulated CYCLOPS genes in some tumors suggests potential therapeutic targets in these tumors.

  10. Cut, copy, move, delete: The study of human interferon genes reveal multiple mechanisms underlying their evolution in amniotes.

    Science.gov (United States)

    Krause, Christopher D; Pestka, Sidney

    2015-12-01

    Interferons (IFNs) are rapidly evolving cytokines released when viral infections are detected in cells. Previous research suggests that genes encoding IFNs and their receptors duplicated extensively throughout vertebrate evolution. We present molecular genetic evidence that supports the use of nonallelic homologous recombination (NAHR) to expand select IFN genes during amniote evolution. The duplication of long regions of genome (encompassing at least one functional IFN gene) followed by the insertion of this genome fragment near its parent's location, is commonly observed in many amniote genomes. Duplicates inserted away from duplication hotspots are not as frequently perturbed with new duplicates, and tend to survive long periods of evolution, sometimes becoming new IFN subtypes. Although most duplicates are inserted parallel to and near the original sequence, the insertion of the Kelch-like 9 gene within the Type I IFN locus of placental mammals promoted antiparallel insertion of gene duplicates between the Kelch-like 9 and IFN-ε loci. Genetic exchange between highly similar Type I gene duplicates as well as between Type III IFN gene duplicates homogenized their diversification. Oddly, Type III IFN genes migrated long distances throughout the genome more frequently than did Type I IFN genes. The inter-chromosomal movement of Type I IFN genes in amniotes correlated with complete intron loss in their gene structure, and repeatedly occurred with occasional Type III IFN genes.

  11. Divergent Evolutionary Patterns of NAC Transcription Factors Are Associated with Diversification and Gene Duplications in Angiosperm.

    Science.gov (United States)

    Jin, Xiaoli; Ren, Jing; Nevo, Eviatar; Yin, Xuegui; Sun, Dongfa; Peng, Junhua

    2017-01-01

    NAC (NAM/ATAF/CUC) proteins constitute one of the biggest plant-specific transcription factor (TF) families and have crucial roles in diverse developmental programs during plant growth. Phylogenetic analyses have revealed both conserved and lineage-specific NAC subfamilies, among which various origins and distinct features were observed. It is reasonable to hypothesize that there should be divergent evolutionary patterns of NAC TFs both between dicots and monocots, and among NAC subfamilies. In this study, we compared the gene duplication and loss, evolutionary rate, and selective pattern among non-lineage specific NAC subfamilies, as well as those between dicots and monocots, through genome-wide analyses of sequence and functional data in six dicot and five grass lineages. The number of genes gained in the dicot lineages was much larger than that in the grass lineages, while fewer gene losses were observed in the grass than that in the dicots. We revealed (1) uneven constitution of Clusters of Orthologous Groups (COGs) and contrasting birth/death rates among subfamilies, and (2) two distinct evolutionary scenarios of NAC TFs between dicots and grasses. Our results demonstrated that relaxed selection, resulting from concerted gene duplications, may have permitted substitutions responsible for functional divergence of NAC genes into new lineages. The underlying mechanism of distinct evolutionary fates of NAC TFs shed lights on how evolutionary divergence contributes to differences in establishing NAC gene subfamilies and thus impacts the distinct features between dicots and grasses.

  12. Divergent Evolutionary Patterns of NAC Transcription Factors Are Associated with Diversification and Gene Duplications in Angiosperm

    Directory of Open Access Journals (Sweden)

    Xiaoli Jin

    2017-06-01

    Full Text Available NAC (NAM/ATAF/CUC proteins constitute one of the biggest plant-specific transcription factor (TF families and have crucial roles in diverse developmental programs during plant growth. Phylogenetic analyses have revealed both conserved and lineage-specific NAC subfamilies, among which various origins and distinct features were observed. It is reasonable to hypothesize that there should be divergent evolutionary patterns of NAC TFs both between dicots and monocots, and among NAC subfamilies. In this study, we compared the gene duplication and loss, evolutionary rate, and selective pattern among non-lineage specific NAC subfamilies, as well as those between dicots and monocots, through genome-wide analyses of sequence and functional data in six dicot and five grass lineages. The number of genes gained in the dicot lineages was much larger than that in the grass lineages, while fewer gene losses were observed in the grass than that in the dicots. We revealed (1 uneven constitution of Clusters of Orthologous Groups (COGs and contrasting birth/death rates among subfamilies, and (2 two distinct evolutionary scenarios of NAC TFs between dicots and grasses. Our results demonstrated that relaxed selection, resulting from concerted gene duplications, may have permitted substitutions responsible for functional divergence of NAC genes into new lineages. The underlying mechanism of distinct evolutionary fates of NAC TFs shed lights on how evolutionary divergence contributes to differences in establishing NAC gene subfamilies and thus impacts the distinct features between dicots and grasses.

  13. CBF gene copy number variation at Frost Resistance-2 is associated with levels of freezing tolerance in temperate-climate cereals.

    Science.gov (United States)

    Knox, Andrea K; Dhillon, Taniya; Cheng, Hongmei; Tondelli, Alessandro; Pecchioni, Nicola; Stockinger, Eric J

    2010-06-01

    Frost Resistance-1 (FR-1) and FR-2 are two loci affecting freezing tolerance and winter hardiness of the temperate-climate cereals. FR-1 is hypothesized to be due to the pleiotropic effects of VRN-1. FR-2 spans a cluster of C-Repeat Binding Factor (CBF) genes. These loci are genetically and functionally linked. Recent studies indicate CBF transcripts are downregulated by the VRN-1 encoded MADS-box protein or a factor in the VRN-1 pathway. Here, we report that barley genotypes 'Dicktoo' and 'Nure' carrying a vrn-H1 winter allele at VRN-H1 harbor increased copy numbers of CBF coding sequences relative to Vrn-H1 spring allele genotypes 'Morex' and 'Tremois'. Sequencing bacteriophage lambda genomic clones from these four genotypes alongside DNA blot hybridizations indicate approximately half of the eleven CBF orthologs at FR-H2 are duplicated in individual genomes. One of these duplications discriminates vrn-H1 genotypes from Vrn-H1 genotypes. The vrn-H1 winter allele genotypes harbor tandem segmental duplications through the CBF2A-CBF4B genomic region and maintain two distinct CBF2 paralogs, while the Vrn-H1 spring allele genotypes harbor single copies of CBF2 and CBF4. An additional CBF gene, CBF13, is a pseudogene interrupted by multiple non-sense codons in 'Tremois' whereas CBF13 is a complete uninterrupted coding sequence in 'Dicktoo' and 'Nure'. DNA blot hybridization with wheat DNAs reveals greater copy numbers of CBF14 also occurs in winter wheats than in spring wheats. These data indicate that variation in CBF gene copy numbers is widespread in the Triticeae and suggest selection for winter hardiness co-selects winter alleles at both VRN-1 and FR-2.

  14. The Role of Cis-Regulatory Motifs and Genetical Control of Expression in the Divergence of Yeast Duplicate Genes

    National Research Council Canada - National Science Library

    Leach, Lindsey J; Zhang, Ze; Lu, Chenqi; Kearsey, Michael J; Luo, Zewei

    2007-01-01

    Expression divergence of duplicate genes is widely believed to be important for their retention and evolution of new function, although the mechanism that determines their expression divergence remains unclear...

  15. Low copy number of the salivary amylase gene predisposes to obesity.

    Science.gov (United States)

    Falchi, Mario; El-Sayed Moustafa, Julia Sarah; Takousis, Petros; Pesce, Francesco; Bonnefond, Amélie; Andersson-Assarsson, Johanna C; Sudmant, Peter H; Dorajoo, Rajkumar; Al-Shafai, Mashael Nedham; Bottolo, Leonardo; Ozdemir, Erdal; So, Hon-Cheong; Davies, Robert W; Patrice, Alexandre; Dent, Robert; Mangino, Massimo; Hysi, Pirro G; Dechaume, Aurélie; Huyvaert, Marlène; Skinner, Jane; Pigeyre, Marie; Caiazzo, Robert; Raverdy, Violeta; Vaillant, Emmanuel; Field, Sarah; Balkau, Beverley; Marre, Michel; Visvikis-Siest, Sophie; Weill, Jacques; Poulain-Godefroy, Odile; Jacobson, Peter; Sjostrom, Lars; Hammond, Christopher J; Deloukas, Panos; Sham, Pak Chung; McPherson, Ruth; Lee, Jeannette; Tai, E Shyong; Sladek, Robert; Carlsson, Lena M S; Walley, Andrew; Eichler, Evan E; Pattou, Francois; Spector, Timothy D; Froguel, Philippe

    2014-05-01

    Common multi-allelic copy number variants (CNVs) appear enriched for phenotypic associations compared to their biallelic counterparts. Here we investigated the influence of gene dosage effects on adiposity through a CNV association study of gene expression levels in adipose tissue. We identified significant association of a multi-allelic CNV encompassing the salivary amylase gene (AMY1) with body mass index (BMI) and obesity, and we replicated this finding in 6,200 subjects. Increased AMY1 copy number was positively associated with both amylase gene expression (P = 2.31 × 10(-14)) and serum enzyme levels (P copy number was associated with increased BMI (change in BMI per estimated copy = -0.15 (0.02) kg/m(2); P = 6.93 × 10(-10)) and obesity risk (odds ratio (OR) per estimated copy = 1.19, 95% confidence interval (CI) = 1.13-1.26; P = 1.46 × 10(-10)). The OR value of 1.19 per copy of AMY1 translates into about an eightfold difference in risk of obesity between subjects in the top (copy number > 9) and bottom (copy number copy number distribution. Our study provides a first genetic link between carbohydrate metabolism and BMI and demonstrates the power of integrated genomic approaches beyond genome-wide association studies.

  16. Case history and genome-wide scans for copy number variants in a family with patient having 15q11.1-q11.2 duplication and 22q11.2 deletion, and schizophrenia.

    Science.gov (United States)

    Takahashi, Sakae; Suzuki, Takahiro; Nakamura-Tomizuka, Sakura; Osaki, Koichi; Sotome, Yuta; Sagawa, Tomoaki; Uchiyama, Makoto

    2015-06-01

    Many studies have indicated that chromosomes 15q11 and 22q11 may be associated with the genetic etiologies of schizophrenia. We have followed an adult schizophrenia case with 15q11.1-q11.2 duplication and 22q11.2 deletion. Here we report his clinical history, and copy number variants (CNVs) identified by microarray and real-time PCR in the patient and his parents. This is the first report describing a detailed phenotype of an adult schizophrenic case with both 15q11 and 22q11 CNVs as revealed by novel and trustworthy technologies. Subjects were a 33-year-old male patient with 15q11 and 22q11 CNVs, and his normal parents. He fulfilled the DSM-IV criteria for schizophrenia at age 18 years. He was also diagnosed with 22q11.2 deletion syndrome by fluorescence in situ hybridization (FISH) at age 18 years. To search for CNVs in more detail, whole-genome array-CGH analyses including ∼ 420,000 probes were carried out in the patient and his parents. For validations of the CNVs detected by array-CGH, real-time PCR analyses of these CNVs were performed. The patient had two disease-specific CNVs, 15q11.1-q11.2 duplication (∼ 2.7 Mb) and 22q11.21 deletion (∼ 2.9 Mb). These two regions are important for the development of schizophrenia, and this patient had shown symptoms of schizophrenia. Thus, the two areas may contain causal genes for schizophrenia. © 2015 Wiley Periodicals, Inc.

  17. Copy-number and gene dependency analysis reveals partial copy loss of wild-type SF3B1 as a novel cancer vulnerability.

    Science.gov (United States)

    Paolella, Brenton R; Gibson, William J; Urbanski, Laura M; Alberta, John A; Zack, Travis I; Bandopadhayay, Pratiti; Nichols, Caitlin A; Agarwalla, Pankaj K; Brown, Meredith S; Lamothe, Rebecca; Yu, Yong; Choi, Peter S; Obeng, Esther A; Heckl, Dirk; Wei, Guo; Wang, Belinda; Tsherniak, Aviad; Vazquez, Francisca; Weir, Barbara A; Root, David E; Cowley, Glenn S; Buhrlage, Sara J; Stiles, Charles D; Ebert, Benjamin L; Hahn, William C; Reed, Robin; Beroukhim, Rameen

    2017-02-08

    Genomic instability is a hallmark of human cancer, and results in widespread somatic copy number alterations. We used a genome-scale shRNA viability screen in human cancer cell lines to systematically identify genes that are essential in the context of particular copy-number alterations (copy-number associated gene dependencies). The most enriched class of copy-number associated gene dependencies was CYCLOPS (Copy-number alterations Yielding Cancer Liabilities Owing to Partial losS) genes, and spliceosome components were the most prevalent. One of these, the pre-mRNA splicing factor SF3B1, is also frequently mutated in cancer. We validated SF3B1 as a CYCLOPS gene and found that human cancer cells harboring partial SF3B1 copy-loss lack a reservoir of SF3b complex that protects cells with normal SF3B1 copy number from cell death upon partial SF3B1 suppression. These data provide a catalog of copy-number associated gene dependencies and identify partial copy-loss of wild-type SF3B1 as a novel, non-driver cancer gene dependency.

  18. Rapid evolution and copy number variation of primate RHOXF2, an X-linked homeobox gene involved in male reproduction and possibly brain function

    Directory of Open Access Journals (Sweden)

    Zhang Rui

    2011-10-01

    Full Text Available Abstract Background Homeobox genes are the key regulators during development, and they are in general highly conserved with only a few reported cases of rapid evolution. RHOXF2 is an X-linked homeobox gene in primates. It is highly expressed in the testicle and may play an important role in spermatogenesis. As male reproductive system is often the target of natural and/or sexual selection during evolution, in this study, we aim to dissect the pattern of molecular evolution of RHOXF2 in primates and its potential functional consequence. Results We studied sequences and copy number variation of RHOXF2 in humans and 16 nonhuman primate species as well as the expression patterns in human, chimpanzee, white-browed gibbon and rhesus macaque. The gene copy number analysis showed that there had been parallel gene duplications/losses in multiple primate lineages. Our evidence suggests that 11 nonhuman primate species have one RHOXF2 copy, and two copies are present in humans and four Old World monkey species, and at least 6 copies in chimpanzees. Further analysis indicated that the gene duplications in primates had likely been mediated by endogenous retrovirus (ERV sequences flanking the gene regions. In striking contrast to non-human primates, humans appear to have homogenized their two RHOXF2 copies by the ERV-mediated non-allelic recombination mechanism. Coding sequence and phylogenetic analysis suggested multi-lineage strong positive selection on RHOXF2 during primate evolution, especially during the origins of humans and chimpanzees. All the 8 coding region polymorphic sites in human populations are non-synonymous, implying on-going selection. Gene expression analysis demonstrated that besides the preferential expression in the reproductive system, RHOXF2 is also expressed in the brain. The quantitative data suggests expression pattern divergence among primate species. Conclusions RHOXF2 is a fast-evolving homeobox gene in primates. The rapid

  19. Historical profiling of maize duplicate genes sheds light on the evolution of C4 photosynthesis in grasses.

    Science.gov (United States)

    Chang, Yao-Ming; Chang, Chia-Lin; Li, Wen-Hsiung; Shih, Arthur Chun-Chieh

    2013-02-01

    C4 plants evolved from C3 plants through a series of complex evolutionary steps. On the basis of the evolution of key C4 enzyme genes, the evolution of C4 photosynthesis has been considered a story of gene/genome duplications and subsequent modifications of gene function. If whole-genome duplication has contributed to the evolution of C4 photosynthesis, other genes should have been duplicated together with these C4 genes. However, which genes were co-duplicated with C4 genes and whether they have also played a role in C4 evolution are largely unknown. In this study, we developed a simple method to characterize the historical profile of the paralogs of a gene by tracing back to the most recent common ancestor (MRCA) of the gene and its paralog(s) and then counting the number of paralogs at each MRCA. We clustered the genes into clusters with similar duplication profiles and inferred their functional enrichments. Applying our method to maize, a familiar C4 plant, we identified many genes that show similar duplication profiles with those of the key C4 enzyme genes and found that the functional preferences of the C4 gene clusters are not only similar to those identified by an experimental approach in a recent study but also highly consistent with the functions required for the C4 photosynthesis evolutionary model proposed by S.F. Sage. Some of these genes might have co-evolved with the key C4 enzyme genes to increase the strength of C4 photosynthesis. Moreover, our results suggested that most key C4 enzyme genes had different origins and have undergone a long evolutionary process before the emergence of C4 grasses (Andropogoneae), consistent with the conclusion proposed by previous authors. Copyright © 2012 Elsevier Inc. All rights reserved.

  20. Identification of genes that are essential to restrict genome duplication to once per cell division

    Science.gov (United States)

    Vassilev, Alex; Lee, Chrissie Y.; Vassilev, Boris; Zhu, Wenge; Ormanoglu, Pinar; Martin, Scott E.; DePamphilis, Melvin L.

    2016-01-01

    Nuclear genome duplication is normally restricted to once per cell division, but aberrant events that allow excess DNA replication (EDR) promote genomic instability and aneuploidy, both of which are characteristics of cancer development. Here we provide the first comprehensive identification of genes that are essential to restrict genome duplication to once per cell division. An siRNA library of 21,584 human genes was screened for those that prevent EDR in cancer cells with undetectable chromosomal instability. Candidates were validated by testing multiple siRNAs and chemical inhibitors on both TP53+ and TP53- cells to reveal the relevance of this ubiquitous tumor suppressor to preventing EDR, and in the presence of an apoptosis inhibitor to reveal the full extent of EDR. The results revealed 42 genes that prevented either DNA re-replication or unscheduled endoreplication. All of them participate in one or more of eight cell cycle events. Seventeen of them have not been identified previously in this capacity. Remarkably, 14 of the 42 genes have been shown to prevent aneuploidy in mice. Moreover, suppressing a gene that prevents EDR increased the ability of the chemotherapeutic drug Paclitaxel to induce EDR, suggesting new opportunities for synthetic lethalities in the treatment of human cancers. PMID:27144335

  1. The maize auxotrophic mutant orange pericarp is defective in duplicate genes for tryptophan synthase beta.

    Science.gov (United States)

    Wright, A D; Moehlenkamp, C A; Perrot, G H; Neuffer, M G; Cone, K C

    1992-06-01

    orange pericarp (orp) is a seedling lethal mutant of maize caused by mutations in the duplicate unlinked recessive loci orp1 and orp2. Mutant seedlings accumulate two tryptophan precursors, anthranilate and indole, suggesting a block in tryptophan biosynthesis. Results from feeding studies and enzyme assays indicate that the orp mutant is defective in tryptophan synthase beta activity. Thus, orp is one of only a few amino acid auxotrophic mutants to be characterized in plants. Two genes encoding tryptophan synthase beta were isolated from maize and sequenced. Both genes encode polypeptides with high homology to tryptophan synthase beta enzymes from other organisms. The cloned genes were mapped by restriction fragment length polymorphism analysis to approximately the same chromosomal locations as the genetically mapped factors orp1 and orp2. RNA analysis indicates that both genes are expressed in all tissues examined from normal plants. Together, the biochemical, genetic, and molecular data verify the identity of orp1 and orp2 as duplicate structural genes for the beta subunit of tryptophan synthase.

  2. Adaptive evolution after gene duplication in alpha-KT x 14 subfamily from Buthus martensii Karsch.

    Science.gov (United States)

    Cao, Zhijian; Mao, Xin; Xu, Xiuling; Sheng, Jiqun; Dai, Chao; Wu, Yingliang; Luo, Feng; Sha, Yonggang; Jiang, Dahe; Li, Wenxin

    2005-07-01

    A series of isoforms of alpha-KT x 14 (short chain potassium channel scorpion toxins) were isolated from the venom of Buthus martensii Karsch by RACE and screening cDNA library methods. These isoforms adding BmKK1--3 and BmSKTx1--2 together shared high homology (more than 97%) with each other. The result of genomic sequence analysis showed that a length 79 bp intron is inserted Ala codes between the first and the second base at the 17th amino acid of signal peptide. The introns of these isoforms also share high homology with those of BmKK2 and BmSKT x 1 reported previously. Sequence analysis of many clones of cDNA and genomic DNA showed that a species population or individual polymorphism of alpha-KT x 14 genes took place in scorpion Buthus martensii Karsch and accelerated evolution played an important role in the forming process of alpha-KT x 14 scorpion toxins subfamily. The result of southern hybridization indicated that alpha-KT x 14 toxin genes existed in scorpion chromosome with multicopies. All findings maybe provided an important evidence for an extensive evolutionary process of the scorpion "pharmacological factory": at the early course of evolution, the ancestor toxic gene duplicated into a series of multicopy genes integrated at the different chromosome; at the late course of evolution, subsequent functional divergence of duplicate genes was generated by mutations, deletions and insertion.

  3. High-Resolution Analysis of Gene Copy Number Alterations in Human Prostate Cancer Using CGH on cDNA Microarrays: Impact of Copy Number on Gene Expression

    Directory of Open Access Journals (Sweden)

    Maija Wolf

    2004-05-01

    Full Text Available Identification of target genes for genetic rearrangements in prostate cancer and the impact of copy number changes on gene expression are currently not well understood. Here, we applied high-resolution comparative genomic hybridization (CGH on cDNA microarrays for analysis of prostate cancer cell lines. CGH microarrays identified most of the alterations detected by classical chromosomal CGH, as well as a number of previously unreported alterations. Specific recurrent regions of gain (28 and loss (18 were found, their boundaries defined with sub-megabasepair accuracy. The most common changes included copy number decreases at 13% and gains at iq and 5p. Refined mapping identified several sites, such as at 13q (33-44, 49-51, 74-76 Mbp from the p-telomere, which matched with minimal regions of loss seen in extensive loss of heterozygosity mapping studies of large numbers of tumors. Previously unreported recurrent changes were found at 2p, 2q, 3p, 17q (losses, at 3q, 5p, 6p (gains. Integration of genomic and transcriptomic data revealed the role of individual candidate target genes for genomic alterations as well as a highly significant (P < .0001 overall association between copy number levels and the percentage of differentially expressed genes. Across the genome, the overall impact of copy number on gene expression levels was, to a large extent, attributable to low-level gains and losses of copy number, corresponding to common deletions and gains of often large chromosomal regions.

  4. Importance of rare gene copy number alterations for personalized tumor characterization and survival analysis.

    Science.gov (United States)

    Seifert, Michael; Friedrich, Betty; Beyer, Andreas

    2016-10-03

    It has proven exceedingly difficult to ascertain rare copy number alterations (CNAs) that may have strong effects in individual tumors. We show that a regulatory network inferred from gene expression and gene copy number data of 768 human cancer cell lines can be used to quantify the impact of patient-specific CNAs on survival signature genes. A focused analysis of tumors from six tissues reveals that rare patient-specific gene CNAs often have stronger effects on signature genes than frequent gene CNAs. Further comparison to a related network-based approach shows that the integration of indirectly acting gene CNAs significantly improves the survival analysis.

  5. Comparative genomic analysis of duplicated homoeologous regions involved in the resistance of Brassica napus to stem canker

    Directory of Open Access Journals (Sweden)

    Berline eFopa Fomeju

    2015-09-01

    Full Text Available All crop species are current or ancient polyploids. Following whole genome duplication, structural and functional modifications result in differential gene content or regulation in the duplicated regions, which can play a fundamental role in the diversification of genes underlying complex traits. We have investigated this issue in Brassica napus, a species with a highly duplicated genome, with the aim of studying the structural and functional organization of duplicated regions involved in quantitative resistance to stem canker, a disease caused by the fungal pathogen Leptosphaeria maculans. Genome-wide association analysis on two oilseed rape panels confirmed that duplicated regions of ancestral blocks E, J, R, U and W were involved in resistance to stem canker. The structural analysis of the duplicated genomic regions showed a higher gene density on the A genome than on the C genome and a better collinearity between homoeologous regions than paralogous regions, as overall in the whole B. napus genome. The three ancestral sub-genomes were involved in the resistance to stem canker and the fractionation profile of the duplicated regions corresponded to what was expected from results on the B. napus progenitors. About 60% of the genes identified in these duplicated regions were single-copy genes while less than 5% were retained in all the duplicated copies of a given ancestral block. Genes retained in several copies were mainly involved in response to stress, signaling or transcription regulation. Genes with resistance-associated markers were mainly retained in more than two copies. These results suggested that some genes underlying quantitative resistance to stem canker might be duplicated genes. Genes with a hydrolase activity that were retained in one copy or R-like genes might also account for resistance in some regions. Further analyses need to be conducted to indicate to what extent duplicated genes contribute to the expression of the

  6. Gene expression profiling and gene copy-number changes in malignant mesothelioma cell lines.

    Science.gov (United States)

    Zanazzi, Claudia; Hersmus, Remko; Veltman, Imke M; Gillis, Ad J M; van Drunen, Ellen; Beverloo, H Berna; Hegmans, Joost P J J; Verweij, Marielle; Lambrecht, Bart N; Oosterhuis, J Wolter; Looijenga, Leendert H J

    2007-10-01

    Malignant mesothelioma (MM) is an asbestos-induced tumor that acquires aneuploid DNA content during the tumorigenic process. We used instable MM cell lines as an in vitro model to study the impact of DNA copy-number changes on gene expression profiling, in the course of their chromosomal redistribution process. Two MM cell lines, PMR-MM2 (early passages of in vitro culture) and PMR-MM7 (both early and late passages of in vitro culture), were cytogenetically characterized. Genomic gains and losses were precisely defined using microarray-based comparative genomic hybridization (array-CGH), and minimal overlapping analysis led to the identification of the common unbalanced genomic regions. Using the U133Plus 2.0 Affymetrix gene chip array, we analyzed PMR-MM7 early and late passages for genome-wide gene expression, and correlated the differentially expressed genes with copy-number changes. The presence of a high number of genetic imbalances occurring from early to late culture steps reflected the tendency of MM cells toward genomic instability. The selection of specific chromosomal abnormalities observed during subsequent cultures demonstrated the spontaneous evolution of the cancer cells in an in vitro environment. MM cell lines were characterized by copy-number changes associated with the TP53 apoptotic pathway already present at the first steps of in vitro culture. Prolonged culture led to acquisition of additional chromosomal copy-number changes associated with dysregulation of genes involved in cell adhesion, regulation of mitotic cell cycle, signal transduction, carbohydrate metabolism, motor activity, glycosaminoglycan biosynthesis, protein binding activity, lipid transport, ATP synthesis, and methyltransferase activity.

  7. Large inverted duplications in the human genome form via a fold-back mechanism.

    Directory of Open Access Journals (Sweden)

    Karen E Hermetz

    2014-01-01

    Full Text Available Inverted duplications are a common type of copy number variation (CNV in germline and somatic genomes. Large duplications that include many genes can lead to both neurodevelopmental phenotypes in children and gene amplifications in tumors. There are several models for inverted duplication formation, most of which include a dicentric chromosome intermediate followed by breakage-fusion-bridge (BFB cycles, but the mechanisms that give rise to the inverted dicentric chromosome in most inverted duplications remain unknown. Here we have combined high-resolution array CGH, custom sequence capture, next-generation sequencing, and long-range PCR to analyze the breakpoints of 50 nonrecurrent inverted duplications in patients with intellectual disability, autism, and congenital anomalies. For half of the rearrangements in our study, we sequenced at least one breakpoint junction. Sequence analysis of breakpoint junctions reveals a normal-copy disomic spacer between inverted and non-inverted copies of the duplication. Further, short inverted sequences are present at the boundary of the disomic spacer and the inverted duplication. These data support a mechanism of inverted duplication formation whereby a chromosome with a double-strand break intrastrand pairs with itself to form a "fold-back" intermediate that, after DNA replication, produces a dicentric inverted chromosome with a disomic spacer corresponding to the site of the fold-back loop. This process can lead to inverted duplications adjacent to terminal deletions, inverted duplications juxtaposed to translocations, and inverted duplication ring chromosomes.

  8. Variations in CCL3L gene cluster sequence and non-specific gene copy numbers

    Directory of Open Access Journals (Sweden)

    Edberg Jeffrey C

    2010-03-01

    Full Text Available Abstract Background Copy number variations (CNVs of the gene CC chemokine ligand 3-like1 (CCL3L1 have been implicated in HIV-1 susceptibility, but the association has been inconsistent. CCL3L1 shares homology with a cluster of genes localized to chromosome 17q12, namely CCL3, CCL3L2, and, CCL3L3. These genes are involved in host defense and inflammatory processes. Several CNV assays have been developed for the CCL3L1 gene. Findings Through pairwise and multiple alignments of these genes, we have shown that the homology between these genes ranges from 50% to 99% in complete gene sequences and from 70-100% in the exonic regions, with CCL3L1 and CCL3L3 being identical. By use of MEGA 4 and BioEdit, we aligned sense primers, anti-sense primers, and probes used in several previously described assays against pre-multiple alignments of all four chemokine genes. Each set of probes and primers aligned and matched with overlapping sequences in at least two of the four genes, indicating that previously utilized RT-PCR based CNV assays are not specific for only CCL3L1. The four available assays measured median copies of 2 and 3-4 in European and African American, respectively. The concordance between the assays ranged from 0.44-0.83 suggesting individual discordant calls and inconsistencies with the assays from the expected gene coverage from the known sequence. Conclusions This indicates that some of the inconsistencies in the association studies could be due to assays that provide heterogenous results. Sequence information to determine CNV of the three genes separately would allow to test whether their association with the pathogenesis of a human disease or phenotype is affected by an individual gene or by a combination of these genes.

  9. Diversification of genes encoding granule-bound starch synthase in monocots and dicots is marked by multiple genome-wide duplication events.

    Directory of Open Access Journals (Sweden)

    Jun Cheng

    Full Text Available Starch is one of the major components of cereals, tubers, and fruits. Genes encoding granule-bound starch synthase (GBSS, which is responsible for amylose synthesis, have been extensively studied in cereals but little is known about them in fruits. Due to their low copy gene number, GBSS genes have been used to study plant phylogenetic and evolutionary relationships. In this study, GBSS genes have been isolated and characterized in three fruit trees, including apple, peach, and orange. Moreover, a comprehensive evolutionary study of GBSS genes has also been conducted between both monocots and eudicots. Results have revealed that genomic structures of GBSS genes in plants are conserved, suggesting they all have evolved from a common ancestor. In addition, the GBSS gene in an ancestral angiosperm must have undergone genome duplication ∼251 million years ago (MYA to generate two families, GBSSI and GBSSII. Both GBSSI and GBSSII are found in monocots; however, GBSSI is absent in eudicots. The ancestral GBSSII must have undergone further divergence when monocots and eudicots split ∼165 MYA. This is consistent with expression profiles of GBSS genes, wherein these profiles are more similar to those of GBSSII in eudicots than to those of GBSSI genes in monocots. In dicots, GBSSII must have undergone further divergence when rosids and asterids split from each other ∼126 MYA. Taken together, these findings suggest that it is GBSSII rather than GBSSI of monocots that have orthologous relationships with GBSS genes of eudicots. Moreover, diversification of GBSS genes is mainly associated with genome-wide duplication events throughout the evolutionary course of history of monocots and eudicots.

  10. Intrinsic karyotype stability and gene copy number variations may have laid the foundation for tetraploid wheat formation.

    Science.gov (United States)

    Zhang, Huakun; Bian, Yao; Gou, Xiaowan; Dong, Yuzhu; Rustgi, Sachin; Zhang, Bangjiao; Xu, Chunming; Li, Ning; Qi, Bao; Han, Fangpu; von Wettstein, Diter; Liu, Bao

    2013-11-26

    Polyploidy or whole-genome duplication is recurrent in plant evolution, yet only a small fraction of whole-genome duplications has led to successful speciation. A major challenge in the establishment of nascent polyploids is sustained karyotype instability, which compromises fitness. The three putative diploid progenitors of bread wheat, with AA, SS (S ∼ B), and DD genomes occurred sympatrically, and their cross-fertilization in different combinations may have resulted in fertile allotetraploids with various genomic constitutions. However, only SSAA or closely related genome combinations have led to the speciation of tetraploid wheats like Triticum turgidum and Triticum timopheevii. We analyzed early generations of four newly synthesized allotetraploid wheats with genome compositions S(sh)S(sh)A(m)A(m), S(l)S(l)AA, S(b)S(b)DD, and AADD by combined fluorescence and genomic in situ hybridization-based karyotyping. Results of karyotype analyses showed that although S(sh)S(sh)A(m)A(m) and S(l)S(l)AA are characterized by immediate and persistent karyotype stability, massive aneuploidy and extensive chromosome restructuring are associated with S(b)S(b)DD and AADD in which parental subgenomes showed markedly different propensities for chromosome gain/loss and rearrangements. Although compensating aneuploidy and reciprocal translocation between homeologs prevailed, reproductive fitness was substantially compromised due to chromosome instability. Strikingly, localized genomic changes in repetitive DNA and copy-number variations in gene homologs occurred in both chromosome stable lines, S(sh)S(sh)A(m)A(m) and S(l)S(l)AA. Our data demonstrated that immediate and persistent karyotype stability is intrinsic to newly formed allotetraploid wheat with genome combinations analogous to natural tetraploid wheats. This property, coupled with rapid gene copy-number variations, may have laid the foundation of tetraploid wheat establishment.

  11. Gene Duplication and the Evolution of Plant MADS-box Transcription Factors

    Institute of Scientific and Technical Information of China (English)

    Chiara A. Airoldi; Brendan Davies

    2012-01-01

    Since the first MADS-box transcription factor genes were implicated in the establishment of floral organ identity in a couple of model plants,the size and scope of this gene family has begun to be appreciated in a much wider range of species.Over the course of millions of years the number of MADS-box genes in plants has increased to the point that the Arabidopsis genome contains more than 100.The understanding gained from studying the evolution,regulation and function of multiple MADS-box genes in an increasing set of species,makes this large plant transcription factor gene family an ideal subject to study the processes that lead to an increase in gene number and the selective birth,death and repurposing of its component members.Here we will use examples taken from the MADS-box gene family to review what is known about the factors that influence the loss and retention of genes duplicated in different ways and examine the varied fates of the retained genes and their associated biological outcomes.

  12. Evolution of C, D and S-type cystatins in mammals: an extensive gene duplication in primates.

    Science.gov (United States)

    de Sousa-Pereira, Patrícia; Abrantes, Joana; Pinheiro, Ana; Colaço, Bruno; Vitorino, Rui; Esteves, Pedro J

    2014-01-01

    Cystatins are a family of inhibitors of cysteine peptidases that comprises the salivary cystatins (D and S-type cystatins) and cystatin C. These cystatins are encoded by a multigene family (CST3, CST5, CST4, CST1 and CST2) organized in tandem in the human genome. Their presence and functional importance in human saliva has been reported, however the distribution of these proteins in other mammals is still unclear. Here, we performed a proteomic analysis of the saliva of several mammals and studied the evolution of this multigene family. The proteomic analysis detected S-type cystatins (S, SA, and SN) in human saliva and cystatin D in rat saliva. The evolutionary analysis showed that the cystatin C encoding gene is present in species of the most representative mammalian groups, i.e. Artiodactyla, Rodentia, Lagomorpha, Carnivora and Primates. On the other hand, D and S-type cystatins are mainly retrieved from Primates, and especially the evolution of S-type cystatins seems to be a dynamic process as seen in Pongo abelii genome where several copies of CST1-like gene (cystatin SN) were found. In Rodents, a group of cystatins previously identified as D and S has also evolved. Despite the high divergence of the amino acid sequence, their position in the phylogenetic tree and their genome organization suggests a common origin with those of the Primates. These results suggest that the D and S type cystatins have emerged before the mammalian radiation and were retained only in Primates and Rodents. Although the mechanisms driving the evolution of cystatins are unknown, it seems to be a dynamic process with several gene duplications evolving according to the birth-and-death model of evolution. The factors that led to the appearance of a group of saliva-specific cystatins in Primates and its rapid evolution remain undetermined, but may be associated with an adaptive advantage.

  13. Evolution of C, D and S-type cystatins in mammals: an extensive gene duplication in primates.

    Directory of Open Access Journals (Sweden)

    Patrícia de Sousa-Pereira

    Full Text Available Cystatins are a family of inhibitors of cysteine peptidases that comprises the salivary cystatins (D and S-type cystatins and cystatin C. These cystatins are encoded by a multigene family (CST3, CST5, CST4, CST1 and CST2 organized in tandem in the human genome. Their presence and functional importance in human saliva has been reported, however the distribution of these proteins in other mammals is still unclear. Here, we performed a proteomic analysis of the saliva of several mammals and studied the evolution of this multigene family. The proteomic analysis detected S-type cystatins (S, SA, and SN in human saliva and cystatin D in rat saliva. The evolutionary analysis showed that the cystatin C encoding gene is present in species of the most representative mammalian groups, i.e. Artiodactyla, Rodentia, Lagomorpha, Carnivora and Primates. On the other hand, D and S-type cystatins are mainly retrieved from Primates, and especially the evolution of S-type cystatins seems to be a dynamic process as seen in Pongo abelii genome where several copies of CST1-like gene (cystatin SN were found. In Rodents, a group of cystatins previously identified as D and S has also evolved. Despite the high divergence of the amino acid sequence, their position in the phylogenetic tree and their genome organization suggests a common origin with those of the Primates. These results suggest that the D and S type cystatins have emerged before the mammalian radiation and were retained only in Primates and Rodents. Although the mechanisms driving the evolution of cystatins are unknown, it seems to be a dynamic process with several gene duplications evolving according to the birth-and-death model of evolution. The factors that led to the appearance of a group of saliva-specific cystatins in Primates and its rapid evolution remain undetermined, but may be associated with an adaptive advantage.

  14. Multiple tandem duplication of the phenylalanine ammonia-lyase genes in Cucumis sativus L.

    Science.gov (United States)

    Shang, Qing-Mao; Li, Liang; Dong, Chun-Juan

    2012-10-01

    Phenylalanine ammonia-lyase (PAL) is the first entry enzyme of the phenylpropanoid pathway, and therefore plays a key role in both plant development and stress defense. In many plants, PAL is encoded by a multi-gene family, and each member is differentially regulated in response to environmental stimuli. In the present study, we report that PAL in cucumber (Cucumis sativus L.) is encoded for by a family of seven genes (designated as CsPAL1-7). All seven CsPALs are arranged in tandem in two duplication blocks, which are located on chromosomes 4 and 6, respectively. The cDNA and protein sequences of the CsPALs share an overall high identity to each other. Homology modeling reveals similarities in their protein structures, besides several slight differences, implying the different activities in conversion of phenylalanine. Phylogenic analysis places CsPAL1-7 in a separate cluster rather than clustering with other plant PALs. Analyses of expression profiles in different cucumber tissues or in response to various stress or plant hormone treatments indicate that CsPAL1-7 play redundant, but divergent roles in cucumber development and stress response. This is consistent with our finding that CsPALs possess overlapping but different cis-elements in their promoter regions. Finally, several duplication events are discussed to explain the evolution of the cucumber PAL genes.

  15. Insights into the coupling of duplication events and macroevolution from an age profile of animal transmembrane gene families.

    Directory of Open Access Journals (Sweden)

    Guohui Ding

    2006-08-01

    Full Text Available The evolution of new gene families subsequent to gene duplication may be coupled to the fluctuation of population and environment variables. Based upon that, we presented a systematic analysis of the animal transmembrane gene duplication events on a macroevolutionary scale by integrating the palaeontology repository. The age of duplication events was calculated by maximum likelihood method, and the age distribution was estimated by density histogram and normal kernel density estimation. We showed that the density of the duplicates displays a positive correlation with the estimates of maximum number of cell types of common ancestors, and the oxidation events played a key role in the major transitions of this density trace. Next, we focused on the Phanerozoic phase, during which more macroevolution data are available. The pulse mass extinction timepoints coincide with the local peaks of the age distribution, suggesting that the transmembrane gene duplicates fixed frequently when the environment changed dramatically. Moreover, a 61-million-year cycle is the most possible cycle in this phase by spectral analysis, which is consistent with the cycles recently detected in biodiversity. Our data thus elucidate a strong coupling of duplication events and macroevolution; furthermore, our method also provides a new way to address these questions.

  16. Characterization of genes encoding poly(A polymerases in plants: evidence for duplication and functional specialization.

    Directory of Open Access Journals (Sweden)

    Lisa R Meeks

    Full Text Available BACKGROUND: Poly(A polymerase is a key enzyme in the machinery that mediates mRNA 3' end formation in eukaryotes. In plants, poly(A polymerases are encoded by modest gene families. To better understand this multiplicity of genes, poly(A polymerase-encoding genes from several other plants, as well as from Selaginella, Physcomitrella, and Chlamydomonas, were studied. METHODOLOGY/PRINCIPAL FINDINGS: Using bioinformatics tools, poly(A polymerase-encoding genes were identified in the genomes of eight species in the plant lineage. Whereas Chlamydomonas reinhardtii was found to possess a single poly(A polymerase gene, other species possessed between two and six possible poly(A polymerase genes. With the exception of four intron-lacking genes, all of the plant poly(A polymerase genes (but not the C. reinhardtii gene possessed almost identical intron positions within the poly(A polymerase coding sequences, suggesting that all plant poly(A polymerase genes derive from a single ancestral gene. The four Arabidopsis poly(A polymerase genes were found to be essential, based on genetic analysis of T-DNA insertion mutants. GFP fusion proteins containing three of the four Arabidopsis poly(A polymerases localized to the nucleus, while one such fusion protein was localized in the cytoplasm. The fact that this latter protein is largely pollen-specific suggests that it has important roles in male gametogenesis. CONCLUSIONS/SIGNIFICANCE: Our results indicate that poly(A polymerase genes have expanded from a single ancestral gene by a series of duplication events during the evolution of higher plants, and that individual members have undergone sorts of functional specialization so as to render them essential for plant growth and development. Perhaps the most interesting of the plant poly(A polymerases is a novel cytoplasmic poly(A polymerase that is expressed in pollen in Arabidopsis; this is reminiscent of spermatocyte-specific cytoplasmic poly(A polymerases in

  17. Finding all sorting tandem duplication random loss operations

    DEFF Research Database (Denmark)

    Bernt, Matthias; Chen, Kuan Yu; Chen, Ming Chiang

    2011-01-01

    A tandem duplication random loss (TDRL) operation duplicates a contiguous segment of genes, followed by the random loss of one copy of each of the duplicated genes. Although the importance of this operation is founded by several recent biological studies, it has been investigated only rarely from...... a theoretical point of view. Of particular interest are sorting TDRLs which are TDRLs that, when applied to a permutation representing a genome, reduce the distance towards another given permutation. The identification of sorting genome rearrangement operations in general is a key ingredient of many algorithms...

  18. Evolutionary changes in gene expression, coding sequence and copy-number at the Cyp6g1 locus contribute to resistance to multiple insecticides in Drosophila.

    Directory of Open Access Journals (Sweden)

    Thomas W R Harrop

    Full Text Available Widespread use of insecticides has led to insecticide resistance in many populations of insects. In some populations, resistance has evolved to multiple pesticides. In Drosophila melanogaster, resistance to multiple classes of insecticide is due to the overexpression of a single cytochrome P450 gene, Cyp6g1. Overexpression of Cyp6g1 appears to have evolved in parallel in Drosophila simulans, a sibling species of D. melanogaster, where it is also associated with insecticide resistance. However, it is not known whether the ability of the CYP6G1 enzyme to provide resistance to multiple insecticides evolved recently in D. melanogaster or if this function is present in all Drosophila species. Here we show that duplication of the Cyp6g1 gene occurred at least four times during the evolution of different Drosophila species, and the ability of CYP6G1 to confer resistance to multiple insecticides exists in D. melanogaster and D. simulans but not in Drosophila willistoni or Drosophila virilis. In D. virilis, which has multiple copies of Cyp6g1, one copy confers resistance to DDT and another to nitenpyram, suggesting that the divergence of protein sequence between copies subsequent to the duplication affected the activity of the enzyme. All orthologs tested conferred resistance to one or more insecticides, suggesting that CYP6G1 had the capacity to provide resistance to anthropogenic chemicals before they existed. Finally, we show that expression of Cyp6g1 in the Malpighian tubules, which contributes to DDT resistance in D. melanogaster, is specific to the D. melanogaster-D. simulans lineage. Our results suggest that a combination of gene duplication, regulatory changes and protein coding changes has taken place at the Cyp6g1 locus during evolution and this locus may play a role in providing resistance to different environmental toxins in different Drosophila species.

  19. Neofunctionalization of a duplicate hatching enzyme gene during the evolution of teleost fishes.

    Science.gov (United States)

    Sano, Kaori; Kawaguchi, Mari; Watanabe, Satoshi; Yasumasu, Shigeki

    2014-10-19

    Duplication and subsequent neofunctionalization of the teleostean hatching enzyme gene occurred in the common ancestor of Euteleostei and Otocephala, producing two genes belonging to different phylogenetic clades (clade I and II). In euteleosts, the clade I enzyme inherited the activity of the ancestral enzyme of swelling the egg envelope by cleavage of the N-terminal region of egg envelope proteins. The clade II enzyme gained two specific cleavage sites, N-ZPd and mid-ZPd but lost the ancestral activity. Thus, euteleostean clade II enzymes assumed a new function; solubilization of the egg envelope by the cooperative action with clade I enzyme. However, in Otocephala, the clade II gene was lost during evolution. Consequently, in a late group of Otocephala, only the clade I enzyme is present to swell the egg envelope. We evaluated the egg envelope digestion properties of clade I and II enzymes in Gonorynchiformes, an early diverging group of Otocephala, using milkfish, and compared their digestion with those of other fishes. Finally, we propose a hypothesis of the neofunctionalization process. The milkfish clade II enzyme cleaved N-ZPd but not mid-ZPd, and did not cause solubilization of the egg envelope. We conclude that neofunctionalization is incomplete in the otocephalan clade II enzymes. Comparison of clade I and clade II enzyme characteristics implies that the specificity of the clade II enzymes gradually changed during evolution after the duplication event, and that a change in substrate was required for the addition of the mid-ZPd site and loss of activity at the N-terminal region. We infer the process of neofunctionalization of the clade II enzyme after duplication of the gene. The ancestral clade II gene gained N-ZPd cleavage activity in the common ancestral lineage of the Euteleostei and Otocephala. Subsequently, acquisition of cleavage activity at the mid-ZPd site and loss of cleavage activity in the N-terminal region occurred during the evolution of

  20. Identification of candidate growth promoting genes in ovarian cancer through integrated copy number and expression analysis.

    Science.gov (United States)

    Ramakrishna, Manasa; Williams, Louise H; Boyle, Samantha E; Bearfoot, Jennifer L; Sridhar, Anita; Speed, Terence P; Gorringe, Kylie L; Campbell, Ian G

    2010-04-08

    Ovarian cancer is a disease characterised by complex genomic rearrangements but the majority of the genes that are the target of these alterations remain unidentified. Cataloguing these target genes will provide useful insights into the disease etiology and may provide an opportunity to develop novel diagnostic and therapeutic interventions. High resolution genome wide copy number and matching expression data from 68 primary epithelial ovarian carcinomas of various histotypes was integrated to identify genes in regions of most frequent amplification with the strongest correlation with expression and copy number. Regions on chromosomes 3, 7, 8, and 20 were most frequently increased in copy number (> 40% of samples). Within these regions, 703/1370 (51%) unique gene expression probesets were differentially expressed when samples with gain were compared to samples without gain. 30% of these differentially expressed probesets also showed a strong positive correlation (r > or =0.6) between expression and copy number. We also identified 21 regions of high amplitude copy number gain, in which 32 known protein coding genes showed a strong positive correlation between expression and copy number. Overall, our data validates previously known ovarian cancer genes, such as ERBB2, and also identified novel potential drivers such as MYNN, PUF60 and TPX2.

  1. Identification of candidate growth promoting genes in ovarian cancer through integrated copy number and expression analysis.

    Directory of Open Access Journals (Sweden)

    Manasa Ramakrishna

    Full Text Available Ovarian cancer is a disease characterised by complex genomic rearrangements but the majority of the genes that are the target of these alterations remain unidentified. Cataloguing these target genes will provide useful insights into the disease etiology and may provide an opportunity to develop novel diagnostic and therapeutic interventions. High resolution genome wide copy number and matching expression data from 68 primary epithelial ovarian carcinomas of various histotypes was integrated to identify genes in regions of most frequent amplification with the strongest correlation with expression and copy number. Regions on chromosomes 3, 7, 8, and 20 were most frequently increased in copy number (> 40% of samples. Within these regions, 703/1370 (51% unique gene expression probesets were differentially expressed when samples with gain were compared to samples without gain. 30% of these differentially expressed probesets also showed a strong positive correlation (r > or =0.6 between expression and copy number. We also identified 21 regions of high amplitude copy number gain, in which 32 known protein coding genes showed a strong positive correlation between expression and copy number. Overall, our data validates previously known ovarian cancer genes, such as ERBB2, and also identified novel potential drivers such as MYNN, PUF60 and TPX2.

  2. Selection of suitable endogenous reference genes for relative copy number detection in sugarcane.

    Science.gov (United States)

    Xue, Bantong; Guo, Jinlong; Que, Youxiong; Fu, Zhiwei; Wu, Luguang; Xu, Liping

    2014-05-19

    Transgene copy number has a great impact on the expression level and stability of exogenous gene in transgenic plants. Proper selection of endogenous reference genes is necessary for detection of genetic components in genetically modification (GM) crops by quantitative real-time PCR (qPCR) or by qualitative PCR approach, especially in sugarcane with polyploid and aneuploid genomic structure. qPCR technique has been widely accepted as an accurate, time-saving method on determination of copy numbers in transgenic plants and on detection of genetically modified plants to meet the regulatory and legislative requirement. In this study, to find a suitable endogenous reference gene and its real-time PCR assay for sugarcane (Saccharum spp. hybrids) DNA content quantification, we evaluated a set of potential "single copy" genes including P4H, APRT, ENOL, CYC, TST and PRR, through qualitative PCR and absolute quantitative PCR. Based on copy number comparisons among different sugarcane genotypes, including five S. officinarum, one S. spontaneum and two S. spp. hybrids, these endogenous genes fell into three groups: ENOL-3--high copy number group, TST-1 and PRR-1--medium copy number group, P4H-1, APRT-2 and CYC-2--low copy number group. Among these tested genes, P4H, APRT and CYC were the most stable, while ENOL and TST were the least stable across different sugarcane genotypes. Therefore, three primer pairs of P4H-3, APRT-2 and CYC-2 were then selected as the suitable reference gene primer pairs for sugarcane. The test of multi-target reference genes revealed that the APRT gene was a specific amplicon, suggesting this gene is the most suitable to be used as an endogenous reference target for sugarcane DNA content quantification. These results should be helpful for establishing accurate and reliable qualitative and quantitative PCR analysis of GM sugarcane.

  3. Gene duplication and adaptive evolution of digestive proteases in Drosophila arizonae female reproductive tracts.

    Directory of Open Access Journals (Sweden)

    Erin S Kelleher

    2007-08-01

    Full Text Available It frequently has been postulated that intersexual coevolution between the male ejaculate and the female reproductive tract is a driving force in the rapid evolution of reproductive proteins. The dearth of research on female tracts, however, presents a major obstacle to empirical tests of this hypothesis. Here, we employ a comparative EST approach to identify 241 candidate female reproductive proteins in Drosophila arizonae, a repleta group species in which physiological ejaculate-female coevolution has been documented. Thirty-one of these proteins exhibit elevated amino acid substitution rates, making them candidates for molecular coevolution with the male ejaculate. Strikingly, we also discovered 12 unique digestive proteases whose expression is specific to the D. arizonae lower female reproductive tract. These enzymes belong to classes most commonly found in the gastrointestinal tracts of a diverse array of organisms. We show that these proteases are associated with recent, lineage-specific gene duplications in the Drosophila repleta species group, and exhibit strong signatures of positive selection. Observation of adaptive evolution in several female reproductive tract proteins indicates they are active players in the evolution of reproductive tract interactions. Additionally, pervasive gene duplication, adaptive evolution, and rapid acquisition of a novel digestive function by the female reproductive tract points to a novel coevolutionary mechanism of ejaculate-female interaction.

  4. On the origin of protein synthesis factors: a gene duplication/fusion model.

    Science.gov (United States)

    Cousineau, B; Leclerc, F; Cedergren, R

    1997-12-01

    Sequence similarity has given rise to the proposal that IF-2, EF-G, and EF-Tu are related through a common ancestor. We evaluate this proposition and whether the relationship can be extended to other factors of protein synthesis. Analysis of amino acid sequence similarity gives statistical support for an evolutionary affiliation among IF-1, IF-2, IF-3, EF-Tu, EF-Ts, and EF-G and suggests further that this association is a result of gene duplication/fusion events. In support of this mechanism, the three-dimensional structures of IF-3, EF-Tu, and EF-G display a predictable domain structure and overall conformational similarity. The model that we propose consists of three consecutives duplication/fusion events which would have taken place before the divergence of the three superkingdoms: eubacteria, archaea, and eukaryotes. The root of this protein superfamily tree would be an ancestor of the modern IF-1 gene sequence. The repeated fundamental motif of this protein superfamily is a small RNA binding domain composed of two alpha-helices packed along side of an antiparallel beta-sheet.

  5. New organelles by gene duplication in a biophysical model of eukaryote endomembrane evolution.

    Science.gov (United States)

    Ramadas, Rohini; Thattai, Mukund

    2013-06-04

    Extant eukaryotic cells have a dynamic traffic network that consists of diverse membrane-bound organelles exchanging matter via vesicles. This endomembrane system arose and diversified during a period characterized by massive expansions of gene families involved in trafficking after the acquisition of a mitochondrial endosymbiont by a prokaryotic host cell >1.8 billion years ago. Here we investigate the mechanistic link between gene duplication and the emergence of new nonendosymbiotic organelles, using a minimal biophysical model of traffic. Our model incorporates membrane-bound compartments, coat proteins and adaptors that drive vesicles to bud and segregate cargo from source compartments, and SNARE proteins and associated factors that cause vesicles to fuse into specific destination compartments. In simulations, arbitrary numbers of compartments with heterogeneous initial compositions segregate into a few compositionally distinct subsets that we term organelles. The global structure of the traffic system (i.e., the number, composition, and connectivity of organelles) is determined completely by local molecular interactions. On evolutionary timescales, duplication of the budding and fusion machinery followed by loss of cross-interactions leads to the emergence of new organelles, with increased molecular specificity being necessary to maintain larger organellar repertoires. These results clarify potential modes of early eukaryotic evolution as well as more recent eukaryotic diversification. Copyright © 2013 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  6. Inducible amplification of gene copy number and heterologous protein production in the yeast Kluyveromyces lactis.

    Science.gov (United States)

    Morlino, G B; Tizzani, L; Fleer, R; Frontali, L; Bianchi, M M

    1999-11-01

    Heterologous protein production can be doubled by increasing the copy number of the corresponding heterologous gene. We constructed a host-vector system in the yeast Kluyveromyces lactis that was able to induce copy number amplification of pKD1 plasmid-based vectors upon expression of an integrated copy of the plasmid recombinase gene. We increased the production and secretion of two heterologous proteins, glucoamylase from the yeast Arxula adeninivorans and mammalian interleukin-1beta, following gene dosage amplification when the heterologous genes were carried by pKD1-based vectors. The choice of the promoters for expression of the integrated recombinase gene and of the episomal heterologous genes are critical for the mitotic stability of the host-vector system.

  7. The impact of gene duplication, insertion, deletion, lateral gene transfer and sequencing error on orthology inference: a simulation study.

    Science.gov (United States)

    Dalquen, Daniel A; Altenhoff, Adrian M; Gonnet, Gaston H; Dessimoz, Christophe

    2013-01-01

    The identification of orthologous genes, a prerequisite for numerous analyses in comparative and functional genomics, is commonly performed computationally from protein sequences. Several previous studies have compared the accuracy of orthology inference methods, but simulated data has not typically been considered in cross-method assessment studies. Yet, while dependent on model assumptions, simulation-based benchmarking offers unique advantages: contrary to empirical data, all aspects of simulated data are known with certainty. Furthermore, the flexibility of simulation makes it possible to investigate performance factors in isolation of one another.Here, we use simulated data to dissect the performance of six methods for orthology inference available as standalone software packages (Inparanoid, OMA, OrthoInspector, OrthoMCL, QuartetS, SPIMAP) as well as two generic approaches (bidirectional best hit and reciprocal smallest distance). We investigate the impact of various evolutionary forces (gene duplication, insertion, deletion, and lateral gene transfer) and technological artefacts (ambiguous sequences) on orthology inference. We show that while gene duplication/loss and insertion/deletion are well handled by most methods (albeit for different trade-offs of precision and recall), lateral gene transfer disrupts all methods. As for ambiguous sequences, which might result from poor sequencing, assembly, or genome annotation, we show that they affect alignment score-based orthology methods more strongly than their distance-based counterparts.

  8. The impact of gene duplication, insertion, deletion, lateral gene transfer and sequencing error on orthology inference: a simulation study.

    Directory of Open Access Journals (Sweden)

    Daniel A Dalquen

    Full Text Available The identification of orthologous genes, a prerequisite for numerous analyses in comparative and functional genomics, is commonly performed computationally from protein sequences. Several previous studies have compared the accuracy of orthology inference methods, but simulated data has not typically been considered in cross-method assessment studies. Yet, while dependent on model assumptions, simulation-based benchmarking offers unique advantages: contrary to empirical data, all aspects of simulated data are known with certainty. Furthermore, the flexibility of simulation makes it possible to investigate performance factors in isolation of one another.Here, we use simulated data to dissect the performance of six methods for orthology inference available as standalone software packages (Inparanoid, OMA, OrthoInspector, OrthoMCL, QuartetS, SPIMAP as well as two generic approaches (bidirectional best hit and reciprocal smallest distance. We investigate the impact of various evolutionary forces (gene duplication, insertion, deletion, and lateral gene transfer and technological artefacts (ambiguous sequences on orthology inference. We show that while gene duplication/loss and insertion/deletion are well handled by most methods (albeit for different trade-offs of precision and recall, lateral gene transfer disrupts all methods. As for ambiguous sequences, which might result from poor sequencing, assembly, or genome annotation, we show that they affect alignment score-based orthology methods more strongly than their distance-based counterparts.

  9. ALK Gene Copy Number Gain and Immunohistochemical Expression Status Using Three Antibodies in Neuroblastoma.

    Science.gov (United States)

    Kim, Eun Kyung; Kim, Sewha

    2017-01-01

    Anaplastic lymphoma kinase ( ALK) gene aberrations-such as mutations, amplifications, and copy number gains-represent a major genetic predisposition to neuroblastoma (NB). This study aimed to evaluate the correlation between ALK gene copy number status, ALK protein expression, and clinicopathological parameters. We retrospectively retrieved 30 cases of poorly differentiated NB and constructed tissue microarrays (TMAs). ALK copy number changes were assessed by fluorescence in situ hybridization (FISH) assays, and ALK immunohistochemistry (IHC) testing was performed using three different antibodies (ALK1, D5F3, and 5A4 clones). ALK amplification and copy number gain were observed in 10% (3/30) and 53.3% (16/30) of the cohort, respectively. There were positive correlations between ALK copy number and IHC-positive rate in ALK1 and 5A4 antibodies ( P copy number gain differed among the three antibodies, with 75% sensitivity in D5F3 and 0% sensitivity in ALK1. ALK-amplified NBs were correlated with synchronous MYCN amplification and chromosome 1p deletion. ALK IHC positivity was frequently observed in INSS stage IV and high-risk group patients. In conclusion, this study identified that an increase in the ALK copy number is a frequent genetic alteration in poorly differentiated NB. ALK-amplified NBs showed consistent ALK IHC positivity with all kinds of antibodies. In contrast, the detection performance of ALK copy number gain was antibody dependent, with the D5F3 antibody showing the best sensitivity.

  10. Copy number variation of KIR genes influences HIV-1 control

    DEFF Research Database (Denmark)

    Pelak, Kimberly; Need, Anna C; Fellay, Jacques

    2011-01-01

    A genome-wide screen for large structural variants showed that a copy number variant (CNV) in the region encoding killer cell immunoglobulin-like receptors (KIR) associates with HIV-1 control as measured by plasma viral load at set point in individuals of European ancestry. This CNV encompasses...... the KIR3DL1-KIR3DS1 locus, encoding receptors that interact with specific HLA-Bw4 molecules to regulate the activation of lymphocyte subsets including natural killer (NK) cells. We quantified the number of copies of KIR3DS1 and KIR3DL1 in a large HIV-1 positive cohort, and showed that an increase in KIR3...... individuals with multiple copies of KIR3DL1, in the presence of KIR3DS1 and the appropriate ligands, inhibit HIV-1 replication more robustly, and associated with a significant expansion in the frequency of KIR3DS1+, but not KIR3DL1+, NK cells in their peripheral blood. Our results suggest that the relative...

  11. Sgs1 and Exo1 suppress targeted chromosome duplication during ends-in and ends-out gene targeting.

    Science.gov (United States)

    Štafa, Anamarija; Miklenić, Marina; Zunar, Bojan; Lisnić, Berislav; Symington, Lorraine S; Svetec, Ivan-Krešimir

    2014-10-01

    Gene targeting is extremely efficient in the yeast Saccharomyces cerevisiae. It is performed by transformation with a linear, non-replicative DNA fragment carrying a selectable marker and containing ends homologous to the particular locus in a genome. However, even in S. cerevisiae, transformation can result in unwanted (aberrant) integration events, the frequency and spectra of which are quite different for ends-out and ends-in transformation assays. It has been observed that gene replacement (ends-out gene targeting) can result in illegitimate integration, integration of the transforming DNA fragment next to the target sequence and duplication of a targeted chromosome. By contrast, plasmid integration (ends-in gene targeting) is often associated with multiple targeted integration events but illegitimate integration is extremely rare and a targeted chromosome duplication has not been reported. Here we systematically investigated the influence of design of the ends-out assay on the success of targeted genetic modification. We have determined transformation efficiency, fidelity of gene targeting and spectra of all aberrant events in several ends-out gene targeting assays designed to insert, delete or replace a particular sequence in the targeted region of the yeast genome. Furthermore, we have demonstrated for the first time that targeted chromosome duplications occur even during ends-in gene targeting. Most importantly, the whole chromosome duplication is POL32 dependent pointing to break-induced replication (BIR) as the underlying mechanism. Moreover, the occurrence of duplication of the targeted chromosome was strikingly increased in the exo1Δ sgs1Δ double mutant but not in the respective single mutants demonstrating that the Exo1 and Sgs1 proteins independently suppress whole chromosome duplication during gene targeting.

  12. Evolutionary history of c-myc in teleosts and characterization of the duplicated c-myca genes in goldfish embryos.

    Science.gov (United States)

    Marandel, Lucie; Labbe, Catherine; Bobe, Julien; Le Bail, Pierre-Yves

    2012-02-01

    c-Myc plays an important role during embryogenesis in mammals, but little is known about its function during embryonic development in teleosts. In addition, the evolutionary history of c-myc gene in teleosts remains unclear, and depending on the species, a variable number of gene duplicates exist in teleosts. To gain new insight into c-myc genes in teleosts, the present study was designed to clarify the evolutionary history of c-myc gene(s) in teleosts and to subsequently characterize DNA methylation and early embryonic expression patterns in a cyprinid fish. Our results show that a duplication of c-myc gene occurred before or around the teleost radiation, as a result of the teleost-specific whole genome duplication giving rise to c-myca and c-mycb in teleosts and was followed by a loss of the c-mycb gene in the Gasterosteiforms and Tetraodontiforms. Our data also demonstrate that both c-myc genes previously identified in carp and goldfish are co-orthologs of the zebrafish c-myca. These results indicate the presence of additional c-myca duplication in Cyprininae. We were able to identify differences between the expression patterns of the two goldfish c-myca genes in oocytes and early embryos. These differences suggest a partial sub-functionalization of c-myca genes after duplication. Despite differences in transcription patterns, both of the c-myca genes displayed similar DNA methylation patterns during early development and in gametes. Together, our results clarify the evolutionary history of the c-myc gene in teleosts and provide new insight into the involvement of c-myc in early embryonic development in cyprinids. Copyright © 2011 Wiley Periodicals, Inc.

  13. TOP1 gene copy numbers are increased in cancers of the bile duct and pancreas

    DEFF Research Database (Denmark)

    Grunnet, Mie; Calatayud, Dan; Schultz, Nicolai Aa.

    2015-01-01

    ) poison. Top1 protein, TOP1 gene copy number and mRNA expression, respectively, have been proposed as predictive biomarkers of response to irinotecan in other cancers. Here we investigate the occurrence of TOP1 gene aberrations in cancers of the bile ducts and pancreas. Material and methods. TOP1...... and centromere 20 (CEN-20) numbers were investigated by fluorescence in situ hybridization analyses in tumor tissue from 226 patients. The frequencies of aberration in the TOP1 gene copy number, the CEN-20 copy number and the TOP1/CEN-20 ratio were analyzed. As TOP1 is located on chromosome 20, the CEN-20 probe...... was included to distinguish between chromosomal and gene amplifications. Results. In PC, 29.8% had an increased TOP1 copy number (≥3.5n gene copies per cell) and 10.8% had a TOP1/CEN-20 ratio >1.5. In bile duct cancer, 12.8 % had an increased TOP1 copy number and 6.4% had a TOP1/CEN-20 ratio >1.5. Neither...

  14. A 380-kb Duplication in 7p22.3 Encompassing the LFNG Gene in a Boy with Asperger Syndrome

    NARCIS (Netherlands)

    Vulto-van Silfhout, A.T.; Brouwer, A.F. de; Leeuw, N. de; Obihara, C.C.; Brunner, H.G.; Vries, B.B. de

    2012-01-01

    De novo genomic aberrations are considered an important cause of autism spectrum disorders. We describe a de novo 380-kb gain in band p22.3 of chromosome 7 in a patient with Asperger syndrome. This duplicated region contains 9 genes including the LNFG gene that is an important regulator of NOTCH

  15. A 380-kb Duplication in 7p22.3 Encompassing the LFNG Gene in a Boy with Asperger Syndrome

    NARCIS (Netherlands)

    Vulto-van Silfhout, A.T.; Brouwer, A.F. de; Leeuw, N. de; Obihara, C.C.; Brunner, H.G.; Vries, B.B. de

    2012-01-01

    De novo genomic aberrations are considered an important cause of autism spectrum disorders. We describe a de novo 380-kb gain in band p22.3 of chromosome 7 in a patient with Asperger syndrome. This duplicated region contains 9 genes including the LNFG gene that is an important regulator of NOTCH sig

  16. A 380-kb Duplication in 7p22.3 Encompassing the LFNG Gene in a Boy with Asperger Syndrome

    NARCIS (Netherlands)

    Vulto-van Silfhout, A.T.; Brouwer, A.F. de; Leeuw, N. de; Obihara, C.C.; Brunner, H.G.; Vries, B.B. de

    2012-01-01

    De novo genomic aberrations are considered an important cause of autism spectrum disorders. We describe a de novo 380-kb gain in band p22.3 of chromosome 7 in a patient with Asperger syndrome. This duplicated region contains 9 genes including the LNFG gene that is an important regulator of NOTCH sig

  17. Copy number and orientation determine the susceptibility of a gene to silencing by nearby heterochromatin in Drosophila

    Energy Technology Data Exchange (ETDEWEB)

    Sabl, J.F. [Univ. of Washington, Seattle, WA (United States)]|[Fred Hutchinson Cancer Research Center, Seattle, WA (United States); Henikoff, S. [Fred Hutchinson Cancer Research Center, Seattle, WA (United States)]|[Howard Hughes Medical Institute, Seattle, WA (United States)

    1996-02-01

    The classical phenomenon of position-effect variegation (PEV) is the mosaic expression that occurs when a chromosomal rearrangements moves a euchromatic gene near heterochromatin. A striking feature of this phenomenon is that genes far away from the junction with heterochromatin can be affected, as if the heterochromatic state {open_quotes}spreads.{close_quotes} We have investigated classical PEV of a Drosophila brown transgene affected by a heterochromatic junction {approximately} 60 kb away. PEV was enhanced when the transgene was locally duplicated using P transposase. Successive rounds of P transpose mutagenesis and phenotypic selection produced a series of PEV alleles with differences in phenotype that depended on transgene copy number and orientation. As for other examples of classical PEV, nearby heterochromatin was required for gene silencing. Modifications of classical PEV by alterations at a single site are unexpected, and these observations contradict models for spreading that invoke propagation of heterochromatin along the chromosome. Rather, our results support a model in which local alterations affect the affinity of a gene region for nearby heterochromatin via homology-based pairing, suggesting an alternative explanation for this 65-year-old phenomenon. 63 refs., 6 figs., 1 tab.

  18. Integrated analyses of copy number variations and gene expression in lung adenocarcinoma.

    Directory of Open Access Journals (Sweden)

    Tzu-Pin Lu

    Full Text Available Numerous efforts have been made to elucidate the etiology and improve the treatment of lung cancer, but the overall five-year survival rate is still only 15%. Identification of prognostic biomarkers for lung cancer using gene expression microarrays poses a major challenge in that very few overlapping genes have been reported among different studies. To address this issue, we have performed concurrent genome-wide analyses of copy number variation and gene expression to identify genes reproducibly associated with tumorigenesis and survival in non-smoking female lung adenocarcinoma. The genomic landscape of frequent copy number variable regions (CNVRs in at least 30% of samples was revealed, and their aberration patterns were highly similar to several studies reported previously. Further statistical analysis for genes located in the CNVRs identified 475 genes differentially expressed between tumor and normal tissues (p<10(-5. We demonstrated the reproducibility of these genes in another lung cancer study (p = 0.0034, Fisher's exact test, and showed the concordance between copy number variations and gene expression changes by elevated Pearson correlation coefficients. Pathway analysis revealed two major dysregulated functions in lung tumorigenesis: survival regulation via AKT signaling and cytoskeleton reorganization. Further validation of these enriched pathways using three independent cohorts demonstrated effective prediction of survival. In conclusion, by integrating gene expression profiles and copy number variations, we identified genes/pathways that may serve as prognostic biomarkers for lung tumorigenesis.

  19. Selection of Suitable Endogenous Reference Genes for Relative Copy Number Detection in Sugarcane

    Directory of Open Access Journals (Sweden)

    Bantong Xue

    2014-05-01

    Full Text Available Transgene copy number has a great impact on the expression level and stability of exogenous gene in transgenic plants. Proper selection of endogenous reference genes is necessary for detection of genetic components in genetically modification (GM crops by quantitative real-time PCR (qPCR or by qualitative PCR approach, especially in sugarcane with polyploid and aneuploid genomic structure. qPCR technique has been widely accepted as an accurate, time-saving method on determination of copy numbers in transgenic plants and on detection of genetically modified plants to meet the regulatory and legislative requirement. In this study, to find a suitable endogenous reference gene and its real-time PCR assay for sugarcane (Saccharum spp. hybrids DNA content quantification, we evaluated a set of potential “single copy” genes including P4H, APRT, ENOL, CYC, TST and PRR, through qualitative PCR and absolute quantitative PCR. Based on copy number comparisons among different sugarcane genotypes, including five S. officinarum, one S. spontaneum and two S. spp. hybrids, these endogenous genes fell into three groups: ENOL-3—high copy number group, TST-1 and PRR-1—medium copy number group, P4H-1, APRT-2 and CYC-2—low copy number group. Among these tested genes, P4H, APRT and CYC were the most stable, while ENOL and TST were the least stable across different sugarcane genotypes. Therefore, three primer pairs of P4H-3, APRT-2 and CYC-2 were then selected as the suitable reference gene primer pairs for sugarcane. The test of multi-target reference genes revealed that the APRT gene was a specific amplicon, suggesting this gene is the most suitable to be used as an endogenous reference target for sugarcane DNA content quantification. These results should be helpful for establishing accurate and reliable qualitative and quantitative PCR analysis of GM sugarcane.

  20. Single-Copy Genes as Molecular Markers for Phylogenomic Studies in Seed Plants.

    Science.gov (United States)

    Li, Zhen; De La Torre, Amanda R; Sterck, Lieven; Cánovas, Francisco M; Avila, Concepción; Merino, Irene; Cabezas, José Antonio; Cervera, María Teresa; Ingvarsson, Pär K; Van de Peer, Yves

    2017-05-01

    Phylogenetic relationships among seed plant taxa, especially within the gymnosperms, remain contested. In contrast to angiosperms, for which several genomic, transcriptomic and phylogenetic resources are available, there are few, if any, molecular markers that allow broad comparisons among gymnosperm species. With few gymnosperm genomes available, recently obtained transcriptomes in gymnosperms are a great addition to identifying single-copy gene families as molecular markers for phylogenomic analysis in seed plants. Taking advantage of an increasing number of available genomes and transcriptomes, we identified single-copy genes in a broad collection of seed plants and used these to infer phylogenetic relationships between major seed plant taxa. This study aims at extending the current phylogenetic toolkit for seed plants, assessing its ability for resolving seed plant phylogeny, and discussing potential factors affecting phylogenetic reconstruction. In total, we identified 3,072 single-copy genes in 31 gymnosperms and 2,156 single-copy genes in 34 angiosperms. All studied seed plants shared 1,469 single-copy genes, which are generally involved in functions like DNA metabolism, cell cycle, and photosynthesis. A selected set of 106 single-copy genes provided good resolution for the seed plant phylogeny except for gnetophytes. Although some of our analyses support a sister relationship between gnetophytes and other gymnosperms, phylogenetic trees from concatenated alignments without 3rd codon positions and amino acid alignments under the CAT + GTR model, support gnetophytes as a sister group to Pinaceae. Our phylogenomic analyses demonstrate that, in general, single-copy genes can uncover both recent and deep divergences of seed plant phylogeny. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  1. Copy number variation of KIR genes influences HIV-1 control

    DEFF Research Database (Denmark)

    Pelak, Kimberly; Need, Anna C; Fellay, Jacques;

    2011-01-01

    A genome-wide screen for large structural variants showed that a copy number variant (CNV) in the region encoding killer cell immunoglobulin-like receptors (KIR) associates with HIV-1 control as measured by plasma viral load at set point in individuals of European ancestry. This CNV encompasses...... the KIR3DL1-KIR3DS1 locus, encoding receptors that interact with specific HLA-Bw4 molecules to regulate the activation of lymphocyte subsets including natural killer (NK) cells. We quantified the number of copies of KIR3DS1 and KIR3DL1 in a large HIV-1 positive cohort, and showed that an increase in KIR3......DS1 count associates with a lower viral set point if its putative ligand is present (p = 0.00028), as does an increase in KIR3DL1 count in the presence of KIR3DS1 and appropriate ligands for both receptors (p = 0.0015). We further provide functional data that demonstrate that NK cells from...

  2. Gene duplication and fragmentation in the zebra finch major histocompatibility complex

    Directory of Open Access Journals (Sweden)

    Burt David W

    2010-04-01

    Full Text Available Abstract Background Due to its high polymorphism and importance for disease resistance, the major histocompatibility complex (MHC has been an important focus of many vertebrate genome projects. Avian MHC organization is of particular interest because the chicken Gallus gallus, the avian species with the best characterized MHC, possesses a highly streamlined minimal essential MHC, which is linked to resistance against specific pathogens. It remains unclear the extent to which this organization describes the situation in other birds and whether it represents a derived or ancestral condition. The sequencing of the zebra finch Taeniopygia guttata genome, in combination with targeted bacterial artificial chromosome (BAC sequencing, has allowed us to characterize an MHC from a highly divergent and diverse avian lineage, the passerines. Results The zebra finch MHC exhibits a complex structure and history involving gene duplication and fragmentation. The zebra finch MHC includes multiple Class I and Class II genes, some of which appear to be pseudogenes, and spans a much more extensive genomic region than the chicken MHC, as evidenced by the presence of MHC genes on each of seven BACs spanning 739 kb. Cytogenetic (FISH evidence and the genome assembly itself place core MHC genes on as many as four chromosomes with TAP and Class I genes mapping to different chromosomes. MHC Class II regions are further characterized by high endogenous retroviral content. Lastly, we find strong evidence of selection acting on sites within passerine MHC Class I and Class II genes. Conclusion The zebra finch MHC differs markedly from that of the chicken, the only other bird species with a complete genome sequence. The apparent lack of synteny between TAP and the expressed MHC Class I locus is in fact reminiscent of a pattern seen in some mammalian lineages and may represent convergent evolution. Our analyses of the zebra finch MHC suggest a complex history involving

  3. The polyphenol oxidase gene family in land plants: Lineage-specific duplication and expansion

    Directory of Open Access Journals (Sweden)

    Tran Lan T

    2012-08-01

    Full Text Available Abstract Background Plant polyphenol oxidases (PPOs are enzymes that typically use molecular oxygen to oxidize ortho-diphenols to ortho-quinones. These commonly cause browning reactions following tissue damage, and may be important in plant defense. Some PPOs function as hydroxylases or in cross-linking reactions, but in most plants their physiological roles are not known. To better understand the importance of PPOs in the plant kingdom, we surveyed PPO gene families in 25 sequenced genomes from chlorophytes, bryophytes, lycophytes, and flowering plants. The PPO genes were then analyzed in silico for gene structure, phylogenetic relationships, and targeting signals. Results Many previously uncharacterized PPO genes were uncovered. The moss, Physcomitrella patens, contained 13 PPO genes and Selaginella moellendorffii (spike moss and Glycine max (soybean each had 11 genes. Populus trichocarpa (poplar contained a highly diversified gene family with 11 PPO genes, but several flowering plants had only a single PPO gene. By contrast, no PPO-like sequences were identified in several chlorophyte (green algae genomes or Arabidopsis (A. lyrata and A. thaliana. We found that many PPOs contained one or two introns often near the 3’ terminus. Furthermore, N-terminal amino acid sequence analysis using ChloroP and TargetP 1.1 predicted that several putative PPOs are synthesized via the secretory pathway, a unique finding as most PPOs are predicted to be chloroplast proteins. Phylogenetic reconstruction of these sequences revealed that large PPO gene repertoires in some species are mostly a consequence of independent bursts of gene duplication, while the lineage leading to Arabidopsis must have lost all PPO genes. Conclusion Our survey identified PPOs in gene families of varying sizes in all land plants except in the genus Arabidopsis. While we found variation in intron numbers and positions, overall PPO gene structure is congruent with the phylogenetic

  4. A survey of innovation through duplication in the reduced genomes of twelve parasites.

    Directory of Open Access Journals (Sweden)

    Jeremy D DeBarry

    Full Text Available We characterize the prevalence, distribution, divergence, and putative functions of detectable two-copy paralogs and segmental duplications in the Apicomplexa, a phylum of parasitic protists. Apicomplexans are mostly obligate intracellular parasites responsible for human and animal diseases (e.g. malaria and toxoplasmosis. Gene loss is a major force in the phylum. Genomes are small and protein-encoding gene repertoires are reduced. Despite this genomic streamlining, duplications and gene family amplifications are present. The potential for innovation introduced by duplications is of particular interest. We compared genomes of twelve apicomplexans across four lineages and used orthology and genome cartography to map distributions of duplications against genome architectures. Segmental duplications appear limited to five species. Where present, they correspond to regions enriched for multi-copy and species-specific genes, pointing toward roles in adaptation and innovation. We found a phylum-wide association of duplications with dynamic chromosome regions and syntenic breakpoints. Trends in the distribution of duplicated genes indicate that recent, species-specific duplicates are often tandem while most others have been dispersed by genome rearrangements. These trends show a relationship between genome architecture and gene duplication. Functional analysis reveals: proteases, which are vital to a parasitic lifecycle, to be prominent in putative recent duplications; a pair of paralogous genes in Toxoplasma gondii previously shown to produce the rate-limiting step in dopamine synthesis in mammalian cells, a possible link to the modification of host behavior; and phylum-wide differences in expression and subcellular localization, indicative of modes of divergence. We have uncovered trends in multiple modes of duplicate divergence including sequence, intron content, expression, subcellular localization, and functions of putative recent duplicates that

  5. High-Throughput Amplicon-Based Copy Number Detection of 11 Genes in Formalin-Fixed Paraffin-Embedded Ovarian Tumour Samples by MLPA-Seq.

    Science.gov (United States)

    Kondrashova, Olga; Love, Clare J; Lunke, Sebastian; Hsu, Arthur L; Waring, Paul M; Taylor, Graham R

    2015-01-01

    Whilst next generation sequencing can report point mutations in fixed tissue tumour samples reliably, the accurate determination of copy number is more challenging. The conventional Multiplex Ligation-dependent Probe Amplification (MLPA) assay is an effective tool for measurement of gene dosage, but is restricted to around 50 targets due to size resolution of the MLPA probes. By switching from a size-resolved format, to a sequence-resolved format we developed a scalable, high-throughput, quantitative assay. MLPA-seq is capable of detecting deletions, duplications, and amplifications in as little as 5ng of genomic DNA, including from formalin-fixed paraffin-embedded (FFPE) tumour samples. We show that this method can detect BRCA1, BRCA2, ERBB2 and CCNE1 copy number changes in DNA extracted from snap-frozen and FFPE tumour tissue, with 100% sensitivity and >99.5% specificity.

  6. Gene duplication and fragment recombination drive functional diversification of a superfamily of cytoplasmic effectors in Phytophthora sojae.

    Science.gov (United States)

    Shen, Danyu; Liu, Tingli; Ye, Wenwu; Liu, Li; Liu, Peihan; Wu, Yuren; Wang, Yuanchao; Dou, Daolong

    2013-01-01

    Phytophthora and other oomycetes secrete a large number of putative host cytoplasmic effectors with conserved FLAK motifs following signal peptides, termed crinkling and necrosis inducing proteins (CRN), or Crinkler. Here, we first investigated the evolutionary patterns and mechanisms of CRN effectors in Phytophthora sojae and compared them to two other Phytophthora species. The genes encoding CRN effectors could be divided into 45 orthologous gene groups (OGG), and most OGGs unequally distributed in the three species, in which each underwent large number of gene gains or losses, indicating that the CRN genes expanded after species evolution in Phytophthora and evolved through pathoadaptation. The 134 expanded genes in P. sojae encoded family proteins including 82 functional genes and expressed at higher levels while the other 68 genes encoding orphan proteins were less expressed and contained 50 pseudogenes. Furthermore, we demonstrated that most expanded genes underwent gene duplication or/and fragment recombination. Three different mechanisms that drove gene duplication or recombination were identified. Finally, the expanded CRN effectors exhibited varying pathogenic functions, including induction of programmed cell death (PCD) and suppression of PCD through PAMP-triggered immunity or/and effector-triggered immunity. Overall, these results suggest that gene duplication and fragment recombination may be two mechanisms that drive the expansion and neofunctionalization of the CRN family in P. sojae, which aids in understanding the roles of CRN effectors within each oomycete pathogen.

  7. Gene duplication and fragment recombination drive functional diversification of a superfamily of cytoplasmic effectors in Phytophthora sojae.

    Directory of Open Access Journals (Sweden)

    Danyu Shen

    Full Text Available Phytophthora and other oomycetes secrete a large number of putative host cytoplasmic effectors with conserved FLAK motifs following signal peptides, termed crinkling and necrosis inducing proteins (CRN, or Crinkler. Here, we first investigated the evolutionary patterns and mechanisms of CRN effectors in Phytophthora sojae and compared them to two other Phytophthora species. The genes encoding CRN effectors could be divided into 45 orthologous gene groups (OGG, and most OGGs unequally distributed in the three species, in which each underwent large number of gene gains or losses, indicating that the CRN genes expanded after species evolution in Phytophthora and evolved through pathoadaptation. The 134 expanded genes in P. sojae encoded family proteins including 82 functional genes and expressed at higher levels while the other 68 genes encoding orphan proteins were less expressed and contained 50 pseudogenes. Furthermore, we demonstrated that most expanded genes underwent gene duplication or/and fragment recombination. Three different mechanisms that drove gene duplication or recombination were identified. Finally, the expanded CRN effectors exhibited varying pathogenic functions, including induction of programmed cell death (PCD and suppression of PCD through PAMP-triggered immunity or/and effector-triggered immunity. Overall, these results suggest that gene duplication and fragment recombination may be two mechanisms that drive the expansion and neofunctionalization of the CRN family in P. sojae, which aids in understanding the roles of CRN effectors within each oomycete pathogen.

  8. Gene duplication, loss and selection in the evolution of saxitoxin biosynthesis in alveolates.

    Science.gov (United States)

    Murray, Shauna A; Diwan, Rutuja; Orr, Russell J S; Kohli, Gurjeet S; John, Uwe

    2015-11-01

    A group of marine dinoflagellates (Alveolata, Eukaryota), consisting of ∼10 species of the genus Alexandrium, Gymnodinium catenatum and Pyrodinium bahamense, produce the toxin saxitoxin and its analogues (STX), which can accumulate in shellfish, leading to ecosystem and human health impacts. The genes, sxt, putatively involved in STX biosynthesis, have recently been identified, however, the evolution of these genes within dinoflagellates is not clear. There are two reasons for this: uncertainty over the phylogeny of dinoflagellates; and that the sxt genes of many species of Alexandrium and other dinoflagellate genera are not known. Here, we determined the phylogeny of STX-producing and other dinoflagellates based on a concatenated eight-gene alignment. We determined the presence, diversity and phylogeny of sxtA, domains A1 and A4 and sxtG in 52 strains of Alexandrium, and a further 43 species of dinoflagellates and thirteen other alveolates. We confirmed the presence and high sequence conservation of sxtA, domain A4, in 40 strains (35 Alexandrium, 1 Pyrodinium, 4 Gymnodinium) of 8 species of STX-producing dinoflagellates, and absence from non-producing species. We found three paralogs of sxtA, domain A1, and a widespread distribution of sxtA1 in non-STX producing dinoflagellates, indicating duplication events in the evolution of this gene. One paralog, clade 2, of sxtA1 may be particularly related to STX biosynthesis. Similarly, sxtG appears to be generally restricted to STX-producing species, while three amidinotransferase gene paralogs were found in dinoflagellates. We investigated the role of positive (diversifying) selection following duplication in sxtA1 and sxtG, and found negative selection in clades of sxtG and sxtA1, clade 2, suggesting they were functionally constrained. Significant episodic diversifying selection was found in some strains in clade 3 of sxtA1, a clade that may not be involved in STX biosynthesis, indicating pressure for diversification

  9. Characterization of gene mutations and copy number changes in acute myeloid leukemia using a rapid target enrichment protocol.

    Science.gov (United States)

    Bolli, Niccolò; Manes, Nicla; McKerrell, Thomas; Chi, Jianxiang; Park, Naomi; Gundem, Gunes; Quail, Michael A; Sathiaseelan, Vijitha; Herman, Bram; Crawley, Charles; Craig, Jenny I O; Conte, Natalie; Grove, Carolyn; Papaemmanuil, Elli; Campbell, Peter J; Varela, Ignacio; Costeas, Paul; Vassiliou, George S

    2015-02-01

    Prognostic stratification is critical for making therapeutic decisions and maximizing survival of patients with acute myeloid leukemia. Advances in the genomics of acute myeloid leukemia have identified several recurrent gene mutations whose prognostic impact is being deciphered. We used HaloPlex target enrichment and Illumina-based next generation sequencing to study 24 recurrently mutated genes in 42 samples of acute myeloid leukemia with a normal karyotype. Read depth varied between and within genes for the same sample, but was predictable and highly consistent across samples. Consequently, we were able to detect copy number changes, such as an interstitial deletion of BCOR, three MLL partial tandem duplications, and a novel KRAS amplification. With regards to coding mutations, we identified likely oncogenic variants in 41 of 42 samples. NPM1 mutations were the most frequent, followed by FLT3, DNMT3A and TET2. NPM1 and FLT3 indels were reported with good efficiency. We also showed that DNMT3A mutations can persist post-chemotherapy and in 2 cases studied at diagnosis and relapse, we were able to delineate the dynamics of tumor evolution and give insights into order of acquisition of variants. HaloPlex is a quick and reliable target enrichment method that can aid diagnosis and prognostic stratification of acute myeloid leukemia patients.

  10. Evaluation of β-globin gene therapy constructs in single-copy transgenic mice.

    NARCIS (Netherlands)

    J. Ellis (James); K.C. Tan-Un; P. Pasceri; A. Harper; X. Wu; P.J. Fraser (Peter); F.G. Grosveld (Frank)

    1997-01-01

    textabstractEffective gene therapy constructs based on retrovirus or adeno-associated virus vectors will require regulatory elements that direct expression of genes transduced at single copy. Most beta-globin constructs designed for therapy of beta-thalassemias are regulated by the 5'HS2 component o

  11. Comparison of quantitative PCR assays for Escherichia coli targeting ribosomal RNA and single copy genes

    Science.gov (United States)

    Aims: Compare specificity and sensitivity of quantitative PCR (qPCR) assays targeting single and multi-copy gene regions of Escherichia coli. Methods and Results: A previously reported assay targeting the uidA gene (uidA405) was used as the basis for comparing the taxono...

  12. Genome-wide analysis of homeobox genes from Mesobuthus martensii reveals Hox gene duplication in scorpions.

    Science.gov (United States)

    Di, Zhiyong; Yu, Yao; Wu, Yingliang; Hao, Pei; He, Yawen; Zhao, Huabin; Li, Yixue; Zhao, Guoping; Li, Xuan; Li, Wenxin; Cao, Zhijian

    2015-06-01

    Homeobox genes belong to a large gene group, which encodes the famous DNA-binding homeodomain that plays a key role in development and cellular differentiation during embryogenesis in animals. Here, one hundred forty-nine homeobox genes were identified from the Asian scorpion, Mesobuthus martensii (Chelicerata: Arachnida: Scorpiones: Buthidae) based on our newly assembled genome sequence with approximately 248 × coverage. The identified homeobox genes were categorized into eight classes including 82 families: 67 ANTP class genes, 33 PRD genes, 11 LIM genes, five POU genes, six SINE genes, 14 TALE genes, five CUT genes, two ZF genes and six unclassified genes. Transcriptome data confirmed that more than half of the genes were expressed in adults. The homeobox gene diversity of the eight classes is similar to the previously analyzed Mandibulata arthropods. Interestingly, it is hypothesized that the scorpion M. martensii may have two Hox clusters. The first complete genome-wide analysis of homeobox genes in Chelicerata not only reveals the repertoire of scorpion, arachnid and chelicerate homeobox genes, but also shows some insights into the evolution of arthropod homeobox genes.

  13. Resolution and reconciliation of non-binary gene trees with transfers, duplications and losses.

    Science.gov (United States)

    Jacox, Edwin; Weller, Mathias; Tannier, Eric; Scornavacca, Celine

    2017-04-01

    Gene trees reconstructed from sequence alignments contain poorly supported branches when the phylogenetic signal in the sequences is insufficient to determine them all. When a species tree is available, the signal of gains and losses of genes can be used to correctly resolve the unsupported parts of the gene history. However finding a most parsimonious binary resolution of a non-binary tree obtained by contracting the unsupported branches is NP-hard if transfer events are considered as possible gene scale events, in addition to gene origination, duplication and loss. We propose an exact, parameterized algorithm to solve this problem in single-exponential time, where the parameter is the number of connected branches of the gene tree that show low support from the sequence alignment or, equivalently, the maximum number of children of any node of the gene tree once the low-support branches have been collapsed. This improves on the best known algorithm by an exponential factor. We propose a way to choose among optimal solutions based on the available information. We show the usability of this principle on several simulated and biological datasets. The results are comparable in quality to several other tested methods having similar goals, but our approach provides a lower running time and a guarantee that the produced solution is optimal. Our algorithm has been integrated into the ecceTERA phylogeny package, available at http://mbb.univ-montp2.fr/MBB/download_sources/16__ecceTERA and which can be run online at http://mbb.univ-montp2.fr/MBB/subsection/softExec.php?soft=eccetera . celine.scornavacca@umontpellier.fr. Supplementary data are available at Bioinformatics online.

  14. Low AMY1 Gene Copy Number Is Associated with Increased Body Mass Index in Prepubertal Boys.

    Directory of Open Access Journals (Sweden)

    M Loredana Marcovecchio

    Full Text Available Genome-wide association studies have identified more than 60 single nucleotide polymorphisms associated with Body Mass Index (BMI. Additional genetic variants, such as copy number variations (CNV, have also been investigated in relation to BMI. Recently, the highly polymorphic CNV in the salivary amylase (AMY1 gene, encoding an enzyme implicated in the first step of starch digestion, has been associated with obesity in adults and children. We assessed the potential association between AMY1 copy number and a wide range of BMI in a population of Italian school-children.744 children (354 boys, 390 girls, mean age (±SD: 8.4±1.4years underwent anthropometric assessments (height, weight and collection of saliva samples for DNA extraction. AMY1 copies were evaluated by quantitative PCR.A significant increase of BMI z-score by decreasing AMY1 copy number was observed in boys (β: -0.117, p = 0.033, but not in girls. Similarly, waist circumference (β: -0.155, p = 0.003, adjusted for age was negatively influenced by AMY1 copy number in boys. Boys with 8 or more AMY1 copy numbers presented a significant lower BMI z-score (p = 0.04 and waist circumference (p = 0.01 when compared to boys with less than 8 copy numbers.In this pediatric-only, population-based study, a lower AMY1 copy number emerged to be associated with increased BMI in boys. These data confirm previous findings from adult studies and support a potential role of a higher copy number of the salivary AMY1 gene in protecting from excess weight gain.

  15. Low AMY1 Gene Copy Number Is Associated with Increased Body Mass Index in Prepubertal Boys

    Science.gov (United States)

    Verginelli, Fabio; De Lellis, Laura; Capelli, Cristian; Verzilli, Delfina; Chiarelli, Francesco; Mohn, Angelika; Cama, Alessandro

    2016-01-01

    Background Genome-wide association studies have identified more than 60 single nucleotide polymorphisms associated with Body Mass Index (BMI). Additional genetic variants, such as copy number variations (CNV), have also been investigated in relation to BMI. Recently, the highly polymorphic CNV in the salivary amylase (AMY1) gene, encoding an enzyme implicated in the first step of starch digestion, has been associated with obesity in adults and children. We assessed the potential association between AMY1 copy number and a wide range of BMI in a population of Italian school-children. Methods 744 children (354 boys, 390 girls, mean age (±SD): 8.4±1.4years) underwent anthropometric assessments (height, weight) and collection of saliva samples for DNA extraction. AMY1 copies were evaluated by quantitative PCR. Results A significant increase of BMI z-score by decreasing AMY1 copy number was observed in boys (β: -0.117, p = 0.033), but not in girls. Similarly, waist circumference (β: -0.155, p = 0.003, adjusted for age) was negatively influenced by AMY1 copy number in boys. Boys with 8 or more AMY1 copy numbers presented a significant lower BMI z-score (p = 0.04) and waist circumference (p = 0.01) when compared to boys with less than 8 copy numbers. Conclusions In this pediatric-only, population-based study, a lower AMY1 copy number emerged to be associated with increased BMI in boys. These data confirm previous findings from adult studies and support a potential role of a higher copy number of the salivary AMY1 gene in protecting from excess weight gain. PMID:27149670

  16. Effects of Gene Duplication, Positive Selection, and Shifts in Gene Expression on the Evolution of the Venom Gland Transcriptome in Widow Spiders.

    Science.gov (United States)

    Haney, Robert A; Clarke, Thomas H; Gadgil, Rujuta; Fitzpatrick, Ryan; Hayashi, Cheryl Y; Ayoub, Nadia A; Garb, Jessica E

    2016-01-05

    Gene duplication and positive selection can be important determinants of the evolution of venom, a protein-rich secretion used in prey capture and defense. In a typical model of venom evolution, gene duplicates switch to venom gland expression and change function under the action of positive selection, which together with further duplication produces large gene families encoding diverse toxins. Although these processes have been demonstrated for individual toxin families, high-throughput multitissue sequencing of closely related venomous species can provide insights into evolutionary dynamics at the scale of the entire venom gland transcriptome. By assembling and analyzing multitissue transcriptomes from the Western black widow spider and two closely related species with distinct venom toxicity phenotypes, we do not find that gene duplication and duplicate retention is greater in gene families with venom gland biased expression in comparison with broadly expressed families. Positive selection has acted on some venom toxin families, but does not appear to be in excess for families with venom gland biased expression. Moreover, we find 309 distinct gene families that have single transcripts with venom gland biased expression, suggesting that the switching of genes to venom gland expression in numerous unrelated gene families has been a dominant mode of evolution. We also find ample variation in protein sequences of venom gland-specific transcripts, lineage-specific family sizes, and ortholog expression among species. This variation might contribute to the variable venom toxicity of these species.

  17. Phylogenetic relationships among Perissodactyla: secretoglobin 1A1 gene duplication and triplication in the Equidae family.

    Science.gov (United States)

    Côté, Olivier; Viel, Laurent; Bienzle, Dorothee

    2013-12-01

    Secretoglobin family 1A member 1 (SCGB 1A1) is a small anti-inflammatory and immunomodulatory protein that is abundantly secreted in airway surface fluids. We recently reported the existence of three distinct SCGB1A1 genes in the domestic horse genome as opposed to the single gene copy consensus present in other mammals. The origin of SCGB1A1 gene triplication and the evolutionary relationship of the three genes amongst Equidae family members are unknown. For this study, SCGB1A1 genomic data were collected from various Equus individuals including E. caballus, E. przewalskii, E. asinus, E. grevyi, and E. quagga. Three SCGB1A1 genes in E. przewalskii, two SCGB1A1 genes in E. asinus, and a single SCGB1A1 gene in E. grevyi and E. quagga were identified. Sequence analysis revealed that the non-synonymous nucleotide substitutions between the different equid genes coded for 17 amino acid changes. Most of these changes localized to the SCGB 1A1 central cavity that binds hydrophobic ligands, suggesting that this area of SCGB 1A1 evolved to accommodate diverse molecular interactions. Three-dimensional modeling of the proteins revealed that the size of the SCGB 1A1 central cavity is larger than that of SCGB 1A1A. Altogether, these findings suggest that evolution of the SCGB1A1 gene may parallel the separation of caballine and non-caballine species amongst Equidae, and may indicate an expansion of function for SCGB1A1 gene products. Copyright © 2013 Elsevier Inc. All rights reserved.

  18. Copy number variations of 11 macronuclear chromosomes and their gene expression in Oxytricha trifallax.

    Science.gov (United States)

    Xu, Ke; Doak, Thomas G; Lipps, Hans J; Wang, Jingmei; Swart, Estienne C; Chang, Wei-Jen

    2012-08-15

    Ciliated protozoa are peculiar for their nuclear dimorphism, wherein two types of nuclei divide nuclear functions: a germline micronucleus (MIC) is transcriptionally inert during vegetative growth, but serves as the genetic blueprint for the somatic macronucleus (MAC), which is responsible for all transcripts supporting cell growth and reproduction. While all the advantages/disadvantages associated with nuclear dimorphism are not clear, an essential advantage seems to be the ability to produce a highly polyploid MAC, which then allows for the maintenance of extremely large single cells - many ciliate cells are larger than small metazoa. In some ciliate classes, chromosomes in the MAC are extensively fragmented to create extremely short chromosomes that often carry single genes, and these chromosomes may be present in different copy numbers, resulting in different ploidies. While using gene copy number to regulate gene expression is limited in most eukaryotic systems, the extensive fragmentation in some ciliate classes provides this opportunity to every MAC gene. However, it is still unclear if this mechanism is in fact used extensively in these ciliates. To address this, we have quantified copy numbers of 11 MAC chromosomes and their gene expression in Oxytricha trifallax (CI: Spirotrichea). We compared copy numbers between two subpopulations of O. trifallax, and copy numbers of 7 orthologous genes between O. trifallax and the closely related Stylonychia lemnae. We show that copy numbers of MAC chromosomes are variable, dynamic, and positively correlated to gene expression. These features might be conserved in all spirotrichs, and might exist in other classes of ciliates with heavily fragmented MAC chromosomes.

  19. Topoisomerase-1 gene copy aberrations are frequent in patients with breast cancer

    DEFF Research Database (Denmark)

    Kümler, Iben; Balslev, Eva; Poulsen, Tim S.

    2015-01-01

    of TOP1 gene copy gain in BC. The prevalence of TOP1 gene copy gain was investigated by fluorescence in situ hybridization with a TOP1/CEN-20 probemix in normal breast tissue (N=100) and in tissue from patients with metastatic BC in a discovery (N=100) and a validation cohort (N=205). As amplification...... of 20q including CEN-20 is common in BC a TOP1/CEN-2 probemix was applied to the validation cohort. More than 30% of the patients had gene copy numbers of ≥ 4 and approximately 20% of the patients had TOP1/CEN-20 ratios ≥ 1.5. The CEN-2 probe did not add any information. Gain of the TOP1 gene appears...... to be common in BC making the gene a potential biomarker for response to treatment with Top1 inhibitors. As 20q amplification is a common finding in BC and as no other suitable reference gene has yet been identified, TOP1 copy number may be a more valid method of detecting gain than using a gene...

  20. Copy number polymorphism of the salivary amylase gene: implications in human nutrition research.

    Science.gov (United States)

    Santos, J L; Saus, E; Smalley, S V; Cataldo, L R; Alberti, G; Parada, J; Gratacòs, M; Estivill, X

    2012-01-01

    The salivary α-amylase is a calcium-binding enzyme that initiates starch digestion in the oral cavity. The α-amylase genes are located in a cluster on the chromosome that includes salivary amylase genes (AMY1), two pancreatic α-amylase genes (AMY2A and AMY2B) and a related pseudogene. The AMY1 genes show extensive copy number variation which is directly proportional to the salivary α-amylase content in saliva. The α-amylase amount in saliva is also influenced by other factors, such as hydration status, psychosocial stress level, and short-term dietary habits. It has been shown that the average copy number of AMY1 gene is higher in populations that evolved under high-starch diets versus low-starch diets, reflecting an intense positive selection imposed by diet on amylase copy number during evolution. In this context, a number of different aspects can be considered in evaluating the possible impact of copy number variation of the AMY1 gene on nutrition research, such as issues related to human diet gene evolution, action on starch digestion, effect on glycemic response after starch consumption, modulation of the action of α-amylases inhibitors, effect on taste perception and satiety, influence on psychosocial stress and relation to oral health.

  1. The opsin repertoire of Jenynsia onca: a new perspective on gene duplication and divergence in livebearers

    Directory of Open Access Journals (Sweden)

    Owens Gregory L

    2009-08-01

    Full Text Available Abstract Background Jenynsia onca, commonly known as the one sided livebearer, is a member of the family Anablepidae. The opsin gene repertoires of J. onca's close relatives, the four-eyed fish (Anableps anableps and the guppy (Poecilia reticulata, have been characterized and each found to include one unique LWS opsin. Currently, the relationship among LWS paralogs and orthologs in these species are unclear, making it difficult to test the hypotheses that link vision to morphology or life history traits. The phylogenetic signal appears to have been disrupted by gene conversion. Here we have sequenced the opsin genes of J. onca in order to resolve these relationships. Findings We identified nine visual opsins; LWS S180r, LWS S180, LWS P180, SWS1, SWS2A, SWS2B, RH1, RH2-1, and RH2-2. Key site analysis revealed only one unique haplotype, RH2-2, although this is unlikely to shift λmax significantly. LWS P180 was found to be a product of a gene conversion event with LWS S180, followed by convergence to a proline residue at the 180 site. Conclusion Jenynsia onca has at least 9 visual opsins: three LWS, one RH1, two RH2, one SWS1 and two SWS2. The presence of LWS P180 moves the location of the LWS P180-S180 tandem duplication event back to the base of the Poeciliidae-Anablepidae clade, expanding the number of species possessing this unusual blue shifted LWS opsin. The presence of the LWS P180 gene also confirms that gene conversion events have homogenized opsin paralogs in fish, just as they have in humans.

  2. Genome-wide analysis of the Dof transcription factor gene family reveals soybean-specific duplicable and functional characteristics.

    Directory of Open Access Journals (Sweden)

    Yong Guo

    Full Text Available The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max. In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.

  3. Identification of coding exon 3 duplication in the BMPR1A gene in a patient with juvenile polyposis syndrome.

    Science.gov (United States)

    Yamaguchi, Junya; Nagayama, Satoshi; Chino, Akiko; Sakata, Ai; Yamamoto, Noriko; Sato, Yuri; Ashihara, Yuumi; Kita, Mizuho; Nomura, Sachio; Ishikawa, Yuichi; Igarashi, Masahiro; Ueno, Masashi; Arai, Masami

    2014-10-01

    Juvenile polyposis syndrome is an autosomal dominant inherited disorder characterized by multiple juvenile polyps arising in the gastrointestinal tract and an increased risk of gastrointestinal cancers, specifically colon cancer. BMPR1A and SMAD4 germline mutations have been found in patients with juvenile polyposis syndrome. We identified a BMPR1A mutation, which involves a duplication of coding exon 3 (c.230+452_333+441dup1995), on multiple ligation dependent probe amplification in a patient with juvenile polyposis syndrome. The mutation causes a frameshift, producing a truncated protein (p.D112NfsX2). Therefore, the mutation is believed to be pathogenic. We also identified a duplication breakpoint in which Alu sequences are located. These results suggest that the duplication event resulted from recombination between Alu sequences. To our knowledge, partial duplication in the BMPR1A gene has not been reported previously. This is the first case report to document coding exon 3 duplication in the BMPR1A gene in a patient with juvenile polyposis syndrome.

  4. Gyrase activity and number of copies of the gyrase B subunit gene in Haemophilus influenzae

    Energy Technology Data Exchange (ETDEWEB)

    Cabrera-Juarez, E.; Setlow, J.K.

    1985-11-01

    Gyrase activities in extracts of various strains of Haemophilus influenzae can differ by more than an order of magnitude. Measurements of in vitro activity and copy number indicated that most of these differences arose from variations in the number of copies of the gene for the gyrase B subunit, with some strains containing multicopy plasmids coding for that subunit. The quantitative relationship between gyrase and copy number depended on the mutations in the plasmids and in the host. The possibility that the in vivo gyrase activity did not reflect the in vitro data was explored by measurement of alkaline phosphatase and ATPase activity in the extracts. Alkaline phosphatase activity increased with increasing gyrase activity measured in vitro, but ATPase activity did not. The authors conclude that extra supercoiling enhanced transcription of the alkaline phosphatase gene but not the ATPase gene and that it is unlikely that there is much discrepancy between gyrase activity assayed in vitro and the activity in the cell.

  5. The evolution and maintenance of Hox gene clusters in vertebrates and the teleost-specific genome duplication.

    Science.gov (United States)

    Kuraku, Shigehiro; Meyer, Axel

    2009-01-01

    Hox genes are known to specify spatial identities along the anterior-posterior axis during embryogenesis. In vertebrates and most other deuterostomes, they are arranged in sets of uninterrupted clusters on chromosomes, and are in most cases expressed in a "colinear" fashion, in which genes closer to the 3-end of the Hox clusters are expressed earlier and more anteriorly and genes close to the 5-end of the clusters later and more posteriorly. In this review, we summarize the current understanding of how Hox gene clusters have been modified from basal lineages of deuterostomes to diverse taxa of vertebrates. Our parsimony reconstruction of Hox cluster architecture at various stages of vertebrate evolution highlights that the variation in Hox cluster structures among jawed vertebrates is mostly due to secondary lineage-specific gene losses and an additional genome duplication that occurred in the actinopterygian stem lineage, the teleost-specific genome duplication (TSGD).

  6. The role of gene duplication and unconstrained selective pressures in the melanopsin gene family evolution and vertebrate circadian rhythm regulation.

    Science.gov (United States)

    Borges, Rui; Johnson, Warren E; O'Brien, Stephen J; Vasconcelos, Vitor; Antunes, Agostinho

    2012-01-01

    Melanopsin is a photosensitive cell protein involved in regulating circadian rhythms and other non-visual responses to light. The melanopsin gene family is represented by two paralogs, OPN4x and OPN4m, which originated through gene duplication early in the emergence of vertebrates. Here we studied the melanopsin gene family using an integrated gene/protein evolutionary approach, which revealed that the rhabdomeric urbilaterian ancestor had the same amino acid patterns (DRY motif and the Y and E conterions) as extant vertebrate species, suggesting that the mechanism for light detection and regulation is similar to rhabdomeric rhodopsins. Both OPN4m and OPN4x paralogs are found in vertebrate genomic paralogons, suggesting that they diverged following this duplication event about 600 million years ago, when the complex eye emerged in the vertebrate ancestor. Melanopsins generally evolved under negative selection (ω = 0.171) with some minor episodes of positive selection (proportion of sites = 25%) and functional divergence (θ(I) = 0.349 and θ(II) = 0.126). The OPN4m and OPN4x melanopsin paralogs show evidence of spectral divergence at sites likely involved in melanopsin light absorbance (200F, 273S and 276A). Also, following the teleost lineage-specific whole genome duplication (3R) that prompted the teleost fish radiation, type I divergence (θ(I) = 0.181) and positive selection (affecting 11% of sites) contributed to amino acid variability that we related with the photo-activation stability of melanopsin. The melanopsin intracellular regions had unexpectedly high variability in their coupling specificity of G-proteins and we propose that Gq/11 and Gi/o are the two G-proteins most-likely to mediate the melanopsin phototransduction pathway. The selection signatures were mainly observed on retinal-related sites and the third and second intracellular loops, demonstrating the physiological plasticity of the melanopsin protein group. Our results provide new insights on

  7. The role of gene duplication and unconstrained selective pressures in the melanopsin gene family evolution and vertebrate circadian rhythm regulation.

    Directory of Open Access Journals (Sweden)

    Rui Borges

    Full Text Available Melanopsin is a photosensitive cell protein involved in regulating circadian rhythms and other non-visual responses to light. The melanopsin gene family is represented by two paralogs, OPN4x and OPN4m, which originated through gene duplication early in the emergence of vertebrates. Here we studied the melanopsin gene family using an integrated gene/protein evolutionary approach, which revealed that the rhabdomeric urbilaterian ancestor had the same amino acid patterns (DRY motif and the Y and E conterions as extant vertebrate species, suggesting that the mechanism for light detection and regulation is similar to rhabdomeric rhodopsins. Both OPN4m and OPN4x paralogs are found in vertebrate genomic paralogons, suggesting that they diverged following this duplication event about 600 million years ago, when the complex eye emerged in the vertebrate ancestor. Melanopsins generally evolved under negative selection (ω = 0.171 with some minor episodes of positive selection (proportion of sites = 25% and functional divergence (θ(I = 0.349 and θ(II = 0.126. The OPN4m and OPN4x melanopsin paralogs show evidence of spectral divergence at sites likely involved in melanopsin light absorbance (200F, 273S and 276A. Also, following the teleost lineage-specific whole genome duplication (3R that prompted the teleost fish radiation, type I divergence (θ(I = 0.181 and positive selection (affecting 11% of sites contributed to amino acid variability that we related with the photo-activation stability of melanopsin. The melanopsin intracellular regions had unexpectedly high variability in their coupling specificity of G-proteins and we propose that Gq/11 and Gi/o are the two G-proteins most-likely to mediate the melanopsin phototransduction pathway. The selection signatures were mainly observed on retinal-related sites and the third and second intracellular loops, demonstrating the physiological plasticity of the melanopsin protein group. Our results provide new

  8. Plant Genome Duplication Database.

    Science.gov (United States)

    Lee, Tae-Ho; Kim, Junah; Robertson, Jon S; Paterson, Andrew H

    2017-01-01

    Genome duplication, widespread in flowering plants, is a driving force in evolution. Genome alignments between/within genomes facilitate identification of homologous regions and individual genes to investigate evolutionary consequences of genome duplication. PGDD (the Plant Genome Duplication Database), a public web service database, provides intra- or interplant genome alignment information. At present, PGDD contains information for 47 plants whose genome sequences have been released. Here, we describe methods for identification and estimation of dates of genome duplication and speciation by functions of PGDD.The database is freely available at http://chibba.agtec.uga.edu/duplication/.

  9. The evolution and appearance of C3 duplications in fish originate an exclusive teleost c3 gene form with anti-inflammatory activity.

    Directory of Open Access Journals (Sweden)

    Gabriel Forn-Cuní

    Full Text Available The complement system acts as a first line of defense and promotes organism homeostasis by modulating the fates of diverse physiological processes. Multiple copies of component genes have been previously identified in fish, suggesting a key role for this system in aquatic organisms. Herein, we confirm the presence of three different previously reported complement c3 genes (c3.1, c3.2, c3.3 and identify five additional c3 genes (c3.4, c3.5, c3.6, c3.7, c3.8 in the zebrafish genome. Additionally, we evaluate the mRNA expression levels of the different c3 genes during ontogeny and in different tissues under steady-state and inflammatory conditions. Furthermore, while reconciling the phylogenetic tree with the fish species tree, we uncovered an event of c3 duplication common to all teleost fishes that gave rise to an exclusive c3 paralog (c3.7 and c3.8. These paralogs showed a distinct ability to regulate neutrophil migration in response to injury compared with the other c3 genes and may play a role in maintaining the balance between inflammatory and homeostatic processes in zebrafish.

  10. Pyruvate Kinase and Fcγ Receptor Gene Copy Numbers Associated With Malaria Phenotypes.

    Science.gov (United States)

    Faik, Imad; van Tong, Hoang; Lell, Bertrand; Meyer, Christian G; Kremsner, Peter G; Velavan, Thirumalaisamy P

    2017-07-15

    Genetic factors are associated with susceptibility to many infectious diseases and may be determinants of clinical progression. Gene copy number variation (CNV) has been shown to be associated with phenotypes of numerous diseases, including malaria. We quantified gene copy numbers of the pyruvate kinase, liver, and red blood cell (PKLR) gene as well as of the Fcγ receptor 2A and Fcγ receptor 2C (FCGR2A, FCGR2C) and Fcγ receptor 3 (FCGR3) genes using real-time quantitative polymerase chain reaction (RT-qPCR) assays in Gabonese children with severe (n = 184) or and mild (n = 189) malaria and in healthy Gabonese and white individuals (n = 76 each). The means of PKLR, FCGR2A, FCGR2C, and FCGR3 copy numbers were significantly higher among children with severe malaria compared to those with mild malaria (P malaria severity. Copy numbers of the FCGR2A and FCGR2C genes were significantly lower (P = .005) in Gabonese individuals compared with white individuals. In conclusion, CNV of the PKLR, FCGR2A, FCGR2C, and FCGR3 genes is associated with malaria severity, and our results provide evidence for a role of CNV in host responses to malaria. © The Author 2017. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail: journals.permissions@oup.com.

  11. Molecular Characterization of Duplicate Cytosolic Phosphoglucose Isomerase Genes in Clarkia and Comparison to the Single Gene in Arabidopsis

    Science.gov (United States)

    Thomas, B. R.; Ford, V. S.; Pichersky, E.; Gottlieb, L. D.

    1993-01-01

    The nucleotide sequence of PgiC1-a which encodes a cytosolic isozyme of phosphoglucose isomerase (PGIC; EC 5.3.1.9) in Clarkia lewisii, a wildflower native to California, is described and compared to the previously published sequence of the duplicate PgiC2-a from the same genome. Both genes have the same structure of 23 exons and 22 introns located in identical positions, and they encode proteins of 569 amino acids. Exon and inferred protein sequences of the two genes are 96.4% and 97.2% identical, respectively. Intron sequences are 88.2% identical. The high nucleotide similarity of the two genes is consistent with previous genetic and biosystematic findings that suggest the duplication arose within Clarkia. A partial sequence of PgiC2-b was also obtained. It is 99.5% identical to PgiC2-a in exons and 99.7% in introns. The nucleotide sequence of the single PgiC from Arabidopsis thaliana was also determined for comparison to the Clarkia genes. The A. thaliana PgiC has 21 introns located at positions identical to those in Clarkia PgiC1 and PgiC2, but lacks the intron that divides Clarkia exons 21 and 22. The A. thaliana PGIC protein is shorter, with 560 amino acids, and differs by about 17% from the Clarkia PGICs. The PgiC in A. thaliana was mapped to a site 20 cM from restriction fragment length polymorphism marker 331 on chromosome 5. PMID:8293986

  12. Duplication in DNA Sequences

    Science.gov (United States)

    Ito, Masami; Kari, Lila; Kincaid, Zachary; Seki, Shinnosuke

    The duplication and repeat-deletion operations are the basis of a formal language theoretic model of errors that can occur during DNA replication. During DNA replication, subsequences of a strand of DNA may be copied several times (resulting in duplications) or skipped (resulting in repeat-deletions). As formal language operations, iterated duplication and repeat-deletion of words and languages have been well studied in the literature. However, little is known about single-step duplications and repeat-deletions. In this paper, we investigate several properties of these operations, including closure properties of language families in the Chomsky hierarchy and equations involving these operations. We also make progress toward a characterization of regular languages that are generated by duplicating a regular language.

  13. Allelic Polymorphism, Gene Duplication and Balancing Selection of MHC Class IIB Genes in the Omei Treefrog (Rhacophorus omeimontis)

    Institute of Scientific and Technical Information of China (English)

    Li HUANG; Mian ZHAO; Zhenhua LUO; Hua WU

    2016-01-01

    The worldwide declines in amphibian populations have largely been caused by infectious fungi and bacteria. Given that vertebrate immunity against these extracellular pathogens is primarily functioned by the major histocompatibility complex (MHC) class II molecules, the characterization and the evolution of amphibian MHC class II genes have attracted increasing attention. The polymorphism of MHC class II genes was found to be correlated with susceptibility to fungal pathogens in many amphibian species, suggesting the importance of studies on MHC class II genes for amphibians. However, such studies on MHC class II gene evolution have rarely been conducted on amphibians in China. In this study, we chose Omei treefrog (Rhacophorus omeimontis), which lived moist environments easy for breeding bacteria, to study the polymorphism of its MHC class II genes and the underlying evolutionary mechanisms. We amplified the entire MHC class IIB exon 2 sequence in the R. omeimontis using newly designed primers. We detected 102 putative alleles in 146 individuals. The number of alleles per individual ranged from one to seven, indicating that there are at least four loci containing MHC class IIB genes in R. omeimontis. The allelic polymorphism estimated from the 102 alleles in R. omeimontis was not high compared to that estimated in other anuran species. No significant gene recombination was detected in the 102 MHC class IIB exon 2 sequences. In contrast, both gene duplication and balancing selection greatly contributed to the variability in MHC class IIB exon 2 sequences of R. omeimontis. This study lays the groundwork for the future researches to comprehensively analyze the evolution of amphibian MHC genes and to assess the role of MHC gene polymorphisms in resistance against extracellular pathogens for amphibians in China.

  14. Whole-genome duplications spurred the functional diversification of the globin gene superfamily in vertebrates.

    Science.gov (United States)

    Hoffmann, Federico G; Opazo, Juan C; Storz, Jay F

    2012-01-01

    It has been hypothesized that two successive rounds of whole-genome duplication (WGD) in the stem lineage of vertebrates provided genetic raw materials for the evolutionary innovation of many vertebrate-specific features. However, it has seldom been possible to trace such innovations to specific functional differences between paralogous gene products that derive from a WGD event. Here, we report genomic evidence for a direct link between WGD and key physiological innovations in the vertebrate oxygen transport system. Specifically, we demonstrate that key globin proteins that evolved specialized functions in different aspects of oxidative metabolism (hemoglobin, myoglobin, and cytoglobin) represent paralogous products of two WGD events in the vertebrate common ancestor. Analysis of conserved macrosynteny between the genomes of vertebrates and amphioxus (subphylum Cephalochordata) revealed that homologous chromosomal segments defined by myoglobin + globin-E, cytoglobin, and the α-globin gene cluster each descend from the same linkage group in the reconstructed proto-karyotype of the chordate common ancestor. The physiological division of labor between the oxygen transport function of hemoglobin and the oxygen storage function of myoglobin played a pivotal role in the evolution of aerobic energy metabolism, supporting the hypothesis that WGDs helped fuel key innovations in vertebrate evolution.

  15. Mirror-image duplication of the primary axis and heart in Xenopus embryos by the overexpression of Msx-1 gene.

    Science.gov (United States)

    Chen, Y; Solursh, M

    1995-10-01

    The Msx-1 gene (formerly known as Hox-7) is a member of a discrete subclass of homeobox-containing genes. Examination of the expression pattern of Msx-1 in murine and avian embryos suggests that this gene may be involved in the regionalization of the medio-lateral axis during earlier development. We have examined the possible functions of Xenopus Msx-1 during early Xenopus embryonic development by overexpression of the Msx-1 gene. Overexpression of Msx-1 causes a left-right mirror-image duplication of primary axial structures, including notochord, neural tube, somites, suckers, and foregut. The embryonic developing heart is also mirror-image duplicated, including looping directions and polarity. These results indicate that Msx-1 may be involved in the mesoderm formation as well as left-right patterning in the early Xenopus embryonic development.

  16. Evolution of Vertebrate Adam Genes; Duplication of Testicular Adams from Ancient Adam9/9-like Loci.

    Science.gov (United States)

    Bahudhanapati, Harinath; Bhattacharya, Shashwati; Wei, Shuo

    2015-01-01

    Members of the disintegrin metalloproteinase (ADAM) family have important functions in regulating cell-cell and cell-matrix interactions as well as cell signaling. There are two major types of ADAMs: the somatic ADAMs (sADAMs) that have a significant presence in somatic tissues, and the testicular ADAMs (tADAMs) that are expressed predominantly in the testis. Genes encoding tADAMs can be further divided into two groups: group I (intronless) and group II (intron-containing). To date, tAdams have only been reported in placental mammals, and their evolutionary origin and relationship to sAdams remain largely unknown. Using phylogenetic and syntenic tools, we analyzed the Adam genes in various vertebrates ranging from fishes to placental mammals. Our analyses reveal duplication and loss of some sAdams in certain vertebrate species. In particular, there exists an Adam9-like gene in non-mammalian vertebrates but not mammals. We also identified putative group I and group II tAdams in all amniote species that have been examined. These tAdam homologues are more closely related to Adams 9 and 9-like than to other sAdams. In all amniote species examined, group II tAdams lie in close vicinity to Adam9 and hence likely arose from tandem duplication, whereas group I tAdams likely originated through retroposition because of their lack of introns. Clusters of multiple group I tAdams are also common, suggesting tandem duplication after retroposition. Therefore, Adam9/9-like and some of the derived tAdam loci are likely preferred targets for tandem duplication and/or retroposition. Consistent with this hypothesis, we identified a young retroposed gene that duplicated recently from Adam9 in the opossum. As a result of gene duplication, some tAdams were pseudogenized in certain species, whereas others acquired new expression patterns and functions. The rapid duplication of Adam genes has a major contribution to the diversity of ADAMs in various vertebrate species.

  17. Function of Partially Duplicated Human α7 Nicotinic Receptor Subunit CHRFAM7A Gene

    Science.gov (United States)

    de Lucas-Cerrillo, Ana M.; Maldifassi, M. Constanza; Arnalich, Francisco; Renart, Jaime; Atienza, Gema; Serantes, Rocío; Cruces, Jesús; Sánchez-Pacheco, Aurora; Andrés-Mateos, Eva; Montiel, Carmen

    2011-01-01

    The neuronal α7 nicotinic receptor subunit gene (CHRNA7) is partially duplicated in the human genome forming a hybrid gene (CHRFAM7A) with the novel FAM7A gene. The hybrid gene transcript, dupα7, has been identified in brain, immune cells, and the HL-60 cell line, although its translation and function are still unknown. In this study, dupα7 cDNA has been cloned and expressed in GH4C1 cells and Xenopus oocytes to study the pattern and functional role of the expressed protein. Our results reveal that dupα7 transcript was natively translated in HL-60 cells and heterologously expressed in GH4C1 cells and oocytes. Injection of dupα7 mRNA into oocytes failed to generate functional receptors, but when co-injected with α7 mRNA at α7/dupα7 ratios of 5:1, 2:1, 1:1, 1:5, and 1:10, it reduced the nicotine-elicited α7 current generated in control oocytes (α7 alone) by 26, 53, 75, 93, and 94%, respectively. This effect is mainly due to a reduction in the number of functional α7 receptors reaching the oocyte membrane, as deduced from α-bungarotoxin binding and fluorescent confocal assays. Two additional findings open the possibility that the dominant negative effect of dupα7 on α7 receptor activity observed in vitro could be extrapolated to in vivo situations. (i) Compared with α7 mRNA, basal dupα7 mRNA levels are substantial in human cerebral cortex and higher in macrophages. (ii) dupα7 mRNA levels in macrophages are down-regulated by IL-1β, LPS, and nicotine. Thus, dupα7 could modulate α7 receptor-mediated synaptic transmission and cholinergic anti-inflammatory response. PMID:21047781

  18. The fate of tandemly duplicated genes assessed by the expression analysis of a group of Arabidopsis thaliana RING-H2 ubiquitin ligase genes of the ATL family.

    Science.gov (United States)

    Aguilar-Hernández, Victor; Guzmán, Plinio

    2014-03-01

    Gene duplication events exert key functions on gene innovations during the evolution of the eukaryotic genomes. A large portion of the total gene content in plants arose from tandem duplications events, which often result in paralog genes with high sequence identity. Ubiquitin ligases or E3 enzymes are components of the ubiquitin proteasome system that function during the transfer of the ubiquitin molecule to the substrate. In plants, several E3s have expanded in their genomes as multigene families. To gain insight into the consequences of gene duplications on the expansion and diversification of E3s, we examined the evolutionary basis of a cluster of six genes, duplC-ATLs, which arose from segmental and tandem duplication events in Brassicaceae. The assessment of the expression suggested two patterns that are supported by lineage. While retention of expression domains was observed, an apparent absence or reduction of expression was also inferred. We found that two duplC-ATL genes underwent pseudogenization and that, in one case, gene expression is probably regained. Our findings provide insights into the evolution of gene families in plants, defining key events on the expansion of the Arabidopsis Tóxicos en Levadura family of E3 ligases.

  19. S-SCAM, a rare copy number variation gene, induces schizophrenia-related endophenotypes in transgenic mouse model.

    Science.gov (United States)

    Zhang, Nanyan; Zhong, Peng; Shin, Seung Min; Metallo, Jacob; Danielson, Eric; Olsen, Christopher M; Liu, Qing-song; Lee, Sang H

    2015-02-01

    Accumulating genetic evidence suggests that schizophrenia (SZ) is associated with individually rare copy number variations (CNVs) of diverse genes, often specific to single cases. However, the causality of these rare mutations remains unknown. One of the rare CNVs found in SZ cohorts is the duplication of Synaptic Scaffolding Molecule (S-SCAM, also called MAGI-2), which encodes a postsynaptic scaffolding protein controlling synaptic AMPA receptor levels, and thus the strength of excitatory synaptic transmission. Here we report that, in a transgenic mouse model simulating the duplication conditions, elevation of S-SCAM levels in excitatory neurons of the forebrain was sufficient to induce multiple SZ-related endophenotypes. S-SCAM transgenic mice showed an increased number of lateral ventricles and a reduced number of parvalbumin-stained neurons. In addition, the mice exhibited SZ-like behavioral abnormalities, including hyperlocomotor activity, deficits in prepulse inhibition, increased anxiety, impaired social interaction, and working memory deficit. Notably, the S-SCAM transgenic mice showed a unique sex difference in showing these behavioral symptoms, which is reminiscent of human conditions. These behavioral abnormalities were accompanied by hyperglutamatergic function associated with increased synaptic AMPA receptor levels and impaired long-term potentiation. Importantly, reducing glutamate release by the group 2 metabotropic glutamate receptor agonist LY379268 ameliorated the working memory deficits in the transgenic mice, suggesting that hyperglutamatergic function underlies the cognitive functional deficits. Together, these results contribute to validate a causal relationship of the rare S-SCAM CNV and provide supporting evidence for the rare CNV hypothesis in SZ pathogenesis. Furthermore, the S-SCAM transgenic mice provide a valuable new animal model for studying SZ pathogenesis.

  20. Genesis of the vertebrate FoxP subfamily member genes occurred during two ancestral whole genome duplication events.

    Science.gov (United States)

    Song, Xiaowei; Tang, Yezhong; Wang, Yajun

    2016-08-22

    The vertebrate FoxP subfamily genes play important roles in the construction of essential functional modules involved in physiological and developmental processes. To explore the adaptive evolution of functional modules associated with the FoxP subfamily member genes, it is necessary to study the gene duplication process. We detected four member genes of the FoxP subfamily in sea lampreys (a representative species of jawless vertebrates) through genome screenings and phylogenetic analyses. Reliable paralogons (i.e. paralogous chromosome segments) have rarely been detected in scaffolds of FoxP subfamily member genes in sea lampreys due to the considerable existence of HTH_Tnp_Tc3_2 transposases. However, these transposases did not alter gene numbers of the FoxP subfamily in sea lampreys. The coincidence between the "1-4" gene duplication pattern of FoxP subfamily genes from invertebrates to vertebrates and two rounds of ancestral whole genome duplication (1R- and 2R-WGD) events reveal that the FoxP subfamily of vertebrates was quadruplicated in the 1R- and 2R-WGD events. Furthermore, we deduced that a synchronous gene duplication process occurred for the FoxP subfamily and for three linked gene families/subfamilies (i.e. MIT family, mGluR group III and PLXNA subfamily) in the 1R- and 2R-WGD events using phylogenetic analyses and mirror-dendrogram methods (i.e. algorithms to test protein-protein interactions). Specifically, the ancestor of FoxP1 and FoxP3 and the ancestor of FoxP2 and FoxP4 were generated in 1R-WGD event. In the subsequent 2R-WGD event, these two ancestral genes were changed into FoxP1, FoxP2, FoxP3 and FoxP4. The elucidation of these gene duplication processes shed light on the phylogenetic relationships between functional modules of the FoxP subfamily member genes.

  1. Identification and expression analysis of multiple FRO gene copies in Medicago truncatula.

    Science.gov (United States)

    Del C Orozco-Mosqueda, Ma; Santoyo, G; Farías-Rodríguez, R; Macías-Rodríguez, L; Valencia-Cantero, E

    2012-12-17

    Iron (Fe) is an essential element for plant growth. Commonly, this element is found in an oxidized form in soil, which is poorly available for plants. Therefore, plants have evolved ferric-chelate reductase enzymes (FRO) to reduce iron into a more soluble ferrous form. Fe scarcity in plants induce the FRO enzyme activity. Although the legume Medicago truncatula has been employed as a model for FRO activity studies, only one copy of the M. truncatula MtFRO1 gene has been characterized so far. In this study, we identified multiple gene copies of the MtFRO gene in the genome of M. truncatula by an in silico search, using BLAST analysis in the database of the M. truncatula Genome Sequencing Project and the National Center for Biotechnology Information, and also determined whether they are functional. We identified five genes apart from MtFRO1, which had been already characterized. All of the MtFRO genes exhibited high identity with homologous FRO genes from Lycopersicon esculentum, Citrus junos and Arabidopsis thaliana. The gene copies also presented characteristic conserved FAD and NADPH motifs, transmembrane regions and oxidoreductase signature motifs. We also detected expression in five of the putative MtFRO sequences by semiquantitative RT-PCR analysis, performed with mRNA from root and shoot tissues. Iron scarcity might be a condition for an elevated expression of the MtFRO genes observed in different M. truncatula tissues.

  2. Deletion of a single-copy DAAM1 gene in congenital heart defect: a case report

    Directory of Open Access Journals (Sweden)

    Bao Bihui

    2012-08-01

    Full Text Available Abstract Background With an increasing incidence of congenital heart defects (CHDs in recent years, genotype-phenotype correlation and array-based methods have contributed to the genome-wide analysis and understanding of genetic variations in the CHD population. Here, we report a copy number deletion of chromosomal 14q23.1 in a female fetus with complex congenital heart defects. This is the first description of DAAM1 gene deletion associated with congenital heart anomalies. Case Presentation Compared with the control population, one CHD fetus showed a unique copy number deletion of 14q23.1, a region that harbored DAAM1 and KIAA0666 genes. Conclusions Results suggest that the copy number deletion on chromosome 14q23.1 may be critical for cardiogenesis. However, the exact relationship and mechanism of how DAAM1 and KIAA0666 deletion contributes to the onset of CHD is yet to be determined.

  3. Closely linked H2B genes in the marine copepod, Tigriopus californicus indicate a recent gene duplication or gene conversion event.

    Science.gov (United States)

    Brown, D; Cook, A; Wagner, M; Wells, D

    1992-01-01

    Two nonallelic histone gene clusters were characterized in the marine copepod, Tigriopus californicus. The DNA sequence of one of the clusters reveals six genes in the contiguous arrangement of H2B, H1, H3, H4, H2B and H2A. The order of genes within the second cluster is H3, H4, H2B and H2A. There is no evidence for the presence of an H1 gene in this cluster. Comparison of the three copepod H2B genes reveals a high degree of similarity between the 5' upstream regions and between the amino terminal halves of the two H2B genes found within the same cluster. From these data we infer that gene duplication and/or gene conversion events occurred within this cluster in the recent past.

  4. Biological Consequences of Ancient Gene Acquisition and Duplication in the Large Genome of Candidatus Solibacter usitatus Ellin6076

    Energy Technology Data Exchange (ETDEWEB)

    Challacombe, Jean F [ORNL; Eichorst, Stephanie A [Los Alamos National Laboratory (LANL); Hauser, Loren John [ORNL; Land, Miriam L [ORNL; Xie, Gary [Los Alamos National Laboratory (LANL); Kuske, Cheryl R [Los Alamos National Laboratory (LANL)

    2011-01-01

    Members of the bacterial phylum Acidobacteria are widespread in soils and sediments worldwide, and are abundant in many soils. Acidobacteria are challenging to culture in vitro, and many basic features of their biology and functional roles in the soil have not been determined. Candidatus Solibacter usitatus strain Ellin6076 has a 9.9 Mb genome that is approximately 2 5 times as large as the other sequenced Acidobacteria genomes. Bacterial genome sizes typically range from 0.5 to 10 Mb and are influenced by gene duplication, horizontal gene transfer, gene loss and other evolutionary processes. Our comparative genome analyses indicate that the Ellin6076 large genome has arisen by horizontal gene transfer via ancient bacteriophage and/or plasmid-mediated transduction, and widespread small-scale gene duplications, resulting in an increased number of paralogs. Low amino acid sequence identities among functional group members, and lack of conserved gene order and orientation in regions containing similar groups of paralogs, suggest that most of the paralogs are not the result of recent duplication events. The genome sizes of additional cultured Acidobacteria strains were estimated using pulsed-field gel electrophoresis to determine the prevalence of the large genome trait within the phylum. Members of subdivision 3 had larger genomes than those of subdivision 1, but none were as large as the Ellin6076 genome. The large genome of Ellin6076 may not be typical of the phylum, and encodes traits that could provide a selective metabolic, defensive and regulatory advantage in the soil environment.

  5. Biological consequences of ancient gene acquisition and duplication in the large genome soil bacterium, ""solibacter usitatus"" strain Ellin6076

    Energy Technology Data Exchange (ETDEWEB)

    Challacombe, Jean F [Los Alamos National Laboratory; Eichorst, Stephanie A [Los Alamos National Laboratory; Xie, Gary [Los Alamos National Laboratory; Kuske, Cheryl R [Los Alamos National Laboratory; Hauser, Loren [ORNL; Land, Miriam [ORNL

    2009-01-01

    Bacterial genome sizes range from ca. 0.5 to 10Mb and are influenced by gene duplication, horizontal gene transfer, gene loss and other evolutionary processes. Sequenced genomes of strains in the phylum Acidobacteria revealed that 'Solibacter usistatus' strain Ellin6076 harbors a 9.9 Mb genome. This large genome appears to have arisen by horizontal gene transfer via ancient bacteriophage and plasmid-mediated transduction, as well as widespread small-scale gene duplications. This has resulted in an increased number of paralogs that are potentially ecologically important (ecoparalogs). Low amino acid sequence identities among functional group members and lack of conserved gene order and orientation in the regions containing similar groups of paralogs suggest that most of the paralogs were not the result of recent duplication events. The genome sizes of cultured subdivision 1 and 3 strains in the phylum Acidobacteria were estimated using pulsed-field gel electrophoresis to determine the prevalence of the large genome trait within the phylum. Members of subdivision 1 were estimated to have smaller genome sizes ranging from ca. 2.0 to 4.8 Mb, whereas members of subdivision 3 had slightly larger genomes, from ca. 5.8 to 9.9 Mb. It is hypothesized that the large genome of strain Ellin6076 encodes traits that provide a selective metabolic, defensive and regulatory advantage in the variable soil environment.

  6. Copy number variation of age-related macular degeneration relevant genes in the Korean population.

    Directory of Open Access Journals (Sweden)

    Jung Hyun Park

    Full Text Available PURPOSE: Studies that analyzed single nucleotide polymorphisms (SNP in various genes have shown that genetic factors are strongly associated with age-related macular degeneration (AMD susceptibility. Copy number variation (CNV may be an additional type of genetic variation that contributes to AMD pathogenesis. This study investigated CNV in 4 AMD-relevant genes in Korean AMD patients and control subjects. METHODS: Four CNV candidate regions located in AMD-relevant genes (VEGFA, ARMS2/HTRA1, CFH and VLDLR, were selected based on the outcomes of our previous study which elucidated common CNVs in the Asian populations. Real-time PCR based TaqMan Copy Number Assays were performed on CNV candidates in 273 AMD patients and 257 control subjects. RESULTS: The predicted copy number (PCN, 0, 1, 2 or 3+ of each region was called using the CopyCaller program. All candidate genes except ARMS2/HTRA1 showed CNV in at least one individual, in which losses of VEGFA and VLDLR represent novel findings in the Asian population. When the frequencies of PCN were compared, only the gain in VLDLR showed significant differences between AMD patients and control subjects (p = 0.025. Comparisons of the raw copy values (RCV revealed that 3 of 4 candidate genes showed significant differences (2.03 vs. 1.92 for VEGFA, p<0.01; 2.01 vs. 1.97 for CFH, p<0.01; 1.97 vs. 2.01, p<0.01 for ARMS2/HTRA1. CONCLUSION: CNVs located in AMD-relevant genes may be associated with AMD susceptibility. Further investigations encompassing larger patient cohorts are needed to elucidate the role of CNV in AMD pathogenesis.

  7. Alpha-defensin DEFA1A3 gene copy number elevation in Danish Crohn's disease patients

    DEFF Research Database (Denmark)

    Jespersgaard, Cathrine; Fode, Peder; Dybdahl, Marianne

    2011-01-01

    BACKGROUND AND PURPOSE OF STUDY: Extensive copy number variation is observed for the DEFA1A3 gene encoding alpha-defensins 1-3. The objective of this study was to determine the involvement of alpha-defensins in colonic tissue from Crohn's disease (CD) patients and the possible genetic association...

  8. Dietary Variation and Evolution of Gene Copy Number among Dog Breeds.

    Science.gov (United States)

    Reiter, Taylor; Jagoda, Evelyn; Capellini, Terence D

    2016-01-01

    Prolonged human interactions and artificial selection have influenced the genotypic and phenotypic diversity among dog breeds. Because humans and dogs occupy diverse habitats, ecological contexts have likely contributed to breed-specific positive selection. Prior to the advent of modern dog-feeding practices, there was likely substantial variation in dietary landscapes among disparate dog breeds. As such, we investigated one type of genetic variant, copy number variation, in three metabolic genes: glucokinase regulatory protein (GCKR), phytanol-CoA 2-hydroxylase (PHYH), and pancreatic α-amylase 2B (AMY2B). These genes code for proteins that are responsible for metabolizing dietary products that originate from distinctly different food types: sugar, meat, and starch, respectively. After surveying copy number variation among dogs with diverse dietary histories, we found no correlation between diet and positive selection in either GCKR or PHYH. Although it has been previously demonstrated that dogs experienced a copy number increase in AMY2B relative to wolves during or after the dog domestication process, we demonstrate that positive selection continued to act on amylase copy number in dog breeds that consumed starch-rich diets in time periods after domestication. Furthermore, we found that introgression with wolves is not responsible for deterioration of positive selection on AMY2B among diverse dog breeds. Together, this supports the hypothesis that the amylase copy number expansion is found universally in dogs.

  9. Dietary Variation and Evolution of Gene Copy Number among Dog Breeds.

    Directory of Open Access Journals (Sweden)

    Taylor Reiter

    Full Text Available Prolonged human interactions and artificial selection have influenced the genotypic and phenotypic diversity among dog breeds. Because humans and dogs occupy diverse habitats, ecological contexts have likely contributed to breed-specific positive selection. Prior to the advent of modern dog-feeding practices, there was likely substantial variation in dietary landscapes among disparate dog breeds. As such, we investigated one type of genetic variant, copy number variation, in three metabolic genes: glucokinase regulatory protein (GCKR, phytanol-CoA 2-hydroxylase (PHYH, and pancreatic α-amylase 2B (AMY2B. These genes code for proteins that are responsible for metabolizing dietary products that originate from distinctly different food types: sugar, meat, and starch, respectively. After surveying copy number variation among dogs with diverse dietary histories, we found no correlation between diet and positive selection in either GCKR or PHYH. Although it has been previously demonstrated that dogs experienced a copy number increase in AMY2B relative to wolves during or after the dog domestication process, we demonstrate that positive selection continued to act on amylase copy number in dog breeds that consumed starch-rich diets in time periods after domestication. Furthermore, we found that introgression with wolves is not responsible for deterioration of positive selection on AMY2B among diverse dog breeds. Together, this supports the hypothesis that the amylase copy number expansion is found universally in dogs.

  10. Human-specific duplication and mosaic transcripts: the recent paralogous structure of chromosome 22.

    Science.gov (United States)

    Bailey, Jeffrey A; Yavor, Amy M; Viggiano, Luigi; Misceo, Doriana; Horvath, Juliann E; Archidiacono, Nicoletta; Schwartz, Stuart; Rocchi, Mariano; Eichler, Evan E

    2002-01-01

    In recent decades, comparative chromosomal banding, chromosome painting, and gene-order studies have shown strong conservation of gross chromosome structure and gene order in mammals. However, findings from the human genome sequence suggest an unprecedented degree of recent (homologous duplications (> or = 1 kb and > or = 90%) on chromosome 22. Overall, 10.8% (3.7/33.8 Mb) of chromosome 22 is duplicated, with an average sequence identity of 95.4%. To organize the duplications into tractable units, intron-exon structure and well-defined duplication boundaries were used to define 78 duplicated modules (minimally shared evolutionary segments) with 157 copies on chromosome 22. Analysis of these modules provides evidence for the creation or modification of 11 novel transcripts. Comparative FISH analyses of human, chimpanzee, gorilla, orangutan, and macaque reveal qualitative and quantitative differences in the distribution of these duplications--consistent with their recent origin. Several duplications appear to be human specific, including a approximately 400-kb duplication (99.4%-99.8% sequence identity) that transposed from chromosome 14 to the most proximal pericentromeric region of chromosome 22. Experimental and in silico data further support a pericentromeric gradient of duplications where the most recent duplications transpose adjacent to the centromere. Taken together, these data suggest that segmental duplications have been an ongoing process of primate genome evolution, contributing to recent gene innovation and the dynamic transformation of genome architecture within and among closely related species.

  11. MAP kinase pathway gene copy alterations in NRAS/BRAF wild-type advanced melanoma.

    Science.gov (United States)

    Orouji, Elias; Orouji, Azadeh; Gaiser, Timo; Larribère, Lionel; Gebhardt, Christoffer; Utikal, Jochen

    2016-05-01

    Recent therapeutic advances have improved melanoma patientś clinical outcome. Novel therapeutics targeting BRAF, NRAS and cKit mutant melanomas are widely used in clinical practice. However therapeutic options in NRAS(wild-type) /BRAF(wild-type) /cKit(wild-type) melanoma patients are limited. Our study shows that gene copy numbers of members of the MAPK signaling pathway vary in different melanoma subgroups. NRAS(wild-type) /BRAF(wild-type) melanoma metastases are characterized by significant gains of MAP2K1 (MEK1) and MAPK3 (ERK1) gene loci. These additional gene copies could lead to an activation of the MAPK signaling pathway via a gene-dosage effect. Our results suggest that downstream analyses of the pMEK and pERK expression status in NRAS(wild-type) /BRAF(wild-type) melanoma patients identify patients that could benefit from targeted therapies with MEK and ERK inhibitors.

  12. Incorporating 16S Gene Copy Number Information Improves Estimates of Microbial Diversity and Abundance

    Science.gov (United States)

    Kembel, Steven W.; Wu, Martin; Eisen, Jonathan A.; Green, Jessica L.

    2012-01-01

    The abundance of different SSU rRNA (“16S”) gene sequences in environmental samples is widely used in studies of microbial ecology as a measure of microbial community structure and diversity. However, the genomic copy number of the 16S gene varies greatly – from one in many species to up to 15 in some bacteria and to hundreds in some microbial eukaryotes. As a result of this variation the relative abundance of 16S genes in environmental samples can be attributed both to variation in the relative abundance of different organisms, and to variation in genomic 16S copy number among those organisms. Despite this fact, many studies assume that the abundance of 16S gene sequences is a surrogate measure of the relative abundance of the organisms containing those sequences. Here we present a method that uses data on sequences and genomic copy number of 16S genes along with phylogenetic placement and ancestral state estimation to estimate organismal abundances from environmental DNA sequence data. We use theory and simulations to demonstrate that 16S genomic copy number can be accurately estimated from the short reads typically obtained from high-throughput environmental sequencing of the 16S gene, and that organismal abundances in microbial communities are more strongly correlated with estimated abundances obtained from our method than with gene abundances. We re-analyze several published empirical data sets and demonstrate that the use of gene abundance versus estimated organismal abundance can lead to different inferences about community diversity and structure and the identity of the dominant taxa in microbial communities. Our approach will allow microbial ecologists to make more accurate inferences about microbial diversity and abundance based on 16S sequence data. PMID:23133348

  13. U3 snoRNA genes are multi-copy and frequently linked to U5 snRNA genes in Euglena gracilis§

    Directory of Open Access Journals (Sweden)

    Charette J Michael

    2009-11-01

    Full Text Available Abstract Background U3 snoRNA is a box C/D small nucleolar RNA (snoRNA involved in the processing events that liberate 18S rRNA from the ribosomal RNA precursor (pre-rRNA. Although U3 snoRNA is present in all eukaryotic organisms, most investigations of it have focused on fungi (particularly yeasts, animals and plants. Relatively little is known about U3 snoRNA and its gene(s in the phylogenetically broad assemblage of protists (mostly unicellular eukaryotes. In the euglenozoon Euglena gracilis, a distant relative of the kinetoplastid protozoa, Southern analysis had previously revealed at least 13 bands hybridizing with U3 snoRNA, suggesting the existence of multiple copies of U3 snoRNA genes. Results Through screening of a λ genomic library and PCR amplification, we recovered 14 U3 snoRNA gene variants, defined by sequence heterogeneities that are mostly located in the U3 3'-stem-loop domain. We identified three different genomic arrangements of Euglena U3 snoRNA genes: i stand-alone, ii linked to tRNAArg genes, and iii linked to a U5 snRNA gene. In arrangement ii, the U3 snoRNA gene is positioned upstream of two identical tRNAArg genes that are convergently transcribed relative to the U3 gene. This scenario is reminiscent of a U3 snoRNA-tRNA gene linkage previously described in trypanosomatids. We document here twelve different U3 snoRNA-U5 snRNA gene arrangements in Euglena; in each case, the U3 gene is linked to a downstream and convergently oriented U5 gene, with the intergenic region differing in length and sequence among the variants. Conclusion The multiple U3 snoRNA-U5 snRNA gene linkages, which cluster into distinct families based on sequence similarities within the intergenic spacer, presumably arose by genome, chromosome, and/or locus duplications. We discuss possible reasons for the existence of the unusually large number of U3 snoRNA genes in the Euglena genome. Variability in the signal intensities of the multiple Southern

  14. Somatic Copy Number Alterations at Oncogenic Loci Show Diverse Correlations with Gene Expression

    Science.gov (United States)

    Roszik, Jason; Wu, Chang-Jiun; Siroy, Alan E.; Lazar, Alexander J.; Davies, Michael A.; Woodman, Scott E.; Kwong, Lawrence N.

    2016-01-01

    Somatic copy number alterations (SCNAs) affecting oncogenic drivers have a firmly established role in promoting cancer. However, no agreed-upon standard exists for calling locus-specific amplifications and deletions in each patient sample. Here, we report the correlative analysis of copy number amplitude and length with gene expression across 6,109 samples from The Cancer Genome Atlas (TCGA) dataset across 16 cancer types. Using specificity, sensitivity, and precision-based scores, we assigned optimized amplitude and length cutoffs for nine recurrent SCNAs affecting known oncogenic drivers, using mRNA expression as a functional readout. These cutoffs captured the majority of SCNA-driven, highly-expression-altered samples. The majority of oncogenes required only amplitude cutoffs, as high amplitude samples were almost invariably focal; however, CDKN2A and PTEN uniquely required both amplitude and length cutoffs as primary predictors. For PTEN, these extended to downstream AKT activation. In contrast, SCNA genes located peri-telomerically or in fragile sites showed poor expression-copy number correlations. Overall, our analyses identify optimized amplitude and length cutoffs as efficient predictors of gene expression changes for specific oncogenic SCNAs, yet warn against one-size-fits-all interpretations across all loci. Our results have implications for cancer data analyses and the clinic, where copy number and mutation data are increasingly used to personalize cancer therapy.

  15. Construction and function of recombinant AcMNPV with double copies of v-cath gene

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    Two recombinant baculoviruses, dciAcMNPV and dcdAcMNPV in which another copy of the v-cath gene controlled by ie1 promoter and polh promoter was inserted, were respectively constructed by the Bac-to-Bac system. The expression of the v-cath gene of the recombinant baculoviruses in Sf9 cells at different phases was investigated by SDS- PAGE and Western blot. The results showed that only recombinant virus dciAcMNPV containing late gene v-cath driven by early gene promoter could express V-CATH protein, cathepsin encoded by virus genome, 12 h post-infection and dcdAcMNPV containing late gene v-cath driven by late and very late gene promoters could express more V-CATH protein. Negative control ncAcMNPV, a mutant deleted v- cath gene, could not express V-CATH protein at all. The Spodopera exigua larvae were infected with viruses respectively and the results showed that the toxicity was as follows: dcdAcMNPV>dciAcMNPV>wtAcMNPV>ncAcMNPV. The toxicity of recombinant viruses and the characters of dead larvae showed that the v-cath gene was relative to viral toxicity and host liquefaction. Recombinant baculovirus dcdAcMNPV might be used as a new kind of safe viral-pes- ticide, because of its high toxicity obtained by adding another gene copy and changing the expression level of its own gene relative to virulence.

  16. Probing the evolution of biological nitrogen fixation by examining phylogenetic relationships of nitrogen fixation genes related by gene duplication

    Science.gov (United States)

    Peters, J.; Boyd, E. S.; Hamilton, T.

    2011-12-01

    Mounting evidence indicates the presence of a near complete biological nitrogen cycle in redox stratified oceans during the late Archean to early Proterozoic (~2.5 to 2.0 Ga). It has been suggested that the iron (Fe)-only or vanadium (V)-dependent alternative forms of nitrogenase rather than molybdenum (Mo)-dependent form was responsible for dinitrogen (N2) fixation during this time because oceans were depleted in Mo and rich in Fe. However, the only extant nitrogen fixing organisms that harbor alternative nitrogenases also harbor a Mo-dependent nitrogenase. Furthermore, our recent global gene expression analysis revealed that the alternative enzymes rely on genes encoding biosynthetic machinery to assemble active enzymes that are associated with the Mo-dependent nitrogenase. In our recent work we conducted an in-depth phylogenetic analysis of the proteins required for molybdenum (Mo)-nitrogenase that arose from gene fusion and duplication, expanding on previous analyses of single gene loci and multiple gene loci. The results of this analysis are highly suggestive that Mo-nitrogenase is unlikely to have been associated with the last universal common ancestor (LUCA). Rather, the oldest extant organisms harboring Mo-nitrogenase can be traced to hydrogenotrophic methanogens with acquisition in the bacterial domain via lateral gene transfer involving an anaerobic member of the Firmicutes. An origin and ensuing proliferation of Mo-nitrogenase under anoxic conditions would likely have occurred in an environment where anaerobic methanogens and Firmicutes coexisted and where Mo was at least episodically available, such as in a redox stratified Proterozoic ocean basin. In more recent work we have examined the hypothesis that the alternative forms predate the Mo-dependent nitrogenase by examining the phylogenetic relationships of the genetically distinct structural proteins of the Fe-only, V-, and Mo-nitrogenase that are required for activity. As a result, a clear and

  17. New insights into the nutritional regulation of gluconeogenesis in carnivorous rainbow trout (Oncorhynchus mykiss): a gene duplication trail.

    Science.gov (United States)

    Marandel, Lucie; Seiliez, Iban; Véron, Vincent; Skiba-Cassy, Sandrine; Panserat, Stéphane

    2015-07-01

    The rainbow trout (Oncorhynchus mykiss) is considered to be a strictly carnivorous fish species that is metabolically adapted for high catabolism of proteins and low utilization of dietary carbohydrates. This species consequently has a "glucose-intolerant" phenotype manifested by persistent hyperglycemia when fed a high-carbohydrate diet. Gluconeogenesis in adult fish is also poorly, if ever, regulated by carbohydrates, suggesting that this metabolic pathway is involved in this specific phenotype. In this study, we hypothesized that the fate of duplicated genes after the salmonid-specific 4th whole genome duplication (Ss4R) may have led to adaptive innovation and that their study might provide new elements to enhance our understanding of gluconeogenesis and poor dietary carbohydrate use in this species. Our evolutionary analysis of gluconeogenic genes revealed that pck1, pck2, fbp1a, and g6pca were retained as singletons after Ss4r, while g6pcb1, g6pcb2, and fbp1b ohnolog pairs were maintained. For all genes, duplication may have led to sub- or neofunctionalization. Expression profiles suggest that the gluconeogenesis pathway remained active in trout fed a no-carbohydrate diet. When trout were fed a high-carbohydrate diet (30%), most of the gluconeogenic genes were non- or downregulated, except for g6pbc2 ohnologs, whose RNA levels were surprisingly increased. This study demonstrates that Ss4R in trout involved adaptive innovation via gene duplication and via the outcome of the resulting ohnologs. Indeed, maintenance of ohnologous g6pcb2 pair may contribute in a significant way to the glucose-intolerant phenotype of trout and may partially explain its poor use of dietary carbohydrates.

  18. Genome-wide copy number profiling to detect gene amplifications in neural progenitor cells

    Directory of Open Access Journals (Sweden)

    U. Fischer

    2014-12-01

    Full Text Available DNA sequence amplification occurs at defined stages during normal development in amphibians and flies and seems to be restricted in humans to drug-resistant and tumor cells only. We used array-CGH to discover copy number changes including gene amplifications and deletions during differentiation of human neural progenitor cells. Here, we describe cell culture features, DNA extraction, and comparative genomic hybridization (CGH analysis tailored towards the identification of genomic copy number changes. Further detailed analysis of amplified chromosome regions associated with this experiment, was published by Fischer and colleagues in PLOS One in 2012 (Fischer et al., 2012. We provide detailed information on deleted chromosome regions during differentiation and give an overview on copy number changes during differentiation induction for two representative chromosome regions.

  19. Polymorphic segmental duplications at 8p23.1 challenge the determination of individual defensin gene repertoires and the assembly of a contiguous human reference sequence

    Directory of Open Access Journals (Sweden)

    Loncarevic Ivan F

    2004-12-01

    Full Text Available Abstract Background Defensins are important components of innate immunity to combat bacterial and viral infections, and can even elicit antitumor responses. Clusters of defensin (DEF genes are located in a 2 Mb range of the human chromosome 8p23.1. This DEF locus, however, represents one of the regions in the euchromatic part of the final human genome sequence which contains segmental duplications, and recalcitrant gaps indicating high structural dynamics. Results We find that inter- and intraindividual genetic variations within this locus prevent a correct automatic assembly of the human reference genome (NCBI Build 34 which currently even contains misassemblies. Manual clone-by-clone alignment and gene annotation as well as repeat and SNP/haplotype analyses result in an alternative alignment significantly improving the DEF locus representation. Our assembly better reflects the experimentally verified variability of DEF gene and DEF cluster copy numbers. It contains an additional DEF cluster which we propose to reside between two already known clusters. Furthermore, manual annotation revealed a novel DEF gene and several pseudogenes expanding the hitherto known DEF repertoire. Analyses of BAC and working draft sequences of the chimpanzee indicates that its DEF region is also complex as in humans and DEF genes and a cluster are multiplied. Comparative analysis of human and chimpanzee DEF genes identified differences affecting the protein structure. Whether this might contribute to differences in disease susceptibility between man and ape remains to be solved. For the determination of individual DEF gene repertoires we provide a molecular approach based on DEF haplotypes. Conclusions Complexity and variability seem to be essential genomic features of the human DEF locus at 8p23.1 and provides an ongoing challenge for the best possible representation in the human reference sequence. Dissection of paralogous sequence variations, duplicon SNPs ans

  20. Copy Number Deletion Has Little Impact on Gene Expression Levels in Racehorses

    Directory of Open Access Journals (Sweden)

    Kyung-Do Park

    2014-09-01

    Full Text Available Copy number variations (CNVs, important genetic factors for study of human diseases, may have as large of an effect on phenotype as do single nucleotide polymorphisms. Indeed, it is widely accepted that CNVs are associated with differential disease susceptibility. However, the relationships between CNVs and gene expression have not been characterized in the horse. In this study, we investigated the effects of copy number deletion in the blood and muscle transcriptomes of Thoroughbred racing horses. We identified a total of 1,246 CNVs of deletion polymorphisms using DNA re-sequencing data from 18 Thoroughbred racing horses. To discover the tendencies between CNV status and gene expression levels, we extracted CNVs of four Thoroughbred racing horses of which RNA sequencing was available. We found that 252 pairs of CNVs and genes were associated in the four horse samples. We did not observe a clear and consistent relationship between the deletion status of CNVs and gene expression levels before and after exercise in blood and muscle. However, we found some pairs of CNVs and associated genes that indicated relationships with gene expression levels: a positive relationship with genes responsible for membrane structure or cytoskeleton and a negative relationship with genes involved in disease. This study will lead to conceptual advances in understanding the relationship between CNVs and global gene expression in the horse.

  1. Spectrum of EGFR gene copy number changes and KRAS gene mutation status in Korean triple negative breast cancer patients.

    Directory of Open Access Journals (Sweden)

    Yoonjung Kim

    Full Text Available Anti-epidermal growth factor receptor (EGFR therapy has been tried in triple negative breast cancer (TNBC patients without evaluation of molecular and clinical predictors in several randomized clinical studies. Only fewer than 20% of metastatic TNBCs showed response to anti-EGFR therapy. In order to increase the overall response rate, first step would be to classify TNBC into good or poor responders according to oncogenic mutation profiles. This study provides the molecular characteristics of TNBCs including EGFR gene copy number changes and mutation status of EGFR and KRAS gene in Korean TNBC patients. Mutation analysis for EGFR, KRAS, BRAF and TP53 from a total of 105 TNBC tissue samples was performed by direct sequencing, peptide nucleic acid-mediated PCR clamping method and real-time PCR. Copy number changes of EGFR gene were evaluated using multiplex ligation-dependent probe amplification. Out of all 105 TNBCs, 15.2% (16/105 showed EGFR copy number changes. Among them, increased or decreased EGFR copy number was detected in 13 (5 single copy gain, 2 amplification and 4 high-copy number amplification and 3 cases (3 hemizygous deletion, respectively. The mutation frequencies of KRAS, EGFR and TP53 gene were 1.9% (G12V and G12D, 1.0% (exon 19 del and 31.4%, respectively. There was no BRAF V600E mutation found. Future studies are needed to evaluate the clinical outcomes of TNBC patients who undergo anti-EGFR therapy according to the genetic status of EGFR.

  2. Evolution of CONSTANS Regulation and Function after Gene Duplication Produced a Photoperiodic Flowering Switch in the Brassicaceae.

    Science.gov (United States)

    Simon, Samson; Rühl, Mark; de Montaigu, Amaury; Wötzel, Stefan; Coupland, George

    2015-09-01

    Environmental control of flowering allows plant reproduction to occur under optimal conditions and facilitates adaptation to different locations. At high latitude, flowering of many plants is controlled by seasonal changes in day length. The photoperiodic flowering pathway confers this response in the Brassicaceae, which colonized temperate latitudes after divergence from the Cleomaceae, their subtropical sister family. The CONSTANS (CO) transcription factor of Arabidopsis thaliana, a member of the Brassicaceae, is central to the photoperiodic flowering response and shows characteristic patterns of transcription required for day-length sensing. CO is believed to be widely conserved among flowering plants; however, we show that it arose after gene duplication at the root of the Brassicaceae followed by divergence of transcriptional regulation and protein function. CO has two close homologs, CONSTANS-LIKE1 (COL1) and COL2, which are related to CO by tandem duplication and whole-genome duplication, respectively. The single CO homolog present in the Cleomaceae shows transcriptional and functional features similar to those of COL1 and COL2, suggesting that these were ancestral. We detect cis-regulatory and codon changes characteristic of CO and use transgenic assays to demonstrate their significance in the day-length-dependent activation of the CO target gene FLOWERING LOCUS T. Thus, the function of CO as a potent photoperiodic flowering switch evolved in the Brassicaceae after gene duplication. The origin of CO may have contributed to the range expansion of the Brassicaceae and suggests that in other families CO genes involved in photoperiodic flowering arose by convergent evolution.

  3. Clinical Omics Analysis of Colorectal Cancer Incorporating Copy Number Aberrations and Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Tsuyoshi Yoshida

    2010-07-01

    Full Text Available Background: Colorectal cancer (CRC is one of the most frequently occurring cancers in Japan, and thus a wide range of methods have been deployed to study the molecular mechanisms of CRC. In this study, we performed a comprehensive analysis of CRC, incorporating copy number aberration (CRC and gene expression data. For the last four years, we have been collecting data from CRC cases and organizing the information as an “omics” study by integrating many kinds of analysis into a single comprehensive investigation. In our previous studies, we had experienced difficulty in finding genes related to CRC, as we observed higher noise levels in the expression data than in the data for other cancers. Because chromosomal aberrations are often observed in CRC, here, we have performed a combination of CNA analysis and expression analysis in order to identify some new genes responsible for CRC. This study was performed as part of the Clinical Omics Database Project at Tokyo Medical and Dental University. The purpose of this study was to investigate the mechanism of genetic instability in CRC by this combination of expression analysis and CNA, and to establish a new method for the diagnosis and treatment of CRC. Materials and methods: Comprehensive gene expression analysis was performed on 79 CRC cases using an Affymetrix Gene Chip, and comprehensive CNA analysis was performed using an Affymetrix DNA Sty array. To avoid the contamination of cancer tissue with normal cells, laser micro-dissection was performed before DNA/RNA extraction. Data analysis was performed using original software written in the R language. Result: We observed a high percentage of CNA in colorectal cancer, including copy number gains at 7, 8q, 13 and 20q, and copy number losses at 8p, 17p and 18. Gene expression analysis provided many candidates for CRC-related genes, but their association with CRC did not reach the level of statistical significance. The combination of CNA and gene

  4. Characterization of copy number variants for CCL3L1 gene in rheumatoid arthritis for French trio families and Tunisian cases and controls.

    Science.gov (United States)

    Ben Kilani, Mohamed Sahbi; Achour, Yosser; Perea, Javier; Cornelis, François; Bardin, Thomas; Chaudru, Valérie; Maalej, Abdellatif; Petit-Teixeira, Elisabeth

    2016-08-01

    Analyses of copy number variants (CNVs) for candidate genes in complex diseases are currently a promising research field. CNVs of C-C chemokine ligand 3-like 1 (CCL3L1) gene are candidate genomic factors in rheumatoid arthritis (RA). We investigated CCL3L1 CNVs association with a case-control study in Tunisians and a transmission analysis in French trio families. Relative copy number (rCN) of CCL3L1 gene was quantified by droplet digital PCR (ddPCR) in 100 French trio families (RA patients and their two parents) and in 166 RA cases and 102 healthy controls from Tunisia. We calculated odds ratio (OR) to investigate association risk for CCL3L1 CNVs in RA. rCN identified varied from 0 to 4 in the French population and from 0 to 7 in the Tunisian population. A significant difference was observed in the distribution of these rCNs between the two populations (p = 2.34 × 10(-10)), as when rCN from French and Tunisian RA patients were compared (p = 2.83 × 10(-5)). CNVs transmission in French RA trios allowed the characterization of genotypes with the presence of tandem duplication and triplication on the same chromosome. RA association tests highlighted a protective effect of rCN = 5 for CCL3L1 gene in the Tunisian population (OR = 0.056; CI 95 % [0.01-0.46]). Characterization of CCL3L1 CNVs with ddPCR methodology highlighted specific CN genotypes in a French family sample. A copy number polymorphism of a RA candidate gene was quantified, and its significant association with RA was revealed in a Tunisian sample.

  5. Detection of genomic copy number changes in patients with idiopathic mental retardation by high-resolution X-array-CGH: important role for increased gene dosage of XLMR genes.

    Science.gov (United States)

    Froyen, Guy; Van Esch, Hilde; Bauters, Marijke; Hollanders, Karen; Frints, Suzanna G M; Vermeesch, Joris R; Devriendt, Koen; Fryns, Jean-Pierre; Marynen, Peter

    2007-10-01

    A tiling X-chromosome-specific genomic array with a theoretical resolution of 80 kb was developed to screen patients with idiopathic mental retardation (MR) for submicroscopic copy number differences. Four patients with aberrations previously detected at lower resolution were first analyzed. This facilitated delineation of the location and extent of the aberration at high resolution and subsequently, more precise genotype-phenotype analyses. A cohort of 108 patients was screened, 57 of which were suspected of X-linked mental retardation (XLMR), 26 were probands of brother pairs, and 25 were sporadic cases. A total of 15 copy number changes in 14 patients (13%) were detected, which included two deletions and 13 duplications ranging from 0.1 to 2.7 Mb. The aberrations are associated with the phenotype in five patients (4.6%), based on the following criteria: de novo aberration; involvement of a known or candidate X-linked nonsyndromic(syndromic) MR (MRX(S)) gene; segregation with the disease in the family; absence in control individuals; and skewed X-inactivation in carrier females. These include deletions that contain the MRX(S) genes CDKL5, OPHN1, and CASK, and duplications harboring CDKL5, NXF5, MECP2, and GDI1. In addition, seven imbalances were apparent novel polymorphic regions because they do not fulfill the proposed criteria. Taken together, our data strongly suggest that not only deletions but also duplications on the X chromosome contribute to the phenotype more often than expected, supporting the increased gene dosage mechanism for deregulation of normal cognitive development.

  6. Critical evaluation of HPV16 gene copy number quantification by SYBR green PCR.

    Science.gov (United States)

    Roberts, Ian; Ng, Grace; Foster, Nicola; Stanley, Margaret; Herdman, Michael T; Pett, Mark R; Teschendorff, Andrew; Coleman, Nicholas

    2008-07-24

    Human papilloma virus (HPV) load and physical status are considered useful parameters for clinical evaluation of cervical squamous cell neoplasia. However, the errors implicit in HPV gene quantification by PCR are not well documented. We have undertaken the first rigorous evaluation of the errors that can be expected when using SYBR green qPCR for quantification of HPV type 16 gene copy numbers. We assessed a modified method, in which external calibration curves were generated from a single construct containing HPV16 E2, HPV16 E6 and the host gene hydroxymethylbilane synthase in a 1:1:1 ratio. When testing dilutions of mixed HPV/host DNA in replicate runs, we observed errors in quantifying E2 and E6 amplicons of 5-40%, with greatest error at the lowest DNA template concentration (3 ng/microl). Errors in determining viral copy numbers per diploid genome were 13-53%. Nevertheless, in cervical keratinocyte cell lines we observed reasonable agreement between viral loads determined by qPCR and Southern blotting. The mean E2/E6 ratio in episome-only cells was 1.04, but with a range of 0.76-1.32. In three integrant-only lines the mean E2/E6 ratios were 0.20, 0.72 and 2.61 (values confirmed by gene-specific Southern blotting). When E2/E6 ratios in fourteen HPV16-positive cervical carcinomas were analysed, conclusions regarding viral physical state could only be made in three cases, where the E2/E6 ratio was unavoidable inaccuracies that should be allowed for when quantifying HPV gene copy number. While E6 copy numbers can be considered to provide a useable indication of viral loads, the E2/E6 ratio is of limited value. Previous studies may have overestimated the frequency of mixed episomal/integrant HPV infections.

  7. Proteomic changes resulting from gene copy number variations in cancer cells.

    Directory of Open Access Journals (Sweden)

    Tamar Geiger

    2010-09-01

    Full Text Available Along the transformation process, cells accumulate DNA aberrations, including mutations, translocations, amplifications, and deletions. Despite numerous studies, the overall effects of amplifications and deletions on the end point of gene expression--the level of proteins--is generally unknown. Here we use large-scale and high-resolution proteomics combined with gene copy number analysis to investigate in a global manner to what extent these genomic changes have a proteomic output and therefore the ability to affect cellular transformation. We accurately measure expression levels of 6,735 proteins and directly compare them to the gene copy number. We find that the average effect of these alterations on the protein expression is only a few percent. Nevertheless, by using a novel algorithm, we find the combined impact that many of these regional chromosomal aberrations have at the protein level. We show that proteins encoded by amplified oncogenes are often overexpressed, while adjacent amplified genes, which presumably do not promote growth and survival, are attenuated. Furthermore, regulation of biological processes and molecular complexes is independent of general copy number changes. By connecting the primary genome alteration to their proteomic consequences, this approach helps to interpret the data from large-scale cancer genomics efforts.

  8. Chaperonin genes on the rise: new divergent classes and intense duplication in human and other vertebrate genomes

    Directory of Open Access Journals (Sweden)

    Macario Alberto JL

    2010-03-01

    Full Text Available Abstract Background Chaperonin proteins are well known for the critical role they play in protein folding and in disease. However, the recent identification of three diverged chaperonin paralogs associated with the human Bardet-Biedl and McKusick-Kaufman Syndromes (BBS and MKKS, respectively indicates that the eukaryotic chaperonin-gene family is larger and more differentiated than previously thought. The availability of complete genome sequences makes possible a definitive characterization of the complete set of chaperonin sequences in human and other species. Results We identified fifty-four chaperonin-like sequences in the human genome and similar numbers in the genomes of the model organisms mouse and rat. In mammal genomes we identified, besides the well-known CCT chaperonin genes and the three genes associated with the MKKS and BBS pathological conditions, a newly-defined class of chaperonin genes named CCT8L, represented in human by the two sequences CCT8L1 and CCT8L2. Comparative analyses from several vertebrate genomes established the monophyletic origin of chaperonin-like MKKS and BBS genes from the CCT8 lineage. The CCT8L gene originated from a later duplication also in the CCT8 lineage at the onset of mammal evolution and duplicated in primate genomes. The functionality of CCT8L genes in different species was confirmed by evolutionary analyses and in human by expression data. Detailed sequence analysis and structural predictions of MKKS, BBS and CCT8L proteins strongly suggested that they conserve a typical chaperonin-like core structure but that they are unlikely to form a CCT-like oligomeric complex. The characterization of many newly-discovered chaperonin pseudogenes uncovered the intense duplication activity of eukaryotic chaperonin genes. Conclusions In vertebrates, chaperonin genes, driven by intense duplication processes, have diversified into multiple classes and functionalities that extend beyond their well-known protein

  9. Tandem Duplication Events in the Expansion of the Small Heat Shock Protein Gene Family in Solanum lycopersicum (cv. Heinz 1706)

    Science.gov (United States)

    Krsticevic, Flavia J.; Arce, Débora P.; Ezpeleta, Joaquín; Tapia, Elizabeth

    2016-01-01

    In plants, fruit maturation and oxidative stress can induce small heat shock protein (sHSP) synthesis to maintain cellular homeostasis. Although the tomato reference genome was published in 2012, the actual number and functionality of sHSP genes remain unknown. Using a transcriptomic (RNA-seq) and evolutionary genomic approach, putative sHSP genes in the Solanum lycopersicum (cv. Heinz 1706) genome were investigated. A sHSP gene family of 33 members was established. Remarkably, roughly half of the members of this family can be explained by nine independent tandem duplication events that determined, evolutionarily, their functional fates. Within a mitochondrial class subfamily, only one duplicated member, Solyc08g078700, retained its ancestral chaperone function, while the others, Solyc08g078710 and Solyc08g078720, likely degenerated under neutrality and lack ancestral chaperone function. Functional conservation occurred within a cytosolic class I subfamily, whose four members, Solyc06g076570, Solyc06g076560, Solyc06g076540, and Solyc06g076520, support ∼57% of the total sHSP RNAm in the red ripe fruit. Subfunctionalization occurred within a new subfamily, whose two members, Solyc04g082720 and Solyc04g082740, show heterogeneous differential expression profiles during fruit ripening. These findings, involving the birth/death of some genes or the preferential/plastic expression of some others during fruit ripening, highlight the importance of tandem duplication events in the expansion of the sHSP gene family in the tomato genome. Despite its evolutionary diversity, the sHSP gene family in the tomato genome seems to be endowed with a core set of four homeostasis genes: Solyc05g014280, Solyc03g082420, Solyc11g020330, and Solyc06g076560, which appear to provide a baseline protection during both fruit ripening and heat shock stress in different tomato tissues. PMID:27565886

  10. Tandem Duplication Events in the Expansion of the Small Heat Shock Protein Gene Family in Solanum lycopersicum (cv. Heinz 1706

    Directory of Open Access Journals (Sweden)

    Flavia J. Krsticevic

    2016-10-01

    Full Text Available In plants, fruit maturation and oxidative stress can induce small heat shock protein (sHSP synthesis to maintain cellular homeostasis. Although the tomato reference genome was published in 2012, the actual number and functionality of sHSP genes remain unknown. Using a transcriptomic (RNA-seq and evolutionary genomic approach, putative sHSP genes in the Solanum lycopersicum (cv. Heinz 1706 genome were investigated. A sHSP gene family of 33 members was established. Remarkably, roughly half of the members of this family can be explained by nine independent tandem duplication events that determined, evolutionarily, their functional fates. Within a mitochondrial class subfamily, only one duplicated member, Solyc08g078700, retained its ancestral chaperone function, while the others, Solyc08g078710 and Solyc08g078720, likely degenerated under neutrality and lack ancestral chaperone function. Functional conservation occurred within a cytosolic class I subfamily, whose four members, Solyc06g076570, Solyc06g076560, Solyc06g076540, and Solyc06g076520, support ∼57% of the total sHSP RNAm in the red ripe fruit. Subfunctionalization occurred within a new subfamily, whose two members, Solyc04g082720 and Solyc04g082740, show heterogeneous differential expression profiles during fruit ripening. These findings, involving the birth/death of some genes or the preferential/plastic expression of some others during fruit ripening, highlight the importance of tandem duplication events in the expansion of the sHSP gene family in the tomato genome. Despite its evolutionary diversity, the sHSP gene family in the tomato genome seems to be endowed with a core set of four homeostasis genes: Solyc05g014280, Solyc03g082420, Solyc11g020330, and Solyc06g076560, which appear to provide a baseline protection during both fruit ripening and heat shock stress in different tomato tissues.

  11. An ancient history of gene duplications, fusions and losses in the evolution of APOBEC3 mutators in mammals

    Directory of Open Access Journals (Sweden)

    Münk Carsten

    2012-05-01

    Full Text Available Abstract Background The APOBEC3 (A3 genes play a key role in innate antiviral defense in mammals by introducing directed mutations in the DNA. The human genome encodes for seven A3 genes, with multiple splice alternatives. Different A3 proteins display different substrate specificity, but the very basic question on how discerning self from non-self still remains unresolved. Further, the expression of A3 activity/ies shapes the way both viral and host genomes evolve. Results We present here a detailed temporal analysis of the origin and expansion of the A3 repertoire in mammals. Our data support an evolutionary scenario where the genome of the mammalian ancestor encoded for at least one ancestral A3 gene, and where the genome of the ancestor of placental mammals (and possibly of the ancestor of all mammals already encoded for an A3Z1-A3Z2-A3Z3 arrangement. Duplication events of the A3 genes have occurred independently in different lineages: humans, cats and horses. In all of them, gene duplication has resulted in changes in enzyme activity and/or substrate specificity, in a paradigmatic example of convergent adaptive evolution at the genomic level. Finally, our results show that evolutionary rates for the three A3Z1, A3Z2 and A3Z3 motifs have significantly decreased in the last 100 Mya. The analysis constitutes a textbook example of the evolution of a gene locus by duplication and sub/neofunctionalization in the context of virus-host arms race. Conclusions Our results provide a time framework for identifying ancestral and derived genomic arrangements in the APOBEC loci, and to date the expansion of this gene family for different lineages through time, as a response to changes in viral/retroviral/retrotransposon pressure.

  12. Focal DNA copy number changes in neuroblastoma target MYCN regulated genes.

    Directory of Open Access Journals (Sweden)

    Candy Kumps

    Full Text Available Neuroblastoma is an embryonic tumor arising from immature sympathetic nervous system cells. Recurrent genomic alterations include MYCN and ALK amplification as well as recurrent patterns of gains and losses of whole or large partial chromosome segments. A recent whole genome sequencing effort yielded no frequently recurring mutations in genes other than those affecting ALK. However, the study further stresses the importance of DNA copy number alterations in this disease, in particular for genes implicated in neuritogenesis. Here we provide additional evidence for the importance of focal DNA copy number gains and losses, which are predominantly observed in MYCN amplified tumors. A focal 5 kb gain encompassing the MYCN regulated miR-17~92 cluster as sole gene was detected in a neuroblastoma cell line and further analyses of the array CGH data set demonstrated enrichment for other MYCN target genes in focal gains and amplifications. Next we applied an integrated genomics analysis to prioritize MYCN down regulated genes mediated by MYCN driven miRNAs within regions of focal heterozygous or homozygous deletion. We identified RGS5, a negative regulator of G-protein signaling implicated in vascular normalization, invasion and metastasis, targeted by a focal homozygous deletion, as a new MYCN target gene, down regulated through MYCN activated miRNAs. In addition, we expand the miR-17~92 regulatory network controlling TGFß signaling in neuroblastoma with the ring finger protein 11 encoding gene RNF11, which was previously shown to be targeted by the miR-17~92 member miR-19b. Taken together, our data indicate that focal DNA copy number imbalances in neuroblastoma (1 target genes that are implicated in MYCN signaling, possibly selected to reinforce MYCN oncogene addiction and (2 serve as a resource for identifying new molecular targets for treatment.

  13. Efficient inversions and duplications of mammalian regulatory DNA elements and gene clusters by CRISPR/Cas9

    Science.gov (United States)

    Li, Jinhuan; Shou, Jia; Guo, Ya; Tang, Yuanxiao; Wu, Yonghu; Jia, Zhilian; Zhai, Yanan; Chen, Zhifeng; Xu, Quan; Wu, Qiang

    2015-01-01

    The human genome contains millions of DNA regulatory elements and a large number of gene clusters, most of which have not been tested experimentally. The clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated nuclease 9 (Cas9) programed with a synthetic single-guide RNA (sgRNA) emerges as a method for genome editing in virtually any organisms. Here we report that targeted DNA fragment inversions and duplications could easily be achieved in human and mouse genomes by CRISPR with two sgRNAs. Specifically, we found that, in cultured human cells and mice, efficient precise inversions of DNA fragments ranging in size from a few tens of bp to hundreds of kb could be generated. In addition, DNA fragment duplications and deletions could also be generated by CRISPR through trans-allelic recombination between the Cas9-induced double-strand breaks (DSBs) on two homologous chromosomes (chromatids). Moreover, junctions of combinatorial inversions and duplications of the protocadherin (Pcdh) gene clusters induced by Cas9 with four sgRNAs could be detected. In mice, we obtained founders with alleles of precise inversions, duplications, and deletions of DNA fragments of variable sizes by CRISPR. Interestingly, we found that very efficient inversions were mediated by microhomology-mediated end joining (MMEJ) through short inverted repeats. We showed for the first time that DNA fragment inversions could be transmitted through germlines in mice. Finally, we applied this CRISPR method to a regulatory element of the Pcdhα cluster and found a new role in the regulation of members of the Pcdhγ cluster. This simple and efficient method should be useful in manipulating mammalian genomes to study millions of regulatory DNA elements as well as vast numbers of gene clusters. PMID:25757625

  14. iGC-an integrated analysis package of gene expression and copy number alteration.

    Science.gov (United States)

    Lai, Yi-Pin; Wang, Liang-Bo; Wang, Wei-An; Lai, Liang-Chuan; Tsai, Mong-Hsun; Lu, Tzu-Pin; Chuang, Eric Y

    2017-01-14

    With the advancement in high-throughput technologies, researchers can simultaneously investigate gene expression and copy number alteration (CNA) data from individual patients at a lower cost. Traditional analysis methods analyze each type of data individually and integrate their results using Venn diagrams. Challenges arise, however, when the results are irreproducible and inconsistent across multiple platforms. To address these issues, one possible approach is to concurrently analyze both gene expression profiling and CNAs in the same individual. We have developed an open-source R/Bioconductor package (iGC). Multiple input formats are supported and users can define their own criteria for identifying differentially expressed genes driven by CNAs. The analysis of two real microarray datasets demonstrated that the CNA-driven genes identified by the iGC package showed significantly higher Pearson correlation coefficients with their gene expression levels and copy numbers than those genes located in a genomic region with CNA. Compared with the Venn diagram approach, the iGC package showed better performance. The iGC package is effective and useful for identifying CNA-driven genes. By simultaneously considering both comparative genomic and transcriptomic data, it can provide better understanding of biological and medical questions. The iGC package's source code and manual are freely available at https://www.bioconductor.org/packages/release/bioc/html/iGC.html .

  15. Primers for low-copy nuclear genes in the Melastomataceae 1

    OpenAIRE

    Reginato,Marcelo; Michelangeli, Fabián A.

    2016-01-01

    Premise of the study: Low-copy nuclear gene primers were developed for phylogenetic studies across the Melastomataceae. Methods and Results: Total genomic libraries from eight species in the Melastomataceae along with one transcriptome were used for marker identification and primer design. Eight exon-primed intron-crossing markers were amplified with success in taxa of nine tribes in the Melastomataceae. The new markers were directly sequenced for eight samples of closely related species of M...

  16. Gene fusions and gene duplications: relevance to genomic annotation and functional analysis

    Directory of Open Access Journals (Sweden)

    Riley Monica

    2005-03-01

    Full Text Available Abstract Background Escherichia coli a model organism provides information for annotation of other genomes. Our analysis of its genome has shown that proteins encoded by fused genes need special attention. Such composite (multimodular proteins consist of two or more components (modules encoding distinct functions. Multimodular proteins have been found to complicate both annotation and generation of sequence similar groups. Previous work overstated the number of multimodular proteins in E. coli. This work corrects the identification of modules by including sequence information from proteins in 50 sequenced microbial genomes. Results Multimodular E. coli K-12 proteins were identified from sequence similarities between their component modules and non-fused proteins in 50 genomes and from the literature. We found 109 multimodular proteins in E. coli containing either two or three modules. Most modules had standalone sequence relatives in other genomes. The separated modules together with all the single (un-fused proteins constitute the sum of all unimodular proteins of E. coli. Pairwise sequence relationships among all E. coli unimodular proteins generated 490 sequence similar, paralogous groups. Groups ranged in size from 92 to 2 members and had varying degrees of relatedness among their members. Some E. coli enzyme groups were compared to homologs in other bacterial genomes. Conclusion The deleterious effects of multimodular proteins on annotation and on the formation of groups of paralogs are emphasized. To improve annotation results, all multimodular proteins in an organism should be detected and when known each function should be connected with its location in the sequence of the protein. When transferring functions by sequence similarity, alignment locations must be noted, particularly when alignments cover only part of the sequences, in order to enable transfer of the correct function. Separating multimodular proteins into module units makes

  17. Duplication and Loss of Function of Genes Encoding RNA Polymerase III Subunit C4 Causes Hybrid Incompatibility in Rice

    Directory of Open Access Journals (Sweden)

    Giao Ngoc Nguyen

    2017-08-01

    Full Text Available Reproductive barriers are commonly observed in both animals and plants, in which they maintain species integrity and contribute to speciation. This report shows that a combination of loss-of-function alleles at two duplicated loci, DUPLICATED GAMETOPHYTIC STERILITY 1 (DGS1 on chromosome 4 and DGS2 on chromosome 7, causes pollen sterility in hybrid progeny derived from an interspecific cross between cultivated rice, Oryza sativa, and an Asian annual wild rice, O. nivara. Male gametes carrying the DGS1 allele from O. nivara (DGS1-nivaras and the DGS2 allele from O. sativa (DGS2-T65s were sterile, but female gametes carrying the same genotype were fertile. We isolated the causal gene, which encodes a protein homologous to DNA-dependent RNA polymerase (RNAP III subunit C4 (RPC4. RPC4 facilitates the transcription of 5S rRNAs and tRNAs. The loss-of-function alleles at DGS1-nivaras and DGS2-T65s were caused by weak or nonexpression of RPC4 and an absence of RPC4, respectively. Phylogenetic analysis demonstrated that gene duplication of RPC4 at DGS1 and DGS2 was a recent event that occurred after divergence of the ancestral population of Oryza from other Poaceae or during diversification of AA-genome species.

  18. Gene duplication and an accelerated evolutionary rate in 11S globulin genes are associated with higher protein synthesis in dicots as compared to monocots

    Directory of Open Access Journals (Sweden)

    Li Chun

    2012-01-01

    Full Text Available Abstract Background Seed storage proteins are a major source of dietary protein, and the content of such proteins determines both the quantity and quality of crop yield. Significantly, examination of the protein content in the seeds of crop plants shows a distinct difference between monocots and dicots. Thus, it is expected that there are different evolutionary patterns in the genes underlying protein synthesis in the seeds of these two groups of plants. Results Gene duplication, evolutionary rate and positive selection of a major gene family of seed storage proteins (the 11S globulin genes, were compared in dicots and monocots. The results, obtained from five species in each group, show more gene duplications, a higher evolutionary rate and positive selections of this gene family in dicots, which are rich in 11S globulins, but not in the monocots. Conclusion Our findings provide evidence to support the suggestion that gene duplication and an accelerated evolutionary rate may be associated with higher protein synthesis in dicots as compared to monocots.

  19. Heterogeneous expression pattern of tandem duplicated sHsps genes during fruit ripening in two tomato species

    Science.gov (United States)

    Arce, DP; Krsticevic, FJ; Ezpeleta, J.; Ponce, SD; Pratta, GR; Tapia, E.

    2016-04-01

    The small heat shock proteins (sHSPs) have been found to play a critical role in physiological stress conditions in protecting proteins from irreversible aggregation. To characterize the gene expression profile of four sHsps with a tandem gene structure arrangement in the domesticated Solanum lycopersicum (Heinz 1706) genome and its wild close relative Solanum pimpinellifolium (LA1589), differential gene expression analysis using RNA-Seq was conducted in three ripening stages in both cultivars fruits. Gene promoter analysis was performed to explain the heterogeneous pattern of gene expression found for these tandem duplicated sHsps. In silico analysis results contribute to refocus wet experiment analysis in tomato sHsp family proteins.

  20. Molecular and Population Analysis of Natural Selection on the Human Haptoglobin Duplication

    OpenAIRE

    Rodriguez, Santiago; Williams, Dylan M; Guthrie, Philip AI; McArdle, Wendy L.; Smith, George Davey; Evans, David M.; Gaunt, Tom R.; Day, Ian NM

    2012-01-01

    Haptoglobin binds free haemoglobin that prevents oxidative damage produced by haemolysis. There is a copy number variant (CNV) in the haptoglobin gene (HP) consisting of two alleles, Hp1 (no duplication), and Hp2 (1.7kb duplication involving two exons). The spread of the Hp2 allele is believed to have taken place under selective pressures conferred by malaria resistance. However, molecular evidence is lacking and Hp did not emerge in genomewide SNPs surveys for evidence of selection. In Europ...

  1. Copy number variation of mitochondrial genes in Pneumocystis jirovecii according to the fungal load in BAL specimens

    Directory of Open Access Journals (Sweden)

    clara valero

    2016-09-01

    Full Text Available AbstractPneumocystis jirovecii is an unculturable fungus and the causative agent of Pneumocystis pneumonia, a life-threatening opportunistic infection. Although molecular diagnosis is often based on the mtLSU rRNA mitochondrial gene due to its greater sensitivity, physiology and the dynamics of the mitochondria in this fungus remains largely unknown. We developed and optimized six real-time PCR assays in order to determine the copy number of four mitochondrial genes (mtSSU rRNA, mtLSU rRNA, NAD1 and CYTB in comparison to nuclear genome (DHPS and HSP70 and tested 84 bronchoalveolar fluids of patients at different stages of the infection. Unexpectedly, we found that copy number of mitochondrial genes varied from gene to gene with mtSSU rRNA gene being more represented (37 copies than NAD1 (23 copies, mtLSU rRNA (15 copies and CYTB (6 copies genes compared to nuclear genome. Hierarchical clustering analysis (HCA allowed us to define five major clusters, significantly associated with fungal load (p=0.029, in which copy number of mitochondrial genes was significantly different among them. More importantly, copy number of mtLSU rRNA, NAD1 and CYTB but not mtSSU rRNA differed according to P. jirovecii physiological state with a decreased number of copies when the fungal load is low. This suggests the existence of a mixture of various subspecies of mtDNA that can harbor different amplification rates. Overall, we revealed here an unexpected plasticity and dynamics of P. jirovecii mitochondrial DNA that vary according to P. jirovecii’s physiological state.

  2. Duplication of the IGFBP-2 gene in teleost fish: protein structure and functionality conservation and gene expression divergence.

    Directory of Open Access Journals (Sweden)

    Jianfeng Zhou

    growth and development primarily by binding to and inhibiting IGF actions in vivo. The duplicated IGFBP-2 genes may provide additional flexibility in the regulation of IGF activities.

  3. Opossum carboxylesterases: sequences, phylogeny and evidence for CES gene duplication events predating the marsupial-eutherian common ancestor

    Directory of Open Access Journals (Sweden)

    Chan Jeannie

    2008-02-01

    Full Text Available Abstract Background Carboxylesterases (CES perform diverse metabolic roles in mammalian organisms in the detoxification of a broad range of drugs and xenobiotics and may also serve in specific roles in lipid, cholesterol, pheromone and lung surfactant metabolism. Five CES families have been reported in mammals with human CES1 and CES2 the most extensively studied. Here we describe the genetics, expression and phylogeny of CES isozymes in the opossum and report on the sequences and locations of CES1, CES2 and CES6 'like' genes within two gene clusters on chromosome one. We also discuss the likely sequence of gene duplication events generating multiple CES genes during vertebrate evolution. Results We report a cDNA sequence for an opossum CES and present evidence for CES1 and CES2 like genes expressed in opossum liver and intestine and for distinct gene locations of five opossum CES genes,CES1, CES2.1, CES2.2, CES2.3 and CES6, on chromosome 1. Phylogenetic and sequence alignment studies compared the predicted amino acid sequences for opossum CES with those for human, mouse, chicken, frog, salmon and Drosophila CES gene products. Phylogenetic analyses produced congruent phylogenetic trees depicting a rapid early diversification into at least five distinct CES gene family clusters: CES2, CES1, CES7, CES3, and CES6. Molecular divergence estimates based on a Bayesian relaxed clock approach revealed an origin for the five mammalian CES gene families between 328–378 MYA. Conclusion The deduced amino acid sequence for an opossum cDNA was consistent with its identity as a mammalian CES2 gene product (designated CES2.1. Distinct gene locations for opossum CES1 (1: 446,222,550–446,274,850, three CES2 genes (1: 677,773,395–677,927,030 and a CES6 gene (1: 677,585,520–677,730,419 were observed on chromosome 1. Opossum CES1 and multiple CES2 genes were expressed in liver and intestine. Amino acid sequences for opossum CES1 and three CES2 gene products

  4. Low-copy piggyBac transposon mutagenesis in mice identifies genes driving melanoma.

    Science.gov (United States)

    Ni, Thomas K; Landrette, Sean F; Bjornson, Robert D; Bosenberg, Marcus W; Xu, Tian

    2013-09-17

    Despite considerable efforts to sequence hypermutated cancers such as melanoma, distinguishing cancer-driving genes from thousands of recurrently mutated genes remains a significant challenge. To circumvent the problematic background mutation rates and identify new melanoma driver genes, we carried out a low-copy piggyBac transposon mutagenesis screen in mice. We induced eleven melanomas with mutation burdens that were 100-fold lower relative to human melanomas. Thirty-eight implicated genes, including two known drivers of human melanoma, were classified into three groups based on high, low, or background-level mutation frequencies in human melanomas, and we further explored the functional significance of genes in each group. For two genes overlooked by prevailing discovery methods, we found that loss of membrane associated guanylate kinase, WW and PDZ domain containing 2 and protein tyrosine phosphatase, receptor type, O cooperated with the v-raf murine sarcoma viral oncogene homolog B (BRAF) recurrent V600E mutation to promote cellular transformation. Moreover, for infrequently mutated genes often disregarded by current methods, we discovered recurrent mitogen-activated protein kinase kinase kinase 1 (Map3k1)-activating insertions in our screen, mirroring recurrent MAP3K1 up-regulation in human melanomas. Aberrant expression of Map3k1 enabled growth factor-autonomous proliferation and drove BRAF-independent ERK signaling, thus shedding light on alternative means of activating this prominent signaling pathway in melanoma. In summary, our study contributes several previously undescribed genes involved in melanoma and establishes an important proof-of-principle for the utility of the low-copy transposon mutagenesis approach for identifying cancer-driving genes, especially those masked by hypermutation.

  5. The positioning logic and copy number control of genes in bacteria under stress

    Science.gov (United States)

    Zhang, Qiucen; Austin, Robert; Vyawahare, Saurabh; Lau, Alexandra

    2013-03-01

    Escherichia coli (E. coli) cells when challenged with sublethal concentrations of the genotoxic antibiotic ciprofloxacin cease to divide and form long filaments which contain multiple bacterial chromosomes. These filaments are individual mesoscopic environmental niches which provide protection for a community of chromosomes (as opposed to cells) under mutagenic stress and can provide an evolutionary fitness advantage within the niche. We use comparative genomic hybridization to show that the mesoscopic niche evolves within 20 minutes of ciprofloxacin exposure via replication of multiple copies of genes expressing ATP dependent transporters. We show that this rapid genomic amplification is done in a time efficient manner via placement of the genes encoding the pumps near the origin of replication on the bacterial chromosome. The de-amplification of multiple copies back to the wild type number is a function of the duration is a function of the ciprofloxacin exposure duration: the longer the exposure, the slower the removal of the multiple copies. The project described was supported by the National Science Foundation and the National Cancer Institute

  6. Positive selection in the adhesion domain of Mus sperm Adam genes through gene duplications and function-driven gene complex formations.

    Science.gov (United States)

    Grayson, Phil; Civetta, Alberto

    2013-09-30

    Sperm and testes-expressed Adam genes have been shown to undergo bouts of positive selection in mammals. Despite the pervasiveness of positive selection signals, it is unclear what has driven such selective bouts. The fact that only sperm surface Adam genes show signals of positive selection within their adhesion domain has led to speculation that selection might be driven by species-specific adaptations to fertilization or sperm competition. Alternatively, duplications and neofunctionalization of Adam sperm surface genes, particularly as it is now understood in rodents, might have contributed to an acceleration of evolutionary rates and possibly adaptive diversification. Here we sequenced and conducted tests of selection within the adhesion domain of sixteen known sperm-surface Adam genes among five species of the Mus genus. We find evidence of positive selection associated with all six Adam genes known to interact to form functional complexes on Mus sperm. A subset of these complex-forming sperm genes also displayed accelerated branch evolution with Adam5 evolving under positive selection. In contrast to our previous findings in primates, selective bouts within Mus sperm Adams showed no associations to proxies of sperm competition. Expanded phylogenetic analysis including sequence data from other placental mammals allowed us to uncover ancient and recent episodes of adaptive evolution. The prevailing signals of rapid divergence and positive selection detected within the adhesion domain of interacting sperm Adams is driven by duplications and potential neofunctionalizations that are in some cases ancient (Adams 2, 3 and 5) or more recent (Adams 1b, 4b and 6).

  7. DNA copy-number alterations underlie gene expression differences between microsatellite stable and unstable colorectal cancers

    DEFF Research Database (Denmark)

    Jorissen, Robert N; Lipton, Lara; Gibbs, Peter

    2008-01-01

    Purpose: About 15% of colorectal cancers harbor microsatellite instability (MSI). MSI-associated gene expression changes have been identified in colorectal cancers, but little overlap exists between signatures hindering an assessment of overall consistency. Little is known about the causes...... and downstream effects of differential gene expression. Experimental Design: DNA microarray data on 89 MSI and 140 microsatellite-stable (MSS) colorectal cancers from this study and 58 MSI and 77 MSS cases from three published reports were randomly divided into test and training sets. MSI-associated gene...... expression changes were assessed for cross-study consistency using training samples and validated as MSI classifier using test samples. Differences in biological pathways were identified by functional category analysis. Causation of differential gene expression was investigated by comparison to DNA copy...

  8. TOP1 gene copy number and TOP1/CEN-20 ratio in stage III colorectal cancer samples

    DEFF Research Database (Denmark)

    Rømer, Maria Unni Koefoed; Nygård, Sune Boris; Christensen, Ib Jarle

    AIM OF STUDY To investigate if TOP1 gene copy number and/or the TOP1/CEN-20 ratio in colorectal cancer (CRC) areassociated with prognosis. BACKGROUND TOP1, localized on chromosome 20, encodes topoisomerase I (TOP1), which is the sole molecular target of irinotecan. TOP1 immunoreactivity in formalin...... analyses on 50 FFPE primary CRC tissues. When compared with results from normal colorectal mucosa, 80 % of the tumors showed increased TOP1 gene copy number and 2/3 had increased TOP1/CEN-20 ratio. MATERIALS AND METHODS FFPE samples from 154 stage III CRC patients not receiving adjuvant chemotherapy were...... included. For each patient TOP1 gene copy number and CEN-20 reference number were determined in 60 nuclei from the malignant tumor by FISH using a TOP1/CEN-20 probe mix. Similarly, the TOP1 gene copy number and and CEN-20 reference number were dertermined in the normal colorectal mucosa in 105 of the 154...

  9. From DNA Copy Number to Gene Expression: Local aberrations, Trisomies and Monosomies

    Science.gov (United States)

    Shay, Tal

    The goal of my PhD research was to study the effect of DNA copy number changes on gene expression. DNA copy number aberrations may be local, encompassing several genes, or on the level of an entire chromosome, such as trisomy and monosomy. The main dataset I studied was of Glioblastoma, obtained in the framework of a collaboration, but I worked also with public datasets of cancer and Down's Syndrome. The molecular basis of expression changes in Glioblastoma. Glioblastoma is the most common and aggressive type of primary brain tumors in adults. In collaboration with Prof. Hegi (CHUV, Switzerland), we analyzed a rich Glioblastoma dataset including clinical information, DNA copy number (array CGH) and expression profiles. We explored the correlation between DNA copy number and gene expression at the level of chromosomal arms and local genomic aberrations. We detected known amplification and over expression of oncogenes, as well as deletion and down-regulation of tumor suppressor genes. We exploited that information to map alterations of pathways that are known to be disrupted in Glioblastoma, and tried to characterize samples that have no known alteration in any of the studied pathways. Identifying local DNA aberrations of biological significance. Many types of tumors exhibit chromosomal losses or gains and local amplifications and deletions. A region that is aberrant in many tumors, or whose copy number change is stronger, is more likely to be clinically relevant, and not just a by-product of genetic instability. We developed a novel method that defines and prioritizes aberrations by formalizing these intuitions. The method scores each aberration by the fraction of patients harboring it, its length and its amplitude, and assesses the significance of the score by comparing it to a null distribution obtained by permutations. This approach detects genetic locations that are significantly aberrant, generating a 'genomic aberration profile' for each sample. The 'genomic

  10. The Porcine TSPY Gene Is Tricopy but Not a Copy Number Variant.

    Directory of Open Access Journals (Sweden)

    Anh T Quach

    Full Text Available The testis-specific protein Y-encoded (TSPY gene is situated on the mammalian Y-chromosome and exhibits some remarkable biological characteristics. It has the highest known copy number (CN of all protein coding genes in the human and bovine genomes (up to 74 and 200, respectively and also shows high individual variability. Although the biological function of TSPY has not yet been elucidated, its specific expression in the testis and several identified binding domains within the protein suggests roles in male reproduction. Here we describe the porcine TSPY, as a multicopy gene with three copies located on the short arm of the Y-chromosome with no variation at three exon loci among 20 animals of normal reproductive health from four breeds of domestic pigs (Piétrain, Landrace, Duroc and Yorkshire. To further investigate the speculation that porcine TSPY is not a copy number variant, we have included five Low-fertility boars and five boars with exceptional High-fertility records. Interestingly, there was no difference between the High- and Low-fertile groups, but we detected slightly lower TSPY CN at all three exons (2.56-2.85 in both groups, as compared to normal animals, which could be attributed to technical variability or somatic mosaicism. The results are based on both relative quantitative real-time PCR (qPCR and droplet digital PCR (ddPCR. Chromosomal localization of the porcine TSPY was done using fluorescence in situ hybridization (FISH with gene specific PCR probes.

  11. Antigen-presenting genes and genomic copy number variations in the Tasmanian devil MHC

    Directory of Open Access Journals (Sweden)

    Cheng Yuanyuan

    2012-03-01

    Full Text Available Abstract Background The Tasmanian devil (Sarcophilus harrisii is currently under threat of extinction due to an unusual fatal contagious cancer called Devil Facial Tumour Disease (DFTD. DFTD is caused by a clonal tumour cell line that is transmitted between unrelated individuals as an allograft without triggering immune rejection due to low levels of Major Histocompatibility Complex (MHC diversity in Tasmanian devils. Results Here we report the characterization of the genomic regions encompassing MHC Class I and Class II genes in the Tasmanian devil. Four genomic regions approximately 960 kb in length were assembled and annotated using BAC contigs and physically mapped to devil Chromosome 4q. 34 genes and pseudogenes were identified, including five Class I and four Class II loci. Interestingly, when two haplotypes from two individuals were compared, three genomic copy number variants with sizes ranging from 1.6 to 17 kb were observed within the classical Class I gene region. One deletion is particularly important as it turns a Class Ia gene into a pseudogene in one of the haplotypes. This deletion explains the previously observed variation in the Class I allelic number between individuals. The frequency of this deletion is highest in the northwestern devil population and lowest in southeastern areas. Conclusions The third sequenced marsupial MHC provides insights into the evolution of this dynamic genomic region among the diverse marsupial species. The two sequenced devil MHC haplotypes revealed three copy number variations that are likely to significantly affect immune response and suggest that future work should focus on the role of copy number variations in disease susceptibility in this species.

  12. Formation of chimeric genes by copy-number variation as a mutational mechanism in schizophrenia.

    Science.gov (United States)

    Rippey, Caitlin; Walsh, Tom; Gulsuner, Suleyman; Brodsky, Matt; Nord, Alex S; Gasperini, Molly; Pierce, Sarah; Spurrell, Cailyn; Coe, Bradley P; Krumm, Niklas; Lee, Ming K; Sebat, Jonathan; McClellan, Jon M; King, Mary-Claire

    2013-10-03

    Chimeric genes can be caused by structural genomic rearrangements that fuse together portions of two different genes to create a novel gene. We hypothesize that brain-expressed chimeras may contribute to schizophrenia. Individuals with schizophrenia and control individuals were screened genome wide for copy-number variants (CNVs) that disrupted two genes on the same DNA strand. Candidate events were filtered for predicted brain expression and for frequency genes in localization, regulation, or function. Subcellular localizations of DNAJA2-NETO2 and MAP3K3-DDX42 differed from their parent genes. On the basis of the expression profile of the MATK promoter, MATK-ZFR2 is likely to be far more highly expressed in the brain during development than the ZFR2 parent gene. MATK-ZFR2 includes a ZFR2-derived isoform that we demonstrate localizes preferentially to neuronal dendritic branch sites. These results suggest that the formation of chimeric genes is a mechanism by which CNVs contribute to schizophrenia and that, by interfering with parent gene function, chimeras may disrupt critical brain processes, including neurogenesis, neuronal differentiation, and dendritic arborization.

  13. Enzymatic, expression and structural divergences among carboxyl O-methyltransferases after gene duplication and speciation in Nicotiana.

    Science.gov (United States)

    Hippauf, Frank; Michalsky, Elke; Huang, Ruiqi; Preissner, Robert; Barkman, Todd J; Piechulla, Birgit

    2010-02-01

    Methyl salicylate and methyl benzoate have important roles in a variety of processes including pollinator attraction and plant defence. These compounds are synthesized by salicylic acid, benzoic acid and benzoic acid/salicylic acid carboxyl methyltransferases (SAMT, BAMT and BSMT) which are members of the SABATH gene family. Both SAMT and BSMT were isolated from Nicotiana suaveolens, Nicotiana alata, and Nicotiana sylvestris allowing us to discern levels of enzyme divergence resulting from gene duplication in addition to species divergence. Phylogenetic analyses showed that Nicotiana SAMTs and BSMTs evolved in separate clades and the latter can be differentiated into the BSMT1 and the newly established BSMT2 branch. Although SAMT and BSMT orthologs showed minimal change coincident with species divergences, substantial evolutionary change of enzyme activity and expression patterns occurred following gene duplication. After duplication, the BSMT enzymes evolved higher preference for benzoic acid (BA) than salicylic acid (SA) whereas SAMTs maintained ancestral enzymatic preference for SA over BA. Expression patterns are largely complementary in that BSMT transcripts primarily accumulate in flowers, leaves and stems whereas SAMT is expressed mostly in roots. A novel enzyme, nicotinic acid carboxyl methyltransferase (NAMT), which displays a high degree of activity with nicotinic acid was discovered to have evolved in N. gossei from an ancestral BSMT. Furthermore a SAM-dependent synthesis of methyl anthranilate via BSMT2 is reported and contrasts with alternative biosynthetic routes previously proposed. While BSMT in flowers is clearly involved in methyl benzoate synthesis to attract pollinators, its function in other organs and tissues remains obscure.

  14. Critical evaluation of HPV16 gene copy number quantification by SYBR green PCR

    Directory of Open Access Journals (Sweden)

    Pett Mark R

    2008-07-01

    Full Text Available Abstract Background Human papilloma virus (HPV load and physical status are considered useful parameters for clinical evaluation of cervical squamous cell neoplasia. However, the errors implicit in HPV gene quantification by PCR are not well documented. We have undertaken the first rigorous evaluation of the errors that can be expected when using SYBR green qPCR for quantification of HPV type 16 gene copy numbers. We assessed a modified method, in which external calibration curves were generated from a single construct containing HPV16 E2, HPV16 E6 and the host gene hydroxymethylbilane synthase in a 1:1:1 ratio. Results When testing dilutions of mixed HPV/host DNA in replicate runs, we observed errors in quantifying E2 and E6 amplicons of 5–40%, with greatest error at the lowest DNA template concentration (3 ng/μl. Errors in determining viral copy numbers per diploid genome were 13–53%. Nevertheless, in cervical keratinocyte cell lines we observed reasonable agreement between viral loads determined by qPCR and Southern blotting. The mean E2/E6 ratio in episome-only cells was 1.04, but with a range of 0.76–1.32. In three integrant-only lines the mean E2/E6 ratios were 0.20, 0.72 and 2.61 (values confirmed by gene-specific Southern blotting. When E2/E6 ratios in fourteen HPV16-positive cervical carcinomas were analysed, conclusions regarding viral physical state could only be made in three cases, where the E2/E6 ratio was ≤ 0.06. Conclusion Run-to-run variation in SYBR green qPCR produces unavoidable inaccuracies that should be allowed for when quantifying HPV gene copy number. While E6 copy numbers can be considered to provide a useable indication of viral loads, the E2/E6 ratio is of limited value. Previous studies may have overestimated the frequency of mixed episomal/integrant HPV infections.

  15. Similarity of DMD gene deletion and duplication in the Chinese patients compared to global populations

    Directory of Open Access Journals (Sweden)

    Yan Ming

    2008-04-01

    Full Text Available Abstract Background DNA deletion and duplication were determined as the major mutation underlying Duchenne muscular dystrophy (DMD and Becker muscular dystrophy (BMD. Method Applying multiplex ligation-dependent probe amplification (MLPA, we have analyzed 179 unrelated DMD/BMD subjects from northern China. Results Seventy-three percent of the subjects were found having a deletion (66.25% or duplication (6.25%. Exons 51–52 were detected as the most common fragment deleted in single-exon deletion, and the region of exons 45–50 was the most common exons deleted in multi-exon deletions. About 90% of DMD/BMD cases carry a small size deletion that involves 10 exons or less, 26.67% of which carry a single-exon deletion. Most of the smaller deletions resulted in an out-of-frame mutation. The most common exons deleted were determined to be between exon 48 and exon 52, with exon 50 was the model allele. Verifying single-exon deletion, one sample with a deletion of exon 53 that was initially observed from MLPA showed that there was a single base deletion that abolished the ligation site in MLPA. Confirmation of single-exon deletion is recommended to exclude single base deletion or mutation at the MLPA ligation site. Conclusion The frequency of deletion and duplication in northern China is similar to global ethnic populations.

  16. Startling mosaicism of the Y-chromosome and tandem duplication of the SRY and DAZ genes in patients with Turner Syndrome.

    Directory of Open Access Journals (Sweden)

    Sanjay Premi

    Full Text Available Presence of the human Y-chromosome in females with Turner Syndrome (TS enhances the risk of development of gonadoblastoma besides causing several other phenotypic abnormalities. In the present study, we have analyzed the Y chromosome in 15 clinically diagnosed Turner Syndrome (TS patients and detected high level of mosaicisms ranging from 45,XO:46,XY = 100:0% in 4; 45,XO:46,XY:46XX = 4:94:2 in 8; and 45,XO:46,XY:46XX = 50:30:20 cells in 3 TS patients, unlike previous reports showing 5-8% cells with Y- material. Also, no ring, marker or di-centric Y was observed in any of the cases. Of the two TS patients having intact Y chromosome in >85% cells, one was exceptionally tall. Both the patients were positive for SRY, DAZ, CDY1, DBY, UTY and AZFa, b and c specific STSs. Real Time PCR and FISH demonstrated tandem duplication/multiplication of the SRY and DAZ genes. At sequence level, the SRY was normal in 8 TS patients while the remaining 7 showed either absence of this gene or known and novel mutations within and outside of the HMG box. SNV/SFV analysis showed normal four copies of the DAZ genes in these 8 patients. All the TS patients showed aplastic uterus with no ovaries and no symptom of gonadoblastoma. Present study demonstrates new types of polymorphisms indicating that no two TS patients have identical genotype-phenotype. Thus, a comprehensive analysis of more number of samples is warranted to uncover consensus on the loci affected, to be able to use them as potential diagnostic markers.

  17. Becker Muscular Dystrophy (BMD) caused by duplication of exons 3-6 of the dystrophin gene presenting as dilated cardiomyopathy

    Energy Technology Data Exchange (ETDEWEB)

    Tsai, A.C.; Allingham-Hawkins, D.J.; Becker, L. [Univ. of Toronto, Ontario (Canada)] [and others

    1994-09-01

    X-linked dilated cardiomyopathy (XLCM) is a progressive myocardial disease presenting with congestive heart failure in teenage males without clinical signs of skeletal myopathy. Tight linkage of XLCM to the DMD locus has been demonstrated; it has been suggested that, at least in some families, XLCM is a {open_quotes}dystrophinopathy.{close_quotes} We report a 14-year-old boy who presented with acute heart failure due to dilated cardiomyopathy. He had no history of muscle weakness, but physical examination revealed pseudohypertrophy of the calf muscles. He subsequently received a heart transplantation. Family history was negative. Serum CK level at the time of diagnosis was 10,416. Myocardial biopsy showed no evidence of carditis. Dystrophin staining of cardiac and skeletal muscle with anti-sera to COOH and NH{sub 2}termini showed a patchy distribution of positivity suggestive of Becker muscular dystrophy. Analysis of 18 of the 79 dystrophin exons detected a duplication that included exons 3-6. The proband`s mother has an elevated serum CK and was confirmed to be a carrier of the same duplication. A mutation in the muscle promotor region of the dystrophin gene has been implicated in the etiology of SLCM. However, Towbin et al. (1991) argued that other 5{prime} mutations in the dystrophin gene could cause selective cardiomyopathy. The findings in our patient support the latter hypothesis. This suggests that there are multiple regions in the dystrophin gene which, when disrupted, can cause isolated dilated cardiomyopathy.

  18. Original tandem duplication in FXIIIA gene with splicing site modification and four amino acids insertion causes factor XIII deficiency.

    Science.gov (United States)

    Louhichi, Nacim; Haj Salem, Ikhlass; Medhaffar, Moez; Miled, Nabil; Hadji, Ahmad F; Keskes, Leila; Fakhfakh, Faiza

    2017-04-01

    : Recessive mutations of F13A gene are reported to be responsible of FXIIIA subunit deficiency (FXIIIA). In all, some intronic nucleotide changes identified in this gene were investigated by in-silico analysis and occasionally supported by experimental data or reported in some cases as a polymorphism. To determine the molecular defects responsible of congenital factor XIII deficiency in Libyan patient, molecular analysis was performed by direct DNA sequencing of the coding regions and splice junctions of the FXIIIA subunit gene (F13A). A splicing minigene assay was used to study the effect of this mutation. Bioinformatics exploration was fulfilled to conceive consequences on protein. A 12-bp duplication straddling the border of intron 9 and exon 10 leads to two 3' acceptor splice sites, resulting in silencing of the downstream wild 3' splice site. It caused an in-frame insertion of 12 nucleotides into mRNA and four amino acids into protein. Bioinformatic analysis predicts that the insertion of four amino acids affects the site 3 of calcium binding site, which disturbs the smooth function of the FXIIIA peptide causing the factor XIII deficiency. This study showed that a small duplication seems to weaken the original 3' splice site and enhance the activation of a new splice site responsible for an alternative splicing. It would be interesting to examine the underlying molecular mechanism involved in this rearrangement.

  19. Systematic prioritization and integrative analysis of copy number variations in schizophrenia reveal key schizophrenia susceptibility genes.

    Science.gov (United States)

    Luo, Xiongjian; Huang, Liang; Han, Leng; Luo, Zhenwu; Hu, Fang; Tieu, Roger; Gan, Lin

    2014-11-01

    Schizophrenia is a common mental disorder with high heritability and strong genetic heterogeneity. Common disease-common variants hypothesis predicts that schizophrenia is attributable in part to common genetic variants. However, recent studies have clearly demonstrated that copy number variations (CNVs) also play pivotal roles in schizophrenia susceptibility and explain a proportion of missing heritability. Though numerous CNVs have been identified, many of the regions affected by CNVs show poor overlapping among different studies, and it is not known whether the genes disrupted by CNVs contribute to the risk of schizophrenia. By using cumulative scoring, we systematically prioritized the genes affected by CNVs in schizophrenia. We identified 8 top genes that are frequently disrupted by CNVs, including NRXN1, CHRNA7, BCL9, CYFIP1, GJA8, NDE1, SNAP29, and GJA5. Integration of genes affected by CNVs with known schizophrenia susceptibility genes (from previous genetic linkage and association studies) reveals that many genes disrupted by CNVs are also associated with schizophrenia. Further protein-protein interaction (PPI) analysis indicates that protein products of genes affected by CNVs frequently interact with known schizophrenia-associated proteins. Finally, systematic integration of CNVs prioritization data with genetic association and PPI data identifies key schizophrenia candidate genes. Our results provide a global overview of genes impacted by CNVs in schizophrenia and reveal a densely interconnected molecular network of de novo CNVs in schizophrenia. Though the prioritized top genes represent promising schizophrenia risk genes, further work with different prioritization methods and independent samples is needed to confirm these findings. Nevertheless, the identified key candidate genes may have important roles in the pathogenesis of schizophrenia, and further functional characterization of these genes may provide pivotal targets for future therapeutics and

  20. Association of variation in Fc gamma receptor 3B gene copy number with rheumatoid arthritis in Caucasian samples

    NARCIS (Netherlands)

    McKinney, Cushla; Fanciulli, Manuela; Merriman, Marilyn E.; Phipps-Green, Amanda; Alizadeh, Behrooz Z.; Koeleman, Bobby P. C.; Dalbeth, Nicola; Gow, Peter J.; Harrison, Andrew A.; Highton, John; Jones, Peter B.; Stamp, Lisa K.; Steer, Sophia; Barrera, Pilar; Coenen, Marieke J. H.; Franke, Barbara; van Riel, Piet L. C. M.; Vyse, Tim J.; Aitman, Tim J.; Radstake, Timothy R. D. J.; Merriman, Tony R.

    2010-01-01

    Objective There is increasing evidence that variation in gene copy number (CN) influences clinical phenotype. The low-affinity Fc gamma receptor 3B (FCGR3B) located in the FCGR gene cluster is a CN polymorphic gene involved in the recruitment to sites of inflammation and activation of polymorphonucl

  1. Antagonistic roles for KNOX1 and KNOX2 genes in patterning the land plant body plan following an ancient gene duplication.

    Science.gov (United States)

    Furumizu, Chihiro; Alvarez, John Paul; Sakakibara, Keiko; Bowman, John L

    2015-02-01

    Neofunctionalization following gene duplication is thought to be one of the key drivers in generating evolutionary novelty. A gene duplication in a common ancestor of land plants produced two classes of KNOTTED-like TALE homeobox genes, class I (KNOX1) and class II (KNOX2). KNOX1 genes are linked to tissue proliferation and maintenance of meristematic potentials of flowering plant and moss sporophytes, and modulation of KNOX1 activity is implicated in contributing to leaf shape diversity of flowering plants. While KNOX2 function has been shown to repress the gametophytic (haploid) developmental program during moss sporophyte (diploid) development, little is known about KNOX2 function in flowering plants, hindering syntheses regarding the relationship between two classes of KNOX genes in the context of land plant evolution. Arabidopsis plants harboring loss-of-function KNOX2 alleles exhibit impaired differentiation of all aerial organs and have highly complex leaves, phenocopying gain-of-function KNOX1 alleles. Conversely, gain-of-function KNOX2 alleles in conjunction with a presumptive heterodimeric BELL TALE homeobox partner suppressed SAM activity in Arabidopsis and reduced leaf complexity in the Arabidopsis relative Cardamine hirsuta, reminiscent of loss-of-function KNOX1 alleles. Little evidence was found indicative of epistasis or mutual repression between KNOX1 and KNOX2 genes. KNOX proteins heterodimerize with BELL TALE homeobox proteins to form functional complexes, and contrary to earlier reports based on in vitro and heterologous expression, we find high selectivity between KNOX and BELL partners in vivo. Thus, KNOX2 genes confer opposing activities rather than redundant roles with KNOX1 genes, and together they act to direct the development of all above-ground organs of the Arabidopsis sporophyte. We infer that following the KNOX1/KNOX2 gene duplication in an ancestor of land plants, neofunctionalization led to evolution of antagonistic biochemical

  2. Post-polyploidisation morphotype diversification associates with gene copy number variation

    Science.gov (United States)

    Schiessl, Sarah; Huettel, Bruno; Kuehn, Diana; Reinhardt, Richard; Snowdon, Rod

    2017-01-01

    Genetic models for polyploid crop adaptation provide important information relevant for future breeding prospects. A well-suited model is Brassica napus, a recent allopolyploid closely related to Arabidopsis thaliana. Flowering time is a major adaptation trait determining life cycle synchronization with the environment. Here we unravel natural genetic variation in B. napus flowering time regulators and investigate associations with evolutionary diversification into different life cycle morphotypes. Deep sequencing of 35 flowering regulators was performed in 280 diverse B. napus genotypes. High sequencing depth enabled high-quality calling of single-nucleotide polymorphisms (SNPs), insertion-deletions (InDels) and copy number variants (CNVs). By combining these data with genotyping data from the Brassica 60 K Illumina® Infinium SNP array, we performed a genome-wide marker distribution analysis across the 4 ecogeographical morphotypes. Twelve haplotypes, including Bna.FLC.A10, Bna.VIN3.A02 and the Bna.FT promoter on C02_random, were diagnostic for the diversification of winter and spring types. The subspecies split between oilseed/kale (B. napus ssp. napus) and swedes/rutabagas (B. napus ssp. napobrassica) was defined by 13 haplotypes, including genomic rearrangements encompassing copies of Bna.FLC, Bna.PHYA and Bna.GA3ox1. De novo variation in copies of important flowering-time genes in B. napus arose during allopolyploidisation, enabling sub-functionalisation that allowed different morphotypes to appropriately fine-tune their lifecycle. PMID:28165502

  3. NDRG2 gene copy number is not altered in colorectal carcinoma

    DEFF Research Database (Denmark)

    Lorentzen, Anders Blomkild; Mitchelmore, Cathy

    2017-01-01

    levels using quantitative reverse transcription-polymerase chain reaction (qRT-PCR); interaction of the MYC gene-regulatory protein with the NDRG2 promoter using chromatin immunoprecipitation; and NDRG2 promoter methylation using bisulfite sequencing. Furthermore, we performed qPCR to analyse the copy......AIM To investigate if the down-regulation of N-myc Downstream Regulated Gene 2 (NDRG2) expression in colorectal carcinoma (CRC) is due to loss of the NDRG2 allele(s). METHODS The following were investigated in the human colorectal cancer cell lines DLD-1, LoVo and SW-480: NDRG2 mRNA expression...... numbers of NDRG2 and MYC genes in the above three cell lines, 8 normal colorectal tissue samples and 40 CRC tissue samples. RESULTS As expected, NDRG2 mRNA levels were low in the three colorectal cancer cell lines, compared to normal colon. Endogenous MYC protein interacted with the NDRG2 core promoter...

  4. Establishment of TaqMan Real-time Quantitative PCR Assay for Foreign Gene Copy Numbers in Transgenic Soybean

    Institute of Scientific and Technical Information of China (English)

    Qiu You-wen; Gao Xue-jun; Qi Bang-ruo; Li Lu; Zhen Zhen

    2012-01-01

    TaqMan quantitative PCR technique was used to detect the copies of exogenous CaMV35S flanks sequence in transgenic soybean. With soybean lectin as the endogenous reference gene, and gene complex DNA in non-GMO soybeans as the endogenous reference standard, the gradient dilution method was used to separately calculate Ct value of endogenous reference gene and plasmid DNA and correlation standard curve equation of logarithm of copies, and then to calculate the copies of samples through substituting thus-obtained Ct into the standard curve equation. The standard curve equation of endogenous reference gene was y =–3.422x+35.201, R2=0.998; the standard curve equation of exogenous gene was y =–3.495x+35.303, R2=0.999. The sample copies was got by putting Ct value into the standard curve equation, and it was the ratio of exogenous gene and reference gene. We found that CaMV35S gene in transgenic soy was single copy.

  5. Development of universal genetic markers based on single-copy orthologous (COSII) genes in Poaceae.

    Science.gov (United States)

    Liu, Hailan; Guo, Xiaoqin; Wu, Jiasheng; Chen, Guo-Bo; Ying, Yeqing

    2013-03-01

    KEY MESSAGE : We develop a set of universal genetic markers based on single-copy orthologous (COSII) genes in Poaceae. Being evolutionary conserved, single-copy orthologous (COSII) genes are particularly useful in comparative mapping and phylogenetic investigation among species. In this study, we identified 2,684 COSII genes based on five sequenced Poaceae genomes including rice, maize, sorghum, foxtail millet, and brachypodium, and then developed 1,072 COSII markers whose transferability and polymorphism among five bamboo species were further evaluated with 46 pairs of randomly selected primers. 91.3 % of the 46 primers obtained clear amplification in at least one bamboo species, and 65.2 % of them produced polymorphism in more than one species. We also used 42 of them to construct the phylogeny for the five bamboo species, and it might reflect more precise evolutionary relationship than the one based on the vegetative morphology. The results indicated a promising prospect of applying these markers to the investigation of genetic diversity and the classification of Poaceae. To ease and facilitate access of the information of common interest to readers, a web-based database of the COSII markers is provided ( http://www.sicau.edu.cn/web/yms/PCOSWeb/PCOS.html ).

  6. Oligonucleotide primers for targeted amplification of single-copy nuclear genes in apocritan Hymenoptera.

    Directory of Open Access Journals (Sweden)

    Gerrit Hartig

    Full Text Available BACKGROUND: Published nucleotide sequence data from the mega-diverse insect order Hymenoptera (sawflies, bees, wasps, and ants are taxonomically scattered and still inadequate for reconstructing a well-supported phylogenetic tree for the order. The analysis of comprehensive multiple gene data sets obtained via targeted PCR could provide a cost-effective solution to this problem. However, oligonucleotide primers for PCR amplification of nuclear genes across a wide range of hymenopteran species are still scarce. FINDINGS: Here we present a suite of degenerate oligonucleotide primer pairs for PCR amplification of 154 single-copy nuclear protein-coding genes from Hymenoptera. These primers were inferred from genome sequence data from nine Hymenoptera (seven species of ants, the honeybee, and the parasitoid wasp Nasonia vitripennis. We empirically tested a randomly chosen subset of these primer pairs for amplifying target genes from six Hymenoptera, representing the families Chrysididae, Crabronidae, Gasteruptiidae, Leucospidae, Pompilidae, and Stephanidae. Based on our results, we estimate that these primers are suitable for studying a large number of nuclear genes across a wide range of apocritan Hymenoptera (i.e., all hymenopterans with a wasp-waist and of aculeate Hymenoptera in particular (i.e., apocritan wasps with stingers. CONCLUSIONS: The amplified nucleotide sequences are (a with high probability from single-copy genes, (b easily generated at low financial costs, especially when compared to phylogenomic approaches, (c easily sequenced by means of an additionally provided set of sequencing primers, and (d suitable to address a wide range of phylogenetic questions and to aid rapid species identification via barcoding, as many amplicons contain both exonic and fast-evolving intronic nucleotides.

  7. Epidermal growth factor receptor and AKT1 gene copy numbers by multi-gene fluorescence in situ hybridization impact on prognosis in breast cancer.

    Science.gov (United States)

    Li, Jiao; Su, Wei; Zhang, Sheng; Hu, Yunhui; Liu, Jingjing; Zhang, Xiaobei; Bai, Jingchao; Yuan, Weiping; Hu, Linping; Cheng, Tao; Zetterberg, Anders; Lei, Zhenmin; Zhang, Jin

    2015-05-01

    The epidermal growth factor receptor (EGFR)/PI3K/AKT signaling pathway aberrations play significant roles in breast cancer occurrence and development. However, the status of EGFR and AKT1 gene copy numbers remains unclear. In this study, we showed that the rates of EGFR and AKT1 gene copy number alterations were associated with the prognosis of breast cancer. Among 205 patients, high EGFR and AKT1 gene copy numbers were observed in 34.6% and 27.8% of cases by multi-gene fluorescence in situ hybridization, respectively. Co-heightened EGFR/AKT1 gene copy numbers were identified in 11.7% cases. No changes were found in 49.3% of patients. Although changes in EGFR and AKT1 gene copy numbers had no correlation with patients' age, tumor stage, histological grade and the expression status of other molecular makers, high EGFR (P = 0.0002) but not AKT1 (P = 0.1177) gene copy numbers correlated with poor 5-year overall survival. The patients with co-heightened EGFR/AKT1 gene copy numbers displayed a poorer prognosis than those with tumors with only high EGFR gene copy numbers (P = 0.0383). Both Univariate (U) and COX multivariate (C) analyses revealed that high EGFR and AKT1 gene copy numbers (P = 0.000 [U], P = 0.0001 [C]), similar to histological grade (P = 0.001 [U], P = 0.012 [C]) and lymph node metastasis (P = 0.046 [U], P = 0.158 [C]), were independent prognostic indicators of 5-year overall survival. These results indicate that high EGFR and AKT1 gene copy numbers were relatively frequent in breast cancer. Co-heightened EGFR/AKT1 gene copy numbers had a worse outcome than those with only high EGFR gene copy numbers, suggesting that evaluation of these two genes together may be useful for selecting patients for anti-EGFR-targeted therapy or anti-EGFR/AKT1-targeted therapy and for predicting outcomes. © 2015 The Authors. Cancer Science published by Wiley Publishing Asia Pty Ltd on behalf of Japanese Cancer Association.

  8. Gene Duplication of the zebrafish kit ligand and partitioning of melanocyte development functions to kit ligand a.

    Directory of Open Access Journals (Sweden)

    Keith A Hultman

    2007-01-01

    Full Text Available The retention of particular genes after the whole genome duplication in zebrafish has given insights into how genes may evolve through partitioning of ancestral functions. We examine the partitioning of expression patterns and functions of two zebrafish kit ligands, kit ligand a (kitla and kit ligand b (kitlb, and discuss their possible coevolution with the duplicated zebrafish kit receptors (kita and kitb. In situ hybridizations show that kitla mRNA is expressed in the trunk adjacent to the notochord in the middle of each somite during stages of melanocyte migration and later expressed in the skin, when the receptor is required for melanocyte survival. kitla is also expressed in other regions complementary to kita receptor expression, including the pineal gland, tail bud, and ear. In contrast, kitlb mRNA is expressed in brain ventricles, ear, and cardinal vein plexus, in regions generally not complementary to either zebrafish kit receptor ortholog. However, like kitla, kitlb is expressed in the skin during stages consistent with melanocyte survival. Thus, it appears that kita and kitla have maintained congruent expression patterns, while kitb and kitlb have evolved divergent expression patterns. We demonstrate the interaction of kita and kitla by morpholino knockdown analysis. kitla morphants, but not kitlb morphants, phenocopy the null allele of kita, with defects for both melanocyte migration and survival. Furthermore, kitla morpholino, but not kitlb morpholino, interacts genetically with a sensitized allele of kita, confirming that kitla is the functional ligand to kita. Last, we examine kitla overexpression in embryos, which results in hyperpigmentation caused by an increase in the number and size of melanocytes. This hyperpigmentation is dependent on kita function. We conclude that following genome duplication, kita and kitla have maintained their receptor-ligand relationship, coevolved complementary expression patterns, and that

  9. An integrated analysis of miRNA and gene copy numbers in xenografts of Ewing's sarcoma

    Directory of Open Access Journals (Sweden)

    Mosakhani Neda

    2012-03-01

    Full Text Available Abstract Background Xenografts have been shown to provide a suitable source of tumor tissue for molecular analysis in the absence of primary tumor material. We utilized ES xenograft series for integrated microarray analyses to identify novel biomarkers. Method Microarray technology (array comparative genomic hybridization (aCGH and micro RNA arrays was used to screen and identify copy number changes and differentially expressed miRNAs of 34 and 14 passages, respectively. Incubated cells used for xenografting (Passage 0 were considered to represent the primary tumor. Four important differentially expressed miRNAs (miR-31, miR-31*, miR-145, miR-106 were selected for further validation by real time polymerase chain reaction (RT-PCR. Integrated analysis of aCGH and miRNA data was performed on 14 xenograft passages by bioinformatic methods. Results The most frequent losses and gains of DNA copy number were detected at 9p21.3, 16q and at 8, 15, 17q21.32-qter, 1q21.1-qter, respectively. The presence of these alterations was consistent in all tumor passages. aCGH profiles of xenograft passages of each series resembled their corresponding primary tumors (passage 0. MiR-21, miR-31, miR-31*, miR-106b, miR-145, miR-150*, miR-371-5p, miR-557 and miR-598 showed recurrently altered expression. These miRNAS were predicted to regulate many ES-associated genes, such as genes of the IGF1 pathway, EWSR1, FLI1 and their fusion gene (EWS-FLI1. Twenty differentially expressed miRNAs were pinpointed in regions carrying altered copy numbers. Conclusion In the present study, ES xenografts were successfully applied for integrated microarray analyses. Our findings showed expression changes of miRNAs that were predicted to regulate many ES associated genes, such as IGF1 pathway genes, FLI1, EWSR1, and the EWS-FLI1 fusion genes.

  10. Between-species differences in gene copy number are enriched among functions critical for adaptive evolution in Arabidopsis halleri.

    Science.gov (United States)

    Suryawanshi, Vasantika; Talke, Ina N; Weber, Michael; Eils, Roland; Brors, Benedikt; Clemens, Stephan; Krämer, Ute

    2016-12-22

    Gene copy number divergence between species is a form of genetic polymorphism that contributes significantly to both genome size and phenotypic variation. In plants, copy number expansions of single genes were implicated in cultivar- or species-specific tolerance of high levels of soil boron, aluminium or calamine-type heavy metals, respectively. Arabidopsis halleri is a zinc- and cadmium-hyperaccumulating extremophile species capable of growing on heavy-metal contaminated, toxic soils. In contrast, its non-accumulating sister species A. lyrata and the closely related reference model species A. thaliana exhibit merely basal metal tolerance. For a genome-wide assessment of the role of copy number divergence (CND) in lineage-specific environmental adaptation, we conducted cross-species array comparative genome hybridizations of three plant species and developed a global signal scaling procedure to adjust for sequence divergence. In A. halleri, transition metal homeostasis functions are enriched twofold among the genes detected as copy number expanded. Moreover, biotic stress functions including mostly disease Resistance (R) gene-related genes are enriched twofold among genes detected as copy number reduced, when compared to the abundance of these functions among all genes. Our results provide genome-wide support for a link between evolutionary adaptation and CND in A. halleri as shown previously for Heavy metal ATPase4. Moreover our results support the hypothesis that elemental defences, which result from the hyperaccumulation of toxic metals, allow the reduction of classical defences against biotic stress as a trade-off.

  11. Balanced gene losses, duplications and intensive rearrangements led to an unusual regularly sized genome in Arbutus unedo chloroplasts.

    Directory of Open Access Journals (Sweden)

    Fernando Martínez-Alberola

    Full Text Available Completely sequenced plastomes provide a valuable source of information about the duplication, loss, and transfer events of chloroplast genes and phylogenetic data for resolving relationships among major groups of plants. Moreover, they can also be useful for exploiting chloroplast genetic engineering technology. Ericales account for approximately six per cent of eudicot diversity with 11,545 species from which only three complete plastome sequences are currently available. With the aim of increasing the number of ericalean complete plastome sequences, and to open new perspectives in understanding Mediterranean plant adaptations, a genomic study on the basis of the complete chloroplast genome sequencing of Arbutus unedo and an updated phylogenomic analysis of Asteridae was implemented. The chloroplast genome of A. unedo shows extensive rearrangements but a medium size (150,897 nt in comparison to most of angiosperms. A number of remarkable distinct features characterize the plastome of A. unedo: five-fold dismissing of the SSC region in relation to most angiosperms; complete loss or pseudogenization of a number of essential genes; duplication of the ndhH-D operon and its location within the two IRs; presence of large tandem repeats located near highly re-arranged regions and pseudogenes. All these features outline the primary evolutionary split between Ericaceae and other ericalean families. The newly sequenced plastome of A. unedo with the available asterid sequences allowed the resolution of some uncertainties in previous phylogenies of Asteridae.

  12. Intron analyses reveal multiple calmodulin copies in Littorina.

    Science.gov (United States)

    Simpson, R J; Wilding, C S; Grahame, J

    2005-04-01

    Intron 3 and the flanking exons of the calmodulin gene have been amplified, cloned, and sequenced from 18 members of the gastropod genus Littorina. From the 48 sequences, at least five different gene copies have been identified and their functionality characterized using a strategy based upon the potential protein product predicted from flanking exon data. The functionality analyses suggest that four of the genes code for functional copies of calmodulin. All five copies have been identified across a wide range of littorinid species although not ubiquitously. Using this novel approach based on intron sequences, we have identified an unprecedented number of potential calmodulin copies in Littorina, exceeding that reported for any other invertebrate. This suggests a higher number of, and more ancient, gene duplications than previously detected in a single genus.

  13. Duplications and positive selection drive the evolution of parasitism associated gene families in the nematode Strongyloides papillosus.

    Science.gov (United States)

    Baskaran, Praveen; Jaleta, Tegegn G; Streit, Adrian; Rödelsperger, Christian

    2017-03-02

    Gene duplication is one major mechanism playing a role in the evolution of phenotypic complexity and in the generation of novel traits. By comparing parasitic and nonparasitic nematodes, a recent study found that the evolution of parasitism in Strongyloididae is associated with a large expansion in the Astacin and CAP gene families.To gain novel insights into the developmental processes in the sheep parasite Strongyloides papillosus, we sequenced transcriptomes of different developmental stages and sexes. Overall, we found that the majority of genes are developmentally regulated and have one-to-one orthologs in the diverged S. ratti genome. Together with the finding of similar expression profiles between S. papillosus and S. ratti, these results indicate a strong evolutionary constraint acting against change at sequence and expression levels. However, the comparison between parasitic and free-living females demonstrates a quite divergent pattern that is mostly due to the previously mentioned expansion in the Astacin and CAP gene families. More detailed phylogenetic analysis of both gene families shows that most members date back to single expansion events early in the Strongyloides lineage and have undergone subfunctionalization resulting in clusters that are highly expressed either in infective larvae or in parasitic females. Finally, we found increased evidence for positive selection in both gene families relative to the genome-wide expectation.In summary, our study reveals first insights into the developmental transcriptomes of S. papillosus and provides a detailed analysis of sequence and expression evolution in parasitism associated gene families.

  14. The DUB/USP17 deubiquitinating enzymes: A gene family within a tandemly repeated sequence, is also embedded within the copy number variable Beta-defensin cluster

    Directory of Open Access Journals (Sweden)

    Scott Christopher J

    2010-04-01

    Full Text Available Abstract Background The DUB/USP17 subfamily of deubiquitinating enzymes were originally identified as immediate early genes induced in response to cytokine stimulation in mice (DUB-1, DUB-1A, DUB-2, DUB-2A. Subsequently we have identified a number of human family members and shown that one of these (DUB-3 is also cytokine inducible. We originally showed that constitutive expression of DUB-3 can block cell proliferation and more recently we have demonstrated that this is due to its regulation of the ubiquitination and activity of the 'CAAX' box protease RCE1. Results Here we demonstrate that the human DUB/USP17 family members are found on both chromosome 4p16.1, within a block of tandem repeats, and on chromosome 8p23.1, embedded within the copy number variable beta-defensin cluster. In addition, we show that the multiple genes observed in humans and other distantly related mammals have arisen due to the independent expansion of an ancestral sequence within each species. However, it is also apparent when sequences from humans and the more closely related chimpanzee are compared, that duplication events have taken place prior to these species separating. Conclusions The observation that the DUB/USP17 genes, which can influence cell growth and survival, have evolved from an unstable ancestral sequence which has undergone multiple and varied duplications in the species examined marks this as a unique family. In addition, their presence within the beta-defensin repeat raises the question whether they may contribute to the influence of this repeat on immune related conditions.

  15. Copy number variation analysis implicates the cell polarity gene glypican 5 as a human spina bifida candidate gene

    Science.gov (United States)

    Bassuk, Alexander G.; Muthuswamy, Lakshmi B.; Boland, Riley; Smith, Tiffany L.; Hulstrand, Alissa M.; Northrup, Hope; Hakeman, Matthew; Dierdorff, Jason M.; Yung, Christina K.; Long, Abby; Brouillette, Rachel B.; Au, Kit Sing; Gurnett, Christina; Houston, Douglas W.; Cornell, Robert A.; Manak, J. Robert

    2013-01-01

    Neural tube defects (NTDs) are common birth defects of complex etiology. Family and population-based studies have confirmed a genetic component to NTDs. However, despite more than three decades of research, the genes involved in human NTDs remain largely unknown. We tested the hypothesis that rare copy number variants (CNVs), especially de novo germline CNVs, are a significant risk factor for NTDs. We used array-based comparative genomic hybridization (aCGH) to identify rare CNVs in 128 Caucasian and 61 Hispanic patients with non-syndromic lumbar-sacral myelomeningocele. We also performed aCGH analysis on the parents of affected individuals with rare CNVs where parental DNA was available (42 sets). Among the eight de novo CNVs that we identified, three generated copy number changes of entire genes. One large heterozygous deletion removed 27 genes, including PAX3, a known spina bifida-associated gene. A second CNV altered genes (PGPD8, ZC3H6) for which little is known regarding function or expression. A third heterozygous deletion removed GPC5 and part of GPC6, genes encoding glypicans. Glypicans are proteoglycans that modulate the activity of morphogens such as Sonic Hedgehog (SHH) and bone morphogenetic proteins (BMPs), both of which have been implicated in NTDs. Additionally, glypicans function in the planar cell polarity (PCP) pathway, and several PCP genes have been associated with NTDs. Here, we show that GPC5 orthologs are expressed in the neural tube, and that inhibiting their expression in frog and fish embryos results in NTDs. These results implicate GPC5 as a gene required for normal neural tube development. PMID:23223018

  16. Dietary starch intake modifies the relation between copy number variation in the salivary amylase gene and BMI.

    Science.gov (United States)

    Rukh, Gull; Ericson, Ulrika; Andersson-Assarsson, Johanna; Orho-Melander, Marju; Sonestedt, Emily

    2017-07-01

    Background: Studies have shown conflicting associations between the salivary amylase gene (AMY1) copy number and obesity. Salivary amylase initiates starch digestion in the oral cavity; starch is a major source of energy in the diet.Objective: We investigated the association between AMY1 copy number and obesity traits, and the effect of the interaction between AMY1 copy number and starch intake on these obesity traits.Design: We first assessed the association between AMY1 copy number (genotyped by digital droplet polymerase chain reaction) and obesity traits in 4800 individuals without diabetes (mean age: 57 y; 60% female) from the Malmö Diet and Cancer Cohort. Then we analyzed interactions between AMY1 copy number and energy-adjusted starch intake (obtained by a modified diet history method) on body mass index (BMI) and body fat percentage.Results:AMY1 copy number was not associated with BMI (P = 0.80) or body fat percentage (P = 0.38). We observed a significant effect of the interaction between AMY1 copy number and starch intake on BMI (P-interaction = 0.007) and body fat percentage (P-interaction = 0.03). Upon stratification by dietary starch intake, BMI tended to decrease with increasing AMY1 copy numbers in the low-starch intake group (P = 0.07) and tended to increase with increasing AMY1 copy numbers in the high-starch intake group (P = 0.08). The lowest mean BMI was observed in the group of participants with a low AMY1 copy number and a high dietary intake of starch.Conclusions: Our findings suggest an effect of the interaction between starch intake and AMY1 copy number on obesity. Individuals with high starch intake but low genetic capacity to digest starch had the lowest BMI, potentially because larger amounts of undigested starch are transported through the gastrointestinal tract, contributing to fewer calories extracted from ingested starch. © 2017 American Society for Nutrition.

  17. Analysis of Copy Number Variations in Patients with Autism Using Cytogenetic and MLPA Techniques: Report of 16p13.1p13.3 and 10q26.3 Duplications

    Science.gov (United States)

    Ghasemi Firouzabadi, Saghar; Vameghi, Roshanak; Kariminejad, Roxana; Darvish, Hossein; Banihashemi, Susan; Firouzkouhi Moghaddam, Mahboubeh; Jamali, Peyman; Farbod Mofidi Tehrani, Hassan; Dehghani, Hossein; Raeisoon, Mohammad Reza; Narooie-Nejad, Mehrnaz; Jamshidi, Javad; Tafakhori, Abbas; Sadabadi, Saeid; Behjati, Farkhondeh

    2016-01-01

    Autism is a common neuropsychiatric disorder affecting 1 in 68 children. Copy number variations (CNVs) are known to be major contributors of autism spectrum disorder (ASD). There are different whole genome or targeted techniques to identify CNVs in the patients including karyotyping, multiplex ligation-dependent probe amplification (MLPA) and array CGH. In this study, we used karyotyping and MLPA to detect CNVs in 50 Iranian patients with autism. GTG banding and 4 different MLPA kits (2 subtelomeric and 2 autism kits) were utilized. To elevate our detection rate, we selected the sporadic patients who had additional clinical features including intellectual disability, seizure, attention deficit hyperactivity disorder, and abnormal head circumference. Two out of 50 patients (4%) showed microscopic chromosome abnormalities and 5 out of 50 (10%) demonstrated copy number gains or losses using MLPA kits. Including one overlapping result between karyotype and MLPA techniques, our overall detection rate was 6 out of 50 (12%). Three out of 6 CNVs were de novo and three others were paternally inherited. Two of CNVs detected by karyotyping and MLPA tests were 16p13.1q13.3 and 10q26.3 duplications, respectively. For these two CNVs genotype and phenotype of the patients were compared with other studies. Although the pathogenicity of cytogenetic results was certain, most of MLPA results needed to be better refined using other more accurate techniques such as array CGH. Our findings suggest that it might be possible to obtain some useful information using MLPA technique but it cannot be used as a single diagnostic tool for the autism. PMID:28357200

  18. Deletion/duplication mutation screening of TP53 gene in patients with transitional cell carcinoma of urinary bladder using multiplex ligation-dependent probe amplification.

    Science.gov (United States)

    Bazrafshani, Mohammad Reza R; Nowshadi, Pouriaali A; Shirian, Sadegh; Daneshbod, Yahya; Nabipour, Fatemeh; Mokhtari, Maral; Hosseini, Fatemehsadat; Dehghan, Somayeh; Saeedzadeh, Abolfazl; Mosayebi, Ziba

    2016-02-01

    Bladder cancer is a molecular disease driven by the accumulation of genetic, epigenetic, and environmental factors. The aim of this study was to detect the deletions/duplication mutations in TP53 gene exons using multiplex ligation-dependent probe amplification (MLPA) method in the patients with transitional cell carcinoma (TCC). The achieved formalin-fixed paraffin-embedded tissues from 60 patients with TCC of bladder were screened for exonal deletions or duplications of every 12 TP53 gene exons using MLPA. The pathological sections were examined by three pathologists and categorized according to the WHO scoring guideline as 18 (30%) grade I, 22 (37%) grade II, 13 (22%) grade III, and 7 (11%) grade IV cases of TCC. None mutation changes of TP53 gene were detected in 24 (40%) of the patients. Furthermore, mutation changes including, 15 (25%) deletion, 17 (28%) duplication, and 4 (7%) both deletion and duplication cases were observed among 60 samples. From 12 exons of TP53 gene, exon 1 was more subjected to exonal deletion. Deletion of exon 1 of TP53 gene has occurred in 11 (35.4%) patients with TCC. In general, most mutations of TP53, either deletion or duplication, were found in exon 1, which was statistically significant. In addition, no relation between the TCC tumor grade and any type of mutation were observed in this research. MLPA is a simple and efficient method to analyze genomic deletions and duplications of all 12 exons of TP53 gene. The finding of this report that most of the mutations of TP53 occur in exon 1 is in contrast to that of the other reports suggesting that exons 5-8 are the most (frequently) mutated exons of TP53 gene. The mutations of exon 1 of TP53 gene may play an important role in the tumorogenesis of TCC.

  19. Expression of epithelial-mesenchymal transition-related genes increases with copy number in multiple cancer types.

    Science.gov (United States)

    Zhao, Min; Liu, Yining; Qu, Hong

    2016-04-26

    Epithelial-mesenchymal transition (EMT) is a cellular process through which epithelial cells transform into mesenchymal cells. EMT-implicated genes initiate and promote cancer metastasis because mesenchymal cells have greater invasive and migration capacities than epithelial cells. In this pan-cancer analysis, we explored the relationship between gene expression changes and copy number variations (CNVs) for EMT-implicated genes. Based on curated 377 EMT-implicated genes from the literature, we identified 212 EMT-implicated genes associated with more frequent copy number gains (CNGs) than copy number losses (CNLs) using data from The Cancer Genome Atlas (TCGA). Then by correlating these CNV data with TCGA gene expression data, we identified 71 EMT-implicated genes with concordant CNGs and gene up-regulation in 20 or more tumor samples. Of those, 14 exhibited such concordance in over 110 tumor samples. These 14 genes were predominantly apoptosis regulators, which may implies that apoptosis is critical during EMT. Moreover, the 71 genes with concordant CNG and up-regulation were largely involved in cellular functions such as phosphorylation cascade signaling. This is the first observation of concordance between CNG and up-regulation of specific genes in hundreds of samples, which may indicate that somatic CNGs activate gene expression by increasing the gene dosage.

  20. Genomic Copy Number Dictates a Gene-Independent Cell Response to CRISPR/Cas9 Targeting | Office of Cancer Genomics

    Science.gov (United States)

    The CRISPR/Cas9 system enables genome editing and somatic cell genetic screens in mammalian cells. We performed genome-scale loss-of-function screens in 33 cancer cell lines to identify genes essential for proliferation/survival and found a strong correlation between increased gene copy number and decreased cell viability after genome editing. Within regions of copy-number gain, CRISPR/Cas9 targeting of both expressed and unexpressed genes, as well as intergenic loci, led to significantly decreased cell proliferation through induction of a G2 cell-cycle arrest.

  1. The nuclear OXPHOS genes in insecta: a common evolutionary origin, a common cis-regulatory motif, a common destiny for gene duplicates

    Directory of Open Access Journals (Sweden)

    Pesole Graziano

    2007-11-01

    Full Text Available Abstract Background When orthologous sequences from species distributed throughout an optimal range of divergence times are available, comparative genomics is a powerful tool to address problems such as the identification of the forces that shape gene structure during evolution, although the functional constraints involved may vary in different genes and lineages. Results We identified and annotated in the MitoComp2 dataset the orthologs of 68 nuclear genes controlling oxidative phosphorylation in 11 Drosophilidae species and in five non-Drosophilidae insects, and compared them with each other and with their counterparts in three vertebrates (Fugu rubripes, Danio rerio and Homo sapiens and in the cnidarian Nematostella vectensis, taking into account conservation of gene structure and regulatory motifs, and preservation of gene paralogs in the genome. Comparative analysis indicates that the ancestral insect OXPHOS genes were intron rich and that extensive intron loss and lineage-specific intron gain occurred during evolution. Comparison with vertebrates and cnidarians also shows that many OXPHOS gene introns predate the cnidarian/Bilateria evolutionary split. The nuclear respiratory gene element (NRG has played a key role in the evolution of the insect OXPHOS genes; it is constantly conserved in the OXPHOS orthologs of all the insect species examined, while their duplicates either completely lack the element or possess only relics of the motif. Conclusion Our observations reinforce the notion that the common ancestor of most animal phyla had intron-rich gene, and suggest that changes in the pattern of expression of the gene facilitate the fixation of duplications in the genome and the development of novel genetic functions.

  2. Genomic Copy Number Variations of the Complement Component C4B Gene Are Associated With Chronic Central Serous Chorioretinopathy

    NARCIS (Netherlands)

    Breukink, M.B.; Schellevis, R.L.; Boon, C.J.F.; Fauser, S.; Hoyng, C.B.; Hollander, A.I. den; Jong, E.K.

    2015-01-01

    PURPOSE: Chronic central serous chorioretinopathy (cCSC) has recently been associated to variants in the complement factor H gene. To further investigate the role of the complement system in cCSC, the genomic copy number variations in the complement component 4 gene (C4) were studied. METHODS: C4A

  3. Genomic Copy Number Variations of the Complement Component C4B Gene Are Associated With Chronic Central Serous Chorioretinopathy

    NARCIS (Netherlands)

    Breukink, M.B.; Schellevis, R.L.; Boon, C.J.F.; Fauser, S.; Hoyng, C.B.; Hollander, A.I. den; Jong, E.K.

    2015-01-01

    PURPOSE: Chronic central serous chorioretinopathy (cCSC) has recently been associated to variants in the complement factor H gene. To further investigate the role of the complement system in cCSC, the genomic copy number variations in the complement component 4 gene (C4) were studied. METHODS: C4A a

  4. The fate of the duplicated androgen receptor in fishes: a late neofunctionalization event?

    Directory of Open Access Journals (Sweden)

    Haendler Bernard

    2008-12-01

    Full Text Available Abstract Background Based on the observation of an increased number of paralogous genes in teleost fishes compared with other vertebrates and on the conserved synteny between duplicated copies, it has been shown that a whole genome duplication (WGD occurred during the evolution of Actinopterygian fish. Comparative phylogenetic dating of this duplication event suggests that it occurred early on, specifically in teleosts. It has been proposed that this event might have facilitated the evolutionary radiation and the phenotypic diversification of the teleost fish, notably by allowing the sub- or neo-functionalization of many duplicated genes. Results In this paper, we studied in a wide range of Actinopterygians the duplication and fate of the androgen receptor (AR, NR3C4, a nuclear receptor known to play a key role in sex-determination in vertebrates. The pattern of AR gene duplication is consistent with an early WGD event: it has been duplicated into two genes AR-A and AR-B after the split of the Acipenseriformes from the lineage leading to teleost fish but before the divergence of Osteoglossiformes. Genomic and syntenic analyses in addition to lack of PCR amplification show that one of the duplicated copies, AR-B, was lost in several basal Clupeocephala such as Cypriniformes (including the model species zebrafish, Siluriformes, Characiformes and Salmoniformes. Interestingly, we also found that, in basal teleost fish (Osteoglossiformes and Anguilliformes, the two copies remain very similar, whereas, specifically in Percomorphs, one of the copies, AR-B, has accumulated substitutions in both the ligand binding domain (LBD and the DNA binding domain (DBD. Conclusion The comparison of the mutations present in these divergent AR-B with those known in human to be implicated in complete, partial or mild androgen insensitivity syndrome suggests that the existence of two distinct AR duplicates may be correlated to specific functional differences that may be

  5. Duplication at Xq13.3-q21.1 with syndromic intellectual disability, a probable role for the ATRX gene.

    Science.gov (United States)

    Martínez, Francisco; Roselló, Mónica; Mayo, Sonia; Monfort, Sandra; Oltra, Silvestre; Orellana, Carmen

    2014-04-01

    Here we report on two unrelated male patients with syndromic intellectual disability (ID) due to duplication at Xq13.3-q21.1, a region of about 6 Mb and 25 genes. Among these, the most outstanding is ATRX, the causative gene of X-linked alpha-thalassemia/mental retardation. ATRX belongs to the growing list of genes implied in chromatin remodeling causing ID. Many these genes, such as MECP2, are dose-sensitive so that not only deletions and point mutations, but also duplications cause ID. Both patients have severe ID, absent expressive speech, early hypotonia, behavior problems (hyperactivity, repetitive self-stimulatory behavior), postnatal growth deficiency, microcephaly, micrognathia, cryptorchidism, low-set, posteriorly angulated ears, and downslanting palpebral fissures. These findings are also usually present among patients with loss-of-function mutations of the ATRX gene. Completely skewed X inactivation was observed in the only informative carrier mother, a constant finding among female carriers of inactivating point mutations of this gene. Participation of other duplicated genes cannot be excluded; nevertheless we propose that the increased dosage of ATRX is the major pathogenic mechanism of this X-linked disorder, a syndrome reminiscent of MECP2 duplication.

  6. Translocations used to generate chromosome segment duplications in Neurospora can disrupt genes and create novel open reading frames

    Indian Academy of Sciences (India)

    Parmit K Singh; Srividhya V Iyer; T Naga Sowjanya; B Kranthi Raj; Durgadas P Kasbekar

    2010-12-01

    In Neurospora crassa, crosses between normal sequence strains and strains bearing some translocations can yield progeny bearing a duplication (Dp) of the translocated chromosome segment. Here, 30 breakpoint junction sequences of 12 Dp-generating translocations were determined. The breakpoints disrupted 13 genes (including predicted genes), and created 10 novel open reading frames. Insertion of sequences from LG III into LG I as translocation T(UK818) disrupts the eat-3 gene, which is the ortholog of the Podospora anserine gene ami1. Since ami1-homozygous Podospora crosses were reported to increase the frequency of repeat-induced point mutation (RIP), we performed crosses homozygous for a deficiency in eat-3 to test for a corresponding increase in RIP frequency. However, our results suggested that, unlike in Podospora, the eat-3 gene might be essential for ascus development in Neurospora. Duplication–heterozygous crosses are generally barren in Neurospora; however, by using molecular probes developed in this study, we could identify Dp segregants from two different translocation–heterozygous crosses, and using these we found that the barren phenotype of at least some duplication–heterozygous crosses was incompletely penetrant.

  7. Selective regain of egfr gene copies in CD44+/CD24-/low breast cancer cellular model MDA-MB-468

    Directory of Open Access Journals (Sweden)

    Andreas Antje

    2010-03-01

    Full Text Available Abstract Background Increased transcription of oncogenes like the epidermal growth factor receptor (EGFR is frequently caused by amplification of the whole gene or at least of regulatory sequences. Aim of this study was to pinpoint mechanistic parameters occurring during egfr copy number gains leading to a stable EGFR overexpression and high sensitivity to extracellular signalling. A deeper understanding of those marker events might improve early diagnosis of cancer in suspect lesions, early detection of cancer progression and the prediction of egfr targeted therapies. Methods The basal-like/stemness type breast cancer cell line subpopulation MDA-MB-468 CD44high/CD24-/low, carrying high egfr amplifications, was chosen as a model system in this study. Subclones of the heterogeneous cell line expressing low and high EGF receptor densities were isolated by cell sorting. Genomic profiling was carried out for these by means of SNP array profiling, qPCR and FISH. Cell cycle analysis was performed using the BrdU quenching technique. Results Low and high EGFR expressing MDA-MB-468 CD44+/CD24-/low subpopulations separated by cell sorting showed intermediate and high copy numbers of egfr, respectively. However, during cell culture an increase solely for egfr gene copy numbers in the intermediate subpopulation occurred. This shift was based on the formation of new cells which regained egfr gene copies. By two parametric cell cycle analysis clonal effects mediated through growth advantage of cells bearing higher egfr gene copy numbers could most likely be excluded for being the driving force. Subsequently, the detection of a fragile site distal to the egfr gene, sustaining uncapped telomere-less chromosomal ends, the ladder-like structure of the intrachromosomal egfr amplification and a broader range of egfr copy numbers support the assumption that dynamic chromosomal rearrangements, like breakage-fusion-bridge-cycles other than proliferation drive the gain

  8. Association between the SMN2 gene copy number and clinical characteristics of patients with spinal muscular atrophy with homozygous deletion of exon 7 of the SMN1 gene

    Directory of Open Access Journals (Sweden)

    Žarkov Marija

    2015-01-01

    Full Text Available Background/Aim. Spinal muscular atrophy (SMA is an autosomal recessive disease characterized by degeneration of alpha motor neurons in the spinal cord and the medulla oblongata, causing progressive muscle weakness and atrophy. The aim of this study was to determine association between the SMN2 gene copy number and disease phenotype in Serbian patients with SMA with homozygous deletion of exon 7 of the SMN1 gene. Methods. The patients were identified using regional Serbian hospital databases. Investigated clinical characteristics of the disease were: patients’ gender, age at disease onset, achieved and current developmental milestones, disease duration, current age, and the presence of the spinal deformities and joint contractures. The number of SMN1 and SMN2 gene copies was determined using real-time polymerase chain reaction (PCR. Results. Among 43 identified patients, 37 (86.0% showed homozygous deletion of SMN1 exon 7. One (2.7% of 37 patients had SMA type I with 3 SMN2 copies, 11 (29.7% patients had SMA type II with 3.1 ± 0.7 copies, 17 (45.9% patients had SMA type III with 3.7 ± 0.9 copies, while 8 (21.6% patients had SMA type IV with 4.2 ± 0.9 copies. There was a progressive increase in the SMN2 gene copy number from type II towards type IV (p < 0.05. A higher SMN2 gene copy number was associated with better current motor performance (p < 0.05. Conclusion. In the Serbian patients with SMA, a higher SMN2 gene copy number correlated with less severe disease phenotype. A possible effect of other phenotype modifiers should not be neglected.

  9. Autopolyploidy genome duplication preserves other ancient genome duplications in Atlantic salmon (Salmo salar)

    Science.gov (United States)

    Davidson, William S.

    2017-01-01

    Salmonids (e.g. Atlantic salmon, Pacific salmon, and trouts) have a long legacy of genome duplication. In addition to three ancient genome duplications that all teleosts are thought to share, salmonids have had one additional genome duplication. We explored a methodology for untangling these duplications from each other to better understand them in Atlantic salmon. In this methodology, homeologous regions (paralogous/duplicated genomic regions originating from a whole genome duplication) from the most recent genome duplication were assumed to have duplicated genes at greater density and have greater sequence similarity. This assumption was used to differentiate duplicated gene pairs in Atlantic salmon that are either from the most recent genome duplication or from earlier duplications. From a comparison with multiple vertebrate species, it is clear that Atlantic salmon have retained more duplicated genes from ancient genome duplications than other vertebrates--often at higher density in the genome and containing fewer synonymous mutations. It may be that polysomic inheritance is the mechanism responsible for maintaining ancient gene duplicates in salmonids. Polysomic inheritance (when multiple chromosomes pair during meiosis) is thought to be relatively common in salmonids compared to other vertebrate species. These findings illuminate how genome duplications may not only increase the number of duplicated genes, but may also be involved in the maintenance of them from previous genome duplications as well. PMID:28241055

  10. Yersinia spp. Identification Using Copy Diversity in the Chromosomal 16S rRNA Gene Sequence.

    Science.gov (United States)

    Hao, Huijing; Liang, Junrong; Duan, Ran; Chen, Yuhuang; Liu, Chang; Xiao, Yuchun; Li, Xu; Su, Mingming; Jing, Huaiqi; Wang, Xin

    2016-01-01

    API 20E strip test, the standard for Enterobacteriaceae identification, is not sufficient to discriminate some Yersinia species for some unstable biochemical reactions and the same biochemical profile presented in some species, e.g. Yersinia ferderiksenii and Yersinia intermedia, which need a variety of molecular biology methods as auxiliaries for identification. The 16S rRNA gene is considered a valuable tool for assigning bacterial strains to species. However, the resolution of the 16S rRNA gene may be insufficient for discrimination because of the high similarity of sequences between some specie