WorldWideScience

Sample records for gene family evolution

  1. Evolution of the vertebrate insulin receptor substrate (Irs) gene family.

    Science.gov (United States)

    Al-Salam, Ahmad; Irwin, David M

    2017-06-23

    Insulin receptor substrate (Irs) proteins are essential for insulin signaling as they allow downstream effectors to dock with, and be activated by, the insulin receptor. A family of four Irs proteins have been identified in mice, however the gene for one of these, IRS3, has been pseudogenized in humans. While it is known that the Irs gene family originated in vertebrates, it is not known when it originated and which members are most closely related to each other. A better understanding of the evolution of Irs genes and proteins should provide insight into the regulation of metabolism by insulin. Multiple genes for Irs proteins were identified in a wide variety of vertebrate species. Phylogenetic and genomic neighborhood analyses indicate that this gene family originated very early in vertebrae evolution. Most Irs genes were duplicated and retained in fish after the fish-specific genome duplication. Irs genes have been lost of various lineages, including Irs3 in primates and birds and Irs1 in most fish. Irs3 and Irs4 experienced an episode of more rapid protein sequence evolution on the ancestral mammalian lineage. Comparisons of the conservation of the proteins sequences among Irs paralogs show that domains involved in binding to the plasma membrane and insulin receptors are most strongly conserved, while divergence has occurred in sequences involved in interacting with downstream effector proteins. The Irs gene family originated very early in vertebrate evolution, likely through genome duplications, and in parallel with duplications of other components of the insulin signaling pathway, including insulin and the insulin receptor. While the N-terminal sequences of these proteins are conserved among the paralogs, changes in the C-terminal sequences likely allowed changes in biological function.

  2. Molecular cloning of RBCS genes in Selaginella and the evolution of the rbcS gene family

    Directory of Open Access Journals (Sweden)

    Wang Bo

    2015-01-01

    Full Text Available Rubisco small subunits (RBCS are encoded by a nuclear rbcS multigene family in higher plants and green algae. However, owing to the lack of rbcS sequences in lycophytes, the characteristics of rbcS genes in lycophytes is unclear. Recently, the complete genome sequence of the lycophyte Selaginella moellendorffii provided the first insight into the rbcS gene family in lycophytes. To understand further the characteristics of rbcS genes in other Selaginella, the full length of rbcS genes (rbcS1 and rbcS2 from two other Selaginella species were isolated. Both rbcS1 and rbcS2 genes shared more than 97% identity among three Selaginella species. RBCS proteins from Selaginella contained the Pfam RBCS domain F00101, which was a major domain of other plant RBCS proteins. To explore the evolution of the rbcS gene family across Selaginella and other plants, we identified and performed comparative analysis of the rbcS gene family among 16 model plants based on a genome-wide analysis. The results showed that (i two rbcS genes were obtained in Selaginella, which is the second fewest number of rbcS genes among the 16 representative plants; (ii an expansion of rbcS genes occurred in the moss Physcomitrella patens; (iii only RBCS proteins from angiosperms contained the Pfam PF12338 domains, and (iv a pattern of concerted evolution existed in the rbcS gene family. Our study provides new insights into the evolution of the rbcS gene family in Selaginella and other plants.

  3. Molecular evolution of the major chemosensory gene families in insects.

    Science.gov (United States)

    Sánchez-Gracia, A; Vieira, F G; Rozas, J

    2009-09-01

    Chemoreception is a crucial biological process that is essential for the survival of animals. In insects, olfaction allows the organism to recognise volatile cues that allow the detection of food, predators and mates, whereas the sense of taste commonly allows the discrimination of soluble stimulants that elicit feeding behaviours and can also initiate innate sexual and reproductive responses. The most important proteins involved in the recognition of chemical cues comprise moderately sized multigene families. These families include odorant-binding proteins (OBPs) and chemosensory proteins (CSPs), which are involved in peripheral olfactory processing, and the chemoreceptor superfamily formed by the olfactory receptor (OR) and gustatory receptor (GR) families. Here, we review some recent evolutionary genomic studies of chemosensory gene families using the data from fully sequenced insect genomes, especially from the 12 newly available Drosophila genomes. Overall, the results clearly support the birth-and-death model as the major mechanism of evolution in these gene families. Namely, new members arise by tandem gene duplication, progressively diverge in sequence and function, and can eventually be lost from the genome by a deletion or pseudogenisation event. Adaptive changes fostered by environmental shifts are also observed in the evolution of chemosensory families in insects and likely involve reproductive, ecological or behavioural traits. Consequently, the current size of these gene families is mainly a result of random gene gain and loss events. This dynamic process may represent a major source of genetic variation, providing opportunities for FUTURE specific adaptations.

  4. Saltatory Evolution of the Ectodermal Neural Cortex Gene Family at the Vertebrate Origin

    Science.gov (United States)

    Feiner, Nathalie; Murakami, Yasunori; Breithut, Lisa; Mazan, Sylvie; Meyer, Axel; Kuraku, Shigehiro

    2013-01-01

    The ectodermal neural cortex (ENC) gene family, whose members are implicated in neurogenesis, is part of the kelch repeat superfamily. To date, ENC genes have been identified only in osteichthyans, although other kelch repeat-containing genes are prevalent throughout bilaterians. The lack of elaborate molecular phylogenetic analysis with exhaustive taxon sampling has obscured the possible link of the establishment of this gene family with vertebrate novelties. In this study, we identified ENC homologs in diverse vertebrates by means of database mining and polymerase chain reaction screens. Our analysis revealed that the ENC3 ortholog was lost in the basal eutherian lineage through single-gene deletion and that the triplication between ENC1, -2, and -3 occurred early in vertebrate evolution. Including our original data on the catshark and the zebrafish, our comparison revealed high conservation of the pleiotropic expression pattern of ENC1 and shuffling of expression domains between ENC1, -2, and -3. Compared with many other gene families including developmental key regulators, the ENC gene family is unique in that conventional molecular phylogenetic inference could identify no obvious invertebrate ortholog. This suggests a composite nature of the vertebrate-specific gene repertoire, consisting not only of de novo genes introduced at the vertebrate origin but also of long-standing genes with no apparent invertebrate orthologs. Some of the latter, including the ENC gene family, may be too rapidly evolving to provide sufficient phylogenetic signals marking orthology to their invertebrate counterparts. Such gene families that experienced saltatory evolution likely remain to be explored and might also have contributed to phenotypic evolution of vertebrates. PMID:23843192

  5. Extensive lineage-specific gene duplication and evolution of the spiggin multi-gene family in stickleback

    Directory of Open Access Journals (Sweden)

    Nishida Mutsumi

    2007-11-01

    Full Text Available Abstract Background The threespine stickleback (Gasterosteus aculeatus has a characteristic reproductive mode; mature males build nests using a secreted glue-like protein called spiggin. Although recent studies reported multiple occurrences of genes that encode this glue-like protein spiggin in threespine and ninespine sticklebacks, it is still unclear how many genes compose the spiggin multi-gene family. Results Genome sequence analysis of threespine stickleback showed that there are at least five spiggin genes and two pseudogenes, whereas a single spiggin homolog occurs in the genomes of other fishes. Comparative genome sequence analysis demonstrated that Muc19, a single-copy mucous gene in human and mouse, is an ortholog of spiggin. Phylogenetic and molecular evolutionary analyses of these sequences suggested that an ancestral spiggin gene originated from a member of the mucin gene family as a single gene in the common ancestor of teleosts, and gene duplications of spiggin have occurred in the stickleback lineage. There was inter-population variation in the copy number of spiggin genes and positive selection on some codons, indicating that additional gene duplication/deletion events and adaptive evolution at some amino acid sites may have occurred in each stickleback population. Conclusion A number of spiggin genes exist in the threespine stickleback genome. Our results provide insight into the origin and dynamic evolutionary process of the spiggin multi-gene family in the threespine stickleback lineage. The dramatic evolution of genes for mucous substrates may have contributed to the generation of distinct characteristics such as "bio-glue" in vertebrates.

  6. The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    Baumgarten Andrew

    2004-06-01

    Full Text Available Abstract Background Most genes in Arabidopsis thaliana are members of gene families. How do the members of gene families arise, and how are gene family copy numbers maintained? Some gene families may evolve primarily through tandem duplication and high rates of birth and death in clusters, and others through infrequent polyploidy or large-scale segmental duplications and subsequent losses. Results Our approach to understanding the mechanisms of gene family evolution was to construct phylogenies for 50 large gene families in Arabidopsis thaliana, identify large internal segmental duplications in Arabidopsis, map gene duplications onto the segmental duplications, and use this information to identify which nodes in each phylogeny arose due to segmental or tandem duplication. Examples of six gene families exemplifying characteristic modes are described. Distributions of gene family sizes and patterns of duplication by genomic distance are also described in order to characterize patterns of local duplication and copy number for large gene families. Both gene family size and duplication by distance closely follow power-law distributions. Conclusions Combining information about genomic segmental duplications, gene family phylogenies, and gene positions provides a method to evaluate contributions of tandem duplication and segmental genome duplication in the generation and maintenance of gene families. These differences appear to correspond meaningfully to differences in functional roles of the members of the gene families.

  7. The IQD gene family in soybean: structure, phylogeny, evolution and expression.

    Directory of Open Access Journals (Sweden)

    Lin Feng

    Full Text Available Members of the plant-specific IQ67-domain (IQD protein family are involved in plant development and the basal defense response. Although systematic characterization of this family has been carried out in Arabidopsis, tomato (Solanum lycopersicum, Brachypodium distachyon and rice (Oryza sativa, systematic analysis and expression profiling of this gene family in soybean (Glycine max have not previously been reported. In this study, we identified and structurally characterized IQD genes in the soybean genome. A complete set of 67 soybean IQD genes (GmIQD1-67 was identified using Blast search tools, and the genes were clustered into four subfamilies (IQD I-IV based on phylogeny. These soybean IQD genes are distributed unevenly across all 20 chromosomes, with 30 segmental duplication events, suggesting that segmental duplication has played a major role in the expansion of the soybean IQD gene family. Analysis of the Ka/Ks ratios showed that the duplicated genes of the GmIQD family primarily underwent purifying selection. Microsynteny was detected in most pairs: genes in clade 1-3 might be present in genome regions that were inverted, expanded or contracted after the divergence; most gene pairs in clade 4 showed high conservation with little rearrangement among these gene-residing regions. Of the soybean IQD genes examined, six were most highly expressed in young leaves, six in flowers, one in roots and two in nodules. Our qRT-PCR analysis of 24 soybean IQD III genes confirmed that these genes are regulated by MeJA stress. Our findings present a comprehensive overview of the soybean IQD gene family and provide insights into the evolution of this family. In addition, this work lays a solid foundation for further experiments aimed at determining the biological functions of soybean IQD genes in growth and development.

  8. Early evolution of the LIM homeobox gene family

    Energy Technology Data Exchange (ETDEWEB)

    Srivastava, Mansi; Larroux, Claire; Lu, Daniel R; Mohanty, Kareshma; Chapman, Jarrod; Degnan, Bernard M; Rokhsar, Daniel S

    2010-01-01

    LIM homeobox (Lhx) transcription factors are unique to the animal lineage and have patterning roles during embryonic development in flies, nematodes and vertebrates, with a conserved role in specifying neuronal identity. Though genes of this family have been reported in a sponge and a cnidarian, the expression patterns and functions of the Lhx family during development in non-bilaterian phyla are not known. We identified Lhx genes in two cnidarians and a placozoan and report the expression of Lhx genes during embryonic development in Nematostella and the demosponge Amphimedon. Members of the six major LIM homeobox subfamilies are represented in the genomes of the starlet sea anemone, Nematostella vectensis, and the placozoan Trichoplax adhaerens. The hydrozoan cnidarian, Hydra magnipapillata, has retained four of the six Lhx subfamilies, but apparently lost two others. Only three subfamilies are represented in the haplosclerid demosponge Amphimedon queenslandica. A tandem cluster of three Lhx genes of different subfamilies and a gene containing two LIM domains in the genome of T. adhaerens (an animal without any neurons) indicates that Lhx subfamilies were generated by tandem duplication. This tandem cluster in Trichoplax is likely a remnant of the original chromosomal context in which Lhx subfamilies first appeared. Three of the six Trichoplax Lhx genes are expressed in animals in laboratory culture, as are all Lhx genes in Hydra. Expression patterns of Nematostella Lhx genes correlate with neural territories in larval and juvenile polyp stages. In the aneural demosponge, A. queenslandica, the three Lhx genes are expressed widely during development, including in cells that are associated with the larval photosensory ring. The Lhx family expanded and diversified early in animal evolution, with all six subfamilies already diverged prior to the cnidarian-placozoan-bilaterian last common ancestor. In Nematostella, Lhx gene expression is correlated with neural

  9. Early evolution of the LIM homeobox gene family

    Directory of Open Access Journals (Sweden)

    Degnan Bernard M

    2010-01-01

    Full Text Available Abstract Background LIM homeobox (Lhx transcription factors are unique to the animal lineage and have patterning roles during embryonic development in flies, nematodes and vertebrates, with a conserved role in specifying neuronal identity. Though genes of this family have been reported in a sponge and a cnidarian, the expression patterns and functions of the Lhx family during development in non-bilaterian phyla are not known. Results We identified Lhx genes in two cnidarians and a placozoan and report the expression of Lhx genes during embryonic development in Nematostella and the demosponge Amphimedon. Members of the six major LIM homeobox subfamilies are represented in the genomes of the starlet sea anemone, Nematostella vectensis, and the placozoan Trichoplax adhaerens. The hydrozoan cnidarian, Hydra magnipapillata, has retained four of the six Lhx subfamilies, but apparently lost two others. Only three subfamilies are represented in the haplosclerid demosponge Amphimedon queenslandica. A tandem cluster of three Lhx genes of different subfamilies and a gene containing two LIM domains in the genome of T. adhaerens (an animal without any neurons indicates that Lhx subfamilies were generated by tandem duplication. This tandem cluster in Trichoplax is likely a remnant of the original chromosomal context in which Lhx subfamilies first appeared. Three of the six Trichoplax Lhx genes are expressed in animals in laboratory culture, as are all Lhx genes in Hydra. Expression patterns of Nematostella Lhx genes correlate with neural territories in larval and juvenile polyp stages. In the aneural demosponge, A. queenslandica, the three Lhx genes are expressed widely during development, including in cells that are associated with the larval photosensory ring. Conclusions The Lhx family expanded and diversified early in animal evolution, with all six subfamilies already diverged prior to the cnidarian-placozoan-bilaterian last common ancestor. In

  10. Chromosomal evolution of the PKD1 gene family in primates

    Directory of Open Access Journals (Sweden)

    Krawczak Michael

    2008-09-01

    Full Text Available Abstract Background The autosomal dominant polycystic kidney disease (ADPKD is mostly caused by mutations in the PKD1 (polycystic kidney disease 1 gene located in 16p13.3. Moreover, there are six pseudogenes of PKD1 that are located proximal to the master gene in 16p13.1. In contrast, no pseudogene could be detected in the mouse genome, only a single copy gene on chromosome 17. The question arises how the human situation originated phylogenetically. To address this question we applied comparative FISH-mapping of a human PKD1-containing genomic BAC clone and a PKD1-cDNA clone to chromosomes of a variety of primate species and the dog as a non-primate outgroup species. Results Comparative FISH with the PKD1-cDNA clone clearly shows that in all primate species studied distinct single signals map in subtelomeric chromosomal positions orthologous to the short arm of human chromosome 16 harbouring the master PKD1 gene. Only in human and African great apes, but not in orangutan, FISH with both BAC and cDNA clones reveals additional signal clusters located proximal of and clearly separated from the PKD1 master genes indicating the chromosomal position of PKD1 pseudogenes in 16p of these species, respectively. Indeed, this is in accordance with sequencing data in human, chimpanzee and orangutan. Apart from the master PKD1 gene, six pseudogenes are identified in both, human and chimpanzee, while only a single-copy gene is present in the whole-genome sequence of orangutan. The phylogenetic reconstruction of the PKD1-tree reveals that all human pseudogenes are closely related to the human PKD1 gene, and all chimpanzee pseudogenes are closely related to the chimpanzee PKD1 gene. However, our statistical analyses provide strong indication that gene conversion events may have occurred within the PKD1 family members of human and chimpanzee, respectively. Conclusion PKD1 must have undergone amplification very recently in hominid evolution. Duplicative

  11. Molecular Evolution and Expression Divergence of HMT Gene Family in Plants

    Directory of Open Access Journals (Sweden)

    Man Zhao

    2018-04-01

    Full Text Available Homocysteine methyltransferase (HMT converts homocysteine to methionine using S-methylmethionine (SMM or S-adenosylmethionine (SAM as methyl donors in organisms, playing an important role in supplying methionine for the growth and the development of plants. To better understand the functions of the HMT genes in plants, we conducted a wide evolution and expression analysis of these genes. Reconstruction of the phylogenetic relationship showed that the HMT gene family was divided into Class 1 and Class 2. In Class 1, HMTs were only found in seed plants, while Class 2 presented in all land plants, which hinted that the HMT genes might have diverged in seed plants. The analysis of gene structures and selection pressures showed that they were relatively conserved during evolution. However, type I functional divergence had been detected in the HMTs. Furthermore, the expression profiles of HMTs showed their distinct expression patterns in different tissues, in which some HMTs were widely expressed in various organs, whereas the others were highly expressed in some specific organs, such as seeds or leaves. Therefore, according to our results in the evolution, functional divergence, and expression, the HMT genes might have diverged during evolution. Further analysis in the expression patterns of AthHMTs with their methyl donors suggested that the diverged HMTs might be related to supply methionine for the development of plant seeds.

  12. Molecular evolution of the keratin associated protein gene family in mammals, role in the evolution of mammalian hair.

    Science.gov (United States)

    Wu, Dong-Dong; Irwin, David M; Zhang, Ya-Ping

    2008-08-23

    Hair is unique to mammals. Keratin associated proteins (KRTAPs), which contain two major groups: high/ultrahigh cysteine and high glycine-tyrosine, are one of the major components of hair and play essential roles in the formation of rigid and resistant hair shafts. The KRTAP family was identified as being unique to mammals, and near-complete KRTAP gene repertoires for eight mammalian genomes were characterized in this study. An expanded KRTAP gene repertoire was found in rodents. Surprisingly, humans have a similar number of genes as other primates despite the relative hairlessness of humans. We identified several new subfamilies not previously reported in the high/ultrahigh cysteine KRTAP genes. Genes in many subfamilies of the high/ultrahigh cysteine KRTAP genes have evolved by concerted evolution with frequent gene conversion events, yielding a higher GC base content for these gene sequences. In contrast, the high glycine-tyrosine KRTAP genes have evolved more dynamically, with fewer gene conversion events and thus have a lower GC base content, possibly due to positive selection. Most of the subfamilies emerged early in the evolution of mammals, thus we propose that the mammalian ancestor should have a diverse KRTAP gene repertoire. We propose that hair content characteristics have evolved and diverged rapidly among mammals because of rapid divergent evolution of KRTAPs between species. In contrast, subfamilies of KRTAP genes have been homogenized within each species due to concerted evolution.

  13. Evolutionary mechanisms driving the evolution of a large polydnavirus gene family coding for protein tyrosine phosphatases

    Directory of Open Access Journals (Sweden)

    Serbielle Céline

    2012-12-01

    Full Text Available Abstract Background Gene duplications have been proposed to be the main mechanism involved in genome evolution and in acquisition of new functions. Polydnaviruses (PDVs, symbiotic viruses associated with parasitoid wasps, are ideal model systems to study mechanisms of gene duplications given that PDV genomes consist of virulence genes organized into multigene families. In these systems the viral genome is integrated in a wasp chromosome as a provirus and virus particles containing circular double-stranded DNA are injected into the parasitoids’ hosts and are essential for parasitism success. The viral virulence factors, organized in gene families, are required collectively to induce host immune suppression and developmental arrest. The gene family which encodes protein tyrosine phosphatases (PTPs has undergone spectacular expansion in several PDV genomes with up to 42 genes. Results Here, we present strong indications that PTP gene family expansion occurred via classical mechanisms: by duplication of large segments of the chromosomally integrated form of the virus sequences (segmental duplication, by tandem duplications within this form and by dispersed duplications. We also propose a novel duplication mechanism specific to PDVs that involves viral circle reintegration into the wasp genome. The PTP copies produced were shown to undergo conservative evolution along with episodes of adaptive evolution. In particular recently produced copies have undergone positive selection in sites most likely involved in defining substrate selectivity. Conclusion The results provide evidence about the dynamic nature of polydnavirus proviral genomes. Classical and PDV-specific duplication mechanisms have been involved in the production of new gene copies. Selection pressures associated with antagonistic interactions with parasitized hosts have shaped these genes used to manipulate lepidopteran physiology with evidence for positive selection involved in

  14. Evolution of the MAGUK protein gene family in premetazoan lineages

    Directory of Open Access Journals (Sweden)

    Ruiz-Trillo Iñaki

    2010-04-01

    Full Text Available Abstract Background Cell-to-cell communication is a key process in multicellular organisms. In multicellular animals, scaffolding proteins belonging to the family of membrane-associated guanylate kinases (MAGUK are involved in the regulation and formation of cell junctions. These MAGUK proteins were believed to be exclusive to Metazoa. However, a MAGUK gene was recently identified in an EST survey of Capsaspora owczarzaki, an unicellular organism that branches off near the metazoan clade. To further investigate the evolutionary history of MAGUK, we have undertook a broader search for this gene family using available genomic sequences of different opisthokont taxa. Results Our survey and phylogenetic analyses show that MAGUK proteins are present not only in Metazoa, but also in the choanoflagellate Monosiga brevicollis and in the protist Capsaspora owczarzaki. However, MAGUKs are absent from fungi, amoebozoans or any other eukaryote. The repertoire of MAGUKs in Placozoa and eumetazoan taxa (Cnidaria + Bilateria is quite similar, except for one class that is missing in Trichoplax, while Porifera have a simpler MAGUK repertoire. However, Vertebrata have undergone several independent duplications and exhibit two exclusive MAGUK classes. Three different MAGUK types are found in both M. brevicollis and C. owczarzaki: DLG, MPP and MAGI. Furthermore, M. brevicollis has suffered a lineage-specific diversification. Conclusions The diversification of the MAGUK protein gene family occurred, most probably, prior to the divergence between Metazoa+choanoflagellates and the Capsaspora+Ministeria clade. A MAGI-like, a DLG-like, and a MPP-like ancestral genes were already present in the unicellular ancestor of Metazoa, and new gene members have been incorporated through metazoan evolution within two major periods, one before the sponge-eumetazoan split and another within the vertebrate lineage. Moreover, choanoflagellates have suffered an independent MAGUK

  15. Gene Structures, Evolution and Transcriptional Profiling of the WRKY Gene Family in Castor Bean (Ricinus communis L.).

    Science.gov (United States)

    Zou, Zhi; Yang, Lifu; Wang, Danhua; Huang, Qixing; Mo, Yeyong; Xie, Guishui

    2016-01-01

    WRKY proteins comprise one of the largest transcription factor families in plants and form key regulators of many plant processes. This study presents the characterization of 58 WRKY genes from the castor bean (Ricinus communis L., Euphorbiaceae) genome. Compared with the automatic genome annotation, one more WRKY-encoding locus was identified and 20 out of the 57 predicted gene models were manually corrected. All RcWRKY genes were shown to contain at least one intron in their coding sequences. According to the structural features of the present WRKY domains, the identified RcWRKY genes were assigned to three previously defined groups (I-III). Although castor bean underwent no recent whole-genome duplication event like physic nut (Jatropha curcas L., Euphorbiaceae), comparative genomics analysis indicated that one gene loss, one intron loss and one recent proximal duplication occurred in the RcWRKY gene family. The expression of all 58 RcWRKY genes was supported by ESTs and/or RNA sequencing reads derived from roots, leaves, flowers, seeds and endosperms. Further global expression profiles with RNA sequencing data revealed diverse expression patterns among various tissues. Results obtained from this study not only provide valuable information for future functional analysis and utilization of the castor bean WRKY genes, but also provide a useful reference to investigate the gene family expansion and evolution in Euphorbiaceus plants.

  16. Evolution, functional differentiation, and co-expression of the RLK gene family revealed in Jilin ginseng, Panax ginseng C.A. Meyer.

    Science.gov (United States)

    Lin, Yanping; Wang, Kangyu; Li, Xiangyu; Sun, Chunyu; Yin, Rui; Wang, Yanfang; Wang, Yi; Zhang, Meiping

    2018-02-21

    Most genes in a genome exist in the form of a gene family; therefore, it is necessary to have knowledge of how a gene family functions to comprehensively understand organismal biology. The receptor-like kinase (RLK)-encoding gene family is one of the most important gene families in plants. It plays important roles in biotic and abiotic stress tolerances, and growth and development. However, little is known about the functional differentiation and relationships among the gene members within a gene family in plants. This study has isolated 563 RLK genes (designated as PgRLK genes) expressed in Jilin ginseng (Panax ginseng C.A. Meyer), investigated their evolution, and deciphered their functional diversification and relationships. The PgRLK gene family is highly diverged and formed into eight types. The LRR type is the earliest and most prevalent, while only the Lec type originated after P. ginseng evolved. Furthermore, although the members of the PgRLK gene family all encode receptor-like protein kinases and share conservative domains, they are functionally very diverse, participating in numerous biological processes. The expressions of different members of the PgRLK gene family are extremely variable within a tissue, at a developmental stage and in the same cultivar, but most of the genes tend to express correlatively, forming a co-expression network. These results not only provide a deeper and comprehensive understanding of the evolution, functional differentiation and correlation of a gene family in plants, but also an RLK genic resource useful for enhanced ginseng genetic improvement.

  17. Tubulin evolution in insects: gene duplication and subfunctionalization provide specialized isoforms in a functionally constrained gene family

    Directory of Open Access Journals (Sweden)

    Gadagkar Sudhindra R

    2010-04-01

    Full Text Available Abstract Background The completion of 19 insect genome sequencing projects spanning six insect orders provides the opportunity to investigate the evolution of important gene families, here tubulins. Tubulins are a family of eukaryotic structural genes that form microtubules, fundamental components of the cytoskeleton that mediate cell division, shape, motility, and intracellular trafficking. Previous in vivo studies in Drosophila find a stringent relationship between tubulin structure and function; small, biochemically similar changes in the major alpha 1 or testis-specific beta 2 tubulin protein render each unable to generate a motile spermtail axoneme. This has evolutionary implications, not a single non-synonymous substitution is found in beta 2 among 17 species of Drosophila and Hirtodrosophila flies spanning 60 Myr of evolution. This raises an important question, How do tubulins evolve while maintaining their function? To answer, we use molecular evolutionary analyses to characterize the evolution of insect tubulins. Results Sixty-six alpha tubulins and eighty-six beta tubulin gene copies were retrieved and subjected to molecular evolutionary analyses. Four ancient clades of alpha and beta tubulins are found in insects, a major isoform clade (alpha 1, beta 1 and three minor, tissue-specific clades (alpha 2-4, beta 2-4. Based on a Homarus americanus (lobster outgroup, these were generated through gene duplication events on major beta and alpha tubulin ancestors, followed by subfunctionalization in expression domain. Strong purifying selection acts on all tubulins, yet maximum pairwise amino acid distances between tubulin paralogs are large (0.464 substitutions/site beta tubulins, 0.707 alpha tubulins. Conversely orthologs, with the exception of reproductive tissue isoforms, show little sequence variation except in the last 15 carboxy terminus tail (CTT residues, which serve as sites for post-translational modifications (PTMs and interactions

  18. Gene family size conservation is a good indicator of evolutionary rates.

    Science.gov (United States)

    Chen, Feng-Chi; Chen, Chiuan-Jung; Li, Wen-Hsiung; Chuang, Trees-Juen

    2010-08-01

    The evolution of duplicate genes has been a topic of broad interest. Here, we propose that the conservation of gene family size is a good indicator of the rate of sequence evolution and some other biological properties. By comparing the human-chimpanzee-macaque orthologous gene families with and without family size conservation, we demonstrate that genes with family size conservation evolve more slowly than those without family size conservation. Our results further demonstrate that both family expansion and contraction events may accelerate gene evolution, resulting in elevated evolutionary rates in the genes without family size conservation. In addition, we show that the duplicate genes with family size conservation evolve significantly more slowly than those without family size conservation. Interestingly, the median evolutionary rate of singletons falls in between those of the above two types of duplicate gene families. Our results thus suggest that the controversy on whether duplicate genes evolve more slowly than singletons can be resolved when family size conservation is taken into consideration. Furthermore, we also observe that duplicate genes with family size conservation have the highest level of gene expression/expression breadth, the highest proportion of essential genes, and the lowest gene compactness, followed by singletons and then by duplicate genes without family size conservation. Such a trend accords well with our observations of evolutionary rates. Our results thus point to the importance of family size conservation in the evolution of duplicate genes.

  19. Analysis of the WUSCHEL-RELATED HOMEOBOX gene family in Pinus pinaster: New insights into the gene family evolution.

    Science.gov (United States)

    Alvarez, José M; Bueno, Natalia; Cañas, Rafael A; Avila, Concepción; Cánovas, Francisco M; Ordás, Ricardo J

    2018-02-01

    WUSCHEL-RELATED HOMEOBOX (WOX) genes are key players controlling stem cells in plants and can be divided into three clades according to the time of their appearance during plant evolution. Our knowledge of stem cell function in vascular plants other than angiosperms is limited, they separated from gymnosperms ca 300 million years ago and their patterning during embryogenesis differs significantly. For this reason, we have used the model gymnosperm Pinus pinaster to identify WOX genes and perform a thorough analysis of their gene expression patterns. Using transcriptomic data from a comprehensive range of tissues and stages of development we have shown three major outcomes: that the P. pinaster genome encodes at least fourteen members of the WOX family spanning all the major clades, that the genome of gymnosperms contains a WOX gene with no homologues in angiosperms representing a transitional stage between intermediate- and WUS-clade proteins, and that we can detect discrete WUS and WOX5 transcripts for the first time in a gymnosperm. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  20. Diversification and evolution of the SDG gene family in Brassica rapa after the whole genome triplication.

    Science.gov (United States)

    Dong, Heng; Liu, Dandan; Han, Tianyu; Zhao, Yuxue; Sun, Ji; Lin, Sue; Cao, Jiashu; Chen, Zhong-Hua; Huang, Li

    2015-11-24

    Histone lysine methylation, controlled by the SET Domain Group (SDG) gene family, is part of the histone code that regulates chromatin function and epigenetic control of gene expression. Analyzing the SDG gene family in Brassica rapa for their gene structure, domain architecture, subcellular localization, rate of molecular evolution and gene expression pattern revealed common occurrences of subfunctionalization and neofunctionalization in BrSDGs. In comparison with Arabidopsis thaliana, the BrSDG gene family was found to be more divergent than AtSDGs, which might partly explain the rich variety of morphotypes in B. rapa. In addition, a new evolutionary pattern of the four main groups of SDGs was presented, in which the Trx group and the SUVR subgroup evolved faster than the E(z), Ash groups and the SUVH subgroup. These differences in evolutionary rate among the four main groups of SDGs are perhaps due to the complexity and variability of the regions that bind with biomacromolecules, which guide SDGs to their target loci.

  1. Comparative genome analysis of PHB gene family reveals deep evolutionary origins and diverse gene function.

    Science.gov (United States)

    Di, Chao; Xu, Wenying; Su, Zhen; Yuan, Joshua S

    2010-10-07

    PHB (Prohibitin) gene family is involved in a variety of functions important for different biological processes. PHB genes are ubiquitously present in divergent species from prokaryotes to eukaryotes. Human PHB genes have been found to be associated with various diseases. Recent studies by our group and others have shown diverse function of PHB genes in plants for development, senescence, defence, and others. Despite the importance of the PHB gene family, no comprehensive gene family analysis has been carried to evaluate the relatedness of PHB genes across different species. In order to better guide the gene function analysis and understand the evolution of the PHB gene family, we therefore carried out the comparative genome analysis of the PHB genes across different kingdoms. The relatedness, motif distribution, and intron/exon distribution all indicated that PHB genes is a relatively conserved gene family. The PHB genes can be classified into 5 classes and each class have a very deep evolutionary origin. The PHB genes within the class maintained the same motif patterns during the evolution. With Arabidopsis as the model species, we found that PHB gene intron/exon structure and domains are also conserved during the evolution. Despite being a conserved gene family, various gene duplication events led to the expansion of the PHB genes. Both segmental and tandem gene duplication were involved in Arabidopsis PHB gene family expansion. However, segmental duplication is predominant in Arabidopsis. Moreover, most of the duplicated genes experienced neofunctionalization. The results highlighted that PHB genes might be involved in important functions so that the duplicated genes are under the evolutionary pressure to derive new function. PHB gene family is a conserved gene family and accounts for diverse but important biological functions based on the similar molecular mechanisms. The highly diverse biological function indicated that more research needs to be carried out

  2. Functional evolution in the plant SQUAMOSA-PROMOTER BINDING PROTEIN-LIKE (SPL gene family

    Directory of Open Access Journals (Sweden)

    Jill Christine Preston

    2013-04-01

    Full Text Available The SQUAMOSA-PROMOTER BINDING PROTEIN-LIKE (SPL family of transcription factors is functionally diverse, controlling a number of fundamental aspects of plant growth and development, including vegetative phase change, flowering time, branching, and leaf initiation rate. In natural plant populations, variation in flowering time and shoot architecture have major consequences for fitness. Likewise, in crop species, variation in branching and developmental rate impact biomass and yield. Thus, studies aimed at dissecting how the various functions are partitioned among different SPL genes in diverse plant lineages are key to providing insight into the genetic basis of local adaptation and have already garnered attention by crop breeders. Here we use phylogenetic reconstruction to reveal nine major SPL gene lineages, each of which is described in terms of function and diversification. To assess evidence for ancestral and derived functions within each SPL gene lineage, we use ancestral character state reconstructions. Our analyses suggest an emerging pattern of sub-functionalization, neo-functionalization, and possible convergent evolution following both ancient and recent gene duplication. Based on these analyses we suggest future avenues of research that may prove fruitful for elucidating the importance of SPL gene evolution in plant growth and development.

  3. Age distribution of human gene families shows significant roles of both large- and small-scale duplications in vertebrate evolution.

    Science.gov (United States)

    Gu, Xun; Wang, Yufeng; Gu, Jianying

    2002-06-01

    The classical (two-round) hypothesis of vertebrate genome duplication proposes two successive whole-genome duplication(s) (polyploidizations) predating the origin of fishes, a view now being seriously challenged. As the debate largely concerns the relative merits of the 'big-bang mode' theory (large-scale duplication) and the 'continuous mode' theory (constant creation by small-scale duplications), we tested whether a significant proportion of paralogous genes in the contemporary human genome was indeed generated in the early stage of vertebrate evolution. After an extensive search of major databases, we dated 1,739 gene duplication events from the phylogenetic analysis of 749 vertebrate gene families. We found a pattern characterized by two waves (I, II) and an ancient component. Wave I represents a recent gene family expansion by tandem or segmental duplications, whereas wave II, a rapid paralogous gene increase in the early stage of vertebrate evolution, supports the idea of genome duplication(s) (the big-bang mode). Further analysis indicated that large- and small-scale gene duplications both make a significant contribution during the early stage of vertebrate evolution to build the current hierarchy of the human proteome.

  4. Evolution of the vertebrate claudin gene family: insights from a basal vertebrate, the sea lamprey.

    Science.gov (United States)

    Mukendi, Christian; Dean, Nicholas; Lala, Rushil; Smith, Jeramiah; Bronner, Marianne E; Nikitina, Natalya V

    2016-01-01

    Claudins are major constituents of tight junctions, contributing both to their intercellular sealing and selective permeability properties. While claudins and claudin-like molecules are present in some invertebrates, the association of claudins with tight junctions has been conclusively documented only in vertebrates. Here we report the sequencing, phylogenetic analysis and comprehensive spatiotemporal expression analysis of the entire claudin gene family in the basal extant vertebrate, the sea lamprey. Our results demonstrate that clear orthologues to about half of all mammalian claudins are present in the lamprey, suggesting that at least one round of whole genome duplication contributed to the diversification of this gene family. Expression analysis revealed that claudins are expressed in discrete and specific domains, many of which represent vertebrate-specific innovations, such as in cranial ectodermal placodes and the neural crest; whereas others represent structures characteristic of chordates, e.g. pronephros, notochord, somites, endostyle and pharyngeal arches. By comparing the embryonic expression of claudins in the lamprey to that of other vertebrates, we found that ancestral expression patterns were often preserved in higher vertebrates. Morpholino mediated loss of Cldn3b demonstrated a functional role for this protein in placode and pharyngeal arch morphogenesis. Taken together, our data provide novel insights into the origins and evolution of the claudin gene family and the significance of claudin proteins in the evolution of vertebrates.

  5. Evolution of glutamate dehydrogenase genes: evidence for lateral gene transfer within and between prokaryotes and eukaryotes

    Directory of Open Access Journals (Sweden)

    Roger Andrew J

    2003-06-01

    Full Text Available Abstract Background Lateral gene transfer can introduce genes with novel functions into genomes or replace genes with functionally similar orthologs or paralogs. Here we present a study of the occurrence of the latter gene replacement phenomenon in the four gene families encoding different classes of glutamate dehydrogenase (GDH, to evaluate and compare the patterns and rates of lateral gene transfer (LGT in prokaryotes and eukaryotes. Results We extend the taxon sampling of gdh genes with nine new eukaryotic sequences and examine the phylogenetic distribution pattern of the various GDH classes in combination with maximum likelihood phylogenetic analyses. The distribution pattern analyses indicate that LGT has played a significant role in the evolution of the four gdh gene families. Indeed, a number of gene transfer events are identified by phylogenetic analyses, including numerous prokaryotic intra-domain transfers, some prokaryotic inter-domain transfers and several inter-domain transfers between prokaryotes and microbial eukaryotes (protists. Conclusion LGT has apparently affected eukaryotes and prokaryotes to a similar extent within the gdh gene families. In the absence of indications that the evolution of the gdh gene families is radically different from other families, these results suggest that gene transfer might be an important evolutionary mechanism in microbial eukaryote genome evolution.

  6. Genome-Wide Comparative Gene Family Classification

    Science.gov (United States)

    Frech, Christian; Chen, Nansheng

    2010-01-01

    Correct classification of genes into gene families is important for understanding gene function and evolution. Although gene families of many species have been resolved both computationally and experimentally with high accuracy, gene family classification in most newly sequenced genomes has not been done with the same high standard. This project has been designed to develop a strategy to effectively and accurately classify gene families across genomes. We first examine and compare the performance of computer programs developed for automated gene family classification. We demonstrate that some programs, including the hierarchical average-linkage clustering algorithm MC-UPGMA and the popular Markov clustering algorithm TRIBE-MCL, can reconstruct manual curation of gene families accurately. However, their performance is highly sensitive to parameter setting, i.e. different gene families require different program parameters for correct resolution. To circumvent the problem of parameterization, we have developed a comparative strategy for gene family classification. This strategy takes advantage of existing curated gene families of reference species to find suitable parameters for classifying genes in related genomes. To demonstrate the effectiveness of this novel strategy, we use TRIBE-MCL to classify chemosensory and ABC transporter gene families in C. elegans and its four sister species. We conclude that fully automated programs can establish biologically accurate gene families if parameterized accordingly. Comparative gene family classification finds optimal parameters automatically, thus allowing rapid insights into gene families of newly sequenced species. PMID:20976221

  7. Using paleogenomics to study the evolution of gene families: origin and duplication history of the relaxin family hormones and their receptors.

    Directory of Open Access Journals (Sweden)

    Sergey Yegorov

    Full Text Available Recent progress in the analysis of whole genome sequencing data has resulted in the emergence of paleogenomics, a field devoted to the reconstruction of ancestral genomes. Ancestral karyotype reconstructions have been used primarily to illustrate the dynamic nature of genome evolution. In this paper, we demonstrate how they can also be used to study individual gene families by examining the evolutionary history of relaxin hormones (RLN/INSL and relaxin family peptide receptors (RXFP. Relaxin family hormones are members of the insulin superfamily, and are implicated in the regulation of a variety of primarily reproductive and neuroendocrine processes. Their receptors are G-protein coupled receptors (GPCR's and include members of two distinct evolutionary groups, an unusual characteristic. Although several studies have tried to elucidate the origins of the relaxin peptide family, the evolutionary origin of their receptors and the mechanisms driving the diversification of the RLN/INSL-RXFP signaling systems in non-placental vertebrates has remained elusive. Here we show that the numerous vertebrate RLN/INSL and RXFP genes are products of an ancestral receptor-ligand system that originally consisted of three genes, two of which apparently trace their origins to invertebrates. Subsequently, diversification of the system was driven primarily by whole genome duplications (WGD, 2R and 3R followed by almost complete retention of the ligand duplicates in most vertebrates but massive loss of receptor genes in tetrapods. Interestingly, the majority of 3R duplicates retained in teleosts are potentially involved in neuroendocrine regulation. Furthermore, we infer that the ancestral AncRxfp3/4 receptor may have been syntenically linked to the AncRln-like ligand in the pre-2R genome, and show that syntenic linkages among ligands and receptors have changed dynamically in different lineages. This study ultimately shows the broad utility, with some caveats, of

  8. Molecular evolution of the actin-like MreB protein gene family in wall-less bacteria.

    Science.gov (United States)

    Ku, Chuan; Lo, Wen-Sui; Kuo, Chih-Horng

    2014-04-18

    The mreB gene family encodes actin-like proteins that determine cell shape by directing cell wall synthesis and often exists in one to three copies in the genomes of non-spherical bacteria. Intriguingly, while most wall-less bacteria do not have this gene, five to seven mreB homologs are found in Spiroplasma and Haloplasma, which are both characterized by cell contractility. To investigate the molecular evolution of this gene family in wall-less bacteria, we sampled the available genome sequences from these two genera and other related lineages for comparative analysis. The gene phylogenies indicated that the mreB homologs in Haloplasma are more closely related to those in Firmicutes, whereas those in Spiroplasma form a separate clade. This finding suggests that the gene family expansions in these two lineages are the results of independent ancient duplications. Moreover, the Spiroplasma mreB homologs can be classified into five clades, of which the genomic positions are largely conserved. The inference of gene gains and losses suggests that there has been an overall trend to retain only one homolog from each of the five mreB clades in the evolutionary history of Spiroplasma. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

  9. Genome-Wide Analysis of the Musa WRKY Gene Family: Evolution and Differential Expression during Development and Stress.

    Science.gov (United States)

    Goel, Ridhi; Pandey, Ashutosh; Trivedi, Prabodh K; Asif, Mehar H

    2016-01-01

    The WRKY gene family plays an important role in the development and stress responses in plants. As information is not available on the WRKY gene family in Musa species, genome-wide analysis has been carried out in this study using available genomic information from two species, Musa acuminata and Musa balbisiana. Analysis identified 147 and 132 members of the WRKY gene family in M. acuminata and M. balbisiana, respectively. Evolutionary analysis suggests that the WRKY gene family expanded much before the speciation in both the species. Most of the orthologs retained in two species were from the γ duplication event which occurred prior to α and β genome-wide duplication (GWD) events. Analysis also suggests that subtle changes in nucleotide sequences during the course of evolution have led to the development of new motifs which might be involved in neo-functionalization of different WRKY members in two species. Expression and cis-regulatory motif analysis suggest possible involvement of Group II and Group III WRKY members during various stresses and growth/development including fruit ripening process respectively.

  10. Genome-wide analysis of the Musa WRKY gene family: evolution and differential expression during development and stress

    Directory of Open Access Journals (Sweden)

    Ridhi eGoel

    2016-03-01

    Full Text Available The WRKY gene family plays an important role in the development and stress responses in plants. As information is not available on the WRKY gene family in Musa species, genome-wide analysis has been carried out in this study using available genomic information from two species, Musa acuminata and Musa balbisiana. Analysis identified 147 and 132 members of the WRKY gene family in M. acuminata and M. balbisiana respectively. Evolutionary analysis suggests that the WRKY gene family expanded much before the speciation in both the species. Most of the orthologs retained in two species were from the γ duplication event which occurred prior to α and β genome-wide duplication (GWD events. Analysis also suggests that subtle changes in nucleotide sequences during the course of evolution have led to the development of new motifs which might be involved in neo-functionalization of different WRKY members in two species. Expression and cis-regulatory motif analysis suggest possible involvement of Group II and Group III WRKY members during various stresses and growth/ development including fruit ripening process respectively.

  11. Gene cluster statistics with gene families.

    Science.gov (United States)

    Raghupathy, Narayanan; Durand, Dannie

    2009-05-01

    Identifying genomic regions that descended from a common ancestor is important for understanding the function and evolution of genomes. In distantly related genomes, clusters of homologous gene pairs are evidence of candidate homologous regions. Demonstrating the statistical significance of such "gene clusters" is an essential component of comparative genomic analyses. However, currently there are no practical statistical tests for gene clusters that model the influence of the number of homologs in each gene family on cluster significance. In this work, we demonstrate empirically that failure to incorporate gene family size in gene cluster statistics results in overestimation of significance, leading to incorrect conclusions. We further present novel analytical methods for estimating gene cluster significance that take gene family size into account. Our methods do not require complete genome data and are suitable for testing individual clusters found in local regions, such as contigs in an unfinished assembly. We consider pairs of regions drawn from the same genome (paralogous clusters), as well as regions drawn from two different genomes (orthologous clusters). Determining cluster significance under general models of gene family size is computationally intractable. By assuming that all gene families are of equal size, we obtain analytical expressions that allow fast approximation of cluster probabilities. We evaluate the accuracy of this approximation by comparing the resulting gene cluster probabilities with cluster probabilities obtained by simulating a realistic, power-law distributed model of gene family size, with parameters inferred from genomic data. Surprisingly, despite the simplicity of the underlying assumption, our method accurately approximates the true cluster probabilities. It slightly overestimates these probabilities, yielding a conservative test. We present additional simulation results indicating the best choice of parameter values for data

  12. Molecular evolution of the polyamine oxidase gene family in Metazoa

    Directory of Open Access Journals (Sweden)

    Polticelli Fabio

    2012-06-01

    monophyletic clades including, respectively, all the SMOs and APAOs from vertebrates. The two vertebrate monophyletic clades clustered strictly mirroring the organismal phylogeny of fishes, amphibians, reptiles, birds, and mammals. Evidences from comparative genomic analysis, structural evolution and functional divergence in a phylogenetic framework across Metazoa suggested an evolutionary scenario where the ancestor PAO coding sequence, present in invertebrates as an orthologous gene, has been duplicated in the vertebrate branch to originate the paralogous SMO and APAO genes. A further genome evolution event concerns the SMO gene of placental, but not marsupial and monotremate, mammals which increased its functional variation following an alternative splicing (AS mechanism. Conclusions In this study the explicit integration in a phylogenomic framework of phylogenetic tree construction, structure prediction, and biochemical function data/prediction, allowed inferring the molecular evolutionary history of the PAO gene family and to disambiguate paralogous genes related by duplication event (SMO and APAO and orthologous genes related by speciation events (PAOs, SMOs/APAOs. Further, while in vertebrates experimental data corroborate SMO and APAO molecular function predictions, in invertebrates the finding of a supported phylogenetic clusters of insect PAOs and the co-occurrence of two PAO variants in the amphioxus urgently claim the need for future structure-function studies.

  13. Differential evolution of members of the rhomboid gene family with conservative and divergent patterns.

    Science.gov (United States)

    Li, Qi; Zhang, Ning; Zhang, Liangsheng; Ma, Hong

    2015-04-01

    Rhomboid proteins are intramembrane serine proteases that are involved in a plethora of biological functions, but the evolutionary history of the rhomboid gene family is not clear. We performed a comprehensive molecular evolutionary analysis of the rhomboid gene family and also investigated the organization and sequence features of plant rhomboids in different subfamilies. Our results showed that eukaryotic rhomboids could be divided into five subfamilies (RhoA-RhoD and PARL). Most orthology groups appeared to be conserved only as single or low-copy genes in all lineages in RhoB-RhoD and PARL, whereas RhoA genes underwent several duplication events, resulting in multiple gene copies. These duplication events were due to whole genome duplications in plants and animals and the duplicates might have experienced functional divergence. We also identified a novel group of plant rhomboid (RhoB1) that might have lost their enzymatic activity; their existence suggests that they might have evolved new mechanisms. Plant and animal rhomboids have similar evolutionary patterns. In addition, there are mutations affecting key active sites in RBL8, RBL9 and one of the Brassicaceae PARL duplicates. This study delineates a possible evolutionary scheme for intramembrane proteins and illustrates distinct fates and a mechanism of evolution of gene duplicates. © 2014 The Authors. New Phytologist © 2014 New Phytologist Trust.

  14. Evolution of the YABBY gene family in seed plants.

    Science.gov (United States)

    Finet, Cédric; Floyd, Sandra K; Conway, Stephanie J; Zhong, Bojian; Scutt, Charles P; Bowman, John L

    2016-01-01

    Members of the YABBY gene family of transcription factors in angiosperms have been shown to be involved in the initiation of outgrowth of the lamina, the maintenance of polarity, and establishment of the leaf margin. Although most of the dorsal-ventral polarity genes in seed plants have homologs in non-spermatophyte lineages, the presence of YABBY genes is restricted to seed plants. To gain insight into the origin and diversification of this gene family, we reconstructed the evolutionary history of YABBY gene lineages in seed plants. Our findings suggest that either one or two YABBY genes were present in the last common ancestor of extant seed plants. We also examined the expression of YABBY genes in the gymnosperms Ephedra distachya (Gnetales), Ginkgo biloba (Ginkgoales), and Pseudotsuga menziesii (Coniferales). Our data indicate that some YABBY genes are expressed in a polar (abaxial) manner in leaves and female cones in gymnosperms. We propose that YABBY genes already acted as polarity genes in the last common ancestor of extant seed plants. © 2016 Wiley Periodicals, Inc.

  15. Insights into the Evolution of a Snake Venom Multi-Gene Family from the Genomic Organization of Echis ocellatus SVMP Genes

    Directory of Open Access Journals (Sweden)

    Libia Sanz

    2016-07-01

    Full Text Available The molecular events underlying the evolution of the Snake Venom Metalloproteinase (SVMP family from an A Disintegrin And Metalloproteinase (ADAM ancestor remain poorly understood. Comparative genomics may provide decisive information to reconstruct the evolutionary history of this multi-locus toxin family. Here, we report the genomic organization of Echis ocellatus genes encoding SVMPs from the PII and PI classes. Comparisons between them and between these genes and the genomic structures of Anolis carolinensis ADAM28 and E. ocellatus PIII-SVMP EOC00089 suggest that insertions and deletions of intronic regions played key roles along the evolutionary pathway that shaped the current diversity within the multi-locus SVMP gene family. In particular, our data suggest that emergence of EOC00028-like PI-SVMP from an ancestral PII(e/d-type SVMP involved splicing site mutations that abolished both the 3′ splice AG acceptor site of intron 12* and the 5′ splice GT donor site of intron 13*, and resulted in the intronization of exon 13* and the consequent destruction of the structural integrity of the PII-SVMP characteristic disintegrin domain.

  16. Evolution of homeobox genes.

    Science.gov (United States)

    Holland, Peter W H

    2013-01-01

    Many homeobox genes encode transcription factors with regulatory roles in animal and plant development. Homeobox genes are found in almost all eukaryotes, and have diversified into 11 gene classes and over 100 gene families in animal evolution, and 10 to 14 gene classes in plants. The largest group in animals is the ANTP class which includes the well-known Hox genes, plus other genes implicated in development including ParaHox (Cdx, Xlox, Gsx), Evx, Dlx, En, NK4, NK3, Msx, and Nanog. Genomic data suggest that the ANTP class diversified by extensive tandem duplication to generate a large array of genes, including an NK gene cluster and a hypothetical ProtoHox gene cluster that duplicated to generate Hox and ParaHox genes. Expression and functional data suggest that NK, Hox, and ParaHox gene clusters acquired distinct roles in patterning the mesoderm, nervous system, and gut. The PRD class is also diverse and includes Pax2/5/8, Pax3/7, Pax4/6, Gsc, Hesx, Otx, Otp, and Pitx genes. PRD genes are not generally arranged in ancient genomic clusters, although the Dux, Obox, and Rhox gene clusters arose in mammalian evolution as did several non-clustered PRD genes. Tandem duplication and genome duplication expanded the number of homeobox genes, possibly contributing to the evolution of developmental complexity, but homeobox gene loss must not be ignored. Evolutionary changes to homeobox gene expression have also been documented, including Hox gene expression patterns shifting in concert with segmental diversification in vertebrates and crustaceans, and deletion of a Pitx1 gene enhancer in pelvic-reduced sticklebacks. WIREs Dev Biol 2013, 2:31-45. doi: 10.1002/wdev.78 For further resources related to this article, please visit the WIREs website. The author declares that he has no conflicts of interest. Copyright © 2012 Wiley Periodicals, Inc.

  17. Analysis of four achaete-scute homologs in Bombyx mori reveals new viewpoints of the evolution and functions of this gene family

    OpenAIRE

    Zhou, Qingxiang; Zhang, Tianyi; Xu, Weihua; Yu, Linlin; Yi, Yongzhu; Zhang, Zhifang

    2008-01-01

    Abstract Background achaete-scute complexe (AS-C) has been widely studied at genetic, developmental and evolutional levels. Genes of this family encode proteins containing a highly conserved bHLH domain, which take part in the regulation of the development of central nervous system and peripheral nervous system. Many AS-C homologs have been isolated from various vertebrates and invertebrates. Also, AS-C genes are duplicated during the evolution of Diptera. Functions besides neural development...

  18. Genome-wide Comparative Analyses Reveal the Dynamic Evolution of Nucleotide-Binding Leucine-Rich Repeat Gene Family among Solanaceae Plants

    Directory of Open Access Journals (Sweden)

    Eunyoung Seo

    2016-08-01

    Full Text Available Plants have evolved an elaborate innate immune system against invading pathogens. Within this system, intracellular nucleotide-binding leucine-rich repeat (NLR immune receptors are known play critical roles in effector-triggered immunity (ETI plant defense. We performed genome-wide identification and classification of NLR-coding sequences from the genomes of pepper, tomato, and potato using fixed criteria. We then compared genomic duplication and evolution features. We identified intact 267, 443, and 755 NLR-encoding genes in tomato, potato, and pepper genomes, respectively. Phylogenetic analyses and classification of Solanaceae NLRs revealed that the majority of NLR super family members fell into 14 subgroups, including a TIR-NLR (TNL subgroup and 13 non-TNL subgroups. Specific subgroups have expanded in each genome, with the expansion in pepper showing subgroup-specific physical clusters. Comparative analysis of duplications showed distinct duplication patterns within pepper and among Solanaceae plants suggesting subgroup- or species-specific gene duplication events after speciation, resulting in divergent evolution. Taken together, genome-wide analyses of NLR family members provide insights into their evolutionary history in Solanaceae. These findings also provide important foundational knowledge for understanding NLR evolution and will empower broader characterization of disease resistance genes to be used for crop breeding.

  19. Evaluation of Gene-Based Family-Based Methods to Detect Novel Genes Associated With Familial Late Onset Alzheimer Disease

    Directory of Open Access Journals (Sweden)

    Maria V. Fernández

    2018-04-01

    Full Text Available Gene-based tests to study the combined effect of rare variants on a particular phenotype have been widely developed for case-control studies, but their evolution and adaptation for family-based studies, especially studies of complex incomplete families, has been slower. In this study, we have performed a practical examination of all the latest gene-based methods available for family-based study designs using both simulated and real datasets. We examined the performance of several collapsing, variance-component, and transmission disequilibrium tests across eight different software packages and 22 models utilizing a cohort of 285 families (N = 1,235 with late-onset Alzheimer disease (LOAD. After a thorough examination of each of these tests, we propose a methodological approach to identify, with high confidence, genes associated with the tested phenotype and we provide recommendations to select the best software and model for family-based gene-based analyses. Additionally, in our dataset, we identified PTK2B, a GWAS candidate gene for sporadic AD, along with six novel genes (CHRD, CLCN2, HDLBP, CPAMD8, NLRP9, and MAS1L as candidate genes for familial LOAD.

  20. RANGER-DTL 2.0: Rigorous Reconstruction of Gene-Family Evolution by Duplication, Transfer, and Loss.

    Science.gov (United States)

    Bansal, Mukul S; Kellis, Manolis; Kordi, Misagh; Kundu, Soumya

    2018-04-24

    RANGER-DTL 2.0 is a software program for inferring gene family evolution using Duplication-Transfer-Loss reconciliation. This new software is highly scalable and easy to use, and offers many new features not currently available in any other reconciliation program. RANGER-DTL 2.0 has a particular focus on reconciliation accuracy and can account for many sources of reconciliation uncertainty including uncertain gene tree rooting, gene tree topological uncertainty, multiple optimal reconciliations, and alternative event cost assignments. RANGER-DTL 2.0 is open-source and written in C ++ and Python. Pre-compiled executables, source code (open-source under GNU GPL), and a detailed manual are freely available from http://compbio.engr.uconn.edu/software/RANGER-DTL/. mukul.bansal@uconn.edu.

  1. Invasion fitness for gene-culture co-evolution in family-structured populations and an application to cumulative culture under vertical transmission.

    Science.gov (United States)

    Mullon, Charles; Lehmann, Laurent

    2017-08-01

    Human evolution depends on the co-evolution between genetically determined behaviors and socially transmitted information. Although vertical transmission of cultural information from parent to offspring is common in hominins, its effects on cumulative cultural evolution are not fully understood. Here, we investigate gene-culture co-evolution in a family-structured population by studying the invasion fitness of a mutant allele that influences a deterministic level of cultural information (e.g., amount of knowledge or skill) to which diploid carriers of the mutant are exposed in subsequent generations. We show that the selection gradient on such a mutant, and the concomitant level of cultural information it generates, can be evaluated analytically under the assumption that the cultural dynamic has a single attractor point, thereby making gene-culture co-evolution in family-structured populations with multigenerational effects mathematically tractable. We apply our result to study how genetically determined phenotypes of individual and social learning co-evolve with the level of adaptive information they generate under vertical transmission. We find that vertical transmission increases adaptive information due to kin selection effects, but when information is transmitted as efficiently between family members as between unrelated individuals, this increase is moderate in diploids. By contrast, we show that the way resource allocation into learning trades off with allocation into reproduction (the "learning-reproduction trade-off") significantly influences levels of adaptive information. We also show that vertical transmission prevents evolutionary branching and may therefore play a qualitative role in gene-culture co-evolutionary dynamics. More generally, our analysis of selection suggests that vertical transmission can significantly increase levels of adaptive information under the biologically plausible condition that information transmission between relatives is

  2. Predictions of Gene Family Distributions in Microbial Genomes: Evolution by Gene Duplication and Modification

    International Nuclear Information System (INIS)

    Yanai, Itai; Camacho, Carlos J.; DeLisi, Charles

    2000-01-01

    A universal property of microbial genomes is the considerable fraction of genes that are homologous to other genes within the same genome. The process by which these homologues are generated is not well understood, but sequence analysis of 20 microbial genomes unveils a recurrent distribution of gene family sizes. We show that a simple evolutionary model based on random gene duplication and point mutations fully accounts for these distributions and permits predictions for the number of gene families in genomes not yet complete. Our findings are consistent with the notion that a genome evolves from a set of precursor genes to a mature size by gene duplications and increasing modifications. (c) 2000 The American Physical Society

  3. Predictions of Gene Family Distributions in Microbial Genomes: Evolution by Gene Duplication and Modification

    Energy Technology Data Exchange (ETDEWEB)

    Yanai, Itai; Camacho, Carlos J.; DeLisi, Charles

    2000-09-18

    A universal property of microbial genomes is the considerable fraction of genes that are homologous to other genes within the same genome. The process by which these homologues are generated is not well understood, but sequence analysis of 20 microbial genomes unveils a recurrent distribution of gene family sizes. We show that a simple evolutionary model based on random gene duplication and point mutations fully accounts for these distributions and permits predictions for the number of gene families in genomes not yet complete. Our findings are consistent with the notion that a genome evolves from a set of precursor genes to a mature size by gene duplications and increasing modifications. (c) 2000 The American Physical Society.

  4. Evolution Analysis of the Aux/IAA Gene Family in Plants Shows Dual Origins and Variable Nuclear Localization Signals

    Directory of Open Access Journals (Sweden)

    Wentao Wu

    2017-10-01

    Full Text Available The plant hormone auxin plays pivotal roles in many aspects of plant growth and development. The auxin/indole-3-acetic acid (Aux/IAA gene family encodes short-lived nuclear proteins acting on auxin perception and signaling, but the evolutionary history of this gene family remains to be elucidated. In this study, the Aux/IAA gene family in 17 plant species covering all major lineages of plants is identified and analyzed by using multiple bioinformatics methods. A total of 434 Aux/IAA genes was found among these plant species, and the gene copy number ranges from three (Physcomitrella patens to 63 (Glycine max. The phylogenetic analysis shows that the canonical Aux/IAA proteins can be generally divided into five major clades, and the origin of Aux/IAA proteins could be traced back to the common ancestor of land plants and green algae. Many truncated Aux/IAA proteins were found, and some of these truncated Aux/IAA proteins may be generated from the C-terminal truncation of auxin response factor (ARF proteins. Our results indicate that tandem and segmental duplications play dominant roles for the expansion of the Aux/IAA gene family mainly under purifying selection. The putative nuclear localization signals (NLSs in Aux/IAA proteins are conservative, and two kinds of new primordial bipartite NLSs in P. patens and Selaginella moellendorffii were discovered. Our findings not only give insights into the origin and expansion of the Aux/IAA gene family, but also provide a basis for understanding their functions during the course of evolution.

  5. Evolution Analysis of the Aux/IAA Gene Family in Plants Shows Dual Origins and Variable Nuclear Localization Signals.

    Science.gov (United States)

    Wu, Wentao; Liu, Yaxue; Wang, Yuqian; Li, Huimin; Liu, Jiaxi; Tan, Jiaxin; He, Jiadai; Bai, Jingwen; Ma, Haoli

    2017-10-08

    The plant hormone auxin plays pivotal roles in many aspects of plant growth and development. The auxin/indole-3-acetic acid (Aux/IAA) gene family encodes short-lived nuclear proteins acting on auxin perception and signaling, but the evolutionary history of this gene family remains to be elucidated. In this study, the Aux/IAA gene family in 17 plant species covering all major lineages of plants is identified and analyzed by using multiple bioinformatics methods. A total of 434 Aux/IAA genes was found among these plant species, and the gene copy number ranges from three ( Physcomitrella patens ) to 63 ( Glycine max ). The phylogenetic analysis shows that the canonical Aux/IAA proteins can be generally divided into five major clades, and the origin of Aux/IAA proteins could be traced back to the common ancestor of land plants and green algae. Many truncated Aux/IAA proteins were found, and some of these truncated Aux/IAA proteins may be generated from the C-terminal truncation of auxin response factor (ARF) proteins. Our results indicate that tandem and segmental duplications play dominant roles for the expansion of the Aux/IAA gene family mainly under purifying selection. The putative nuclear localization signals (NLSs) in Aux/IAA proteins are conservative, and two kinds of new primordial bipartite NLSs in P. patens and Selaginella moellendorffii were discovered. Our findings not only give insights into the origin and expansion of the Aux/IAA gene family, but also provide a basis for understanding their functions during the course of evolution.

  6. A Genomic Survey of SCPP Family Genes in Fishes Provides Novel Insights into the Evolution of Fish Scales.

    Science.gov (United States)

    Lv, Yunyun; Kawasaki, Kazuhiko; Li, Jia; Li, Yanping; Bian, Chao; Huang, Yu; You, Xinxin; Shi, Qiong

    2017-11-16

    The family of secretory calcium-binding phosphoproteins (SCPPs) have been considered vital to skeletal tissue mineralization. However, most previous SCPP studies focused on phylogenetically distant animals but not on those closely related species. Here we provide novel insights into the coevolution of SCPP genes and fish scales in 10 species from Otophysi . According to their scale phenotypes, these fishes can be divided into three groups, i.e., scaled, sparsely scaled, and scaleless. We identified homologous SCPP genes in the genomes of these species and revealed an absence of some SCPP members in some genomes, suggesting an uneven evolutionary history of SCPP genes in fishes. In addition, most of these SCPP genes, with the exception of SPP1 , individually form one or two gene cluster(s) on each corresponding genome. Furthermore, we constructed phylogenetic trees using maximum likelihood method to estimate their evolution. The phylogenetic topology mostly supports two subclasses in some species, such as Cyprinus carpio , Sinocyclocheilus anshuiensis , S. grahamin , and S. rhinocerous , but not in the other examined fishes. By comparing the gene structures of recently reported candidate genes, SCPP1 and SCPP5 , for determining scale phenotypes, we found that the hypothesis is suitable for Astyanax mexicanus , but denied by S. anshuiensis , even though they are both sparsely scaled for cave adaptation. Thus, we conclude that, although different fish species display similar scale phenotypes, the underlying genetic changes however might be diverse. In summary, this paper accelerates the recognition of the SCPP family in teleosts for potential scale evolution.

  7. Genome-wide analysis of the Solanum tuberosum (potato) trehalose-6-phosphate synthase (TPS) gene family: evolution and differential expression during development and stress.

    Science.gov (United States)

    Xu, Yingchun; Wang, Yanjie; Mattson, Neil; Yang, Liu; Jin, Qijiang

    2017-12-01

    Trehalose-6-phosphate synthase (TPS) serves important functions in plant desiccation tolerance and response to environmental stimuli. At present, a comprehensive analysis, i.e. functional classification, molecular evolution, and expression patterns of this gene family are still lacking in Solanum tuberosum (potato). In this study, a comprehensive analysis of the TPS gene family was conducted in potato. A total of eight putative potato TPS genes (StTPSs) were identified by searching the latest potato genome sequence. The amino acid identity among eight StTPSs varied from 59.91 to 89.54%. Analysis of d N /d S ratios suggested that regions in the TPP (trehalose-6-phosphate phosphatase) domains evolved faster than the TPS domains. Although the sequence of the eight StTPSs showed high similarity (2571-2796 bp), their gene length is highly differentiated (3189-8406 bp). Many of the regulatory elements possibly related to phytohormones, abiotic stress and development were identified in different TPS genes. Based on the phylogenetic tree constructed using TPS genes of potato, and four other Solanaceae plants, TPS genes could be categorized into 6 distinct groups. Analysis revealed that purifying selection most likely played a major role during the evolution of this family. Amino acid changes detected in specific branches of the phylogenetic tree suggests relaxed constraints might have contributed to functional divergence among groups. Moreover, StTPSs were found to exhibit tissue and treatment specific expression patterns upon analysis of transcriptome data, and performing qRT-PCR. This study provides a reference for genome-wide identification of the potato TPS gene family and sets a framework for further functional studies of this important gene family in development and stress response.

  8. Genome-wide identification and evolution of the PIN-FORMED (PIN) gene family in Glycine max.

    Science.gov (United States)

    Liu, Yuan; Wei, Haichao

    2017-07-01

    Soybean (Glycine max) is one of the most important crop plants. Wild and cultivated soybean varieties have significant differences worth further investigation, such as plant morphology, seed size, and seed coat development; these characters may be related to auxin biology. The PIN gene family encodes essential transport proteins in cell-to-cell auxin transport, but little research on soybean PIN genes (GmPIN genes) has been done, especially with respect to the evolution and differences between wild and cultivated soybean. In this study, we retrieved 23 GmPIN genes from the latest updated G. max genome database; six GmPIN protein sequences were changed compared with the previous database. Based on the Plant Genome Duplication Database, 18 GmPIN genes have been involved in segment duplication. Three pairs of GmPIN genes arose after the second soybean genome duplication, and six occurred after the first genome duplication. The duplicated GmPIN genes retained similar expression patterns. All the duplicated GmPIN genes experienced purifying selection (K a /K s genome sequence of 17 wild and 14 cultivated soybean varieties. Our research provides useful and comprehensive basic information for understanding GmPIN genes.

  9. Targeted Enrichment of Large Gene Families for Phylogenetic Inference: Phylogeny and Molecular Evolution of Photosynthesis Genes in the Portullugo Clade (Caryophyllales).

    Science.gov (United States)

    Moore, Abigail J; Vos, Jurriaan M De; Hancock, Lillian P; Goolsby, Eric; Edwards, Erika J

    2018-05-01

    Hybrid enrichment is an increasingly popular approach for obtaining hundreds of loci for phylogenetic analysis across many taxa quickly and cheaply. The genes targeted for sequencing are typically single-copy loci, which facilitate a more straightforward sequence assembly and homology assignment process. However, this approach limits the inclusion of most genes of functional interest, which often belong to multi-gene families. Here, we demonstrate the feasibility of including large gene families in hybrid enrichment protocols for phylogeny reconstruction and subsequent analyses of molecular evolution, using a new set of bait sequences designed for the "portullugo" (Caryophyllales), a moderately sized lineage of flowering plants (~ 2200 species) that includes the cacti and harbors many evolutionary transitions to C$_{\\mathrm{4}}$ and CAM photosynthesis. Including multi-gene families allowed us to simultaneously infer a robust phylogeny and construct a dense sampling of sequences for a major enzyme of C$_{\\mathrm{4}}$ and CAM photosynthesis, which revealed the accumulation of adaptive amino acid substitutions associated with C$_{\\mathrm{4}}$ and CAM origins in particular paralogs. Our final set of matrices for phylogenetic analyses included 75-218 loci across 74 taxa, with ~ 50% matrix completeness across data sets. Phylogenetic resolution was greatly improved across the tree, at both shallow and deep levels. Concatenation and coalescent-based approaches both resolve the sister lineage of the cacti with strong support: Anacampserotaceae $+$ Portulacaceae, two lineages of mostly diminutive succulent herbs of warm, arid regions. In spite of this congruence, BUCKy concordance analyses demonstrated strong and conflicting signals across gene trees. Our results add to the growing number of examples illustrating the complexity of phylogenetic signals in genomic-scale data.

  10. A shared promoter region suggests a common ancestor for the human VCX/Y, SPANX, and CSAG gene families and the murine CYPT family

    DEFF Research Database (Denmark)

    Hansen, Martin A; Nielsen, John E; Retelska, Dorota

    2008-01-01

    , sequences corresponding to the shared promoter region of the CYPT family were identified at 39 loci. Most loci were located immediately upstream of genes belonging to the VCX/Y, SPANX, or CSAG gene families. Sequence comparison of the loci revealed a conserved CYPT promoter-like (CPL) element featuring TATA...... cell types. The genomic regions harboring the gene families were rich in direct and inverted segmental duplications (SD), which may facilitate gene conversion and rapid evolution. The conserved CPL and the common expression profiles suggest that the human VCX/Y, SPANX, and CSAG2 gene families together......Many testis-specific genes from the sex chromosomes are subject to rapid evolution, which can make it difficult to identify murine genes in the human genome. The murine CYPT gene family includes 15 members, but orthologs were undetectable in the human genome. However, using refined homology search...

  11. Genome-wide identification of Jatropha curcas aquaporin genes and the comparative analysis provides insights into the gene family expansion and evolution in Hevea brasiliensis

    Directory of Open Access Journals (Sweden)

    Zhi eZou

    2016-03-01

    Full Text Available Aquaporins (AQPs are channel-forming integral membrane proteins that transport water and other small solutes across biological membranes. Despite the vital role of AQPs, to date, little is known in physic nut (Jatropha curcas L., Euphorbiaceae, an important non-edible oilseed crop with great potential for the production of biodiesel. In this study, 32 AQP genes were identified from the physic nut genome and the family number is relatively small in comparison to 51 in another Euphorbiaceae plant, rubber tree (Hevea brasiliensis Muell. Arg.. Based on the phylogenetic analysis, the JcAQPs were assigned to five subfamilies, i.e., 9 plasma membrane intrinsic proteins (PIPs, 9 tonoplast intrinsic proteins (TIPs, 8 NOD26-like intrinsic proteins (NIPs, 2 X intrinsic proteins (XIPs and 4 small basic intrinsic proteins (SIPs. Like rubber tree and other plant species, functional prediction based on the aromatic/arginine selectivity filter, Froger’s positions and specificity-determining positions showed a remarkable difference in substrate specificity among subfamilies of JcAQPs. Genome-wide comparative analysis revealed the specific expansion of PIP and TIP subfamilies in rubber tree and the specific gene loss of the XIP subfamily in physic nut. Furthermore, by analyzing deep transcriptome sequencing data, the expression evolution especially the expression divergence of duplicated HbAQP genes was also investigated and discussed. Results obtained from this study not only provide valuable information for future functional analysis and utilization of Jc/HbAQP genes, but also provide a useful reference to survey the gene family expansion and evolution in Euphorbiaceae plants and other plant species.

  12. Adaptive Evolution of Gene Expression in Drosophila.

    Science.gov (United States)

    Nourmohammad, Armita; Rambeau, Joachim; Held, Torsten; Kovacova, Viera; Berg, Johannes; Lässig, Michael

    2017-08-08

    Gene expression levels are important quantitative traits that link genotypes to molecular functions and fitness. In Drosophila, population-genetic studies have revealed substantial adaptive evolution at the genomic level, but the evolutionary modes of gene expression remain controversial. Here, we present evidence that adaptation dominates the evolution of gene expression levels in flies. We show that 64% of the observed expression divergence across seven Drosophila species are adaptive changes driven by directional selection. Our results are derived from time-resolved data of gene expression divergence across a family of related species, using a probabilistic inference method for gene-specific selection. Adaptive gene expression is stronger in specific functional classes, including regulation, sensory perception, sexual behavior, and morphology. Moreover, we identify a large group of genes with sex-specific adaptation of expression, which predominantly occurs in males. Our analysis opens an avenue to map system-wide selection on molecular quantitative traits independently of their genetic basis. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  13. Adaptive Evolution of Gene Expression in Drosophila

    Directory of Open Access Journals (Sweden)

    Armita Nourmohammad

    2017-08-01

    Full Text Available Gene expression levels are important quantitative traits that link genotypes to molecular functions and fitness. In Drosophila, population-genetic studies have revealed substantial adaptive evolution at the genomic level, but the evolutionary modes of gene expression remain controversial. Here, we present evidence that adaptation dominates the evolution of gene expression levels in flies. We show that 64% of the observed expression divergence across seven Drosophila species are adaptive changes driven by directional selection. Our results are derived from time-resolved data of gene expression divergence across a family of related species, using a probabilistic inference method for gene-specific selection. Adaptive gene expression is stronger in specific functional classes, including regulation, sensory perception, sexual behavior, and morphology. Moreover, we identify a large group of genes with sex-specific adaptation of expression, which predominantly occurs in males. Our analysis opens an avenue to map system-wide selection on molecular quantitative traits independently of their genetic basis.

  14. Molecular evolution of a novel family of putative calcium transporters.

    Directory of Open Access Journals (Sweden)

    Didier Demaegd

    Full Text Available The UPF0016 family is a group of uncharacterized membrane proteins, well conserved through evolution and defined by the presence of one or two copies of an E-Φ-G-D-(KR-(ST consensus motif. Our previous results have shown that two members of this family, the human TMEM165 and the budding yeast Gdt1p, are functionally related and might form a new group of cation/Ca2+ exchangers. Most members of the family are made of two homologous clusters of three transmembrane spans, separated by a central loop and assembled with an opposite orientation in the membrane. However, some bacterial members of the family have only one cluster of transmembrane domains. Among these 'single-domain membrane proteins' some cyanobacterial members were found as pairs of adjacent genes within the genome, but each gene was slightly different. We performed a bioinformatic analysis to propose the molecular evolution of the UPF0016 family and the emergence of the antiparallel topology. Our hypotheses were confirmed experimentally using functional complementation in yeast. This suggests an important and conserved function for UPF0016 proteins in a fundamental cellular process. We also show that members of the UPF0016 family share striking similarities, but no primary sequence homology, with members of the cation/Ca2+ exchangers (CaCA superfamily. Such similarities could be an example of convergent evolution, supporting the previous hypothesis that members of the UPF0016 family are cation/Ca2+ exchangers.

  15. Concerted evolution of sea anemone neurotoxin genes is revealed through analysis of the Nematostella vectensis genome.

    Science.gov (United States)

    Moran, Yehu; Weinberger, Hagar; Sullivan, James C; Reitzel, Adam M; Finnerty, John R; Gurevitz, Michael

    2008-04-01

    Gene families, which encode toxins, are found in many poisonous animals, yet there is limited understanding of their evolution at the nucleotide level. The release of the genome draft sequence for the sea anemone Nematostella vectensis enabled a comprehensive study of a gene family whose neurotoxin products affect voltage-gated sodium channels. All gene family members are clustered in a highly repetitive approximately 30-kb genomic region and encode a single toxin, Nv1. These genes exhibit extreme conservation at the nucleotide level which cannot be explained by purifying selection. This conservation greatly differs from the toxin gene families of other animals (e.g., snakes, scorpions, and cone snails), whose evolution was driven by diversifying selection, thereby generating a high degree of genetic diversity. The low nucleotide diversity at the Nv1 genes is reminiscent of that reported for DNA encoding ribosomal RNA (rDNA) and 2 hsp70 genes from Drosophila, which have evolved via concerted evolution. This evolutionary pattern was experimentally demonstrated in yeast rDNA and was shown to involve unequal crossing-over. Through sequence analysis of toxin genes from multiple N. vectensis populations and 2 other anemone species, Anemonia viridis and Actinia equina, we observed that the toxin genes for each sea anemone species are more similar to one another than to those of other species, suggesting they evolved by manner of concerted evolution. Furthermore, in 2 of the species (A. viridis and A. equina) we found genes that evolved under diversifying selection, suggesting that concerted evolution and accelerated evolution may occur simultaneously.

  16. Evolution and diversification of the CYC/TB1 gene family in Asteraceae--a comparative study in Gerbera (Mutisieae) and sunflower (Heliantheae).

    Science.gov (United States)

    Tähtiharju, Sari; Rijpkema, Anneke S; Vetterli, Adrien; Albert, Victor A; Teeri, Teemu H; Elomaa, Paula

    2012-04-01

    Plant-specific TCP domain transcription factors have been shown to regulate morphological novelties during plant evolution, including the complex architecture of the Asteraceae inflorescence that involves different types of flowers. We conducted comparative analysis of the CYCLOIDEA/TEOSINTE BRANCHED1 (CYC/TB1) gene family in Gerbera hybrida (gerbera) and Helianthus annuus (sunflower), two species that represent distant tribes within Asteraceae. Our data confirm that the CYC/TB1 gene family has expanded in Asteraceae, a condition that appears to be connected with the increased developmental complexity and evolutionary success of this large plant family. Phylogenetic analysis of the CYC/TB1 gene family revealed both shared and lineage-specific duplications in gerbera and sunflower, corresponding to the three gene lineages previously identified as specific to core eudicots: CYC1, CYC2, and CYC3. Expression analyses of early stages of flower primordia development indicated that especially within the CYC2 clade, with the greatest number of secondary gene duplications, gene expression patterns are conserved between the species and associated with flower and inflorescence development. All sunflower and gerbera CYC2 clade genes showed differential expression between developing flower types, being upregulated in marginal ray (and trans) flowers. One gene in gerbera (GhCYC3) and two in sunflower (HaCYC2d and HaCYC2c) were indicated to be strong candidates as regulators of ray flower identity, a function that is specific for Asteraceae. Our data further showed that other CYC2 clade genes are likely to have more specialized functions at the level of single flowers, including the late functions in floral reproductive organs that may be more conserved across plant families. The expression patterns of CYC1 and CYC3 clade genes showed more differences between the two species but still pointed to possible conserved functions during vegetative plant development. Pairwise protein

  17. The Basic/Helix-Loop-Helix Protein Family in Gossypium: Reference Genes and Their Evolution during Tetraploidization.

    Directory of Open Access Journals (Sweden)

    Qian Yan

    Full Text Available Basic/helix-loop-helix (bHLH proteins comprise one of the largest transcription factor families and play important roles in diverse cellular and molecular processes. Comprehensive analyses of the composition and evolution of the bHLH family in cotton are essential to elucidate their functions and the molecular basis of cotton development. By searching bHLH homologous genes in sequenced diploid cotton genomes (Gossypium raimondii and G. arboreum, a set of cotton bHLH reference genes containing 289 paralogs were identified and named as GobHLH001-289. Based on their phylogenetic relationships, these cotton bHLH proteins were clustered into 27 subfamilies. Compared to those in Arabidopsis and cacao, cotton bHLH proteins generally increased in number, but unevenly in different subfamilies. To further uncover evolutionary changes of bHLH genes during tetraploidization of cotton, all genes of S5a and S5b subfamilies in upland cotton and its diploid progenitors were cloned and compared, and their transcript profiles were determined in upland cotton. A total of 10 genes of S5a and S5b subfamilies (doubled from A- and D-genome progenitors maintained in tetraploid cottons. The major sequence changes in upland cotton included a 15-bp in-frame deletion in GhbHLH130D and a long terminal repeat retrotransposon inserted in GhbHLH062A, which eliminated GhbHLH062A expression in various tissues. The S5a and S5b bHLH genes of A and D genomes (except GobHLH062 showed similar transcription patterns in various tissues including roots, stems, leaves, petals, ovules, and fibers, while the A- and D-genome genes of GobHLH110 and GobHLH130 displayed clearly different transcript profiles during fiber development. In total, this study represented a genome-wide analysis of cotton bHLH family, and revealed significant changes in sequence and expression of these genes in tetraploid cottons, which paved the way for further functional analyses of bHLH genes in the cotton genus.

  18. Structure, evolution and functional inference on the Mildew Locus O (MLO) gene family in three cultivated Cucurbitaceae spp.

    Science.gov (United States)

    Iovieno, Paolo; Andolfo, Giuseppe; Schiavulli, Adalgisa; Catalano, Domenico; Ricciardi, Luigi; Frusciante, Luigi; Ercolano, Maria Raffaella; Pavan, Stefano

    2015-12-29

    The powdery mildew disease affects thousands of plant species and arguably represents the major fungal threat for many Cucurbitaceae crops, including melon (Cucumis melo L.), watermelon (Citrullus lanatus L.) and zucchini (Cucurbita pepo L.). Several studies revealed that specific members of the Mildew Locus O (MLO) gene family act as powdery mildew susceptibility factors. Indeed, their inactivation, as the result of gene knock-out or knock-down, is associated with a peculiar form of resistance, referred to as mlo resistance. We exploited recently available genomic information to provide a comprehensive overview of the MLO gene family in Cucurbitaceae. We report the identification of 16 MLO homologs in C. melo, 14 in C. lanatus and 18 in C. pepo genomes. Bioinformatic treatment of data allowed phylogenetic inference and the prediction of several ortholog pairs and groups. Comparison with functionally characterized MLO genes and, in C. lanatus, gene expression analysis, resulted in the detection of candidate powdery mildew susceptibility factors. We identified a series of conserved amino acid residues and motifs that are likely to play a major role for the function of MLO proteins. Finally, we performed a codon-based evolutionary analysis indicating a general high level of purifying selection in the three Cucurbitaceae MLO gene families, and the occurrence of regions under diversifying selection in candidate susceptibility factors. Results of this study may help to address further biological questions concerning the evolution and function of MLO genes. Moreover, data reported here could be conveniently used by breeding research, aiming to select powdery mildew resistant cultivars in Cucurbitaceae.

  19. Evolutionary genomics and adaptive evolution of the Hedgehog gene family (Shh, Ihh and Dhh in vertebrates.

    Directory of Open Access Journals (Sweden)

    Joana Pereira

    Full Text Available The Hedgehog (Hh gene family codes for a class of secreted proteins composed of two active domains that act as signalling molecules during embryo development, namely for the development of the nervous and skeletal systems and the formation of the testis cord. While only one Hh gene is found typically in invertebrate genomes, most vertebrates species have three (Sonic hedgehog--Shh; Indian hedgehog--Ihh; and Desert hedgehog--Dhh, each with different expression patterns and functions, which likely helped promote the increasing complexity of vertebrates and their successful diversification. In this study, we used comparative genomic and adaptive evolutionary analyses to characterize the evolution of the Hh genes in vertebrates following the two major whole genome duplication (WGD events. To overcome the lack of Hh-coding sequences on avian publicly available databases, we used an extensive dataset of 45 avian and three non-avian reptilian genomes to show that birds have all three Hh paralogs. We find suggestions that following the WGD events, vertebrate Hh paralogous genes evolved independently within similar linkage groups and under different evolutionary rates, especially within the catalytic domain. The structural regions around the ion-binding site were identified to be under positive selection in the signaling domain. These findings contrast with those observed in invertebrates, where different lineages that experienced gene duplication retained similar selective constraints in the Hh orthologs. Our results provide new insights on the evolutionary history of the Hh gene family, the functional roles of these paralogs in vertebrate species, and on the location of mutational hotspots.

  20. Evolutionary genomics and adaptive evolution of the Hedgehog gene family (Shh, Ihh and Dhh) in vertebrates.

    Science.gov (United States)

    Pereira, Joana; Johnson, Warren E; O'Brien, Stephen J; Jarvis, Erich D; Zhang, Guojie; Gilbert, M Thomas P; Vasconcelos, Vitor; Antunes, Agostinho

    2014-01-01

    The Hedgehog (Hh) gene family codes for a class of secreted proteins composed of two active domains that act as signalling molecules during embryo development, namely for the development of the nervous and skeletal systems and the formation of the testis cord. While only one Hh gene is found typically in invertebrate genomes, most vertebrates species have three (Sonic hedgehog--Shh; Indian hedgehog--Ihh; and Desert hedgehog--Dhh), each with different expression patterns and functions, which likely helped promote the increasing complexity of vertebrates and their successful diversification. In this study, we used comparative genomic and adaptive evolutionary analyses to characterize the evolution of the Hh genes in vertebrates following the two major whole genome duplication (WGD) events. To overcome the lack of Hh-coding sequences on avian publicly available databases, we used an extensive dataset of 45 avian and three non-avian reptilian genomes to show that birds have all three Hh paralogs. We find suggestions that following the WGD events, vertebrate Hh paralogous genes evolved independently within similar linkage groups and under different evolutionary rates, especially within the catalytic domain. The structural regions around the ion-binding site were identified to be under positive selection in the signaling domain. These findings contrast with those observed in invertebrates, where different lineages that experienced gene duplication retained similar selective constraints in the Hh orthologs. Our results provide new insights on the evolutionary history of the Hh gene family, the functional roles of these paralogs in vertebrate species, and on the location of mutational hotspots.

  1. Analysis of four achaete-scute homologs in Bombyx mori reveals new viewpoints of the evolution and functions of this gene family.

    Science.gov (United States)

    Zhou, Qingxiang; Zhang, Tianyi; Xu, Weihua; Yu, Linlin; Yi, Yongzhu; Zhang, Zhifang

    2008-03-06

    achaete-scute complexe (AS-C) has been widely studied at genetic, developmental and evolutional levels. Genes of this family encode proteins containing a highly conserved bHLH domain, which take part in the regulation of the development of central nervous system and peripheral nervous system. Many AS-C homologs have been isolated from various vertebrates and invertebrates. Also, AS-C genes are duplicated during the evolution of Diptera. Functions besides neural development controlling have also been found in Drosophila AS-C genes. We cloned four achaete-scute homologs (ASH) from the lepidopteran model organism Bombyx mori, including three proneural genes and one neural precursor gene. Proteins encoded by them contained the characteristic bHLH domain and the three proneural ones were also found to have the C-terminal conserved motif. These genes regulated promoter activity through the Class A E-boxes in vitro. Though both Bm-ASH and Drosophila AS-C have four members, they are not in one by one corresponding relationships. Results of RT-PCR and real-time PCR showed that Bm-ASH genes were expressed in different larval tissues, and had well-regulated expressional profiles during the development of embryo and wing/wing disc. There are four achaete-scute homologs in Bombyx mori, the second insect having four AS-C genes so far, and these genes have multiple functions in silkworm life cycle. AS-C gene duplication in insects occurs after or parallel to, but not before the taxonomic order formation during evolution.

  2. Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster

    Science.gov (United States)

    Robertson, Hugh M.; Warr, Coral G.; Carlson, John R.

    2003-01-01

    The insect chemoreceptor superfamily in Drosophila melanogaster is predicted to consist of 62 odorant receptor (Or) and 68 gustatory receptor (Gr) proteins, encoded by families of 60 Or and 60 Gr genes through alternative splicing. We include two previously undescribed Or genes and two previously undescribed Gr genes; two previously predicted Or genes are shown to be alternative splice forms. Three polymorphic pseudogenes and one highly defective pseudogene are recognized. Phylogenetic analysis reveals deep branches connecting multiple highly divergent clades within the Gr family, and the Or family appears to be a single highly expanded lineage within the superfamily. The genes are spread throughout the Drosophila genome, with some relatively recently diverged genes still clustered in the genome. The Gr5a gene on the X chromosome, which encodes a receptor for the sugar trehalose, has transposed from one such tandem cluster of six genes at cytological location 64, as has Gr61a, and all eight of these receptors might bind sugars. Analysis of intron evolution suggests that the common ancestor consisted of a long N-terminal exon encoding transmembrane domains 1-5 followed by three exons encoding transmembrane domains 6-7. As many as 57 additional introns have been acquired idiosyncratically during the evolution of the superfamily, whereas the ancestral introns and some of the older idiosyncratic introns have been lost at least 48 times independently. Altogether, these patterns of molecular evolution suggest that this is an ancient superfamily of chemoreceptors, probably dating back at least to the origin of the arthropods. PMID:14608037

  3. The evolution of CHROMOMETHYLASES and gene body DNA methylation in plants.

    Science.gov (United States)

    Bewick, Adam J; Niederhuth, Chad E; Ji, Lexiang; Rohr, Nicholas A; Griffin, Patrick T; Leebens-Mack, Jim; Schmitz, Robert J

    2017-05-01

    The evolution of gene body methylation (gbM), its origins, and its functional consequences are poorly understood. By pairing the largest collection of transcriptomes (>1000) and methylomes (77) across Viridiplantae, we provide novel insights into the evolution of gbM and its relationship to CHROMOMETHYLASE (CMT) proteins. CMTs are evolutionary conserved DNA methyltransferases in Viridiplantae. Duplication events gave rise to what are now referred to as CMT1, 2 and 3. Independent losses of CMT1, 2, and 3 in eudicots, CMT2 and ZMET in monocots and monocots/commelinids, variation in copy number, and non-neutral evolution suggests overlapping or fluid functional evolution of this gene family. DNA methylation within genes is widespread and is found in all major taxonomic groups of Viridiplantae investigated. Genes enriched with methylated CGs (mCG) were also identified in species sister to angiosperms. The proportion of genes and DNA methylation patterns associated with gbM are restricted to angiosperms with a functional CMT3 or ortholog. However, mCG-enriched genes in the gymnosperm Pinus taeda shared some similarities with gbM genes in Amborella trichopoda. Additionally, gymnosperms and ferns share a CMT homolog closely related to CMT2 and 3. Hence, the dependency of gbM on a CMT most likely extends to all angiosperms and possibly gymnosperms and ferns. The resulting gene family phylogeny of CMT transcripts from the most diverse sampling of plants to date redefines our understanding of CMT evolution and its evolutionary consequences on DNA methylation. Future, functional tests of homologous and paralogous CMTs will uncover novel roles and consequences to the epigenome.

  4. Diversification, phylogeny and evolution of auxin response factor (ARF) family: insights gained from analyzing maize ARF genes.

    Science.gov (United States)

    Wang, Yijun; Deng, Dexiang; Shi, Yating; Miao, Nan; Bian, Yunlong; Yin, Zhitong

    2012-03-01

    Auxin response factors (ARFs), member of the plant-specific B3 DNA binding superfamily, target specifically to auxin response elements (AuxREs) in promoters of primary auxin-responsive genes and heterodimerize with Aux/IAA proteins in auxin signaling transduction cascade. In previous research, we have isolated and characterized maize Aux/IAA genes in whole-genome scale. Here, we report the comprehensive analysis of ARF genes in maize. A total of 36 ARF genes were identified and validated from the B73 maize genome through an iterative strategy. Thirty-six maize ARF genes are distributed in all maize chromosomes except chromosome 7. Maize ARF genes expansion is mainly due to recent segmental duplications. Maize ARF proteins share one B3 DNA binding domain which consists of seven-stranded β sheets and two short α helixes. Twelve maize ARFs with glutamine-rich middle regions could be as activators in modulating expression of auxin-responsive genes. Eleven maize ARF proteins are lack of homo- and heterodimerization domains. Putative cis-elements involved in phytohormones and light signaling responses, biotic and abiotic stress adaption locate in promoters of maize ARF genes. Expression patterns vary greatly between clades and sister pairs of maize ARF genes. The B3 DNA binding and auxin response factor domains of maize ARF proteins are primarily subjected to negative selection during selective sweep. The mixed selective forces drive the diversification and evolution of genomic regions outside of B3 and ARF domains. Additionally, the dicot-specific proliferation of ARF genes was detected. Comparative genomics analysis indicated that maize, sorghum and rice duplicate chromosomal blocks containing ARF homologs are highly syntenic. This study provides insights into the distribution, phylogeny and evolution of ARF gene family.

  5. FGF: A web tool for Fishing Gene Family in a whole genome database

    DEFF Research Database (Denmark)

    Zheng, Hongkun; Shi, Junjie; Fang, Xiaodong

    2007-01-01

    Gene duplication is an important process in evolution. The availability of genome sequences of a number of organisms has made it possible to conduct comprehensive searches for duplicated genes enabling informative studies of their evolution. We have established the FGF (Fishing Gene Family) progr...... is freely available on a web server at http://fgf.genomics.org.cn/...

  6. Analysis of four achaete-scute homologs in Bombyx mori reveals new viewpoints of the evolution and functions of this gene family

    Directory of Open Access Journals (Sweden)

    Yi Yongzhu

    2008-03-01

    Full Text Available Abstract Background achaete-scute complexe (AS-C has been widely studied at genetic, developmental and evolutional levels. Genes of this family encode proteins containing a highly conserved bHLH domain, which take part in the regulation of the development of central nervous system and peripheral nervous system. Many AS-C homologs have been isolated from various vertebrates and invertebrates. Also, AS-C genes are duplicated during the evolution of Diptera. Functions besides neural development controlling have also been found in Drosophila AS-C genes. Results We cloned four achaete-scute homologs (ASH from the lepidopteran model organism Bombyx mori, including three proneural genes and one neural precursor gene. Proteins encoded by them contained the characteristic bHLH domain and the three proneural ones were also found to have the C-terminal conserved motif. These genes regulated promoter activity through the Class A E-boxes in vitro. Though both Bm-ASH and Drosophila AS-C have four members, they are not in one by one corresponding relationships. Results of RT-PCR and real-time PCR showed that Bm-ASH genes were expressed in different larval tissues, and had well-regulated expressional profiles during the development of embryo and wing/wing disc. Conclusion There are four achaete-scute homologs in Bombyx mori, the second insect having four AS-C genes so far, and these genes have multiple functions in silkworm life cycle. AS-C gene duplication in insects occurs after or parallel to, but not before the taxonomic order formation during evolution.

  7. Evolution of the Yellow/Major Royal Jelly Protein family and the emergence of social behavior in honey bees.

    Science.gov (United States)

    Drapeau, Mark David; Albert, Stefan; Kucharski, Robert; Prusko, Carsten; Maleszka, Ryszard

    2006-11-01

    The genomic architecture underlying the evolution of insect social behavior is largely a mystery. Eusociality, defined by overlapping generations, parental brood care, and reproductive division of labor, has most commonly evolved in the Hymenopteran insects, including the honey bee Apis mellifera. In this species, the Major Royal Jelly Protein (MRJP) family is required for all major aspects of eusocial behavior. Here, using data obtained from the A. mellifera genome sequencing project, we demonstrate that the MRJP family is encoded by nine genes arranged in an approximately 60-kb tandem array. Furthermore, the MRJP protein family appears to have evolved from a single progenitor gene that encodes a member of the ancient Yellow protein family. Five genes encoding Yellow-family proteins flank the genomic region containing the genes encoding MRJPs. We describe the molecular evolution of these protein families. We then characterize developmental-stage-specific, sex-specific, and caste-specific expression patterns of the mrjp and yellow genes in the honey bee. We review empirical evidence concerning the functions of Yellow proteins in fruit flies and social ants, in order to shed light on the roles of both Yellow and MRJP proteins in A. mellifera. In total, the available evidence suggests that Yellows and MRJPs are multifunctional proteins with diverse, context-dependent physiological and developmental roles. However, many members of the Yellow/MRJP family act as facilitators of reproductive maturation. Finally, it appears that MRJP protein subfamily evolution from the Yellow protein family may have coincided with the evolution of honey bee eusociality.

  8. Identification and expression profiling analysis of TCP family genes involved in growth and development in maize.

    Science.gov (United States)

    Chai, Wenbo; Jiang, Pengfei; Huang, Guoyu; Jiang, Haiyang; Li, Xiaoyu

    2017-10-01

    The TCP family is a group of plant-specific transcription factors. TCP genes encode proteins harboring bHLH structure, which is implicated in DNA binding and protein-protein interactions and known as the TCP domain. TCP genes play important roles in plant development and have been evolutionarily and functionally elaborated in various plants, however, no overall phylogenetic analysis or expression profiling of TCP genes in Zea mays has been reported. In the present study, a systematic analysis of molecular evolution and functional prediction of TCP family genes in maize ( Z . mays L.) has been conducted. We performed a genome-wide survey of TCP genes in maize, revealing the gene structure, chromosomal location and phylogenetic relationship of family members. Microsynteny between grass species and tissue-specific expression profiles were also investigated. In total, 29 TCP genes were identified in the maize genome, unevenly distributed on the 10 maize chromosomes. Additionally, ZmTCP genes were categorized into nine classes based on phylogeny and purifying selection may largely be responsible for maintaining the functions of maize TCP genes. What's more, microsynteny analysis suggested that TCP genes have been conserved during evolution. Finally, expression analysis revealed that most TCP genes are expressed in the stem and ear, which suggests that ZmTCP genes influence stem and ear growth. This result is consistent with the previous finding that maize TCP genes represses the growth of axillary organs and enables the formation of female inflorescences. Altogether, this study presents a thorough overview of TCP family in maize and provides a new perspective on the evolution of this gene family. The results also indicate that TCP family genes may be involved in development stage in plant growing conditions. Additionally, our results will be useful for further functional analysis of the TCP gene family in maize.

  9. Adaptive evolution of the Hox gene family for development in bats and dolphins.

    Directory of Open Access Journals (Sweden)

    Lu Liang

    Full Text Available Bats and cetaceans (i.e., whales, dolphins, porpoises are two kinds of mammals with unique locomotive styles and occupy novel niches. Bats are the only mammals capable of sustained flight in the sky, while cetaceans have returned to the aquatic environment and are specialized for swimming. Associated with these novel adaptations to their environment, various development changes have occurred to their body plans and associated structures. Given the importance of Hox genes in many aspects of embryonic development, we conducted an analysis of the coding regions of all Hox gene family members from bats (represented by Pteropus vampyrus, Pteropus alecto, Myotis lucifugus and Myotis davidii and cetaceans (represented by Tursiops truncatus for adaptive evolution using the available draft genome sequences. Differences in the selective pressures acting on many Hox genes in bats and cetaceans were found compared to other mammals. Positive selection, however, was not found to act on any of the Hox genes in the common ancestor of bats and only upon Hoxb9 in cetaceans. PCR amplification data from additional bat and cetacean species, and application of the branch-site test 2, showed that the Hoxb2 gene within bats had significant evidence of positive selection. Thus, our study, with genomic and newly sequenced Hox genes, identifies two candidate Hox genes that may be closely linked with developmental changes in bats and cetaceans, such as those associated with the pancreatic, neuronal, thymus shape and forelimb. In addition, the difference in our results from the genome-wide scan and newly sequenced data reveals that great care must be taken in interpreting results from draft genome data from a limited number of species, and deep genetic sampling of a particular clade is a powerful tool for generating complementary data to address this limitation.

  10. The evolution and expression of panarthropod frizzled genes

    Directory of Open Access Journals (Sweden)

    Ralf eJanssen

    2015-08-01

    Full Text Available Wnt signaling regulates many important processes during metazoan development. It has been shown that Wnt ligands represent an ancient and diverse family of proteins that likely function in complex signaling landscapes to induce target cells via receptors including those of the Frizzled (Fz family. The four subfamilies of Fz receptors also evolved early in metazoan evolution. To date, Fz receptors have been characterised mainly in mammals, the nematode Caenorhabditis elegans and insects such as Drosophila melanogaster. To compare these findings with other metazoans, we explored the repertoire of fz genes in three panarthropod species: Parasteatoda tepidariorum, Glomeris marginata and Euperipatoides kanangrensis, representing the Chelicerata, Myriapoda and Onychophora respectively. We found that these three diverse panarthropods each have four fz genes, with representatives of all four metazoan fz subfamilies found in Glomeris and Euperipatoides, while Parasteatoda does not have a fz3 gene, but has two fz4 paralogues. Furthermore we characterized the expression patterns of all the fz genes among these animals. Our results exemplify the evolutionary diversity of Fz receptors and reveals conserved and divergent aspects of their protein sequences and expression patterns among panarthropods; thus providing new insights into the evolution of Wnt signaling more generally.

  11. Ultra Large Gene Families: A Matter of Adaptation or Genomic Parasites?

    Directory of Open Access Journals (Sweden)

    Philipp H. Schiffer

    2016-08-01

    Full Text Available Gene duplication is an important mechanism of molecular evolution. It offers a fast track to modification, diversification, redundancy or rescue of gene function. However, duplication may also be neutral or (slightly deleterious, and often ends in pseudo-geneisation. Here, we investigate the phylogenetic distribution of ultra large gene families on long and short evolutionary time scales. In particular, we focus on a family of NACHT-domain and leucine-rich-repeat-containing (NLR-genes, which we previously found in large numbers to occupy one chromosome arm of the zebrafish genome. We were interested to see whether such a tight clustering is characteristic for ultra large gene families. Our data reconfirm that most gene family inflations are lineage-specific, but we can only identify very few gene clusters. Based on our observations we hypothesise that, beyond a certain size threshold, ultra large gene families continue to proliferate in a mechanism we term “run-away evolution”. This process might ultimately lead to the failure of genomic integrity and drive species to extinction.

  12. Evolution of the defensin-like gene family in grass genomes

    Indian Academy of Sciences (India)

    that the DEFL gene family is subjected to purifying selection. However, sliding window analysis .... sorghum from DOE-JGI Community Sequencing Program ..... This work was supported by the National Key Technologies Re- search and ...

  13. Dynamic evolution of bitter taste receptor genes in vertebrates

    Directory of Open Access Journals (Sweden)

    Jones Gareth

    2009-01-01

    Full Text Available Abstract Background Sensing bitter tastes is crucial for many animals because it can prevent them from ingesting harmful foods. This process is mainly mediated by the bitter taste receptors (T2R, which are largely expressed in the taste buds. Previous studies have identified some T2R gene repertoires, and marked variation in repertoire size has been noted among species. However, the mechanisms underlying the evolution of vertebrate T2R genes remain poorly understood. Results To better understand the evolutionary pattern of these genes, we identified 16 T2R gene repertoires based on the high coverage genome sequences of vertebrates and studied the evolutionary changes in the number of T2R genes during birth-and-death evolution using the reconciled-tree method. We found that the number of T2R genes and the fraction of pseudogenes vary extensively among species. Based on the results of phylogenetic analysis, we showed that T2R gene families in teleost fishes are more diverse than those in tetrapods. In addition to the independent gene expansions in teleost fishes, frogs and mammals, lineage-specific gene duplications were also detected in lizards. Furthermore, extensive gains and losses of T2R genes were detected in each lineage during their evolution, resulting in widely differing T2R gene repertoires. Conclusion These results further support the hypotheses that T2R gene repertoires are closely related to the dietary habits of different species and that birth-and-death evolution is associated with adaptations to dietary changes.

  14. The evolutionary history of the SAL1 gene family in eutherian mammals

    Directory of Open Access Journals (Sweden)

    Callebaut Isabelle

    2011-05-01

    Full Text Available Abstract Background SAL1 (salivary lipocalin is a member of the OBP (Odorant Binding Protein family and is involved in chemical sexual communication in pig. SAL1 and its relatives may be involved in pheromone and olfactory receptor binding and in pre-mating behaviour. The evolutionary history and the selective pressures acting on SAL1 and its orthologous genes have not yet been exhaustively described. The aim of the present work was to study the evolution of these genes, to elucidate the role of selective pressures in their evolution and the consequences for their functions. Results Here, we present the evolutionary history of SAL1 gene and its orthologous genes in mammals. We found that (1 SAL1 and its related genes arose in eutherian mammals with lineage-specific duplications in rodents, horse and cow and are lost in human, mouse lemur, bushbaby and orangutan, (2 the evolution of duplicated genes of horse, rat, mouse and guinea pig is driven by concerted evolution with extensive gene conversion events in mouse and guinea pig and by positive selection mainly acting on paralogous genes in horse and guinea pig, (3 positive selection was detected for amino acids involved in pheromone binding and amino acids putatively involved in olfactory receptor binding, (4 positive selection was also found for lineage, indicating a species-specific strategy for amino acid selection. Conclusions This work provides new insights into the evolutionary history of SAL1 and its orthologs. On one hand, some genes are subject to concerted evolution and to an increase in dosage, suggesting the need for homogeneity of sequence and function in certain species. On the other hand, positive selection plays a role in the diversification of the functions of the family and in lineage, suggesting adaptive evolution, with possible consequences for speciation and for the reinforcement of prezygotic barriers.

  15. Gain, loss and divergence in primate zinc-finger genes: a rich resource for evolution of gene regulatory differences between species.

    Directory of Open Access Journals (Sweden)

    Katja Nowick

    Full Text Available The molecular changes underlying major phenotypic differences between humans and other primates are not well understood, but alterations in gene regulation are likely to play a major role. Here we performed a thorough evolutionary analysis of the largest family of primate transcription factors, the Krüppel-type zinc finger (KZNF gene family. We identified and curated gene and pseudogene models for KZNFs in three primate species, chimpanzee, orangutan and rhesus macaque, to allow for a comparison with the curated set of human KZNFs. We show that the recent evolutionary history of primate KZNFs has been complex, including many lineage-specific duplications and deletions. We found 213 species-specific KZNFs, among them 7 human-specific and 23 chimpanzee-specific genes. Two human-specific genes were validated experimentally. Ten genes have been lost in humans and 13 in chimpanzees, either through deletion or pseudogenization. We also identified 30 KZNF orthologs with human-specific and 42 with chimpanzee-specific sequence changes that are predicted to affect DNA binding properties of the proteins. Eleven of these genes show signatures of accelerated evolution, suggesting positive selection between humans and chimpanzees. During primate evolution the most extensive re-shaping of the KZNF repertoire, including most gene additions, pseudogenizations, and structural changes occurred within the subfamily homininae. Using zinc finger (ZNF binding predictions, we suggest potential impact these changes have had on human gene regulatory networks. The large species differences in this family of TFs stands in stark contrast to the overall high conservation of primate genomes and potentially represents a potent driver of primate evolution.

  16. Molecular evolution of the crustacean hyperglycemic hormone family in ecdysozoans

    Directory of Open Access Journals (Sweden)

    Soyez Daniel

    2010-02-01

    Full Text Available Abstract Background Crustacean Hyperglycemic Hormone (CHH family peptides are neurohormones known to regulate several important functions in decapod crustaceans such as ionic and energetic metabolism, molting and reproduction. The structural conservation of these peptides, together with the variety of functions they display, led us to investigate their evolutionary history. CHH family peptides exist in insects (Ion Transport Peptides and may be present in all ecdysozoans as well. In order to extend the evolutionary study to the entire family, CHH family peptides were thus searched in taxa outside decapods, where they have been, to date, poorly investigated. Results CHH family peptides were characterized by molecular cloning in a branchiopod crustacean, Daphnia magna, and in a collembolan, Folsomia candida. Genes encoding such peptides were also rebuilt in silico from genomic sequences of another branchiopod, a chelicerate and two nematodes. These sequences were included in updated datasets to build phylogenies of the CHH family in pancrustaceans. These phylogenies suggest that peptides found in Branchiopoda and Collembola are more closely related to insect ITPs than to crustacean CHHs. Datasets were also used to support a phylogenetic hypothesis about pancrustacean relationships, which, in addition to gene structures, allowed us to propose two evolutionary scenarios of this multigenic family in ecdysozoans. Conclusions Evolutionary scenarios suggest that CHH family genes of ecdysozoans originate from an ancestral two-exon gene, and genes of arthropods from a three-exon one. In malacostracans, the evolution of the CHH family has involved several duplication, insertion or deletion events, leading to neuropeptides with a wide variety of functions, as observed in decapods. This family could thus constitute a promising model to investigate the links between gene duplications and functional divergence.

  17. Relaxin gene family in teleosts: phylogeny, syntenic mapping, selective constraint, andexpression analysis

    Directory of Open Access Journals (Sweden)

    Glen Peter

    2009-12-01

    Full Text Available Abstract Background In recent years, the relaxin family of signaling molecules has been shown to play diverse roles in mammalian physiology, but little is known about its diversity or physiology in teleosts, an infraclass of the bony fishes comprising ~ 50% of all extant vertebrates. In this paper, 32 relaxin family sequences were obtained by searching genomic and cDNA databases from eight teleost species; phylogenetic, molecular evolutionary, and syntenic data analyses were conducted to understand the relationship and differential patterns of evolution of relaxin family genes in teleosts compared with mammals. Additionally, real-time quantitative PCR was used to confirm and assess the tissues of expression of five relaxin family genes in Danio rerio and in situ hybridization used to assess the site-specific expression of the insulin 3-like gene in D. rerio testis. Results Up to six relaxin family genes were identified in each teleost species. Comparative syntenic mapping revealed that fish possess two paralogous copies of human RLN3, which we call rln3a and rln3b, an orthologue of human RLN2, rln, two paralogous copies of human INSL5, insl5a and insl5b, and an orthologue of human INSL3, insl3. Molecular evolutionary analyses indicated that: rln3a, rln3b and rln are under strong evolutionary constraint, that insl3 has been subject to moderate rates of sequence evolution with two amino acids in insl3/INSL3 showing evidence of positively selection, and that insl5b exhibits a higher rate of sequence evolution than its paralogue insl5a suggesting that it may have been neo-functionalized after the teleost whole genome duplication. Quantitative PCR analyses in D. rerio indicated that rln3a and rln3b are expressed in brain, insl3 is highly expressed in gonads, and that there was low expression of both insl5 genes in adult zebrafish. Finally, in situ hybridization of insl3 in D. rerio testes showed highly specific hybridization to interstitial Leydig

  18. Early vertebrate chromosome duplications and the evolution of the neuropeptide Y receptor gene regions

    Directory of Open Access Journals (Sweden)

    Brenner Sydney

    2008-06-01

    Full Text Available Abstract Background One of the many gene families that expanded in early vertebrate evolution is the neuropeptide (NPY receptor family of G-protein coupled receptors. Earlier work by our lab suggested that several of the NPY receptor genes found in extant vertebrates resulted from two genome duplications before the origin of jawed vertebrates (gnathostomes and one additional genome duplication in the actinopterygian lineage, based on their location on chromosomes sharing several gene families. In this study we have investigated, in five vertebrate genomes, 45 gene families with members close to the NPY receptor genes in the compact genomes of the teleost fishes Tetraodon nigroviridis and Takifugu rubripes. These correspond to Homo sapiens chromosomes 4, 5, 8 and 10. Results Chromosome regions with conserved synteny were identified and confirmed by phylogenetic analyses in H. sapiens, M. musculus, D. rerio, T. rubripes and T. nigroviridis. 26 gene families, including the NPY receptor genes, (plus 3 described recently by other labs showed a tree topology consistent with duplications in early vertebrate evolution and in the actinopterygian lineage, thereby supporting expansion through block duplications. Eight gene families had complications that precluded analysis (such as short sequence length or variable number of repeated domains and another eight families did not support block duplications (because the paralogs in these families seem to have originated in another time window than the proposed genome duplication events. RT-PCR carried out with several tissues in T. rubripes revealed that all five NPY receptors were expressed in the brain and subtypes Y2, Y4 and Y8 were also expressed in peripheral organs. Conclusion We conclude that the phylogenetic analyses and chromosomal locations of these gene families support duplications of large blocks of genes or even entire chromosomes. Thus, these results are consistent with two early vertebrate

  19. Evolution and Stress Responses of Gossypium hirsutum SWEET Genes.

    Science.gov (United States)

    Li, Wei; Ren, Zhongying; Wang, Zhenyu; Sun, Kuan; Pei, Xiaoyu; Liu, Yangai; He, Kunlun; Zhang, Fei; Song, Chengxiang; Zhou, Xiaojian; Zhang, Wensheng; Ma, Xiongfeng; Yang, Daigang

    2018-03-08

    The SWEET (sugars will eventually be exported transporters) proteins are sugar efflux transporters containing the MtN3_saliva domain, which affects plant development as well as responses to biotic and abiotic stresses. These proteins have not been functionally characterized in the tetraploid cotton, Gossypium hirsutum , which is a widely cultivated cotton species. In this study, we comprehensively analyzed the cotton SWEET gene family. A total of 55 putative G. hirsutum SWEET genes were identified. The GhSWEET genes were classified into four clades based on a phylogenetic analysis and on the examination of gene structural features. Moreover, chromosomal localization and an analysis of homologous genes in Gossypium arboreum , Gossypium raimondii , and G. hirsutum suggested that a whole-genome duplication, several tandem duplications, and a polyploidy event contributed to the expansion of the cotton SWEET gene family, especially in Clade III and IV. Analyses of cis -acting regulatory elements in the promoter regions, expression profiles, and artificial selection revealed that the GhSWEET genes were likely involved in cotton developmental processes and responses to diverse stresses. These findings may clarify the evolution of G. hirsutum SWEET gene family and may provide a foundation for future functional studies of SWEET proteins regarding cotton development and responses to abiotic stresses.

  20. Molecular Evolution of the Glycosyltransferase 6 Gene Family in Primates

    Directory of Open Access Journals (Sweden)

    Eliane Evanovich

    2016-01-01

    Full Text Available Glycosyltransferase 6 gene family includes ABO, Ggta1, iGb3S, and GBGT1 genes and by three putative genes restricted to mammals, GT6m6, GTm6, and GT6m7, only the latter is found in primates. GT6 genes may encode functional and nonfunctional proteins. Ggta1 and GBGT1 genes, for instance, are pseudogenes in catarrhine primates, while iGb3S gene is only inactive in human, bonobo, and chimpanzee. Even inactivated, these genes tend to be conversed in primates. As some of the GT6 genes are related to the susceptibility or resistance to parasites, we investigated (i the selective pressure on the GT6 paralogs genes in primates; (ii the basis of the conservation of iGb3S in human, chimpanzee, and bonobo; and (iii the functional potential of the GBGT1 and GT6m7 in catarrhines. We observed that the purifying selection is prevalent and these genes have a low diversity, though ABO and Ggta1 genes have some sites under positive selection. GT6m7, a putative gene associated with aggressive periodontitis, may have regulatory function, but experimental studies are needed to assess its function. The evolutionary conservation of iGb3S in humans, chimpanzee, and bonobo seems to be the result of proximity to genes with important biological functions.

  1. A first-principles model of early evolution: emergence of gene families, species, and preferred protein folds.

    Directory of Open Access Journals (Sweden)

    Konstantin B Zeldovich

    2007-07-01

    Full Text Available In this work we develop a microscopic physical model of early evolution where phenotype--organism life expectancy--is directly related to genotype--the stability of its proteins in their native conformations-which can be determined exactly in the model. Simulating the model on a computer, we consistently observe the "Big Bang" scenario whereby exponential population growth ensues as soon as favorable sequence-structure combinations (precursors of stable proteins are discovered. Upon that, random diversity of the structural space abruptly collapses into a small set of preferred proteins. We observe that protein folds remain stable and abundant in the population at timescales much greater than mutation or organism lifetime, and the distribution of the lifetimes of dominant folds in a population approximately follows a power law. The separation of evolutionary timescales between discovery of new folds and generation of new sequences gives rise to emergence of protein families and superfamilies whose sizes are power-law distributed, closely matching the same distributions for real proteins. On the population level we observe emergence of species--subpopulations that carry similar genomes. Further, we present a simple theory that relates stability of evolving proteins to the sizes of emerging genomes. Together, these results provide a microscopic first-principles picture of how first-gene families developed in the course of early evolution.

  2. Evolutionary history of chordate PAX genes: dynamics of change in a complex gene family.

    Directory of Open Access Journals (Sweden)

    Vanessa Rodrigues Paixão-Côrtes

    Full Text Available Paired box (PAX genes are transcription factors that play important roles in embryonic development. Although the PAX gene family occurs in animals only, it is widely distributed. Among the vertebrates, its 9 genes appear to be the product of complete duplication of an original set of 4 genes, followed by an additional partial duplication. Although some studies of PAX genes have been conducted, no comprehensive survey of these genes across the entire taxonomic unit has yet been attempted. In this study, we conducted a detailed comparison of PAX sequences from 188 chordates, which revealed restricted variation. The absence of PAX4 and PAX8 among some species of reptiles and birds was notable; however, all 9 genes were present in all 74 mammalian genomes investigated. A search for signatures of selection indicated that all genes are subject to purifying selection, with a possible constraint relaxation in PAX4, PAX7, and PAX8. This result indicates asymmetric evolution of PAX family genes, which can be associated with the emergence of adaptive novelties in the chordate evolutionary trajectory.

  3. The roles of gene duplication, gene conversion and positive selection in rodent Esp and Mup pheromone gene families with comparison to the Abp family.

    Science.gov (United States)

    Karn, Robert C; Laukaitis, Christina M

    2012-01-01

    Three proteinaceous pheromone families, the androgen-binding proteins (ABPs), the exocrine-gland secreting peptides (ESPs) and the major urinary proteins (MUPs) are encoded by large gene families in the genomes of Mus musculus and Rattus norvegicus. We studied the evolutionary histories of the Mup and Esp genes and compared them with what is known about the Abp genes. Apparently gene conversion has played little if any role in the expansion of the mouse Class A and Class B Mup genes and pseudogenes, and the rat Mups. By contrast, we found evidence of extensive gene conversion in many Esp genes although not in all of them. Our studies of selection identified at least two amino acid sites in β-sheets as having evolved under positive selection in the mouse Class A and Class B MUPs and in rat MUPs. We show that selection may have acted on the ESPs by determining K(a)/K(s) for Exon 3 sequences with and without the converted sequence segment. While it appears that purifying selection acted on the ESP signal peptides, the secreted portions of the ESPs probably have undergone much more rapid evolution. When the inner gene converted fragment sequences were removed, eleven Esp paralogs were present in two or more pairs with K(a)/K(s) >1.0 and thus we propose that positive selection is detectable by this means in at least some mouse Esp paralogs. We compare and contrast the evolutionary histories of all three mouse pheromone gene families in light of their proposed functions in mouse communication.

  4. Processes of fungal proteome evolution and gain of function: gene duplication and domain rearrangement

    International Nuclear Information System (INIS)

    Cohen-Gihon, Inbar; Nussinov, Ruth; Sharan, Roded

    2011-01-01

    During evolution, organisms have gained functional complexity mainly by modifying and improving existing functioning systems rather than creating new ones ab initio. Here we explore the interplay between two processes which during evolution have had major roles in the acquisition of new functions: gene duplication and protein domain rearrangements. We consider four possible evolutionary scenarios: gene families that have undergone none of these event types; only gene duplication; only domain rearrangement, or both events. We characterize each of the four evolutionary scenarios by functional attributes. Our analysis of ten fungal genomes indicates that at least for the fungi clade, species significantly appear to gain complexity by gene duplication accompanied by the expansion of existing domain architectures via rearrangements. We show that paralogs gaining new domain architectures via duplication tend to adopt new functions compared to paralogs that preserve their domain architectures. We conclude that evolution of protein families through gene duplication and domain rearrangement is correlated with their functional properties. We suggest that in general, new functions are acquired via the integration of gene duplication and domain rearrangements rather than each process acting independently

  5. Improvisation in evolution of genes and genomes: whose structure is it anyway?

    Science.gov (United States)

    Shakhnovich, Boris E; Shakhnovich, Eugene I

    2008-06-01

    Significant progress has been made in recent years in a variety of seemingly unrelated fields such as sequencing, protein structure prediction, and high-throughput transcriptomics and metabolomics. At the same time, new microscopic models have been developed that made it possible to analyze the evolution of genes and genomes from first principles. The results from these efforts enable, for the first time, a comprehensive insight into the evolution of complex systems and organisms on all scales--from sequences to organisms and populations. Every newly sequenced genome uncovers new genes, families, and folds. Where do these new genes come from? How do gene duplication and subsequent divergence of sequence and structure affect the fitness of the organism? What role does regulation play in the evolution of proteins and folds? Emerging synergism between data and modeling provides first robust answers to these questions.

  6. Neighboring Genes Show Correlated Evolution in Gene Expression

    Science.gov (United States)

    Ghanbarian, Avazeh T.; Hurst, Laurence D.

    2015-01-01

    When considering the evolution of a gene’s expression profile, we commonly assume that this is unaffected by its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between neighboring genes in gene expression profiles in extant taxa. Indeed, in all eukaryotic genomes genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their expression or is gene expression evolution autonomous? To address this here we consider evolution of human gene expression since the human-chimp common ancestor, allowing for both variation in estimation of current expression level and error in Bayesian estimation of the ancestral state. We find that in all tissues and both sexes, the change in gene expression of a focal gene on average predicts the change in gene expression of neighbors. The effect is highly pronounced in the immediate vicinity (genes increasing their expression in humans tend to avoid nuclear lamina domains and be enriched for the gene activator 5-hydroxymethylcytosine, we conclude that, most probably owing to chromatin level control of gene expression, a change in gene expression of one gene likely affects the expression evolution of neighbors, what we term expression piggybacking, an analog of hitchhiking. PMID:25743543

  7. A genome-wide analysis of the flax (Linum usitatissimum L.) dirigent protein family: from gene identification and evolution to differential regulation.

    Energy Technology Data Exchange (ETDEWEB)

    Corbin, Cyrielle; Drouet, Samantha; Markulin, Lucija; Auguin, Daniel; Laine, Eric; Davin, Laurence B.; Cort, John R.; Lewis, Norman G.; Hano, Christophe

    2018-04-30

    Identification of DIR encoding genes in flax genome. Analysis of phylogeny, gene/protein structures and evolution. Identification of new conserved motifs linked to biochemical functions. Investigation of spatio-temporal gene expression and response to stress. Dirigent proteins (DIRs) were discovered during 8-8' lignan biosynthesis studies, through identification of stereoselective coupling to afford either (+)- or (-)-pinoresinols from E-coniferyl alcohol. DIRs are also involved or potentially involved in terpenoid, allyl/propenyl phenol lignan, pterocarpan and lignin biosynthesis. DIRs have very large multigene families in different vascular plants including flax, with most still of unknown function. DIR studies typically focus on a small subset of genes and identification of biochemical/physiological functions. Herein, a genome-wide analysis and characterization of the predicted flax DIR 44-membered multigene family was performed, this species being a rich natural grain source of 8-8' linked secoisolariciresinol-derived lignan oligomers. All predicted DIR sequences, including their promoters, were analyzed together with their public gene expression datasets. Expression patterns of selected DIRs were examined using qPCR, as well as through clustering analysis of DIR gene expression. These analyses further implicated roles for specific DIRs in (-)-pinoresinol formation in seed-coats, as well as (+)-pinoresinol in vegetative organs and/or specific responses to stress. Phylogeny and gene expression analysis segregated flax DIRs into six distinct clusters with new cluster-specific motifs identified. We propose that these findings can serve as a foundation to further systematically determine functions of DIRs, i.e. other than those already known in lignan biosynthesis in flax and other species. Given the differential expression profiles and inducibility of the flax DIR family, we provisionally propose that some DIR genes of unknown function could be involved

  8. A genome-wide analysis of the flax (Linum usitatissimum L.) dirigent protein family: from gene identification and evolution to differential regulation.

    Science.gov (United States)

    Corbin, Cyrielle; Drouet, Samantha; Markulin, Lucija; Auguin, Daniel; Lainé, Éric; Davin, Laurence B; Cort, John R; Lewis, Norman G; Hano, Christophe

    2018-05-01

    Identification of DIR encoding genes in flax genome. Analysis of phylogeny, gene/protein structures and evolution. Identification of new conserved motifs linked to biochemical functions. Investigation of spatio-temporal gene expression and response to stress. Dirigent proteins (DIRs) were discovered during 8-8' lignan biosynthesis studies, through identification of stereoselective coupling to afford either (+)- or (-)-pinoresinols from E-coniferyl alcohol. DIRs are also involved or potentially involved in terpenoid, allyl/propenyl phenol lignan, pterocarpan and lignin biosynthesis. DIRs have very large multigene families in different vascular plants including flax, with most still of unknown function. DIR studies typically focus on a small subset of genes and identification of biochemical/physiological functions. Herein, a genome-wide analysis and characterization of the predicted flax DIR 44-membered multigene family was performed, this species being a rich natural grain source of 8-8' linked secoisolariciresinol-derived lignan oligomers. All predicted DIR sequences, including their promoters, were analyzed together with their public gene expression datasets. Expression patterns of selected DIRs were examined using qPCR, as well as through clustering analysis of DIR gene expression. These analyses further implicated roles for specific DIRs in (-)-pinoresinol formation in seed-coats, as well as (+)-pinoresinol in vegetative organs and/or specific responses to stress. Phylogeny and gene expression analysis segregated flax DIRs into six distinct clusters with new cluster-specific motifs identified. We propose that these findings can serve as a foundation to further systematically determine functions of DIRs, i.e. other than those already known in lignan biosynthesis in flax and other species. Given the differential expression profiles and inducibility of the flax DIR family, we provisionally propose that some DIR genes of unknown function could be involved in

  9. Evolution of the duplicated intracellular lipid-binding protein genes of teleost fishes.

    Science.gov (United States)

    Venkatachalam, Ananda B; Parmar, Manoj B; Wright, Jonathan M

    2017-08-01

    Increasing organismal complexity during the evolution of life has been attributed to the duplication of genes and entire genomes. More recently, theoretical models have been proposed that postulate the fate of duplicated genes, among them the duplication-degeneration-complementation (DDC) model. In the DDC model, the common fate of a duplicated gene is lost from the genome owing to nonfunctionalization. Duplicated genes are retained in the genome either by subfunctionalization, where the functions of the ancestral gene are sub-divided between the sister duplicate genes, or by neofunctionalization, where one of the duplicate genes acquires a new function. Both processes occur either by loss or gain of regulatory elements in the promoters of duplicated genes. Here, we review the genomic organization, evolution, and transcriptional regulation of the multigene family of intracellular lipid-binding protein (iLBP) genes from teleost fishes. Teleost fishes possess many copies of iLBP genes owing to a whole genome duplication (WGD) early in the teleost fish radiation. Moreover, the retention of duplicated iLBP genes is substantially higher than the retention of all other genes duplicated in the teleost genome. The fatty acid-binding protein genes, a subfamily of the iLBP multigene family in zebrafish, are differentially regulated by peroxisome proliferator-activated receptor (PPAR) isoforms, which may account for the retention of iLBP genes in the zebrafish genome by the process of subfunctionalization of cis-acting regulatory elements in iLBP gene promoters.

  10. Genome-wide identification, characterisation and expression analysis of the MADS-box gene family in Prunus mume.

    Science.gov (United States)

    Xu, Zongda; Zhang, Qixiang; Sun, Lidan; Du, Dongliang; Cheng, Tangren; Pan, Huitang; Yang, Weiru; Wang, Jia

    2014-10-01

    MADS-box genes encode transcription factors that play crucial roles in plant development, especially in flower and fruit development. To gain insight into this gene family in Prunus mume, an important ornamental and fruit plant in East Asia, and to elucidate their roles in flower organ determination and fruit development, we performed a genome-wide identification, characterisation and expression analysis of MADS-box genes in this Rosaceae tree. In this study, 80 MADS-box genes were identified in P. mume and categorised into MIKC, Mα, Mβ, Mγ and Mδ groups based on gene structures and phylogenetic relationships. The MIKC group could be further classified into 12 subfamilies. The FLC subfamily was absent in P. mume and the six tandemly arranged DAM genes might experience a species-specific evolution process in P. mume. The MADS-box gene family might experience an evolution process from MIKC genes to Mδ genes to Mα, Mβ and Mγ genes. The expression analysis suggests that P. mume MADS-box genes have diverse functions in P. mume development and the functions of duplicated genes diverged after the duplication events. In addition to its involvement in the development of female gametophytes, type I genes also play roles in male gametophytes development. In conclusion, this study adds to our understanding of the roles that the MADS-box genes played in flower and fruit development and lays a foundation for selecting candidate genes for functional studies in P. mume and other species. Furthermore, this study also provides a basis to study the evolution of the MADS-box family.

  11. The Caenorhabditis chemoreceptor gene families

    Directory of Open Access Journals (Sweden)

    Robertson Hugh M

    2008-10-01

    Full Text Available Abstract Background Chemoreceptor proteins mediate the first step in the transduction of environmental chemical stimuli, defining the breadth of detection and conferring stimulus specificity. Animal genomes contain families of genes encoding chemoreceptors that mediate taste, olfaction, and pheromone responses. The size and diversity of these families reflect the biology of chemoperception in specific species. Results Based on manual curation and sequence comparisons among putative G-protein-coupled chemoreceptor genes in the nematode Caenorhabditis elegans, we identified approximately 1300 genes and 400 pseudogenes in the 19 largest gene families, most of which fall into larger superfamilies. In the related species C. briggsae and C. remanei, we identified most or all genes in each of the 19 families. For most families, C. elegans has the largest number of genes and C. briggsae the smallest number, suggesting changes in the importance of chemoperception among the species. Protein trees reveal family-specific and species-specific patterns of gene duplication and gene loss. The frequency of strict orthologs varies among the families, from just over 50% in two families to less than 5% in three families. Several families include large species-specific expansions, mostly in C. elegans and C. remanei. Conclusion Chemoreceptor gene families in Caenorhabditis species are large and evolutionarily dynamic as a result of gene duplication and gene loss. These dynamics shape the chemoreceptor gene complements in Caenorhabditis species and define the receptor space available for chemosensory responses. To explain these patterns, we propose the gray pawn hypothesis: individual genes are of little significance, but the aggregate of a large number of diverse genes is required to cover a large phenotype space.

  12. The Caenorhabditis chemoreceptor gene families.

    Science.gov (United States)

    Thomas, James H; Robertson, Hugh M

    2008-10-06

    Chemoreceptor proteins mediate the first step in the transduction of environmental chemical stimuli, defining the breadth of detection and conferring stimulus specificity. Animal genomes contain families of genes encoding chemoreceptors that mediate taste, olfaction, and pheromone responses. The size and diversity of these families reflect the biology of chemoperception in specific species. Based on manual curation and sequence comparisons among putative G-protein-coupled chemoreceptor genes in the nematode Caenorhabditis elegans, we identified approximately 1300 genes and 400 pseudogenes in the 19 largest gene families, most of which fall into larger superfamilies. In the related species C. briggsae and C. remanei, we identified most or all genes in each of the 19 families. For most families, C. elegans has the largest number of genes and C. briggsae the smallest number, suggesting changes in the importance of chemoperception among the species. Protein trees reveal family-specific and species-specific patterns of gene duplication and gene loss. The frequency of strict orthologs varies among the families, from just over 50% in two families to less than 5% in three families. Several families include large species-specific expansions, mostly in C. elegans and C. remanei. Chemoreceptor gene families in Caenorhabditis species are large and evolutionarily dynamic as a result of gene duplication and gene loss. These dynamics shape the chemoreceptor gene complements in Caenorhabditis species and define the receptor space available for chemosensory responses. To explain these patterns, we propose the gray pawn hypothesis: individual genes are of little significance, but the aggregate of a large number of diverse genes is required to cover a large phenotype space.

  13. Genome-wide characterization, evolution, and expression analysis of the leucine-rich repeat receptor-like protein kinase (LRR-RLK) gene family in Rosaceae genomes.

    Science.gov (United States)

    Sun, Jiangmei; Li, Leiting; Wang, Peng; Zhang, Shaoling; Wu, Juyou

    2017-10-10

    Leucine-rich repeat receptor-like protein kinase (LRR-RLK) is the largest gene family of receptor-like protein kinases (RLKs) and actively participates in regulating the growth, development, signal transduction, immunity, and stress responses of plants. However, the patterns of LRR-RLK gene family evolution in the five main Rosaceae species for which genome sequences are available have not yet been reported. In this study, we performed a comprehensive analysis of LRR-RLK genes for five Rosaceae species: Fragaria vesca (strawberry), Malus domestica (apple), Pyrus bretschneideri (Chinese white pear), Prunus mume (mei), and Prunus persica (peach), which contained 201, 244, 427, 267, and 258 LRR-RLK genes, respectively. All LRR-RLK genes were further grouped into 23 subfamilies based on the hidden Markov models approach. RLK-Pelle_LRR-XII-1, RLK-Pelle_LRR-XI-1, and RLK-Pelle_LRR-III were the three largest subfamilies. Synteny analysis indicated that there were 236 tandem duplicated genes in the five Rosaceae species, among which subfamilies XII-1 (82 genes) and XI-1 (80 genes) comprised 68.6%. Our results indicate that tandem duplication made a large contribution to the expansion of the subfamilies. The gene expression, tissue-specific expression, and subcellular localization data revealed that LRR-RLK genes were differentially expressed in various organs and tissues, and the largest subfamily XI-1 was highly expressed in all five Rosaceae species, suggesting that LRR-RLKs play important roles in each stage of plant growth and development. Taken together, our results provide an overview of the LRR-RLK family in Rosaceae genomes and the basis for further functional studies.

  14. Molecular evolution of the Paramyxoviridae and Rhabdoviridae multiple-protein-encoding P gene.

    Science.gov (United States)

    Jordan, I K; Sutter, B A; McClure, M A

    2000-01-01

    Presented here is an analysis of the molecular evolutionary dynamics of the P gene among 76 representative sequences of the Paramyxoviridae and Rhabdoviridae RNA virus families. In a number of Paramyxoviridae taxa, as well as in vesicular stomatitis viruses of the Rhabdoviridae, the P gene encodes multiple proteins from a single genomic RNA sequence. These products include the phosphoprotein (P), as well as the C and V proteins. The complexity of the P gene makes it an intriguing locus to study from an evolutionary perspective. Amino acid sequence alignments of the proteins encoded at the P and N loci were used in independent phylogenetic reconstructions of the Paramyxoviridae and Rhabdoviridae families. P-gene-coding capacities were mapped onto the Paramyxoviridae phylogeny, and the most parsimonious path of multiple-coding-capacity evolution was determined. Levels of amino acid variation for Paramyxoviridae and Rhabdoviridae P-gene-encoded products were also analyzed. Proteins encoded in overlapping reading frames from the same nucleotides have different levels of amino acid variation. The nucleotide architecture that underlies the amino acid variation was determined in order to evaluate the role of selection in the evolution of the P gene overlapping reading frames. In every case, the evolution of one of the proteins encoded in the overlapping reading frames has been constrained by negative selection while the other has evolved more rapidly. The integrity of the overlapping reading frame that represents a derived state is generally maintained at the expense of the ancestral reading frame encoded by the same nucleotides. The evolution of such multicoding sequences is likely a response by RNA viruses to selective pressure to maximize genomic information content while maintaining small genome size. The ability to evolve such a complex genomic strategy is intimately related to the dynamics of the viral quasispecies, which allow enhanced exploration of the adaptive

  15. Simulating evolution of protein complexes through gene duplication and co-option.

    Science.gov (United States)

    Haarsma, Loren; Nelesen, Serita; VanAndel, Ethan; Lamine, James; VandeHaar, Peter

    2016-06-21

    We present a model of the evolution of protein complexes with novel functions through gene duplication, mutation, and co-option. Under a wide variety of input parameters, digital organisms evolve complexes of 2-5 bound proteins which have novel functions but whose component proteins are not independently functional. Evolution of complexes with novel functions happens more quickly as gene duplication rates increase, point mutation rates increase, protein complex functional probability increases, protein complex functional strength increases, and protein family size decreases. Evolution of complexity is inhibited when the metabolic costs of making proteins exceeds the fitness gain of having functional proteins, or when point mutation rates get so large the functional proteins undergo deleterious mutations faster than new functional complexes can evolve. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. Massive expansion of the calpain gene family in unicellular eukaryotes

    Directory of Open Access Journals (Sweden)

    Zhao Sen

    2012-09-01

    Full Text Available Abstract Background Calpains are Ca2+-dependent cysteine proteases that participate in a range of crucial cellular processes. Dysfunction of these enzymes may cause, for instance, life-threatening diseases in humans, the loss of sex determination in nematodes and embryo lethality in plants. Although the calpain family is well characterized in animal and plant model organisms, there is a great lack of knowledge about these genes in unicellular eukaryote species (i.e. protists. Here, we study the distribution and evolution of calpain genes in a wide range of eukaryote genomes from major branches in the tree of life. Results Our investigations reveal 24 types of protein domains that are combined with the calpain-specific catalytic domain CysPc. In total we identify 41 different calpain domain architectures, 28 of these domain combinations have not been previously described. Based on our phylogenetic inferences, we propose that at least four calpain variants were established in the early evolution of eukaryotes, most likely before the radiation of all the major supergroups of eukaryotes. Many domains associated with eukaryotic calpain genes can be found among eubacteria or archaebacteria but never in combination with the CysPc domain. Conclusions The analyses presented here show that ancient modules present in prokaryotes, and a few de novo eukaryote domains, have been assembled into many novel domain combinations along the evolutionary history of eukaryotes. Some of the new calpain genes show a narrow distribution in a few branches in the tree of life, likely representing lineage-specific innovations. Hence, the functionally important classical calpain genes found among humans and vertebrates make up only a tiny fraction of the calpain family. In fact, a massive expansion of the calpain family occurred by domain shuffling among unicellular eukaryotes and contributed to a wealth of functionally different genes.

  17. Massive expansion of the calpain gene family in unicellular eukaryotes.

    Science.gov (United States)

    Zhao, Sen; Liang, Zhe; Demko, Viktor; Wilson, Robert; Johansen, Wenche; Olsen, Odd-Arne; Shalchian-Tabrizi, Kamran

    2012-09-29

    Calpains are Ca2+-dependent cysteine proteases that participate in a range of crucial cellular processes. Dysfunction of these enzymes may cause, for instance, life-threatening diseases in humans, the loss of sex determination in nematodes and embryo lethality in plants. Although the calpain family is well characterized in animal and plant model organisms, there is a great lack of knowledge about these genes in unicellular eukaryote species (i.e. protists). Here, we study the distribution and evolution of calpain genes in a wide range of eukaryote genomes from major branches in the tree of life. Our investigations reveal 24 types of protein domains that are combined with the calpain-specific catalytic domain CysPc. In total we identify 41 different calpain domain architectures, 28 of these domain combinations have not been previously described. Based on our phylogenetic inferences, we propose that at least four calpain variants were established in the early evolution of eukaryotes, most likely before the radiation of all the major supergroups of eukaryotes. Many domains associated with eukaryotic calpain genes can be found among eubacteria or archaebacteria but never in combination with the CysPc domain. The analyses presented here show that ancient modules present in prokaryotes, and a few de novo eukaryote domains, have been assembled into many novel domain combinations along the evolutionary history of eukaryotes. Some of the new calpain genes show a narrow distribution in a few branches in the tree of life, likely representing lineage-specific innovations. Hence, the functionally important classical calpain genes found among humans and vertebrates make up only a tiny fraction of the calpain family. In fact, a massive expansion of the calpain family occurred by domain shuffling among unicellular eukaryotes and contributed to a wealth of functionally different genes.

  18. Divergence and Conservative Evolution of XTNX Genes in Land Plants

    Directory of Open Access Journals (Sweden)

    Yan-Mei Zhang

    2017-10-01

    Full Text Available The Toll-interleukin-1 receptor (TIR and Nucleotide-binding site (NBS domains are two major components of the TIR-NBS-leucine-rich repeat family plant disease resistance genes. Extensive functional and evolutionary studies have been performed on these genes; however, the characterization of a small group of genes that are composed of atypical TIR and NBS domains, namely XTNX genes, is limited. The present study investigated this specific gene family by conducting genome-wide analyses of 59 green plant genomes. A total of 143 XTNX genes were identified in 51 of the 52 land plant genomes, whereas no XTNX gene was detected in any green algae genomes, which indicated that XTNX genes originated upon emergence of land plants. Phylogenetic analysis revealed that the ancestral XTNX gene underwent two rounds of ancient duplications in land plants, which resulted in the formation of clades I/II and clades IIa/IIb successively. Although clades I and IIb have evolved conservatively in angiosperms, the motif composition difference and sequence divergence at the amino acid level suggest that functional divergence may have occurred since the separation of the two clades. In contrast, several features of the clade IIa genes, including the absence in the majority of dicots, the long branches in the tree, the frequent loss of ancestral motifs, and the loss of expression in all detected tissues of Zea mays, all suggest that the genes in this lineage might have undergone pseudogenization. This study highlights that XTNX genes are a gene family originated anciently in land plants and underwent specific conservative pattern in evolution.

  19. Transcriptome analysis of WRKY gene family in Oryza officinalis Wall ex Watt and WRKY genes involved in responses to Xanthomonas oryzae pv. oryzae stress.

    Science.gov (United States)

    Jiang, Chunmiao; Shen, Qingxi J; Wang, Bo; He, Bin; Xiao, Suqin; Chen, Ling; Yu, Tengqiong; Ke, Xue; Zhong, Qiaofang; Fu, Jian; Chen, Yue; Wang, Lingxian; Yin, Fuyou; Zhang, Dunyu; Ghidan, Walid; Huang, Xingqi; Cheng, Zaiquan

    2017-01-01

    Oryza officinalis Wall ex Watt, a very important and special wild rice species, shows abundant genetic diversity and disease resistance features, especially high resistance to bacterial blight. The molecular mechanisms of bacterial blight resistance in O. officinalis have not yet been elucidated. The WRKY transcription factor family is one of the largest gene families involved in plant growth, development and stress response. However, little is known about the numbers, structure, molecular phylogenetics, and expression of the WRKY genes under Xanthomonas oryzae pv. oryzae (Xoo) stress in O. officinalis due to lacking of O. officinalis genome. Therefore, based on the RNA-sequencing data of O. officinalis, we performed a comprehensive study of WRKY genes in O. officinalis and identified 89 OoWRKY genes. Then 89 OoWRKY genes were classified into three groups based on the WRKY domains and zinc finger motifs. Phylogenetic analysis strongly supported that the evolution of OoWRKY genes were consistent with previous studies of WRKYs, and subgroup IIc OoWRKY genes were the original ancestors of some group II and group III OoWRKYs. Among the 89 OoWRKY genes, eight OoWRKYs displayed significantly different expression (>2-fold, pWRKY family of transcription factors in O.officinalis. Insight was gained into the classification, evolution, and function of the OoWRKY genes, revealing the putative roles of eight significantly different expression OoWRKYs in Xoo strains PXO99 and C5 stress responses in O.officinalis. This study provided a better understanding of the evolution and functions of O. officinalis WRKY genes, and suggested that manipulating eight significantly different expression OoWRKYs would enhance resistance to bacterial blight.

  20. The relationship among gene expression, the evolution of gene dosage, and the rate of protein evolution.

    Directory of Open Access Journals (Sweden)

    Jean-François Gout

    2010-05-01

    Full Text Available The understanding of selective constraints affecting genes is a major issue in biology. It is well established that gene expression level is a major determinant of the rate of protein evolution, but the reasons for this relationship remain highly debated. Here we demonstrate that gene expression is also a major determinant of the evolution of gene dosage: the rate of gene losses after whole genome duplications in the Paramecium lineage is negatively correlated to the level of gene expression, and this relationship is not a byproduct of other factors known to affect the fate of gene duplicates. This indicates that changes in gene dosage are generally more deleterious for highly expressed genes. This rule also holds for other taxa: in yeast, we find a clear relationship between gene expression level and the fitness impact of reduction in gene dosage. To explain these observations, we propose a model based on the fact that the optimal expression level of a gene corresponds to a trade-off between the benefit and cost of its expression. This COSTEX model predicts that selective pressure against mutations changing gene expression level or affecting the encoded protein should on average be stronger in highly expressed genes and hence that both the frequency of gene loss and the rate of protein evolution should correlate negatively with gene expression. Thus, the COSTEX model provides a simple and common explanation for the general relationship observed between the level of gene expression and the different facets of gene evolution.

  1. Isolation of Hox cluster genes from insects reveals an accelerated sequence evolution rate.

    Directory of Open Access Journals (Sweden)

    Heike Hadrys

    Full Text Available Among gene families it is the Hox genes and among metazoan animals it is the insects (Hexapoda that have attracted particular attention for studying the evolution of development. Surprisingly though, no Hox genes have been isolated from 26 out of 35 insect orders yet, and the existing sequences derive mainly from only two orders (61% from Hymenoptera and 22% from Diptera. We have designed insect specific primers and isolated 37 new partial homeobox sequences of Hox cluster genes (lab, pb, Hox3, ftz, Antp, Scr, abd-a, Abd-B, Dfd, and Ubx from six insect orders, which are crucial to insect phylogenetics. These new gene sequences provide a first step towards comparative Hox gene studies in insects. Furthermore, comparative distance analyses of homeobox sequences reveal a correlation between gene divergence rate and species radiation success with insects showing the highest rate of homeobox sequence evolution.

  2. New Gene Evolution: Little Did We Know

    Science.gov (United States)

    Long, Manyuan; VanKuren, Nicholas W.; Chen, Sidi; Vibranovski, Maria D.

    2014-01-01

    Genes are perpetually added to and deleted from genomes during evolution. Thus, it is important to understand how new genes are formed and evolve as critical components of the genetic systems determining the biological diversity of life. Two decades of effort have shed light on the process of new gene origination, and have contributed to an emerging comprehensive picture of how new genes are added to genomes, ranging from the mechanisms that generate new gene structures to the presence of new genes in different organisms to the rates and patterns of new gene origination and the roles of new genes in phenotypic evolution. We review each of these aspects of new gene evolution, summarizing the main evidence for the origination and importance of new genes in evolution. We highlight findings showing that new genes rapidly change existing genetic systems that govern various molecular, cellular and phenotypic functions. PMID:24050177

  3. Unequal rates of Y chromosome gene divergence during speciation of the family Ursidae.

    Science.gov (United States)

    Nakagome, Shigeki; Pecon-Slattery, Jill; Masuda, Ryuichi

    2008-07-01

    Evolution of the bear family Ursidae is well investigated in terms of morphological, paleontological, and genetic features. However, several phylogenetic ambiguities occur within the subfamily Ursinae (the family Ursidae excluding the giant panda and spectacled bear), which may correlate with behavioral traits of female philopatry and male-biased dispersal which form the basis of the observed matriarchal population structure in these species. In the process of bear evolution, we investigate the premise that such behavioral traits may be reflected in patterns of variation among genes with different modes of inheritance: matrilineal mitochondrial DNA (mtDNA), patrilineal Y chromosome, biparentally inherited autosomes, and the X chromosome. In the present study, we sequenced 3 Y-linked genes (3,453 bp) and 4 X-linked genes (4,960 bp) and reanalyzed previously published sequences from autosome genes (2,347 bp) in ursid species to investigate differences in evolutionary rates associated with patterns of inheritance. The results describe topological incongruence between sex-linked genes and autosome genes and between nuclear DNA and mtDNA. In more ancestral branches within the bear phylogeny, Y-linked genes evolved faster than autosome and X-linked genes, consistent with expectations based on male-driven evolution. However, this pattern changes among branches leading to each species within the lineage of Ursinae whereby the evolutionary rates of Y-linked genes have fewer than expected substitutions. This inconsistency between more recent nodes of the bear phylogeny with more ancestral nodes may reflect the influences of sex-biased dispersal as well as molecular evolutionary characteristics of the Y chromosome, and stochastic events in species natural history, and phylogeography unique to ursine bears.

  4. Positive selection in the SLC11A1 gene in the family Equidae

    DEFF Research Database (Denmark)

    Bayerova, Zuzana; Janova, Eva; Matiasovic, Jan

    2016-01-01

    Immunity-related genes are a suitable model for studying effects of selection at the genomic level. Some of them are highly conserved due to functional constraints and purifying selection, while others are variable and change quickly to cope with the variation of pathogens. The SLC11A1 gene encodes...... a transporter protein mediating antimicrobial activity of macrophages. Little is known about the patterns of selection shaping this gene during evolution. Although it is a typical evolutionarily conserved gene, functionally important polymorphisms associated with various diseases were identified in humans...... and other species. We analyzed the genomic organization, genetic variation, and evolution of the SLC11A1 gene in the family Equidae to identify patterns of selection within this important gene. Nucleotide SLC11A1 sequences were shown to be highly conserved in ten equid species, with more than 97 % sequence...

  5. Conservation, Divergence, and Genome-Wide Distribution of PAL and POX A Gene Families in Plants.

    Science.gov (United States)

    Rawal, H C; Singh, N K; Sharma, T R

    2013-01-01

    Genome-wide identification and phylogenetic and syntenic comparison were performed for the genes responsible for phenylalanine ammonia lyase (PAL) and peroxidase A (POX A) enzymes in nine plant species representing very diverse groups like legumes (Glycine max and Medicago truncatula), fruits (Vitis vinifera), cereals (Sorghum bicolor, Zea mays, and Oryza sativa), trees (Populus trichocarpa), and model dicot (Arabidopsis thaliana) and monocot (Brachypodium distachyon) species. A total of 87 and 1045 genes in PAL and POX A gene families, respectively, have been identified in these species. The phylogenetic and syntenic comparison along with motif distributions shows a high degree of conservation of PAL genes, suggesting that these genes may predate monocot/eudicot divergence. The POX A family genes, present in clusters at the subtelomeric regions of chromosomes, might be evolving and expanding with higher rate than the PAL gene family. Our analysis showed that during the expansion of POX A gene family, many groups and subgroups have evolved, resulting in a high level of functional divergence among monocots and dicots. These results will act as a first step toward the understanding of monocot/eudicot evolution and functional characterization of these gene families in the future.

  6. Conservation, Divergence, and Genome-Wide Distribution of PAL and POX A Gene Families in Plants

    Directory of Open Access Journals (Sweden)

    H. C. Rawal

    2013-01-01

    Full Text Available Genome-wide identification and phylogenetic and syntenic comparison were performed for the genes responsible for phenylalanine ammonia lyase (PAL and peroxidase A (POX A enzymes in nine plant species representing very diverse groups like legumes (Glycine max and Medicago truncatula, fruits (Vitis vinifera, cereals (Sorghum bicolor, Zea mays, and Oryza sativa, trees (Populus trichocarpa, and model dicot (Arabidopsis thaliana and monocot (Brachypodium distachyon species. A total of 87 and 1045 genes in PAL and POX A gene families, respectively, have been identified in these species. The phylogenetic and syntenic comparison along with motif distributions shows a high degree of conservation of PAL genes, suggesting that these genes may predate monocot/eudicot divergence. The POX A family genes, present in clusters at the subtelomeric regions of chromosomes, might be evolving and expanding with higher rate than the PAL gene family. Our analysis showed that during the expansion of POX A gene family, many groups and subgroups have evolved, resulting in a high level of functional divergence among monocots and dicots. These results will act as a first step toward the understanding of monocot/eudicot evolution and functional characterization of these gene families in the future.

  7. Molecular Characterization, Evolution, and Expression Profiling of the Dirigent (DIR Family Genes in Chinese White Pear (Pyrus bretschneideri

    Directory of Open Access Journals (Sweden)

    Xi Cheng

    2018-04-01

    Full Text Available Stone cells content and size are the key factors determining the internal quality of the pear fruit. Synthesis of lignin and thickening of secondary cell wall are the keys to the development of stone cells. The polymerization of monolignols and secondary cell wall formation requires the participation of dirigent proteins (DIRs. In recent years, DIR family have been studied in higher plants, but lack of comprehensive study in the pear DIR (PbDIR family. This study focuses on the identification and analysis of PbDIR family for the first time. We identified 35 PbDIRs from the pear genome, 89% of which are intronless genes. Phylogenetic tree and chromosome localization analysis showed that 35 PbDIRs were divided into four subfamilies (DIR-a, -b/d, -e, and -g and irregularly distributed among 10 chromosomes. In addition, we identified 29, 26, and 14 DIRs from the other three Rosids (peach, Mei, and grape, respectively. Interspecies microsynteny analysis revealed the collinear gene pairs between pear and peach are the most. Temporal expression analysis showed that the expression changes of seven PbDIRs (DIR-a subfamily: PbDIR4 and PbDIR5; DIR-b/d subfamily: PbDIR11; DIR-g subfamily: PbDIR19; DIR-e subfamily: PbDIR23, 25 and 26 in fruits were consistent with the changes of fruit lignin and stone cells contents. In addition, the subfamily of PbDIRs in fruits showed significant responses after treatment with ABA, SA, and MeJA. According to the protein tertiary structure, key amino acid residues and expression patterns analysis found that PbDIR4 might be involved in the metabolism of lignin and related to stone cells contents in pear fruits. In this study, we systematically analyzed the structure, evolution, function and expression of PbDIR family, which not only confirmed the characteristics of PbDIR family, but also laid the foundation for revealing the role of DIR in pear stone cell development and lignin polymerization.

  8. Adaptive Evolution of the Myo6 Gene in Old World Fruit Bats (Family: Pteropodidae)

    Science.gov (United States)

    Shen, Bin; Han, Xiuqun; Jones, Gareth; Rossiter, Stephen J.; Zhang, Shuyi

    2013-01-01

    Myosin VI (encoded by the Myo6 gene) is highly expressed in the inner and outer hair cells of the ear, retina, and polarized epithelial cells such as kidney proximal tubule cells and intestinal enterocytes. The Myo6 gene is thought to be involved in a wide range of physiological functions such as hearing, vision, and clathrin-mediated endocytosis. Bats (Chiroptera) represent one of the most fascinating mammal groups for molecular evolutionary studies of the Myo6 gene. A diversity of specialized adaptations occur among different bat lineages, such as echolocation and associated high-frequency hearing in laryngeal echolocating bats, large eyes and a strong dependence on vision in Old World fruit bats (Pteropodidae), and specialized high-carbohydrate but low-nitrogen diets in both Old World and New World fruit bats (Phyllostomidae). To investigate what role(s) the Myo6 gene might fulfill in bats, we sequenced the coding region of the Myo6 gene in 15 bat species and used molecular evolutionary analyses to detect evidence of positive selection in different bat lineages. We also conducted real-time PCR assays to explore the expression levels of Myo6 in a range of tissues from three representative bat species. Molecular evolutionary analyses revealed that the Myo6 gene, which was widely considered as a hearing gene, has undergone adaptive evolution in the Old World fruit bats which lack laryngeal echolocation and associated high-frequency hearing. Real-time PCR showed the highest expression level of the Myo6 gene in the kidney among ten tissues examined in three bat species, indicating an important role for this gene in kidney function. We suggest that Myo6 has undergone adaptive evolution in Old World fruit bats in relation to receptor-mediated endocytosis for the preservation of protein and essential nutrients. PMID:23620821

  9. Genome-wide Identification of WRKY Genes in the Desert Poplar Populus euphratica and Adaptive Evolution of the Genes in Response to Salt Stress.

    Science.gov (United States)

    Ma, Jianchao; Lu, Jing; Xu, Jianmei; Duan, Bingbing; He, Xiaodong; Liu, Jianquan

    2015-01-01

    WRKY transcription factors play important roles in plant development and responses to various stresses in plants. However, little is known about the evolution of the WRKY genes in the desert poplar species Populus euphratica, which is highly tolerant of salt stress. In this study, we identified 107 PeWRKY genes from the P. euphratica genome and examined their evolutionary relationships with the WRKY genes of the salt-sensitive congener Populus trichocarpa. Ten PeWRKY genes are specific to P. euphratica, and five of these showed altered expression under salt stress. Furthermore, we found that two pairs of orthologs between the two species showed evidence of positive evolution, with dN/dS ratios>1 (nonsynonymous/synonymous substitutions), and both of them altered their expression in response to salinity stress. These findings suggested that both the development of new genes and positive evolution in some orthologs of the WRKY gene family may have played an important role in the acquisition of high salt tolerance by P. euphratica.

  10. Evolution and diversity of secretome genes in the apicomplexan parasite Theileria annulata

    Directory of Open Access Journals (Sweden)

    Shiels Brian R

    2010-01-01

    Full Text Available Abstract Background Little is known about how apicomplexan parasites have evolved to infect different host species and cell types. Theileria annulata and Theileria parva invade and transform bovine leukocytes but each species favours a different host cell lineage. Parasite-encoded proteins secreted from the intracellular macroschizont stage within the leukocyte represent a critical interface between host and pathogen systems. Genome sequencing has revealed that several Theileria-specific gene families encoding secreted proteins are positively selected at the inter-species level, indicating diversification between the species. We extend this analysis to the intra-species level, focusing on allelic diversity of two major secretome families. These families represent a well-characterised group of genes implicated in control of the host cell phenotype and a gene family of unknown function. To gain further insight into their evolution and function, this study investigates whether representative genes of these two families are diversifying or constrained within the T. annulata population. Results Strong evidence is provided that the sub-telomerically encoded SVSP family and the host-nucleus targeted TashAT family have evolved under contrasting pressures within natural T. annulata populations. SVSP genes were found to possess atypical codon usage and be evolving neutrally, with high levels of nucleotide substitutions and multiple indels. No evidence of geographical sub-structuring of allelic sequences was found. In contrast, TashAT family genes, implicated in control of host cell gene expression, are strongly conserved at the protein level and geographically sub-structured allelic sequences were identified among Tunisian and Turkish isolates. Although different copy numbers of DNA binding motifs were identified in alleles of TashAT proteins, motif periodicity was strongly maintained, implying conserved functional activity of these sites. Conclusions

  11. Genomic characterization, phylogenetic comparison and differential expression of the cyclic nucleotide-gated channels gene family in pear (Pyrus bretchneideri Rehd.).

    Science.gov (United States)

    Chen, Jianqing; Yin, Hao; Gu, Jinping; Li, Leiting; Liu, Zhe; Jiang, Xueting; Zhou, Hongsheng; Wei, Shuwei; Zhang, Shaoling; Wu, Juyou

    2015-01-01

    The cyclic nucleotide-gated channel (CNGC) family is involved in the uptake of various cations, such as Ca(2+), to regulate plant growth and respond to biotic and abiotic stresses. However, there is far less information about this family in woody plants such as pear. Here, we provided a genome-wide identification and analysis of the CNGC gene family in pear. Phylogenetic analysis showed that the 21 pear CNGC genes could be divided into five groups (I, II, III, IVA and IVB). The majority of gene duplications in pear appeared to have been caused by segmental duplication and occurred 32.94-39.14 million years ago. Evolutionary analysis showed that positive selection had driven the evolution of pear CNGCs. Motif analyses showed that Group I CNGCs generally contained 26 motifs, which was the greatest number of motifs in all CNGC groups. Among these, eight motifs were shared by each group, suggesting that these domains play a conservative role in CNGC activity. Tissue-specific expression analysis indicated that functional diversification of the duplicated CNGC genes was a major feature of long-term evolution. Our results also suggested that the P-S6 and PBC & hinge domains had co-evolved during the evolution. These results provide valuable information to increase our understanding of the function, evolution and expression analyses of the CNGC gene family in higher plants. Copyright © 2014 Elsevier Inc. All rights reserved.

  12. Distinctive patterns of evolution of the δ-globin gene (HBD in primates.

    Directory of Open Access Journals (Sweden)

    Ana Moleirinho

    Full Text Available In most vertebrates, hemoglobin (Hb is a heterotetramer composed of two dissimilar globin chains, which change during development according to the patterns of expression of α- and β-globin family members. In placental mammals, the β-globin cluster includes three early-expressed genes, ε(HBE-γ(HBG-ψβ(HBBP1, and the late expressed genes, δ (HBD and β (HBB. While HBB encodes the major adult β-globin chain, HBD is weakly expressed or totally silent. Paradoxically, in human populations HBD shows high levels of conservation typical of genes under strong evolutionary constraints, possibly due to a regulatory role in the fetal-to-adult switch unique of Anthropoid primates. In this study, we have performed a comprehensive phylogenetic and comparative analysis of the two adult β-like globin genes in a set of diverse mammalian taxa, focusing on the evolution and functional divergence of HBD in primates. Our analysis revealed that anthropoids are an exception to a general pattern of concerted evolution in placental mammals, showing a high level of sequence conservation at HBD, less frequent and shorter gene conversion events. Moreover, this lineage is unique in the retention of a functional GATA-1 motif, known to be involved in the control of the developmental expression of the β-like globin genes. We further show that not only the mode but also the rate of evolution of the δ-globin gene in higher primates are strictly associated with the fetal/adult β-cluster developmental switch. To gain further insight into the possible functional constraints that have been shaping the evolutionary history of HBD in primates, we calculated dN/dS (ω ratios under alternative models of gene evolution. Although our results indicate that HBD might have experienced different selective pressures throughout primate evolution, as shown by different ω values between apes and Old World Monkeys + New World Monkeys (0.06 versus 0.43, respectively, these estimates

  13. Targeted Sequencing of Venom Genes from Cone Snail Genomes Improves Understanding of Conotoxin Molecular Evolution.

    Science.gov (United States)

    Phuong, Mark A; Mahardika, Gusti N

    2018-05-01

    To expand our capacity to discover venom sequences from the genomes of venomous organisms, we applied targeted sequencing techniques to selectively recover venom gene superfamilies and nontoxin loci from the genomes of 32 cone snail species (family, Conidae), a diverse group of marine gastropods that capture their prey using a cocktail of neurotoxic peptides (conotoxins). We were able to successfully recover conotoxin gene superfamilies across all species with high confidence (> 100× coverage) and used these data to provide new insights into conotoxin evolution. First, we found that conotoxin gene superfamilies are composed of one to six exons and are typically short in length (mean = ∼85 bp). Second, we expanded our understanding of the following genetic features of conotoxin evolution: 1) positive selection, where exons coding the mature toxin region were often three times more divergent than their adjacent noncoding regions, 2) expression regulation, with comparisons to transcriptome data showing that cone snails only express a fraction of the genes available in their genome (24-63%), and 3) extensive gene turnover, where Conidae species varied from 120 to 859 conotoxin gene copies. Finally, using comparative phylogenetic methods, we found that while diet specificity did not predict patterns of conotoxin evolution, dietary breadth was positively correlated with total conotoxin gene diversity. Overall, the targeted sequencing technique demonstrated here has the potential to radically increase the pace at which venom gene families are sequenced and studied, reshaping our ability to understand the impact of genetic changes on ecologically relevant phenotypes and subsequent diversification.

  14. Sexy gene conversions: locating gene conversions on the X-chromosome.

    Science.gov (United States)

    Lawson, Mark J; Zhang, Liqing

    2009-08-01

    Gene conversion can have a profound impact on both the short- and long-term evolution of genes and genomes. Here, we examined the gene families that are located on the X-chromosomes of human (Homo sapiens), chimpanzee (Pan troglodytes), mouse (Mus musculus) and rat (Rattus norvegicus) for evidence of gene conversion. We identified seven gene families (WD repeat protein family, Ferritin Heavy Chain family, RAS-related Protein RAB-40 family, Diphosphoinositol polyphosphate phosphohydrolase family, Transcription Elongation Factor A family, LDOC1-related family, Zinc Finger Protein ZIC, and GLI family) that show evidence of gene conversion. Through phylogenetic analyses and synteny evidence, we show that gene conversion has played an important role in the evolution of these gene families and that gene conversion has occurred independently in both primates and rodents. Comparing the results with those of two gene conversion prediction programs (GENECONV and Partimatrix), we found that both GENECONV and Partimatrix have very high false negative rates (i.e. failed to predict gene conversions), which leads to many undetected gene conversions. The combination of phylogenetic analyses with physical synteny evidence exhibits high resolution in the detection of gene conversions.

  15. MADS-box gene evolution - structure and transcription patterns

    DEFF Research Database (Denmark)

    Johansen, Bo; Pedersen, Louise Buchholt; Skipper, Martin

    2002-01-01

    Mads-box genes, ABC model, Evolution, Phylogeny, Transcription patterns, Gene structure, Conserved motifs......Mads-box genes, ABC model, Evolution, Phylogeny, Transcription patterns, Gene structure, Conserved motifs...

  16. The sieve element occlusion gene family in dicotyledonous plants.

    Science.gov (United States)

    Ernst, Antonia M; Rüping, Boris; Jekat, Stephan B; Nordzieke, Steffen; Reineke, Anna R; Müller, Boje; Bornberg-Bauer, Erich; Prüfer, Dirk; Noll, Gundula A

    2011-01-01

    Sieve element occlusion (SEO) genes encoding forisome subunits have been identified in Medicago truncatula and other legumes. Forisomes are structural phloem proteins uniquely found in Fabaceae sieve elements. They undergo a reversible conformational change after wounding, from a condensed to a dispersed state, thereby blocking sieve tube translocation and preventing the loss of photoassimilates. Recently, we identified SEO genes in several non-Fabaceae plants (lacking forisomes) and concluded that they most probably encode conventional non-forisome P-proteins. Molecular and phylogenetic analysis of the SEO gene family has identified domains that are characteristic for SEO proteins. Here, we extended our phylogenetic analysis by including additional SEO genes from several diverse species based on recently published genomic data. Our results strengthen the original assumption that SEO genes seem to be widespread in dicotyledonous angiosperms, and further underline the divergent evolution of SEO genes within the Fabaceae.

  17. Identification and analysis of YELLOW protein family genes in the silkworm, Bombyx mori

    Directory of Open Access Journals (Sweden)

    Yi Yong-Zhu

    2006-08-01

    Full Text Available Abstract Background The major royal jelly proteins/yellow (MRJP/YELLOW family possesses several physiological and chemical functions in the development of Apis mellifera and Drosophila melanogaster. Each protein of the family has a conserved domain named MRJP. However, there is no report of MRJP/YELLOW family proteins in the Lepidoptera. Results Using the YELLOW protein sequence in Drosophila melanogaster to BLAST silkworm EST database, we found a gene family composed of seven members with a conserved MRJP domain each and named it YELLOW protein family of Bombyx mori. We completed the cDNA sequences with RACE method. The protein of each member possesses a MRJP domain and a putative cleavable signal peptide consisting of a hydrophobic sequence. In view of genetic evolution, the whole Bm YELLOW protein family composes a monophyletic group, which is distinctly separate from Drosophila melanogaster and Apis mellifera. We then showed the tissue expression profiles of Bm YELLOW protein family genes by RT-PCR. Conclusion A Bombyx mori YELLOW protein family is found to be composed of at least seven members. The low homogeneity and unique pattern of gene expression by each member among the family ensure us to prophesy that the members of Bm YELLOW protein family would play some important physiological functions in silkworm development.

  18. Duplications and losses in gene families of rust pathogens highlight putative effectors.

    Science.gov (United States)

    Pendleton, Amanda L; Smith, Katherine E; Feau, Nicolas; Martin, Francis M; Grigoriev, Igor V; Hamelin, Richard; Nelson, C Dana; Burleigh, J Gordon; Davis, John M

    2014-01-01

    Rust fungi are a group of fungal pathogens that cause some of the world's most destructive diseases of trees and crops. A shared characteristic among rust fungi is obligate biotrophy, the inability to complete a lifecycle without a host. This dependence on a host species likely affects patterns of gene expansion, contraction, and innovation within rust pathogen genomes. The establishment of disease by biotrophic pathogens is reliant upon effector proteins that are encoded in the fungal genome and secreted from the pathogen into the host's cell apoplast or within the cells. This study uses a comparative genomic approach to elucidate putative effectors and determine their evolutionary histories. We used OrthoMCL to identify nearly 20,000 gene families in proteomes of 16 diverse fungal species, which include 15 basidiomycetes and one ascomycete. We inferred patterns of duplication and loss for each gene family and identified families with distinctive patterns of expansion/contraction associated with the evolution of rust fungal genomes. To recognize potential contributors for the unique features of rust pathogens, we identified families harboring secreted proteins that: (i) arose or expanded in rust pathogens relative to other fungi, or (ii) contracted or were lost in rust fungal genomes. While the origin of rust fungi appears to be associated with considerable gene loss, there are many gene duplications associated with each sampled rust fungal genome. We also highlight two putative effector gene families that have expanded in Cqf that we hypothesize have roles in pathogenicity.

  19. Phylogenetics and evolution of Trx SET genes in fully sequenced land plants.

    Science.gov (United States)

    Zhu, Xinyu; Chen, Caoyi; Wang, Baohua

    2012-04-01

    Plant Trx SET proteins are involved in H3K4 methylation and play a key role in plant floral development. Genes encoding Trx SET proteins constitute a multigene family in which the copy number varies among plant species and functional divergence appears to have occurred repeatedly. To investigate the evolutionary history of the Trx SET gene family, we made a comprehensive evolutionary analysis on this gene family from 13 major representatives of green plants. A novel clustering (here named as cpTrx clade), which included the III-1, III-2, and III-4 orthologous groups, previously resolved was identified. Our analysis showed that plant Trx proteins possessed a variety of domain organizations and gene structures among paralogs. Additional domains such as PHD, PWWP, and FYR were early integrated into primordial SET-PostSET domain organization of cpTrx clade. We suggested that the PostSET domain was lost in some members of III-4 orthologous group during the evolution of land plants. At least four classes of gene structures had been formed at the early evolutionary stage of land plants. Three intronless orphan Trx SET genes from the Physcomitrella patens (moss) were identified, and supposedly, their parental genes have been eliminated from the genome. The structural differences among evolutionary groups of plant Trx SET genes with different functions were described, contributing to the design of further experimental studies.

  20. Identification of a novel gene family that includes the interferon-inducible human genes 6–16 and ISG12

    Directory of Open Access Journals (Sweden)

    Parker Nadeene

    2004-01-01

    Full Text Available Abstract Background The human 6–16 and ISG12 genes are transcriptionally upregulated in a variety of cell types in response to type I interferon (IFN. The predicted products of these genes are small (12.9 and 11.5 kDa respectively, hydrophobic proteins that share 36% overall amino acid identity. Gene disruption and over-expression studies have so far failed to reveal any biochemical or cellular roles for these proteins. Results We have used in silico analyses to identify a novel family of genes (the ISG12 gene family related to both the human 6–16 and ISG12 genes. Each ISG12 family member codes for a small hydrophobic protein containing a conserved ~80 amino-acid motif (the ISG12 motif. So far we have detected 46 family members in 25 organisms, ranging from unicellular eukaryotes to humans. Humans have four ISG12 genes: the 6–16 gene at chromosome 1p35 and three genes (ISG12(a, ISG12(b and ISG12(c clustered at chromosome 14q32. Mice have three family members (ISG12(a, ISG12(b1 and ISG12(b2 clustered at chromosome 12F1 (syntenic with human chromosome 14q32. There does not appear to be a murine 6–16 gene. On the basis of phylogenetic analyses, genomic organisation and intron-alignments we suggest that this family has arisen through divergent inter- and intra-chromosomal gene duplication events. The transcripts from human and mouse genes are detectable, all but two (human ISG12(b and ISG12(c being upregulated in response to type I IFN in the cell lines tested. Conclusions Members of the eukaryotic ISG12 gene family encode a small hydrophobic protein with at least one copy of a newly defined motif of ~80 amino-acids (the ISG12 motif. In higher eukaryotes, many of the genes have acquired a responsiveness to type I IFN during evolution suggesting that a role in resisting cellular or environmental stress may be a unifying property of all family members. Analysis of gene-function in higher eukaryotes is complicated by the possibility of

  1. The SWEET gene family in Hevea brasiliensis - its evolution and expression compared with four other plant species.

    Science.gov (United States)

    Sui, Jin-Lei; Xiao, Xiao-Hu; Qi, Ji-Yan; Fang, Yong-Jun; Tang, Chao-Rong

    2017-12-01

    SWEET proteins play an indispensable role as a sugar efflux transporter in plant development and stress responses. The SWEET genes have previously been characterized in several plants. Here, we present a comprehensive analysis of this gene family in the rubber tree, Hevea brasiliensis . There are 36 members of the SWEET gene family in this species, making it one of the largest families in plant genomes sequenced so far. Structure and phylogeny analyses of these genes in Hevea and in other species demonstrated broad evolutionary conservation. RNA-seq analyses revealed that SWEET2, 16, and 17 might represent the main evolutionary direction of SWEET genes in plants. Our results in Hevea suggested the involvement of HbSWEET1a , 2e , 2f , and 3b in phloem loading, HbSWEET10a and 16b in laticifer sugar transport , and HbSWEET9a in nectary-specific sugar transport. Parallel studies of RNA-seq analyses extended to three other plant species ( Manihot esculenta , Populus trichocarpa , and Arabidopsis thaliana ) produced findings which implicated MeSWEET10a, 3a, and 15b in M. esculenta storage root development, and the involvement of PtSWEET16b and PtSWEET16d in P. trichocarpa xylem development. RT-qPCR results further revealed that HbSWEET10a, 16b, and 1a play important roles in phloem sugar transport. The results from this study provide a foundation not only for further investigation into the functionality of the SWEET gene family in Hevea, especially in its sugar transport for latex production, but also for related studies of this gene family in the plant kingdom.

  2. Embryonic expression of zebrafish MiT family genes tfe3b, tfeb, and tfec.

    Science.gov (United States)

    Lister, James A; Lane, Brandon M; Nguyen, Anhthu; Lunney, Katherine

    2011-11-01

    The MiT family comprises four genes in mammals: Mitf, Tfe3, Tfeb, and Tfec, which encode transcription factors of the basic-helix-loop-helix/leucine zipper class. Mitf is well-known for its essential role in the development of melanocytes, however the functions of the other members of this family, and of interactions between them, are less well understood. We have now characterized the complete set of MiT genes from zebrafish, which totals six instead of four. The zebrafish genome contain two mitf (mitfa and mitfb), two tfe3 (tfe3a and tfe3b), and single tfeb and tfec genes; this distribution is shared with other teleosts. We present here the sequence and embryonic expression patterns for the zebrafish tfe3b, tfeb, and tfec genes, and identify a new isoform of tfe3a. These findings will assist in elucidating the roles of the MiT gene family over the course of vertebrate evolution. Copyright © 2011 Wiley-Liss, Inc.

  3. Transcriptome analyses of the Dof-like gene family in grapevine reveal its involvement in berry, flower and seed development.

    Science.gov (United States)

    da Silva, Danielle Costenaro; da Silveira Falavigna, Vítor; Fasoli, Marianna; Buffon, Vanessa; Porto, Diogo Denardi; Pappas, Georgios Joannis; Pezzotti, Mario; Pasquali, Giancarlo; Revers, Luís Fernando

    2016-01-01

    The Dof (DNA-binding with one finger) protein family spans a group of plant transcription factors involved in the regulation of several functions, such as plant responses to stress, hormones and light, phytochrome signaling and seed germination. Here we describe the Dof-like gene family in grapevine (Vitis vinifera L.), which consists of 25 genes coding for Dof. An extensive in silico characterization of the VviDofL gene family was performed. Additionally, the expression of the entire gene family was assessed in 54 grapevine tissues and organs using an integrated approach with microarray (cv Corvina) and real-time PCR (cv Pinot Noir) analyses. The phylogenetic analysis comparing grapevine sequences with those of Arabidopsis, tomato, poplar and already described Dof genes in other species allowed us to identify several duplicated genes. The diversification of grapevine DofL genes during evolution likely resulted in a broader range of biological roles. Furthermore, distinct expression patterns were identified between samples analyzed, corroborating such hypothesis. Our expression results indicate that several VviDofL genes perform their functional roles mainly during flower, berry and seed development, highlighting their importance for grapevine growth and production. The identification of similar expression profiles between both approaches strongly suggests that these genes have important regulatory roles that are evolutionally conserved between grapevine cvs Corvina and Pinot Noir.

  4. Molecular evolution of a chordate specific family of G protein-coupled receptors

    Directory of Open Access Journals (Sweden)

    Leese Florian

    2011-08-01

    Full Text Available Abstract Background Chordate evolution is a history of innovations that is marked by physical and behavioral specializations, which led to the development of a variety of forms from a single ancestral group. Among other important characteristics, vertebrates obtained a well developed brain, anterior sensory structures, a closed circulatory system and gills or lungs as blood oxygenation systems. The duplication of pre-existing genes had profound evolutionary implications for the developmental complexity in vertebrates, since mutations modifying the function of a duplicated protein can lead to novel functions, improving the evolutionary success. Results We analyzed here the evolution of the GPRC5 family of G protein-coupled receptors by comprehensive similarity searches and found that the receptors are only present in chordates and that the size of the receptor family expanded, likely due to genome duplication events in the early history of vertebrate evolution. We propose that a single GPRC5 receptor coding gene originated in a stem chordate ancestor and gave rise by duplication events to a gene family comprising three receptor types (GPRC5A-C in vertebrates, and a fourth homologue present only in mammals (GPRC5D. Additional duplications of GPRC5B and GPRC5C sequences occurred in teleost fishes. The finding that the expression patterns of the receptors are evolutionarily conserved indicates an important biological function of these receptors. Moreover, we found that expression of GPRC5B is regulated by vitamin A in vivo, confirming previous findings that linked receptor expression to retinoic acid levels in tumor cell lines and strengthening the link between the receptor expression and the development of a complex nervous system in chordates, known to be dependent on retinoic acid signaling. Conclusions GPRC5 receptors, a class of G protein-coupled receptors with unique sequence characteristics, may represent a molecular novelty that helped non

  5. The Eucalyptus terpene synthase gene family.

    Science.gov (United States)

    Külheim, Carsten; Padovan, Amanda; Hefer, Charles; Krause, Sandra T; Köllner, Tobias G; Myburg, Alexander A; Degenhardt, Jörg; Foley, William J

    2015-06-11

    Terpenoids are abundant in the foliage of Eucalyptus, providing the characteristic smell as well as being valuable economically and influencing ecological interactions. Quantitative and qualitative inter- and intra- specific variation of terpenes is common in eucalypts. The genome sequences of Eucalyptus grandis and E. globulus were mined for terpene synthase genes (TPS) and compared to other plant species. We investigated the relative expression of TPS in seven plant tissues and functionally characterized five TPS genes from E. grandis. Compared to other sequenced plant genomes, Eucalyptus grandis has the largest number of putative functional TPS genes of any sequenced plant. We discovered 113 and 106 putative functional TPS genes in E. grandis and E. globulus, respectively. All but one TPS from E. grandis were expressed in at least one of seven plant tissues examined. Genomic clusters of up to 20 genes were identified. Many TPS are expressed in tissues other than leaves which invites a re-evaluation of the function of terpenes in Eucalyptus. Our data indicate that terpenes in Eucalyptus may play a wider role in biotic and abiotic interactions than previously thought. Tissue specific expression is common and the possibility of stress induction needs further investigation. Phylogenetic comparison of the two investigated Eucalyptus species gives insight about recent evolution of different clades within the TPS gene family. While the majority of TPS genes occur in orthologous pairs some clades show evidence of recent gene duplication, as well as loss of function.

  6. Expression of venom gene homologs in diverse python tissues suggests a new model for the evolution of snake venom.

    Science.gov (United States)

    Reyes-Velasco, Jacobo; Card, Daren C; Andrew, Audra L; Shaney, Kyle J; Adams, Richard H; Schield, Drew R; Casewell, Nicholas R; Mackessy, Stephen P; Castoe, Todd A

    2015-01-01

    Snake venom gene evolution has been studied intensively over the past several decades, yet most previous studies have lacked the context of complete snake genomes and the full context of gene expression across diverse snake tissues. We took a novel approach to studying snake venom evolution by leveraging the complete genome of the Burmese python, including information from tissue-specific patterns of gene expression. We identified the orthologs of snake venom genes in the python genome, and conducted detailed analysis of gene expression of these venom homologs to identify patterns that differ between snake venom gene families and all other genes. We found that venom gene homologs in the python are expressed in many different tissues outside of oral glands, which illustrates the pitfalls of using transcriptomic data alone to define "venom toxins." We hypothesize that the python may represent an ancestral state prior to major venom development, which is supported by our finding that the expansion of venom gene families is largely restricted to highly venomous caenophidian snakes. Therefore, the python provides insight into biases in which genes were recruited for snake venom systems. Python venom homologs are generally expressed at lower levels, have higher variance among tissues, and are expressed in fewer organs compared with all other python genes. We propose a model for the evolution of snake venoms in which venom genes are recruited preferentially from genes with particular expression profile characteristics, which facilitate a nearly neutral transition toward specialized venom system expression. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  7. Gene structure, phylogeny and expression profile of the sucrose synthase gene family in cacao (Theobroma cacao L.).

    Science.gov (United States)

    Li, Fupeng; Hao, Chaoyun; Yan, Lin; Wu, Baoduo; Qin, Xiaowei; Lai, Jianxiong; Song, Yinghui

    2015-09-01

    In higher plants, sucrose synthase (Sus, EC 2.4.1.13) is widely considered as a key enzyme involved in sucrose metabolism. Although, several paralogous genes encoding different isozymes of Sus have been identified and characterized in multiple plant genomes, to date detailed information about the Sus genes is lacking for cacao. This study reports the identification of six novel Sus genes from economically important cacao tree. Analyses of the gene structure and phylogeny of the Sus genes demonstrated evolutionary conservation in the Sus family across cacao and other plant species. The expression of cacao Sus genes was investigated via real-time PCR in various tissues, different developmental phases of leaf, flower bud and pod. The Sus genes exhibited distinct but partially redundant expression profiles in cacao, with TcSus1, TcSus5 and TcSus6, being the predominant genes in the bark with phloem, TcSus2 predominantly expressing in the seed during the stereotype stage. TcSus3 and TcSus4 were significantly detected more in the pod husk and seed coat along the pod development, and showed development dependent expression profiles in the cacao pod. These results provide new insights into the evolution, and basic information that will assist in elucidating the functions of cacao Sus gene family.

  8. Rapid evolution of cancer/testis genes on the X chromosome

    Directory of Open Access Journals (Sweden)

    Simpson Andrew J

    2007-05-01

    Full Text Available Abstract Background Cancer/testis (CT genes are normally expressed only in germ cells, but can be activated in the cancer state. This unusual property, together with the finding that many CT proteins elicit an antigenic response in cancer patients, has established a role for this class of genes as targets in immunotherapy regimes. Many families of CT genes have been identified in the human genome, but their biological function for the most part remains unclear. While it has been shown that some CT genes are under diversifying selection, this question has not been addressed before for the class as a whole. Results To shed more light on this interesting group of genes, we exploited the generation of a draft chimpanzee (Pan troglodytes genomic sequence to examine CT genes in an organism that is closely related to human, and generated a high-quality, manually curated set of human:chimpanzee CT gene alignments. We find that the chimpanzee genome contains homologues to most of the human CT families, and that the genes are located on the same chromosome and at a similar copy number to those in human. Comparison of putative human:chimpanzee orthologues indicates that CT genes located on chromosome X are diverging faster and are undergoing stronger diversifying selection than those on the autosomes or than a set of control genes on either chromosome X or autosomes. Conclusion Given their high level of diversifying selection, we suggest that CT genes are primarily responsible for the observed rapid evolution of protein-coding genes on the X chromosome.

  9. Segmental duplications and evolutionary acquisition of UV damage response in the SPATA31 gene family of primates and humans.

    Science.gov (United States)

    Bekpen, Cemalettin; Künzel, Sven; Xie, Chen; Eaaswarkhanth, Muthukrishnan; Lin, Yen-Lung; Gokcumen, Omer; Akdis, Cezmi A; Tautz, Diethard

    2017-03-06

    Segmental duplications are an abundant source for novel gene functions and evolutionary adaptations. This mechanism of generating novelty was very active during the evolution of primates particularly in the human lineage. Here, we characterize the evolution and function of the SPATA31 gene family (former designation FAM75A), which was previously shown to be among the gene families with the strongest signal of positive selection in hominoids. The mouse homologue for this gene family is a single copy gene expressed during spermatogenesis. We show that in primates, the SPATA31 gene duplicated into SPATA31A and SPATA31C types and broadened the expression into many tissues. Each type became further segmentally duplicated in the line towards humans with the largest number of full-length copies found for SPATA31A in humans. Copy number estimates of SPATA31A based on digital PCR show an average of 7.5 with a range of 5-11 copies per diploid genome among human individuals. The primate SPATA31 genes also acquired new protein domains that suggest an involvement in UV response and DNA repair. We generated antibodies and show that the protein is re-localized from the nucleolus to the whole nucleus upon UV-irradiation suggesting a UV damage response. We used CRISPR/Cas mediated mutagenesis to knockout copies of the gene in human primary fibroblast cells. We find that cell lines with reduced functional copies as well as naturally occurring low copy number HFF cells show enhanced sensitivity towards UV-irradiation. The acquisition of new SPATA31 protein functions and its broadening of expression may be related to the evolution of the diurnal life style in primates that required a higher UV tolerance. The increased segmental duplications in hominoids as well as its fast evolution suggest the acquisition of further specific functions particularly in humans.

  10. Duplications and losses in gene families of rust pathogens highlight putative effectors

    Directory of Open Access Journals (Sweden)

    Amanda L. Pendleton

    2014-06-01

    Full Text Available Rust fungi are a group of fungal pathogens that cause some of the world’s most destructive diseases of trees and crops. A shared characteristic among rust fungi is obligate biotrophy, the inability to complete a lifecycle without a host. This dependence on a host species likely affects patterns of gene expansion, contraction, and innovation within rust pathogen genomes. The establishment of disease by biotrophic pathogens is reliant upon effector proteins that are encoded in the fungal genome and secreted from the pathogen into the host’s cell apoplast or within the cells. This study uses a comparative genomic approach to elucidate putative effectors and determine their evolutionary histories. We used OrthoMCL to identify nearly 20,000 gene families in proteomes of sixteen diverse fungal species, which include fifteen basidiomycetes and one ascomycete. We inferred patterns of duplication and loss for each gene family and identified families with distinctive patterns of expansion/contraction associated with the evolution of rust fungal genomes. To recognize potential contributors for the unique features of rust pathogens, we identified families harboring secreted proteins that: i arose or expanded in rust pathogens relative to other fungi, or ii contracted or were lost in rust fungal genomes. While the origin of rust fungi appears to be associated with considerable gene loss, there are many gene duplications associated with each sampled rust fungal genome. We also highlight two putative effector gene families that have expanded in Cqf that we hypothesize have roles in pathogenicity.

  11. Neutral and Non-Neutral Evolution of Duplicated Genes with Gene Conversion

    Directory of Open Access Journals (Sweden)

    Jeffrey A. Fawcett

    2011-02-01

    Full Text Available Gene conversion is one of the major mutational mechanisms involved in the DNA sequence evolution of duplicated genes. It contributes to create unique patters of DNA polymorphism within species and divergence between species. A typical pattern is so-called concerted evolution, in which the divergence between duplicates is maintained low for a long time because of frequent exchanges of DNA fragments. In addition, gene conversion affects the DNA evolution of duplicates in various ways especially when selection operates. Here, we review theoretical models to understand the evolution of duplicates in both neutral and non-neutral cases. We also explain how these theories contribute to interpreting real polymorphism and divergence data by using some intriguing examples.

  12. Inventing an arsenal: adaptive evolution and neofunctionalization of snake venom phospholipase A2 genes

    Directory of Open Access Journals (Sweden)

    Lynch Vincent J

    2007-01-01

    Full Text Available Abstract Background Gene duplication followed by functional divergence has long been hypothesized to be the main source of molecular novelty. Convincing examples of neofunctionalization, however, remain rare. Snake venom phospholipase A2 genes are members of large multigene families with many diverse functions, thus they are excellent models to study the emergence of novel functions after gene duplications. Results Here, I show that positive Darwinian selection and neofunctionalization is common in snake venom phospholipase A2 genes. The pattern of gene duplication and positive selection indicates that adaptive molecular evolution occurs immediately after duplication events as novel functions emerge and continues as gene families diversify and are refined. Surprisingly, adaptive evolution of group-I phospholipases in elapids is also associated with speciation events, suggesting adaptation of the phospholipase arsenal to novel prey species after niche shifts. Mapping the location of sites under positive selection onto the crystal structure of phospholipase A2 identified regions evolving under diversifying selection are located on the molecular surface and are likely protein-protein interactions sites essential for toxin functions. Conclusion These data show that increases in genomic complexity (through gene duplications can lead to phenotypic complexity (venom composition and that positive Darwinian selection is a common evolutionary force in snake venoms. Finally, regions identified under selection on the surface of phospholipase A2 enzymes are potential candidate sites for structure based antivenin design.

  13. Evolution of vertebrate central nervous system is accompanied by novel expression changes of duplicate genes.

    Science.gov (United States)

    Chen, Yuan; Ding, Yun; Zhang, Zuming; Wang, Wen; Chen, Jun-Yuan; Ueno, Naoto; Mao, Bingyu

    2011-12-20

    The evolution of the central nervous system (CNS) is one of the most striking changes during the transition from invertebrates to vertebrates. As a major source of genetic novelties, gene duplication might play an important role in the functional innovation of vertebrate CNS. In this study, we focused on a group of CNS-biased genes that duplicated during early vertebrate evolution. We investigated the tempo-spatial expression patterns of 33 duplicate gene families and their orthologs during the embryonic development of the vertebrate Xenopus laevis and the cephalochordate Brachiostoma belcheri. Almost all the identified duplicate genes are differentially expressed in the CNS in Xenopus embryos, and more than 50% and 30% duplicate genes are expressed in the telencephalon and mid-hindbrain boundary, respectively, which are mostly considered as two innovations in the vertebrate CNS. Interestingly, more than 50% of the amphioxus orthologs do not show apparent expression in the CNS in amphioxus embryos as detected by in situ hybridization, indicating that some of the vertebrate CNS-biased duplicate genes might arise from non-CNS genes in invertebrates. Our data accentuate the functional contribution of gene duplication in the CNS evolution of vertebrate and uncover an invertebrate non-CNS history for some vertebrate CNS-biased duplicate genes. Copyright © 2011. Published by Elsevier Ltd.

  14. WD-repeat instability and diversification of the Podospora anserina hnwd non-self recognition gene family.

    Science.gov (United States)

    Chevanne, Damien; Saupe, Sven J; Clavé, Corinne; Paoletti, Mathieu

    2010-05-06

    Genes involved in non-self recognition and host defence are typically capable of rapid diversification and exploit specialized genetic mechanism to that end. Fungi display a non-self recognition phenomenon termed heterokaryon incompatibility that operates when cells of unlike genotype fuse and leads to the cell death of the fusion cell. In the fungus Podospora anserina, three genes controlling this allorecognition process het-d, het-e and het-r are paralogs belonging to the same hnwd gene family. HNWD proteins are STAND proteins (signal transduction NTPase with multiple domains) that display a WD-repeat domain controlling recognition specificity. Based on genomic sequence analysis of different P. anserina isolates, it was established that repeat regions of all members of the gene family are extremely polymorphic and undergoing concerted evolution arguing for frequent recombination within and between family members. Herein, we directly analyzed the genetic instability and diversification of this allorecognition gene family. We have constituted a collection of 143 spontaneous mutants of the het-R (HNWD2) and het-E (hnwd5) genes with altered recognition specificities. The vast majority of the mutants present rearrangements in the repeat arrays with deletions, duplications and other modifications as well as creation of novel repeat unit variants. We investigate the extreme genetic instability of these genes and provide a direct illustration of the diversification strategy of this eukaryotic allorecognition gene family.

  15. Gene Structures, Evolution, Classification and Expression Profiles of the Aquaporin Gene Family in Castor Bean (Ricinus communis L..

    Directory of Open Access Journals (Sweden)

    Zhi Zou

    Full Text Available Aquaporins (AQPs are a class of integral membrane proteins that facilitate the passive transport of water and other small solutes across biological membranes. Castor bean (Ricinus communis L., Euphobiaceae, an important non-edible oilseed crop, is widely cultivated for industrial, medicinal and cosmetic purposes. Its recently available genome provides an opportunity to analyze specific gene families. In this study, a total of 37 full-length AQP genes were identified from the castor bean genome, which were assigned to five subfamilies, including 10 plasma membrane intrinsic proteins (PIPs, 9 tonoplast intrinsic proteins (TIPs, 8 NOD26-like intrinsic proteins (NIPs, 6 X intrinsic proteins (XIPs and 4 small basic intrinsic proteins (SIPs on the basis of sequence similarities. Functional prediction based on the analysis of the aromatic/arginine (ar/R selectivity filter, Froger's positions and specificity-determining positions (SDPs showed a remarkable difference in substrate specificity among subfamilies. Homology analysis supported the expression of all 37 RcAQP genes in at least one of examined tissues, e.g., root, leaf, flower, seed and endosperm. Furthermore, global expression profiles with deep transcriptome sequencing data revealed diverse expression patterns among various tissues. The current study presents the first genome-wide analysis of the AQP gene family in castor bean. Results obtained from this study provide valuable information for future functional analysis and utilization.

  16. Insights into the molecular evolution of the PDZ/LIM family and identification of a novel conserved protein motif.

    Directory of Open Access Journals (Sweden)

    Aartjan J W Te Velthuis

    Full Text Available The PDZ and LIM domain-containing protein family is encoded by a diverse group of genes whose phylogeny has currently not been analyzed. In mammals, ten genes are found that encode both a PDZ- and one or several LIM-domains. These genes are: ALP, RIL, Elfin (CLP36, Mystique, Enigma (LMP-1, Enigma homologue (ENH, ZASP (Cypher, Oracle, LMO7 and the two LIM domain kinases (LIMK1 and LIMK2. As conventional alignment and phylogenetic procedures of full-length sequences fell short of elucidating the evolutionary history of these genes, we started to analyze the PDZ and LIM domain sequences themselves. Using information from most sequenced eukaryotic lineages, our phylogenetic analysis is based on full-length cDNA-, EST-derived- and genomic- PDZ and LIM domain sequences of over 25 species, ranging from yeast to humans. Plant and protozoan homologs were not found. Our phylogenetic analysis identifies a number of domain duplication and rearrangement events, and shows a single convergent event during evolution of the PDZ/LIM family. Further, we describe the separation of the ALP and Enigma subfamilies in lower vertebrates and identify a novel consensus motif, which we call 'ALP-like motif' (AM. This motif is highly-conserved between ALP subfamily proteins of diverse organisms. We used here a combinatorial approach to define the relation of the PDZ and LIM domain encoding genes and to reconstruct their phylogeny. This analysis allowed us to classify the PDZ/LIM family and to suggest a meaningful model for the molecular evolution of the diverse gene architectures found in this multi-domain family.

  17. Early events in the evolution of spider silk genes.

    Directory of Open Access Journals (Sweden)

    James Starrett

    Full Text Available Silk spinning is essential to spider ecology and has had a key role in the expansive diversification of spiders. Silk is composed primarily of proteins called spidroins, which are encoded by a multi-gene family. Spidroins have been studied extensively in the derived clade, Orbiculariae (orb-weavers, from the suborder Araneomorphae ('true spiders'. Orbicularians produce a suite of different silks, and underlying this repertoire is a history of duplication and spidroin gene divergence. A second class of silk proteins, Egg Case Proteins (ECPs, is known only from the orbicularian species, Lactrodectus hesperus (Western black widow. In L. hesperus, ECPs bond with tubuliform spidroins to form egg case silk fibers. Because most of the phylogenetic diversity of spiders has not been sampled for their silk genes, there is limited understanding of spidroin gene family history and the prevalence of ECPs. Silk genes have not been reported from the suborder Mesothelae (segmented spiders, which diverged from all other spiders >380 million years ago, and sampling from Mygalomorphae (tarantulas, trapdoor spiders and basal araneomorph lineages is sparse. In comparison to orbicularians, mesotheles and mygalomorphs have a simpler silk biology and thus are hypothesized to have less diversity of silk genes. Here, we present cDNAs synthesized from the silk glands of six mygalomorph species, a mesothele, and a non-orbicularian araneomorph, and uncover a surprisingly rich silk gene diversity. In particular, we find ECP homologs in the mesothele, suggesting that ECPs were present in the common ancestor of extant spiders, and originally were not specialized to complex with tubuliform spidroins. Furthermore, gene-tree/species-tree reconciliation analysis reveals that numerous spidroin gene duplications occurred after the split between Mesothelae and Opisthothelae (Mygalomorphae plus Araneomorphae. We use the spidroin gene tree to reconstruct the evolution of amino acid

  18. Evolution and functional insights of different ancestral orthologous clades of chitin synthase genes in the fungal tree of life

    Directory of Open Access Journals (Sweden)

    Mu eLi

    2016-02-01

    Full Text Available Chitin synthases (CHSs are key enzymes in the biosynthesis of chitin, an important structural component of fungal cell walls that can trigger innate immune responses in host plants and animals. Members of CHS gene family perform various functions in fungal cellular processes. Previous studies focused primarily on classifying diverse CHSs into different classes, regardless of their functional diversification, or on characterizing their functions in individual fungal species. A complete and systematic comparative analysis of CHS genes based on their orthologous relationships will be valuable for elucidating the evolution and functions of different CHS genes in fungi. Here, we identified and compared members of the CHS gene family across the fungal tree of life, including 18 divergent fungal lineages. Phylogenetic analysis revealed that the fungal CHS gene family is comprised of at least 10 ancestral orthologous clades, which have undergone multiple independent duplications and losses in different fungal lineages during evolution. Interestingly, one of these CHS clades (class III was expanded in plant or animal pathogenic fungi belonging to different fungal lineages. Two clades (classes VIb and VIc identified for the first time in this study occurred mainly in plant pathogenic fungi from Sordariomycetes and Dothideomycetes. Moreover, members of classes III and VIb were specifically up-regulated during plant infection, suggesting important roles in pathogenesis. In addition, CHS-associated networks conserved among plant pathogenic fungi are involved in various biological processes, including sexual reproduction and plant infection. We also identified specificity-determining sites, many of which are located at or adjacent to important structural and functional sites that are potentially responsible for functional divergence of different CHS classes. Overall, our results provide new insights into the evolution and function of members of CHS gene

  19. New genes as drivers of phenotypic evolution

    Science.gov (United States)

    Chen, Sidi; Krinsky, Benjamin H.; Long, Manyuan

    2014-01-01

    During the course of evolution, genomes acquire novel genetic elements as sources of functional and phenotypic diversity, including new genes that originated in recent evolution. In the past few years, substantial progress has been made in understanding the evolution and phenotypic effects of new genes. In particular, an emerging picture is that new genes, despite being present in the genomes of only a subset of species, can rapidly evolve indispensable roles in fundamental biological processes, including development, reproduction, brain function and behaviour. The molecular underpinnings of how new genes can develop these roles are starting to be characterized. These recent discoveries yield fresh insights into our broad understanding of biological diversity at refined resolution. PMID:23949544

  20. Positive selection in the SLC11A1 gene in the family Equidae.

    Science.gov (United States)

    Bayerova, Zuzana; Janova, Eva; Matiasovic, Jan; Orlando, Ludovic; Horin, Petr

    2016-05-01

    Immunity-related genes are a suitable model for studying effects of selection at the genomic level. Some of them are highly conserved due to functional constraints and purifying selection, while others are variable and change quickly to cope with the variation of pathogens. The SLC11A1 gene encodes a transporter protein mediating antimicrobial activity of macrophages. Little is known about the patterns of selection shaping this gene during evolution. Although it is a typical evolutionarily conserved gene, functionally important polymorphisms associated with various diseases were identified in humans and other species. We analyzed the genomic organization, genetic variation, and evolution of the SLC11A1 gene in the family Equidae to identify patterns of selection within this important gene. Nucleotide SLC11A1 sequences were shown to be highly conserved in ten equid species, with more than 97 % sequence identity across the family. Single nucleotide polymorphisms (SNPs) were found in the coding and noncoding regions of the gene. Seven codon sites were identified to be under strong purifying selection. Codons located in three regions, including the glycosylated extracellular loop, were shown to be under diversifying selection. A 3-bp indel resulting in a deletion of the amino acid 321 in the predicted protein was observed in all horses, while it has been maintained in all other equid species. This codon comprised in an N-glycosylation site was found to be under positive selection. Interspecific variation in the presence of predicted N-glycosylation sites was observed.

  1. Genome-Wide Identification, Molecular Evolution, and Expression Profiling Analysis of Pectin Methylesterase Inhibitor Genes in Brassica campestris ssp. chinensis

    Directory of Open Access Journals (Sweden)

    Tingting Liu

    2018-05-01

    Full Text Available Pectin methylesterase inhibitor genes (PMEIs are a large multigene family and play crucial roles in cell wall modifications in plant growth and development. Here, a comprehensive analysis of the PMEI gene family in Brassica campestris, an important leaf vegetable, was performed. We identified 100 Brassica campestris PMEI genes (BcPMEIs, among which 96 BcPMEIs were unevenly distributed on 10 chromosomes and nine tandem arrays containing 20 BcPMEIs were found. We also detected 80 pairs of syntenic PMEI orthologs. These findings indicated that whole-genome triplication (WGT and tandem duplication (TD were the main mechanisms accounting for the current number of BcPMEIs. In evolution, BcPMEIs were retained preferentially and biasedly, consistent with the gene balance hypothesis and two-step theory, respectively. The molecular evolution analysis of BcPMEIs manifested that they evolved through purifying selection and the divergence time is in accordance with the WGT data of B. campestris. To obtain the functional information of BcPMEIs, the expression patterns in five tissues and the cis-elements distributed in promoter regions were investigated. This work can provide a better understanding of the molecular evolution and biological function of PMEIs in B. campestris.

  2. Comparative genome sequencing of drosophila pseudoobscura: Chromosomal, gene and cis-element evolution

    Energy Technology Data Exchange (ETDEWEB)

    Richards, Stephen; Liu, Yue; Bettencourt, Brian R.; Hradecky, Pavel; Letovsky, Stan; Nielsen, Rasmus; Thornton, Kevin; Todd, Melissa J.; Chen, Rui; Meisel, Richard P.; Couronne, Olivier; Hua, Sujun; Smith, Mark A.; Bussemaker, Harmen J.; van Batenburg, Marinus F.; Howells, Sally L.; Scherer, Steven E.; Sodergren, Erica; Matthews, Beverly B.; Crosby, Madeline A.; Schroeder, Andrew J.; Ortiz-Barrientos, Daniel; Rives, Catherine M.; Metzker, Michael L.; Muzny, Donna M.; Scott, Graham; Steffen, David; Wheeler, David A.; Worley, Kim C.; Havlak, Paul; Durbin, K. James; Egan, Amy; Gill, Rachel; Hume, Jennifer; Morgan, Margaret B.; Miner, George; Hamilton, Cerissa; Huang, Yanmei; Waldron, Lenee; Verduzco, Daniel; Blankenburg, Kerstin P.; Dubchak, Inna; Noor, Mohamed A.F.; Anderson, Wyatt; White, Kevin P.; Clark, Andrew G.; Schaeffer, Stephen W.; Gelbart, William; Weinstock, George M.; Gibbs, Richard A.

    2004-04-01

    The genome sequence of a second fruit fly, D. pseudoobscura, presents an opportunity for comparative analysis of a primary model organism D. melanogaster. The vast majority of Drosophila genes have remained on the same arm, but within each arm gene order has been extensively reshuffled leading to the identification of approximately 1300 syntenic blocks. A repetitive sequence is found in the D. pseudoobscura genome at many junctions between adjacent syntenic blocks. Analysis of this novel repetitive element family suggests that recombination between offset elements may have given rise to many paracentric inversions, thereby contributing to the shuffling of gene order in the D. pseudoobscura lineage. Based on sequence similarity and synteny, 10,516 putative orthologs have been identified as a core gene set conserved over 35 My since divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome wide average consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than control sequences between the species but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a picture of repeat mediated chromosomal rearrangement, and high co-adaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence between these species of Drosophila.

  3. Genome-wide analysis of the ATP-binding cassette (ABC) transporter gene family in sea lamprey and Japanese lamprey.

    Science.gov (United States)

    Ren, Jianfeng; Chung-Davidson, Yu-Wen; Yeh, Chu-Yin; Scott, Camille; Brown, Titus; Li, Weiming

    2015-06-06

    Lampreys are extant representatives of the jawless vertebrate lineage that diverged from jawed vertebrates around 500 million years ago. Lamprey genomes contain information crucial for understanding the evolution of gene families in vertebrates. The ATP-binding cassette (ABC) gene family is found from prokaryotes to eukaryotes. The recent availability of two lamprey draft genomes from sea lamprey Petromyzon marinus and Japanese lamprey Lethenteron japonicum presents an opportunity to infer early evolutionary events of ABC genes in vertebrates. We conducted a genome-wide survey of the ABC gene family in two lamprey draft genomes. A total of 37 ABC transporters were identified and classified into seven subfamilies; namely seven ABCA genes, 10 ABCB genes, 10 ABCC genes, three ABCD genes, one ABCE gene, three ABCF genes, and three ABCG genes. The ABCA subfamily has expanded from three genes in sea squirts, seven and nine in lampreys and zebrafish, to 13 and 16 in human and mouse. Conversely, the multiple copies of ABCB1-, ABCG1-, and ABCG2-like genes found in sea squirts have contracted in the other species examined. ABCB2 and ABCB3 seem to be new additions in gnathostomes (not in sea squirts or lampreys), which coincides with the emergence of the gnathostome-specific adaptive immune system. All the genes in the ABCD, ABCE and ABCF subfamilies were conserved and had undergone limited duplication and loss events. In the sea lamprey transcriptomes, the ABCE and ABCF gene subfamilies were ubiquitously and highly expressed in all tissues while the members in other gene subfamilies were differentially expressed. Thirteen more lamprey ABC transporter genes were identified in this study compared with a previous study. By concatenating the same gene sequences from the two lampreys, more full length sequences were obtained, which significantly improved both the assignment of gene names and the phylogenetic trees compared with a previous analysis using partial sequences. The ABC

  4. The map-1 gene family in root-knot nematodes, Meloidogyne spp.: a set of taxonomically restricted genes specific to clonal species.

    Directory of Open Access Journals (Sweden)

    Iva Tomalova

    Full Text Available Taxonomically restricted genes (TRGs, i.e., genes that are restricted to a limited subset of phylogenetically related organisms, may be important in adaptation. In parasitic organisms, TRG-encoded proteins are possible determinants of the specificity of host-parasite interactions. In the root-knot nematode (RKN Meloidogyne incognita, the map-1 gene family encodes expansin-like proteins that are secreted into plant tissues during parasitism, thought to act as effectors to promote successful root infection. MAP-1 proteins exhibit a modular architecture, with variable number and arrangement of 58 and 13-aa domains in their central part. Here, we address the evolutionary origins of this gene family using a combination of bioinformatics and molecular biology approaches. Map-1 genes were solely identified in one single member of the phylum Nematoda, i.e., the genus Meloidogyne, and not detected in any other nematode, thus indicating that the map-1 gene family is indeed a TRG family. A phylogenetic analysis of the distribution of map-1 genes in RKNs further showed that these genes are specifically present in species that reproduce by mitotic parthenogenesis, with the exception of M. floridensis, and could not be detected in RKNs reproducing by either meiotic parthenogenesis or amphimixis. These results highlight the divergence between mitotic and meiotic RKN species as a critical transition in the evolutionary history of these parasites. Analysis of the sequence conservation and organization of repeated domains in map-1 genes suggests that gene duplication(s together with domain loss/duplication have contributed to the evolution of the map-1 family, and that some strong selection mechanism may be acting upon these genes to maintain their functional role(s in the specificity of the plant-RKN interactions.

  5. Rapid evolution of Beta-keratin genes contribute to phenotypic differences that distinguish turtles and birds from other reptiles.

    Science.gov (United States)

    Li, Yang I; Kong, Lesheng; Ponting, Chris P; Haerty, Wilfried

    2013-01-01

    Sequencing of vertebrate genomes permits changes in distinct protein families, including gene gains and losses, to be ascribed to lineage-specific phenotypes. A prominent example of this is the large-scale duplication of beta-keratin genes in the ancestors of birds, which was crucial to the subsequent evolution of their beaks, claws, and feathers. Evidence suggests that the shell of Pseudomys nelsoni contains at least 16 beta-keratins proteins, but it is unknown whether this is a complete set and whether their corresponding genes are orthologous to avian beak, claw, or feather beta-keratin genes. To address these issues and to better understand the evolution of the turtle shell at a molecular level, we surveyed the diversity of beta-keratin genes from the genome assemblies of three turtles, Chrysemys picta, Pelodiscus sinensis, and Chelonia mydas, which together represent over 160 Myr of chelonian evolution. For these three turtles, we found 200 beta-keratins, which indicate that, as for birds, a large expansion of beta-keratin genes in turtles occurred concomitantly with the evolution of a unique phenotype, namely, their plastron and carapace. Phylogenetic reconstruction of beta-keratin gene evolution suggests that separate waves of gene duplication within a single genomic location gave rise to scales, claws, and feathers in birds, and independently the scutes of the shell in turtles.

  6. Characterization of the bovine pregnancy-associated glycoprotein gene family – analysis of gene sequences, regulatory regions within the promoter and expression of selected genes

    Directory of Open Access Journals (Sweden)

    Walker Angela M

    2009-04-01

    Full Text Available Abstract Background The Pregnancy-associated glycoproteins (PAGs belong to a large family of aspartic peptidases expressed exclusively in the placenta of species in the Artiodactyla order. In cattle, the PAG gene family is comprised of at least 22 transcribed genes, as well as some variants. Phylogenetic analyses have shown that the PAG family segregates into 'ancient' and 'modern' groupings. Along with sequence differences between family members, there are clear distinctions in their spatio-temporal distribution and in their relative level of expression. In this report, 1 we performed an in silico analysis of the bovine genome to further characterize the PAG gene family, 2 we scrutinized proximal promoter sequences of the PAG genes to evaluate the evolution pressures operating on them and to identify putative regulatory regions, 3 we determined relative transcript abundance of selected PAGs during pregnancy and, 4 we performed preliminary characterization of the putative regulatory elements for one of the candidate PAGs, bovine (bo PAG-2. Results From our analysis of the bovine genome, we identified 18 distinct PAG genes and 14 pseudogenes. We observed that the first 500 base pairs upstream of the translational start site contained multiple regions that are conserved among all boPAGs. However, a preponderance of conserved regions, that harbor recognition sites for putative transcriptional factors (TFs, were found to be unique to the modern boPAG grouping, but not the ancient boPAGs. We gathered evidence by means of Q-PCR and screening of EST databases to show that boPAG-2 is the most abundant of all boPAG transcripts. Finally, we provided preliminary evidence for the role of ETS- and DDVL-related TFs in the regulation of the boPAG-2 gene. Conclusion PAGs represent a relatively large gene family in the bovine genome. The proximal promoter regions of these genes display differences in putative TF binding sites, likely contributing to observed

  7. Parallel evolution of TCP and B-class genes in Commelinaceae flower bilateral symmetry

    Directory of Open Access Journals (Sweden)

    Preston Jill C

    2012-03-01

    Full Text Available Abstract Background Flower bilateral symmetry (zygomorphy has evolved multiple times independently across angiosperms and is correlated with increased pollinator specialization and speciation rates. Functional and expression analyses in distantly related core eudicots and monocots implicate independent recruitment of class II TCP genes in the evolution of flower bilateral symmetry. Furthermore, available evidence suggests that monocot flower bilateral symmetry might also have evolved through changes in B-class homeotic MADS-box gene function. Methods In order to test the non-exclusive hypotheses that changes in TCP and B-class gene developmental function underlie flower symmetry evolution in the monocot family Commelinaceae, we compared expression patterns of teosinte branched1 (TB1-like, DEFICIENS (DEF-like, and GLOBOSA (GLO-like genes in morphologically distinct bilaterally symmetrical flowers of Commelina communis and Commelina dianthifolia, and radially symmetrical flowers of Tradescantia pallida. Results Expression data demonstrate that TB1-like genes are asymmetrically expressed in tepals of bilaterally symmetrical Commelina, but not radially symmetrical Tradescantia, flowers. Furthermore, DEF-like genes are expressed in showy inner tepals, staminodes and stamens of all three species, but not in the distinct outer tepal-like ventral inner tepals of C. communis. Conclusions Together with other studies, these data suggest parallel recruitment of TB1-like genes in the independent evolution of flower bilateral symmetry at early stages of Commelina flower development, and the later stage homeotic transformation of C. communis inner tepals into outer tepals through the loss of DEF-like gene expression.

  8. Asteroid families - Physical properties and evolution

    International Nuclear Information System (INIS)

    Chapman, C.R.; Paolicchi, P.; Zappala, V.; Binzel, R.P.; Bell, J.F.

    1989-01-01

    Asteroid families are considered to be fragments from collisional destruction of precursor bodies. However, results available on the inferred mineralogy, size distributions, and spins of family members do not confirm the expectations of the traditional model. Only a handful of nearly 100 proposed families, most of them populous, have distributions of inferred mineralogies consistent with simple cosmochemical models for parent bodies. It is suggested that most catastrophic collisions may not result in observable families, but rather in a spray of smaller particles, thus accounting for the small number of confirmed and consistent families, despite evidence for extensive collisional evolution of asteroids. 52 refs

  9. Gene family expansions and contractions are associated with host range in plant pathogens of the genus Colletotrichum.

    Science.gov (United States)

    Baroncelli, Riccardo; Amby, Daniel Buchvaldt; Zapparata, Antonio; Sarrocco, Sabrina; Vannacci, Giovanni; Le Floch, Gaétan; Harrison, Richard J; Holub, Eric; Sukno, Serenella A; Sreenivasaprasad, Surapareddy; Thon, Michael R

    2016-08-05

    Many species belonging to the genus Colletotrichum cause anthracnose disease on a wide range of plant species. In addition to their economic impact, the genus Colletotrichum is a useful model for the study of the evolution of host specificity, speciation and reproductive behaviors. Genome projects of Colletotrichum species have already opened a new era for studying the evolution of pathogenesis in fungi. We sequenced and annotated the genomes of four strains in the Colletotrichum acutatum species complex (CAsc), a clade of broad host range pathogens within the genus. The four CAsc proteomes and secretomes along with those representing an additional 13 species (six Colletotrichum spp. and seven other Sordariomycetes) were classified into protein families using a variety of tools. Hierarchical clustering of gene family and functional domain assignments, and phylogenetic analyses revealed lineage specific losses of carbohydrate-active enzymes (CAZymes) and proteases encoding genes in Colletotrichum species that have narrow host range as well as duplications of these families in the CAsc. We also found a lineage specific expansion of necrosis and ethylene-inducing peptide 1 (Nep1)-like protein (NLPs) families within the CAsc. This study illustrates the plasticity of Colletotrichum genomes, and shows that major changes in host range are associated with relatively recent changes in gene content.

  10. Genome-wide identification and characterization of WRKY gene family in Salix suchowensis.

    Science.gov (United States)

    Bi, Changwei; Xu, Yiqing; Ye, Qiaolin; Yin, Tongming; Ye, Ning

    2016-01-01

    WRKY proteins are the zinc finger transcription factors that were first identified in plants. They can specifically interact with the W-box, which can be found in the promoter region of a large number of plant target genes, to regulate the expressions of downstream target genes. They also participate in diverse physiological and growing processes in plants. Prior to this study, a plenty of WRKY genes have been identified and characterized in herbaceous species, but there is no large-scale study of WRKY genes in willow. With the whole genome sequencing of Salix suchowensis, we have the opportunity to conduct the genome-wide research for willow WRKY gene family. In this study, we identified 85 WRKY genes in the willow genome and renamed them from SsWRKY1 to SsWRKY85 on the basis of their specific distributions on chromosomes. Due to their diverse structural features, the 85 willow WRKY genes could be further classified into three main groups (group I-III), with five subgroups (IIa-IIe) in group II. With the multiple sequence alignment and the manual search, we found three variations of the WRKYGQK heptapeptide: WRKYGRK, WKKYGQK and WRKYGKK, and four variations of the normal zinc finger motif, which might execute some new biological functions. In addition, the SsWRKY genes from the same subgroup share the similar exon-intron structures and conserved motif domains. Further studies of SsWRKY genes revealed that segmental duplication events (SDs) played a more prominent role in the expansion of SsWRKY genes. Distinct expression profiles of SsWRKY genes with RNA sequencing data revealed that diverse expression patterns among five tissues, including tender roots, young leaves, vegetative buds, non-lignified stems and barks. With the analyses of WRKY gene family in willow, it is not only beneficial to complete the functional and annotation information of WRKY genes family in woody plants, but also provide important references to investigate the expansion and evolution of

  11. Evolution of endogenous non-retroviral genes integrated into plant genomes

    Directory of Open Access Journals (Sweden)

    Hyosub Chu

    2014-08-01

    Full Text Available Numerous comparative genome analyses have revealed the wide extent of horizontal gene transfer (HGT in living organisms, which contributes to their evolution and genetic diversity. Viruses play important roles in HGT. Endogenous viral elements (EVEs are defined as viral DNA sequences present within the genomes of non-viral organisms. In eukaryotic cells, the majority of EVEs are derived from RNA viruses using reverse transcription. In contrast, endogenous non-retroviral elements (ENREs are poorly studied. However, the increasing availability of genomic data and the rapid development of bioinformatics tools have enabled the identification of several ENREs in various eukaryotic organisms. To date, a small number of ENREs integrated into plant genomes have been identified. Of the known non-retroviruses, most identified ENREs are derived from double-strand (ds RNA viruses, followed by single-strand (ss DNA and ssRNA viruses. At least eight virus families have been identified. Of these, viruses in the family Partitiviridae are dominant, followed by viruses of the families Chrysoviridae and Geminiviridae. The identified ENREs have been primarily identified in eudicots, followed by monocots. In this review, we briefly discuss the current view on non-retroviral sequences integrated into plant genomes that are associated with plant-virus evolution and their possible roles in antiviral resistance.

  12. Co-evolution of secondary metabolite gene clusters and their host

    DEFF Research Database (Denmark)

    Kjærbølling, Inge; Vesth, Tammi Camilla; Frisvad, Jens Christian

    Secondary metabolite gene cluster evolution is mainly driven by two events: gene duplication and annexation and horizontal gene transfer. Here we use comparative genomics of Aspergillus species to investigate the evolution of secondary metabolite (SM) gene clusters across a wide spectrum of speci....... We investigate the dynamic evolutionary relationship between the cluster and the host by examining the genes within the cluster and the number of homologous genes found within the host and in closely related species.......Secondary metabolite gene cluster evolution is mainly driven by two events: gene duplication and annexation and horizontal gene transfer. Here we use comparative genomics of Aspergillus species to investigate the evolution of secondary metabolite (SM) gene clusters across a wide spectrum of species...

  13. Genome-wide analysis of WRKY gene family in Cucumis sativus.

    Science.gov (United States)

    Ling, Jian; Jiang, Weijie; Zhang, Ying; Yu, Hongjun; Mao, Zhenchuan; Gu, Xingfang; Huang, Sanwen; Xie, Bingyan

    2011-09-28

    WRKY proteins are a large family of transcriptional regulators in higher plant. They are involved in many biological processes, such as plant development, metabolism, and responses to biotic and abiotic stresses. Prior to the present study, only one full-length cucumber WRKY protein had been reported. The recent publication of the draft genome sequence of cucumber allowed us to conduct a genome-wide search for cucumber WRKY proteins, and to compare these positively identified proteins with their homologs in model plants, such as Arabidopsis. We identified a total of 55 WRKY genes in the cucumber genome. According to structural features of their encoded proteins, the cucumber WRKY (CsWRKY) genes were classified into three groups (group 1-3). Analysis of expression profiles of CsWRKY genes indicated that 48 WRKY genes display differential expression either in their transcript abundance or in their expression patterns under normal growth conditions, and 23 WRKY genes were differentially expressed in response to at least one abiotic stresses (cold, drought or salinity). The expression profile of stress-inducible CsWRKY genes were correlated with those of their putative Arabidopsis WRKY (AtWRKY) orthologs, except for the group 3 WRKY genes. Interestingly, duplicated group 3 AtWRKY genes appear to have been under positive selection pressure during evolution. In contrast, there was no evidence of recent gene duplication or positive selection pressure among CsWRKY group 3 genes, which may have led to the expressional divergence of group 3 orthologs. Fifty-five WRKY genes were identified in cucumber and the structure of their encoded proteins, their expression, and their evolution were examined. Considering that there has been extensive expansion of group 3 WRKY genes in angiosperms, the occurrence of different evolutionary events could explain the functional divergence of these genes.

  14. A remarkably stable TipE gene cluster: evolution of insect Para sodium channel auxiliary subunits

    Directory of Open Access Journals (Sweden)

    Li Jia

    2011-11-01

    -specific characteristics. Conclusions TipE-like genes form a remarkably conserved genomic cluster across all examined insect genomes. This study reveals likely structural and functional constraints on the genomic evolution of insect TipE gene family members maintained in synteny over hundreds of millions of years of evolution. The likely common origin of these NaV channel regulators with BKCa auxiliary subunits highlights the evolutionary plasticity of ion channel regulatory mechanisms.

  15. Immune genes undergo more adaptive evolution than non-immune system genes in Daphnia pulex

    Directory of Open Access Journals (Sweden)

    McTaggart Seanna J

    2012-05-01

    Full Text Available Abstract Background Understanding which parts of the genome have been most influenced by adaptive evolution remains an unsolved puzzle. Some evidence suggests that selection has the greatest impact on regions of the genome that interact with other evolving genomes, including loci that are involved in host-parasite co-evolutionary processes. In this study, we used a population genetic approach to test this hypothesis by comparing DNA sequences of 30 putative immune system genes in the crustacean Daphnia pulex with 24 non-immune system genes. Results In support of the hypothesis, results from a multilocus extension of the McDonald-Kreitman (MK test indicate that immune system genes as a class have experienced more adaptive evolution than non-immune system genes. However, not all immune system genes show evidence of adaptive evolution. Additionally, we apply single locus MK tests and calculate population genetic parameters at all loci in order to characterize the mode of selection (directional versus balancing in the genes that show the greatest deviation from neutral evolution. Conclusions Our data are consistent with the hypothesis that immune system genes undergo more adaptive evolution than non-immune system genes, possibly as a result of host-parasite arms races. The results of these analyses highlight several candidate loci undergoing adaptive evolution that could be targeted in future studies.

  16. Evolution and expression analysis of the grape (Vitis vinifera L.) WRKY gene family.

    Science.gov (United States)

    Guo, Chunlei; Guo, Rongrong; Xu, Xiaozhao; Gao, Min; Li, Xiaoqin; Song, Junyang; Zheng, Yi; Wang, Xiping

    2014-04-01

    WRKY proteins comprise a large family of transcription factors that play important roles in plant defence regulatory networks, including responses to various biotic and abiotic stresses. To date, no large-scale study of WRKY genes has been undertaken in grape (Vitis vinifera L.). In this study, a total of 59 putative grape WRKY genes (VvWRKY) were identified and renamed on the basis of their respective chromosome distribution. A multiple sequence alignment analysis using all predicted grape WRKY genes coding sequences, together with those from Arabidopsis thaliana and tomato (Solanum lycopersicum), indicated that the 59 VvWRKY genes can be classified into three main groups (I-III). An evaluation of the duplication events suggested that several WRKY genes arose before the divergence of the grape and Arabidopsis lineages. Moreover, expression profiles derived from semiquantitative PCR and real-time quantitative PCR analyses showed distinct expression patterns in various tissues and in response to different treatments. Four VvWRKY genes showed a significantly higher expression in roots or leaves, 55 responded to varying degrees to at least one abiotic stress treatment, and the expression of 38 were altered following powdery mildew (Erysiphe necator) infection. Most VvWRKY genes were downregulated in response to abscisic acid or salicylic acid treatments, while the expression of a subset was upregulated by methyl jasmonate or ethylene treatments.

  17. Natural killer cell receptor genes in the family Equidae: not only Ly49.

    Directory of Open Access Journals (Sweden)

    Jan Futas

    Full Text Available Natural killer (NK cells have important functions in immunity. NK recognition in mammals can be mediated through killer cell immunoglobulin-like receptors (KIR and/or killer cell lectin-like Ly49 receptors. Genes encoding highly variable NK cell receptors (NKR represent rapidly evolving genomic regions. No single conservative model of NKR genes was observed in mammals. Single-copy low polymorphic NKR genes present in one mammalian species may expand into highly polymorphic multigene families in other species. In contrast to other non-rodent mammals, multiple Ly49-like genes appear to exist in the horse, while no functional KIR genes were observed in this species. In this study, Ly49 and KIR were sought and their evolution was characterized in the entire family Equidae. Genomic sequences retrieved showed the presence of at least five highly conserved polymorphic Ly49 genes in horses, asses and zebras. These findings confirmed that the expansion of Ly49 occurred in the entire family. Several KIR-like sequences were also identified in the genome of Equids. Besides a previously identified non-functional KIR-Immunoglobulin-like transcript fusion gene (KIR-ILTA and two putative pseudogenes, a KIR3DL-like sequence was analyzed. In contrast to previous observations made in the horse, the KIR3DL sequence, genomic organization and mRNA expression suggest that all Equids might produce a functional KIR receptor protein molecule with a single non-mutated immune tyrosine-based inhibition motif (ITIM domain. No evidence for positive selection in the KIR3DL gene was found. Phylogenetic analysis including rhinoceros and tapir genomic DNA and deduced amino acid KIR-related sequences showed differences between families and even between species within the order Perissodactyla. The results suggest that the order Perissodactyla and its family Equidae with expanded Ly49 genes and with a potentially functional KIR gene may represent an interesting model for

  18. Natural Killer Cell Receptor Genes in the Family Equidae: Not only Ly49

    Science.gov (United States)

    Futas, Jan; Horin, Petr

    2013-01-01

    Natural killer (NK) cells have important functions in immunity. NK recognition in mammals can be mediated through killer cell immunoglobulin-like receptors (KIR) and/or killer cell lectin-like Ly49 receptors. Genes encoding highly variable NK cell receptors (NKR) represent rapidly evolving genomic regions. No single conservative model of NKR genes was observed in mammals. Single-copy low polymorphic NKR genes present in one mammalian species may expand into highly polymorphic multigene families in other species. In contrast to other non-rodent mammals, multiple Ly49-like genes appear to exist in the horse, while no functional KIR genes were observed in this species. In this study, Ly49 and KIR were sought and their evolution was characterized in the entire family Equidae. Genomic sequences retrieved showed the presence of at least five highly conserved polymorphic Ly49 genes in horses, asses and zebras. These findings confirmed that the expansion of Ly49 occurred in the entire family. Several KIR-like sequences were also identified in the genome of Equids. Besides a previously identified non-functional KIR-Immunoglobulin-like transcript fusion gene (KIR-ILTA) and two putative pseudogenes, a KIR3DL-like sequence was analyzed. In contrast to previous observations made in the horse, the KIR3DL sequence, genomic organization and mRNA expression suggest that all Equids might produce a functional KIR receptor protein molecule with a single non-mutated immune tyrosine-based inhibition motif (ITIM) domain. No evidence for positive selection in the KIR3DL gene was found. Phylogenetic analysis including rhinoceros and tapir genomic DNA and deduced amino acid KIR-related sequences showed differences between families and even between species within the order Perissodactyla. The results suggest that the order Perissodactyla and its family Equidae with expanded Ly49 genes and with a potentially functional KIR gene may represent an interesting model for evolutionary biology of

  19. Evolution of the P-type II ATPase gene family in the fungi and presence of structural genomic changes among isolates of Glomus intraradices

    Directory of Open Access Journals (Sweden)

    Sanders Ian R

    2006-03-01

    Full Text Available Abstract Background The P-type II ATPase gene family encodes proteins with an important role in adaptation of the cell to variation in external K+, Ca2+ and Na2+ concentrations. The presence of P-type II gene subfamilies that are specific for certain kingdoms has been reported but was sometimes contradicted by discovery of previously unknown homologous sequences in newly sequenced genomes. Members of this gene family have been sampled in all of the fungal phyla except the arbuscular mycorrhizal fungi (AMF; phylum Glomeromycota, which are known to play a key-role in terrestrial ecosystems and to be genetically highly variable within populations. Here we used highly degenerate primers on AMF genomic DNA to increase the sampling of fungal P-Type II ATPases and to test previous predictions about their evolution. In parallel, homologous sequences of the P-type II ATPases have been used to determine the nature and amount of polymorphism that is present at these loci among isolates of Glomus intraradices harvested from the same field. Results In this study, four P-type II ATPase sub-families have been isolated from three AMF species. We show that, contrary to previous predictions, P-type IIC ATPases are present in all basal fungal taxa. Additionally, P-Type IIE ATPases should no longer be considered as exclusive to the Ascomycota and the Basidiomycota, since we also demonstrate their presence in the Zygomycota. Finally, a comparison of homologous sequences encoding P-type IID ATPases showed unexpectedly that indel mutations among coding regions, as well as specific gene duplications occur among AMF individuals within the same field. Conclusion On the basis of these results we suggest that the diversification of P-Type IIC and E ATPases followed the diversification of the extant fungal phyla with independent events of gene gains and losses. Consistent with recent findings on the human genome, but at a much smaller geographic scale, we provided evidence

  20. Genome-wide evolutionary characterization and expression analyses of WRKY family genes in Brachypodium distachyon.

    Science.gov (United States)

    Wen, Feng; Zhu, Hong; Li, Peng; Jiang, Min; Mao, Wenqing; Ong, Chermaine; Chu, Zhaoqing

    2014-06-01

    Members of plant WRKY gene family are ancient transcription factors that function in plant growth and development and respond to biotic and abiotic stresses. In our present study, we have investigated WRKY family genes in Brachypodium distachyon, a new model plant of family Poaceae. We identified a total of 86 WRKY genes from B. distachyon and explored their chromosomal distribution and evolution, domain alignment, promoter cis-elements, and expression profiles. Combining the analysis of phylogenetic tree of BdWRKY genes and the result of expression profiling, results showed that most of clustered gene pairs had higher similarities in the WRKY domain, suggesting that they might be functionally redundant. Neighbour-joining analysis of 301 WRKY domains from Oryza sativa, Arabidopsis thaliana, and B. distachyon suggested that BdWRKY domains are evolutionarily more closely related to O. sativa WRKY domains than those of A. thaliana. Moreover, tissue-specific expression profile of BdWRKY genes and their responses to phytohormones and several biotic or abiotic stresses were analysed by quantitative real-time PCR. The results showed that the expression of BdWRKY genes was rapidly regulated by stresses and phytohormones, and there was a strong correlation between promoter cis-elements and the phytohormones-induced BdWRKY gene expression. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  1. Reconstruction of Oomycete Genome Evolution Identifies Differences in Evolutionary Trajectories Leading to Present-Day Large Gene Families

    NARCIS (Netherlands)

    Seidl, M.F.; Ackerveken, van den G.; Govers, F.; Snel, B.

    2012-01-01

    The taxonomic class of oomycetes contains numerous pathogens of plants and animals but is related to nonpathogenic diatoms and brown algae. Oomycetes have flexible genomes comprising large gene families that play roles in pathogenicity. The evolutionary processes that shaped the gene content have

  2. CLE peptide-encoding gene families in Medicago truncatula and Lotus japonicus, compared with those of soybean, common bean and Arabidopsis

    DEFF Research Database (Denmark)

    Hastwell, April H; de Bang, Thomas Christian; Gresshoff, Peter M

    2017-01-01

    these complete CLE peptide-encoding gene families with those of fellow legumes, Glycine max and Phaseolus vulgaris, in addition to the model plant Arabidopsis thaliana. This approach provided insight into the evolution of CLE peptide families and enabled us to establish putative M. truncatula and L. japonicus...

  3. Diversification of CYCLOIDEA-like genes in Dipsacaceae (Dipsacales: implications for the evolution of capitulum inflorescences

    Directory of Open Access Journals (Sweden)

    Carlson Sara E

    2011-11-01

    ray florets. The similarity in CYC-like gene diversification seen in Dipsacaceae and some members of the Asteraceae sets the stage to investigate whether the convergent evolution of capitulum inflorescences in both groups may have been underlain by convergent evolution in the same gene family.

  4. Evolution and Diversity of Biosynthetic Gene Clusters in Fusarium

    Directory of Open Access Journals (Sweden)

    Koen Hoogendoorn

    2018-06-01

    Full Text Available Plant pathogenic fungi in the Fusarium genus cause severe damage to crops, resulting in great financial losses and health hazards. Specialized metabolites synthesized by these fungi are known to play key roles in the infection process, and to provide survival advantages inside and outside the host. However, systematic studies of the evolution of specialized metabolite-coding potential across Fusarium have been scarce. Here, we apply a combination of bioinformatic approaches to identify biosynthetic gene clusters (BGCs across publicly available genomes from Fusarium, to group them into annotated families and to study gain/loss events of BGC families throughout the history of the genus. Comparison with MIBiG reference BGCs allowed assignment of 29 gene cluster families (GCFs to pathways responsible for the production of known compounds, while for 57 GCFs, the molecular products remain unknown. Comparative analysis of BGC repertoires using ancestral state reconstruction raised several new hypotheses on how BGCs contribute to Fusarium pathogenicity or host specificity, sometimes surprisingly so: for example, a gene cluster for the biosynthesis of hexadehydro-astechrome was identified in the genome of the biocontrol strain Fusarium oxysporum Fo47, while being absent in that of the tomato pathogen F. oxysporum f.sp. lycopersici. Several BGCs were also identified on supernumerary chromosomes; heterologous expression of genes for three terpene synthases encoded on the Fusarium poae supernumerary chromosome and subsequent GC/MS analysis showed that these genes are functional and encode enzymes that each are able to synthesize koraiol; this observed functional redundancy supports the hypothesis that localization of copies of BGCs on supernumerary chromosomes provides freedom for evolutionary innovations to occur, while the original function remains conserved. Altogether, this systematic overview of biosynthetic diversity in Fusarium paves the way for

  5. Evolution of High Cellulolytic Activity in Symbiotic Streptomyces through Selection of Expanded Gene Content and Coordinated Gene Expression

    Science.gov (United States)

    McDonald, Bradon R.; Takasuka, Taichi E.; Wendt-Pienkowski, Evelyn; Doering, Drew T.; Raffa, Kenneth F.; Fox, Brian G.; Currie, Cameron R.

    2016-01-01

    The evolution of cellulose degradation was a defining event in the history of life. Without efficient decomposition and recycling, dead plant biomass would quickly accumulate and become inaccessible to terrestrial food webs and the global carbon cycle. On land, the primary drivers of plant biomass deconstruction are fungi and bacteria in the soil or associated with herbivorous eukaryotes. While the ecological importance of plant-decomposing microbes is well established, little is known about the distribution or evolution of cellulolytic activity in any bacterial genus. Here we show that in Streptomyces, a genus of Actinobacteria abundant in soil and symbiotic niches, the ability to rapidly degrade cellulose is largely restricted to two clades of host-associated strains and is not a conserved characteristic of the Streptomyces genus or host-associated strains. Our comparative genomics identify that while plant biomass degrading genes (CAZy) are widespread in Streptomyces, key enzyme families are enriched in highly cellulolytic strains. Transcriptomic analyses demonstrate that cellulolytic strains express a suite of multi-domain CAZy enzymes that are coregulated by the CebR transcriptional regulator. Using targeted gene deletions, we verify the importance of a highly expressed cellulase (GH6 family cellobiohydrolase) and the CebR transcriptional repressor to the cellulolytic phenotype. Evolutionary analyses identify complex genomic modifications that drive plant biomass deconstruction in Streptomyces, including acquisition and selective retention of CAZy genes and transcriptional regulators. Our results suggest that host-associated niches have selected some symbiotic Streptomyces for increased cellulose degrading activity and that symbiotic bacteria are a rich biochemical and enzymatic resource for biotechnology. PMID:27276034

  6. Evolution and variation of multigene families

    CERN Document Server

    Ohta, Tomoko

    1980-01-01

    During the last decade and a half, studies of evolution and variation have been revolutionized by the introduction of the methods and concepts of molecular genetics. We can now construct reliable phylogenetic trees, even when fossil records are missing, by compara­ tive studies of protein or mRNA sequences. If, in addition, paleon­ tological information is available, we can estimate the rate at which genes are substituted in the species in the course of evolution. Through the application of electrophoretic methods, it has become possible to study intraspecific variation in molecular terms. We now know that an immense genetic variability exists in a sexually repro­ ducing species, and our human species is no exception. The mathematical theory of population genetics (particularly its stochastic aspects) in conjunction with these new developments led us to formulate the "neutral theory" of molecular evolution, pointing out that chance, in the form of random gene frequency drift, is playing a much more importa...

  7. Gene conversion as a secondary mechanism of short interspersed element (SINE) evolution

    Energy Technology Data Exchange (ETDEWEB)

    Kass, D.H. [Louisiana State Univ. Medical Center, New Orleans, LA (United States). Dept. of Biochemistry and Molecular Biology; Batzer, M.A. [Lawrence Livermore National Lab., CA (United States); Deininger, P.L. [Louisiana State Univ. Medical Center, New Orleans, LA (United States). Dept. of Biochemistry and Molecular Biology]|[Alton Ochsner Medical Foundation, New Orleans, LA (United States). Lab. of Molecular Genetics

    1995-01-01

    The Alu repetitive family of short interspersed elements (SINEs) in primates can be subdivided into distinct subfamilies by specific diagnostic nucleotide changes. The older subfamilies are generally very abundant, while the younger subfamilies have fewer copies. Some of the youngest Alu elements are absent in the orthologous loci of nonhuman primates, indicative of recent retroposition events, the primary mode of SINE evolutions. PCR analysis of one young Alu subfamily (Sb2) member found in the low-density lipoprotein receptor gene apparently revealed the presence of this element in the green monkey, orangutan, gorilla, and chimpanzee genomes, as well as the human genome. However, sequence analysis of these genomes revealed a highly mutated, older, primate-specific Alu element was present at this position in the nonhuman primates. Comparison of the flanking DNA sequences upstream of this Alu insertion corresponded to evolution expected for standard primate phylogeny, but comparison of the Alu repeat sequences revealed that the human element departed from this phylogeny. The change in the human sequence apparently occurred by a gene conversion event only within the Alu element itself, converting it from one of the oldest to one of the youngest Alu subfamilies. Although gene conversions of Alu elements are clearly very rare, this finding shows that such events can occur and contribute to specific cases of SINE subfamily evolution.

  8. Venom Evolution

    Indian Academy of Sciences (India)

    IAS Admin

    Therefore, the platypus sequence was studied to quantify the role of gene duplication in the evolution of venom. ... Platypus venom is present only in males and is used for asserting dominance over com- petitors during the ... Certain toxin gene families are known to re- peatedly evolve through gene duplications. The rapidly ...

  9. A contribution to the study of plant development evolution based on gene co-expression networks

    Directory of Open Access Journals (Sweden)

    Francisco J. Romero-Campero

    2013-08-01

    Full Text Available Phototrophic eukaryotes are among the most successful organisms on Earth due to their unparalleled efficiency at capturing light energy and fixing carbon dioxide to produce organic molecules. A conserved and efficient network of light-dependent regulatory modules could be at the bases of this success. This regulatory system conferred early advantages to phototrophic eukaryotes that allowed for specialization, complex developmental processes and modern plant characteristics. We have studied light-dependent gene regulatory modules from algae to plants employing integrative-omics approaches based on gene co-expression networks. Our study reveals some remarkably conserved ways in which eukaryotic phototrophs deal with day length and light signaling. Here we describe how a family of Arabidopsis transcription factors involved in photoperiod response has evolved from a single algal gene according to the innovation, amplification and divergence theory of gene evolution by duplication. These modifications of the gene co-expression networks from the ancient unicellular green algae Chlamydomonas reinhardtii to the modern brassica Arabidopsis thaliana may hint on the evolution and specialization of plants and other organisms.

  10. Genome-Wide Identification and Evolution of HECT Genes in Soybean

    Directory of Open Access Journals (Sweden)

    Xianwen Meng

    2015-04-01

    Full Text Available Proteins containing domains homologous to the E6-associated protein (E6-AP carboxyl terminus (HECT are an important class of E3 ubiquitin ligases involved in the ubiquitin proteasome pathway. HECT-type E3s play crucial roles in plant growth and development. However, current understanding of plant HECT genes and their evolution is very limited. In this study, we performed a genome-wide analysis of the HECT domain-containing genes in soybean. Using high-quality genome sequences, we identified 19 soybean HECT genes. The predicted HECT genes were distributed unevenly across 15 of 20 chromosomes. Nineteen of these genes were inferred to be segmentally duplicated gene pairs, suggesting that in soybean, segmental duplications have made a significant contribution to the expansion of the HECT gene family. Phylogenetic analysis showed that these HECT genes can be divided into seven groups, among which gene structure and domain architecture was relatively well-conserved. The Ka/Ks ratios show that after the duplication events, duplicated HECT genes underwent purifying selection. Moreover, expression analysis reveals that 15 of the HECT genes in soybean are differentially expressed in 14 tissues, and are often highly expressed in the flowers and roots. In summary, this work provides useful information on which further functional studies of soybean HECT genes can be based.

  11. Genome-wide analysis of WRKY gene family in the sesame genome and identification of the WRKY genes involved in responses to abiotic stresses.

    Science.gov (United States)

    Li, Donghua; Liu, Pan; Yu, Jingyin; Wang, Linhai; Dossa, Komivi; Zhang, Yanxin; Zhou, Rong; Wei, Xin; Zhang, Xiurong

    2017-09-11

    Sesame (Sesamum indicum L.) is one of the world's most important oil crops. However, it is susceptible to abiotic stresses in general, and to waterlogging and drought stresses in particular. The molecular mechanisms of abiotic stress tolerance in sesame have not yet been elucidated. The WRKY domain transcription factors play significant roles in plant growth, development, and responses to stresses. However, little is known about the number, location, structure, molecular phylogenetics, and expression of the WRKY genes in sesame. We performed a comprehensive study of the WRKY gene family in sesame and identified 71 SiWRKYs. In total, 65 of these genes were mapped to 15 linkage groups within the sesame genome. A phylogenetic analysis was performed using a related species (Arabidopsis thaliana) to investigate the evolution of the sesame WRKY genes. Tissue expression profiles of the WRKY genes demonstrated that six SiWRKY genes were highly expressed in all organs, suggesting that these genes may be important for plant growth and organ development in sesame. Analysis of the SiWRKY gene expression patterns revealed that 33 and 26 SiWRKYs respond strongly to waterlogging and drought stresses, respectively. Changes in the expression of 12 SiWRKY genes were observed at different times after the waterlogging and drought treatments had begun, demonstrating that sesame gene expression patterns vary in response to abiotic stresses. In this study, we analyzed the WRKY family of transcription factors encoded by the sesame genome. Insight was gained into the classification, evolution, and function of the SiWRKY genes, revealing their putative roles in a variety of tissues. Responses to abiotic stresses in different sesame cultivars were also investigated. The results of our study provide a better understanding of the structures and functions of sesame WRKY genes and suggest that manipulating these WRKYs could enhance resistance to waterlogging and drought.

  12. Foxtail millet NF-Y families: genome-wide survey and evolution analyses identified two functional genes important in abiotic stresses

    Directory of Open Access Journals (Sweden)

    Zhi-Juan eFeng

    2015-12-01

    Full Text Available It was reported that Nuclear Factor Y (NF-Y genes were involved in abiotic stress in plants. Foxtail millet (Setaria italica, an elite stress tolerant crop, provided an impetus for the investigation of the NF-Y families in abiotic responses. In the present study, a total of 39 NF-Y genes were identified in foxtail millet. Synteny analyses suggested that foxtail millet NF-Y genes had experienced rapid expansion and strong purifying selection during the process of plant evolution. De novo transcriptome assembly of foxtail millet revealed 11 drought up-regulated NF-Y genes. SiNF-YA1 and SiNF-YB8 were highly activated in leaves and/or roots by drought and salt stresses. Abscisic acid (ABA and H2O2 played positive roles in the induction of SiNF-YA1 and SiNF-YB8 under stress treatments. Transient luciferase (LUC expression assays revealed that SiNF-YA1 and SiNF-YB8 could activate the LUC gene driven by the tobacco (Nicotiana tobacam NtERD10, NtLEA5, NtCAT, NtSOD or NtPOD promoter under normal or stress conditions. Overexpression of SiNF-YA1 enhanced drought and salt tolerance by activating stress-related genes NtERD10 and NtCAT1 and by maintaining relatively stable relative water content (RWC and contents of chlorophyll, superoxide dismutase (SOD, peroxidase (POD, catalase (CAT and malondialdehyde (MDA in transgenic lines under stresses. SiNF-YB8 regulated expression of NtSOD, NtPOD, NtLEA5 and NtERD10 and conferred relatively high RWC and chlorophyll contents and low MDA content, resulting in drought and osmotic tolerance in transgenic lines under stresses. Therefore, SiNF-YA1 and SiNF-YB8 could activate stress-related genes and improve physiological traits, resulting in tolerance to abiotic stresses in plants. All these results will facilitate functional characterization of foxtail millet NF-Ys in future studies.

  13. Evolutionary and polymorphism analyses reveal the central role of BTN3A2 in the concerted evolution of the BTN3 gene family.

    Science.gov (United States)

    Afrache, Hassnae; Pontarotti, Pierre; Abi-Rached, Laurent; Olive, Daniel

    2017-06-01

    The butyrophilin 3 (BTN3) receptors are implicated in the T lymphocytes regulation and present a wide plasticity in mammals. In order to understand how these genes have been diversified, we studied their evolution and show that the three human BTN3 are the result of two successive duplications in Primates and that the three genes are present in Hominoids and the Old World Monkey groups. A thorough phylogenetic analysis reveals a concerted evolution of BTN3 characterized by a strong and recurrent homogenization of the region encoding the signal peptide and the immunoglobulin variable (IgV) domain in Hominoids, where the sequences of BTN3A1 or BTN3A3 are replaced by BTN3A2 sequence. In human, the analysis of the diversity of these genes in 1683 individuals representing 26 worldwide populations shows that the three genes are polymorphic, with more than 46 alleles for each gene, and marked by extreme homogenization of the IgV sequences. The same analysis performed for the BTN2 genes shows also a concerted evolution; however, it is not as strong and recurrent as for BTN3. This study shows that BTN3 receptors are marked by extreme concerted evolution at the IgV domain and that BTN3A2 plays a central role in this evolution.

  14. Genome-wide analysis suggests high level of microsynteny and purifying selection affect the evolution of EIN3/EIL family in Rosaceae.

    Science.gov (United States)

    Cao, Yunpeng; Han, Yahui; Meng, Dandan; Li, Dahui; Jin, Qing; Lin, Yi; Cai, Yongping

    2017-01-01

    The ethylene-insensitive3/ethylene-insensitive3-like ( EIN3/EIL ) proteins are a type of nuclear-localized protein with DNA-binding activity in plants. Although the EIN3/EIL gene family has been studied in several plant species, little is known about comprehensive study of the EIN3/EIL gene family in Rosaceae. In this study, ten, five, four, and five EIN3/EIL genes were identified in the genomes of pear ( Pyrus bretschneideri ), mei ( Prunus mume ), peach ( Prunus persica ) and strawberry ( Fragaria vesca ), respectively. Twenty-eight chromosomal segments of EIL/EIN3 gene family were found in four Rosaceae species, and these segments could form seven orthologous or paralogous groups based on interspecies or intraspecies gene colinearity (microsynteny) analysis. Moreover, the highly conserved regions of microsynteny were found in four Rosaceae species. Subsequently it was found that both whole genome duplication and tandem duplication events significantly contributed to the EIL/EIN3 gene family expansion. Gene expression analysis of the EIL/EIN3 genes in the pear revealed subfunctionalization for several PbEIL genes derived from whole genome duplication. It is noteworthy that according to environmental selection pressure analysis, the strong purifying selection should dominate the maintenance of the EIL/EIN3 gene family in four Rosaceae species. These results provided useful information on Rosaceae EIL/EIN3 genes, as well as insights into the evolution of this gene family in four Rosaceae species. Furthermore, high level of microsynteny in the four Rosaceae plants suggested that a large-scale genome duplication event in the EIL/EIN3 gene family was predated to speciation.

  15. Genome-wide analysis suggests high level of microsynteny and purifying selection affect the evolution of EIN3/EIL family in Rosaceae

    Directory of Open Access Journals (Sweden)

    Yunpeng Cao

    2017-05-01

    Full Text Available The ethylene-insensitive3/ethylene-insensitive3-like (EIN3/EIL proteins are a type of nuclear-localized protein with DNA-binding activity in plants. Although the EIN3/EIL gene family has been studied in several plant species, little is known about comprehensive study of the EIN3/EIL gene family in Rosaceae. In this study, ten, five, four, and five EIN3/EIL genes were identified in the genomes of pear (Pyrus bretschneideri, mei (Prunus mume, peach (Prunus persica and strawberry (Fragaria vesca, respectively. Twenty-eight chromosomal segments of EIL/EIN3 gene family were found in four Rosaceae species, and these segments could form seven orthologous or paralogous groups based on interspecies or intraspecies gene colinearity (microsynteny analysis. Moreover, the highly conserved regions of microsynteny were found in four Rosaceae species. Subsequently it was found that both whole genome duplication and tandem duplication events significantly contributed to the EIL/EIN3 gene family expansion. Gene expression analysis of the EIL/EIN3 genes in the pear revealed subfunctionalization for several PbEIL genes derived from whole genome duplication. It is noteworthy that according to environmental selection pressure analysis, the strong purifying selection should dominate the maintenance of the EIL/EIN3 gene family in four Rosaceae species. These results provided useful information on Rosaceae EIL/EIN3 genes, as well as insights into the evolution of this gene family in four Rosaceae species. Furthermore, high level of microsynteny in the four Rosaceae plants suggested that a large-scale genome duplication event in the EIL/EIN3 gene family was predated to speciation.

  16. Genome-wide comparative analysis of papain-like cysteine protease family genes in castor bean and physic nut.

    Science.gov (United States)

    Zou, Zhi; Huang, Qixing; Xie, Guishui; Yang, Lifu

    2018-01-10

    Papain-like cysteine proteases (PLCPs) are a class of proteolytic enzymes involved in many plant processes. Compared with the extensive research in Arabidopsis thaliana, little is known in castor bean (Ricinus communis) and physic nut (Jatropha curcas), two Euphorbiaceous plants without any recent whole-genome duplication. In this study, a total of 26 or 23 PLCP genes were identified from the genomes of castor bean and physic nut respectively, which can be divided into nine subfamilies based on the phylogenetic analysis: RD21, CEP, XCP, XBCP3, THI, SAG12, RD19, ALP and CTB. Although most of them harbor orthologs in Arabidopsis, several members in subfamilies RD21, CEP, XBCP3 and SAG12 form new groups or subgroups as observed in other species, suggesting specific gene loss occurred in Arabidopsis. Recent gene duplicates were also identified in these two species, but they are limited to the SAG12 subfamily and were all derived from local duplication. Expression profiling revealed diverse patterns of different family members over various tissues. Furthermore, the evolution characteristics of PLCP genes were also compared and discussed. Our findings provide a useful reference to characterize PLCP genes and investigate the family evolution in Euphorbiaceae and species beyond.

  17. The role of retrotransposons in gene family expansions: insights from the mouse Abp gene family.

    Science.gov (United States)

    Janoušek, Václav; Karn, Robert C; Laukaitis, Christina M

    2013-05-29

    Retrotransposons have been suggested to provide a substrate for non-allelic homologous recombination (NAHR) and thereby promote gene family expansion. Their precise role, however, is controversial. Here we ask whether retrotransposons contributed to the recent expansions of the Androgen-binding protein (Abp) gene families that occurred independently in the mouse and rat genomes. Using dot plot analysis, we found that the most recent duplication in the Abp region of the mouse genome is flanked by L1Md_T elements. Analysis of the sequence of these elements revealed breakpoints that are the relicts of the recombination that caused the duplication, confirming that the duplication arose as a result of NAHR using L1 elements as substrates. L1 and ERVII retrotransposons are considerably denser in the Abp regions than in one Mb flanking regions, while other repeat types are depleted in the Abp regions compared to flanking regions. L1 retrotransposons preferentially accumulated in the Abp gene regions after lineage separation and roughly followed the pattern of Abp gene expansion. By contrast, the proportion of shared vs. lineage-specific ERVII repeats in the Abp region resembles the rest of the genome. We confirmed the role of L1 repeats in Abp gene duplication with the identification of recombinant L1Md_T elements at the edges of the most recent mouse Abp gene duplication. High densities of L1 and ERVII repeats were found in the Abp gene region with abrupt transitions at the region boundaries, suggesting that their higher densities are tightly associated with Abp gene duplication. We observed that the major accumulation of L1 elements occurred after the split of the mouse and rat lineages and that there is a striking overlap between the timing of L1 accumulation and expansion of the Abp gene family in the mouse genome. Establishing a link between the accumulation of L1 elements and the expansion of the Abp gene family and identification of an NAHR-related breakpoint in

  18. Phylogenetic relationships of the Fox (Forkhead) gene family in the Bilateria

    Science.gov (United States)

    Mazet, Francoise; Yu, Jr Kai; Liberles, David A.; Holland, Linda Z.; Shimeld, Sebastian M.

    2003-01-01

    The Forkhead or Fox gene family encodes putative transcription factors. There are at least four Fox genes in yeast, 16 in Drosophila melanogaster (Dm) and 42 in humans. Recently, vertebrate Fox genes have been classified into 17 groups named FoxA to FoxQ. Here, we extend this analysis to invertebrates, using available sequences from D. melanogaster, Anopheles gambiae (Ag), Caenorhabditis elegans (Ce), the sea squirt Ciona intestinalis (Ci) and amphioxus Branchiostoma floridae (Bf), from which we also cloned several Fox genes. Phylogenetic analyses lend support to the previous overall subclassification of vertebrate genes, but suggest that four subclasses (FoxJ, L, N and Q) could be further subdivided to reflect their relationships to invertebrate genes. We were unable to identify orthologs of Fox subclasses E, H, I, J, M and Q1 in D. melanogaster, A. gambiae or C. elegans, suggesting either considerable loss in ecdysozoans or the evolution of these subclasses in the deuterostome lineage. Our analyses suggest that the common ancestor of protostomes and deuterostomes had a minimum complement of 14 Fox genes.

  19. Transcriptomic and phylogenetic analysis of Culex pipiens quinquefasciatus for three detoxification gene families

    Directory of Open Access Journals (Sweden)

    Yan Liangzhen

    2012-11-01

    susceptible strain. Fifteen detoxification genes, including 2 CCEs, 6 GSTs, and 7 P450s, were expressed at higher levels in the resistant strain. Conclusions The results of the present study provide new insights into the functions and evolution of three detoxification gene families in mosquitoes and comprehensive transcriptomic resources for C. p. quinquefasciatus, which will facilitate the elucidation of molecular mechanisms underlying the different biological characteristics of the three major mosquito vectors.

  20. Structure and evolution of the plant cation diffusion facilitator family of ion transporters

    Directory of Open Access Journals (Sweden)

    Zanis Michael J

    2011-03-01

    Full Text Available Abstract Background Members of the cation diffusion facilitator (CDF family are integral membrane divalent cation transporters that transport metal ions out of the cytoplasm either into the extracellular space or into internal compartments such as the vacuole. The spectrum of cations known to be transported by proteins of the CDF family include Zn, Fe, Co, Cd, and Mn. Members of this family have been identified in prokaryotes, eukaryotes, and archaea, and in sequenced plant genomes. CDF families range in size from nine members in Selaginella moellendorffii to 19 members in Populus trichocarpa. Phylogenetic analysis suggests that the CDF family has expanded within plants, but a definitive plant CDF family phylogeny has not been constructed. Results Representative CDF members were annotated from diverse genomes across the Viridiplantae and Rhodophyta lineages and used to identify phylogenetic relationships within the CDF family. Bayesian phylogenetic analysis of CDF amino acid sequence data supports organizing land plant CDF family sequences into 7 groups. The origin of the 7 groups predates the emergence of land plants. Among these, 5 of the 7 groups are likely to have originated at the base of the tree of life, and 2 of 7 groups appear to be derived from a duplication event prior to or coincident with land plant evolution. Within land plants, local expansion continues within select groups, while several groups are strictly maintained as one gene copy per genome. Conclusions Defining the CDF gene family phylogeny contributes to our understanding of this family in several ways. First, when embarking upon functional studies of the members, defining primary groups improves the predictive power of functional assignment of orthologous/paralogous genes and aids in hypothesis generation. Second, defining groups will allow a group-specific sequence motif to be generated that will help define future CDF family sequences and aid in functional motif

  1. Three neuropeptide Y receptor genes in the spiny dogfish, Squalus acanthias, support en bloc duplications in early vertebrate evolution.

    Science.gov (United States)

    Salaneck, Erik; Ardell, David H; Larson, Earl T; Larhammar, Dan

    2003-08-01

    It has been debated whether the increase in gene number during early vertebrate evolution was due to multiple independent gene duplications or synchronous duplications of many genes. We describe here the cloning of three neuropeptide Y (NPY) receptor genes belonging to the Y1 subfamily in the spiny dogfish, Squalus acanthias, a cartilaginous fish. The three genes are orthologs of the mammalian subtypes Y1, Y4, and Y6, which are located in paralogous gene regions on different chromosomes in mammals. Thus, these genes arose by duplications of a chromosome region before the radiation of gnathostomes (jawed vertebrates). Estimates of duplication times from linearized trees together with evidence from other gene families supports two rounds of chromosome duplications or tetraploidizations early in vertebrate evolution. The anatomical distribution of mRNA was determined by reverse-transcriptase PCR and was found to differ from mammals, suggesting differential functional diversification of the new gene copies during the radiation of the vertebrate classes.

  2. Comprehensive Genomic Identification and Expression Analysis of the Phosphate Transporter (PHT) Gene Family in Apple.

    Science.gov (United States)

    Sun, Tingting; Li, Mingjun; Shao, Yun; Yu, Lingyan; Ma, Fengwang

    2017-01-01

    Elemental phosphorus (Pi) is essential to plant growth and development. The family of phosphate transporters (PHTs) mediates the uptake and translocation of Pi inside the plants. Members include five sub-cellular phosphate transporters that play different roles in Pi uptake and transport. We searched the Genome Database for Rosaceae and identified five clusters of phosphate transporters in apple ( Malus domestica ), including 37 putative genes. The MdPHT1 family contains 14 genes while MdPHT2 has two, MdPHT3 has seven, MdPHT4 has 11, and MdPHT5 has three. Our overview of this gene family focused on structure, chromosomal distribution and localization, phylogenies, and motifs. These genes displayed differential expression patterns in various tissues. For example, expression was high for MdPHT1;12, MdPHT3;6 , and MdPHT3;7 in the roots, and was also increased in response to low-phosphorus conditions. In contrast, MdPHT4;1, MdPHT4;4 , and MdPHT4;10 were expressed only in the leaves while transcript levels of MdPHT1;4, MdPHT1;12 , and MdPHT5;3 were highest in flowers. In general, these 37 genes were regulated significantly in either roots or leaves in response to the imposition of phosphorus and/or drought stress. The results suggest that members of the PHT family function in plant adaptations to adverse growing environments. Our study will lay a foundation for better understanding the PHT family evolution and exploring genes of interest for genetic improvement in apple.

  3. The evolution of gene expression levels in mammalian organs

    DEFF Research Database (Denmark)

    Brawand, David; Soumillon, Magali; Necsulea, Anamaria

    2011-01-01

    and chromosomes, owing to differences in selective pressures: transcriptome change was slow in nervous tissues and rapid in testes, slower in rodents than in apes and monotremes, and rapid for the X chromosome right after its formation. Although gene expression evolution in mammals was strongly shaped......Changes in gene expression are thought to underlie many of the phenotypic differences between species. However, large-scale analyses of gene expression evolution were until recently prevented by technological limitations. Here we report the sequencing of polyadenylated RNA from six organs across...... ten species that represent all major mammalian lineages (placentals, marsupials and monotremes) and birds (the evolutionary outgroup), with the goal of understanding the dynamics of mammalian transcriptome evolution. We show that the rate of gene expression evolution varies among organs, lineages...

  4. Genome-Wide Analysis of the Aquaporin Gene Family in Chickpea (Cicer arietinum L.).

    Science.gov (United States)

    Deokar, Amit A; Tar'an, Bunyamin

    2016-01-01

    Aquaporins (AQPs) are essential membrane proteins that play critical role in the transport of water and many other solutes across cell membranes. In this study, a comprehensive genome-wide analysis identified 40 AQP genes in chickpea ( Cicer arietinum L.). A complete overview of the chickpea AQP (CaAQP) gene family is presented, including their chromosomal locations, gene structure, phylogeny, gene duplication, conserved functional motifs, gene expression, and conserved promoter motifs. To understand AQP's evolution, a comparative analysis of chickpea AQPs with AQP orthologs from soybean, Medicago, common bean, and Arabidopsis was performed. The chickpea AQP genes were found on all of the chickpea chromosomes, except chromosome 7, with a maximum of six genes on chromosome 6, and a minimum of one gene on chromosome 5. Gene duplication analysis indicated that the expansion of chickpea AQP gene family might have been due to segmental and tandem duplications. CaAQPs were grouped into four subfamilies including 15 NOD26-like intrinsic proteins (NIPs), 13 tonoplast intrinsic proteins (TIPs), eight plasma membrane intrinsic proteins (PIPs), and four small basic intrinsic proteins (SIPs) based on sequence similarities and phylogenetic position. Gene structure analysis revealed a highly conserved exon-intron pattern within CaAQP subfamilies supporting the CaAQP family classification. Functional prediction based on conserved Ar/R selectivity filters, Froger's residues, and specificity-determining positions suggested wide differences in substrate specificity among the subfamilies of CaAQPs. Expression analysis of the AQP genes indicated that some of the genes are tissue-specific, whereas few other AQP genes showed differential expression in response to biotic and abiotic stresses. Promoter profiling of CaAQP genes for conserved cis -acting regulatory elements revealed enrichment of cis -elements involved in circadian control, light response, defense and stress responsiveness

  5. Systematic Identification, Evolution and Expression Analysis of the Zea mays PHT1 Gene Family Reveals Several New Members Involved in Root Colonization by Arbuscular Mycorrhizal Fungi.

    Science.gov (United States)

    Liu, Fang; Xu, Yunjian; Jiang, Huanhuan; Jiang, Chaosheng; Du, Yibin; Gong, Cheng; Wang, Wei; Zhu, Suwen; Han, Guomin; Cheng, Beijiu

    2016-06-13

    The Phosphate Transporter1 (PHT1) family of genes plays pivotal roles in the uptake of inorganic phosphate from soils. However, there is no comprehensive report on the PHT1 family in Zea mays based on the whole genome. In the present study, a total of 13 putative PHT1 genes (ZmPHT1;1 to 13) were identified in the inbred line B73 genome by bioinformatics methods. Then, their function was investigated by a yeast PHO84 mutant complementary experiment and qRT-PCR. Thirteen ZmPHT1 genes distributed on six chromosomes (1, 2, 5, 7, 8 and 10) were divided into two paralogues (Class A and Class B). ZmPHT1;1/ZmPHT1;9 and ZmPHT1;9/ZmPHT1;13 are produced from recent segmental duplication events. ZmPHT1;1/ZmPHT1;13 and ZmPHT1;8/ZmPHT1;10 are produced from early segmental duplication events. All 13 putative ZmPHT1s can completely or partly complement the yeast Pi-uptake mutant, and they were obviously induced in maize under low Pi conditions, except for ZmPHT1;1 (p < 0.01), indicating that the overwhelming majority of ZmPHT1 genes can respond to a low Pi condition. ZmPHT1;2, ZmPHT1;4, ZmPHT1;6, ZmPHT1;7, ZmPHT1;9 and ZmPHT1;11 were up-regulated by arbuscular mycorrhizal fungi (AMF), implying that these genes might participate in mediating Pi absorption and/or transport. Analysis of the promoters revealed that the MYCS and P1BS element are widely distributed on the region of different AMF-inducible ZmPHT1 promoters. In light of the above results, five of 13 ZmPHT1 genes were newly-identified AMF-inducible high-affinity phosphate transporters in the maize genome. Our results will lay a foundation for better understanding the PHT1 family evolution and the molecular mechanisms of inorganic phosphate transport under AMF inoculation.

  6. The evolution of gene expression QTL in Saccharomyces cerevisiae.

    Directory of Open Access Journals (Sweden)

    James Ronald

    2007-08-01

    Full Text Available Understanding the evolutionary forces that influence patterns of gene expression variation will provide insights into the mechanisms of evolutionary change and the molecular basis of phenotypic diversity. To date, studies of gene expression evolution have primarily been made by analyzing how gene expression levels vary within and between species. However, the fundamental unit of heritable variation in transcript abundance is the underlying regulatory allele, and as a result it is necessary to understand gene expression evolution at the level of DNA sequence variation. Here we describe the evolutionary forces shaping patterns of genetic variation for 1206 cis-regulatory QTL identified in a cross between two divergent strains of Saccharomyces cerevisiae. We demonstrate that purifying selection against mildly deleterious alleles is the dominant force governing cis-regulatory evolution in S. cerevisiae and estimate the strength of selection. We also find that essential genes and genes with larger codon bias are subject to slightly stronger cis-regulatory constraint and that positive selection has played a role in the evolution of major trans-acting QTL.

  7. Evolutionary Pattern and Regulation Analysis to Support Why Diversity Functions Existed within PPAR Gene Family Members

    Directory of Open Access Journals (Sweden)

    Tianyu Zhou

    2015-01-01

    Full Text Available Peroxisome proliferators-activated receptor (PPAR gene family members exhibit distinct patterns of distribution in tissues and differ in functions. The purpose of this study is to investigate the evolutionary impacts on diversity functions of PPAR members and the regulatory differences on gene expression patterns. 63 homology sequences of PPAR genes from 31 species were collected and analyzed. The results showed that three isolated types of PPAR gene family may emerge from twice times of gene duplication events. The conserved domains of HOLI (ligand binding domain of hormone receptors domain and ZnF_C4 (C4 zinc finger in nuclear in hormone receptors are essential for keeping basic roles of PPAR gene family, and the variant domains of LCRs may be responsible for their divergence in functions. The positive selection sites in HOLI domain are benefit for PPARs to evolve towards diversity functions. The evolutionary variants in the promoter regions and 3′ UTR regions of PPARs result into differential transcription factors and miRNAs involved in regulating PPAR members, which may eventually affect their expressions and tissues distributions. These results indicate that gene duplication event, selection pressure on HOLI domain, and the variants on promoter and 3′ UTR are essential for PPARs evolution and diversity functions acquired.

  8. Evolutionary Pattern and Regulation Analysis to Support Why Diversity Functions Existed within PPAR Gene Family Members.

    Science.gov (United States)

    Zhou, Tianyu; Yan, Xiping; Wang, Guosong; Liu, Hehe; Gan, Xiang; Zhang, Tao; Wang, Jiwen; Li, Liang

    2015-01-01

    Peroxisome proliferators-activated receptor (PPAR) gene family members exhibit distinct patterns of distribution in tissues and differ in functions. The purpose of this study is to investigate the evolutionary impacts on diversity functions of PPAR members and the regulatory differences on gene expression patterns. 63 homology sequences of PPAR genes from 31 species were collected and analyzed. The results showed that three isolated types of PPAR gene family may emerge from twice times of gene duplication events. The conserved domains of HOLI (ligand binding domain of hormone receptors) domain and ZnF_C4 (C4 zinc finger in nuclear in hormone receptors) are essential for keeping basic roles of PPAR gene family, and the variant domains of LCRs may be responsible for their divergence in functions. The positive selection sites in HOLI domain are benefit for PPARs to evolve towards diversity functions. The evolutionary variants in the promoter regions and 3' UTR regions of PPARs result into differential transcription factors and miRNAs involved in regulating PPAR members, which may eventually affect their expressions and tissues distributions. These results indicate that gene duplication event, selection pressure on HOLI domain, and the variants on promoter and 3' UTR are essential for PPARs evolution and diversity functions acquired.

  9. Stepwise Functional Evolution in a Fungal Sugar Transporter Family.

    Science.gov (United States)

    Gonçalves, Carla; Coelho, Marco A; Salema-Oom, Madalena; Gonçalves, Paula

    2016-02-01

    Sugar transport is of the utmost importance for most cells and is central to a wide range of applied fields. However, despite the straightforward in silico assignment of many novel transporters, including sugar porters, to existing families, their exact biological role and evolutionary trajectory often remain unclear, mainly because biochemical characterization of membrane proteins is inherently challenging, but also owing to their uncommonly turbulent evolutionary histories. In addition, many important shifts in membrane carrier function are apparently ancient, which further limits our ability to reconstruct evolutionary trajectories in a reliable manner. Here, we circumvented some of these obstacles by examining the relatively recent emergence of a unique family of fungal sugar facilitators, related to drug antiporters. The former transporters, named Ffz, were previously shown to be required for fructophilic metabolism in yeasts. We first exploited the wealth of fungal genomic data available to define a comprehensive but well-delimited family of Ffz-like transporters, showing that they are only present in Dikarya. Subsequently, a combination of phylogenetic analyses and in vivo functional characterization was used to retrace important changes in function, while highlighting the evolutionary events that are most likely to have determined extant distribution of the gene, such as horizontal gene transfers (HGTs). One such HGT event is proposed to have set the stage for the onset of fructophilic metabolism in yeasts, a trait that according to our results may be the metabolic hallmark of close to 100 yeast species that thrive in sugar rich environments. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  10. Repeated reunions and splits feature the highly dynamic evolution of 5S and 35S ribosomal RNA genes (rDNA) in the Asteraceae family.

    Science.gov (United States)

    Garcia, Sònia; Panero, José L; Siroky, Jiri; Kovarik, Ales

    2010-08-16

    In flowering plants and animals the most common ribosomal RNA genes (rDNA) organisation is that in which 35S (encoding 18S-5.8S-26S rRNA) and 5S genes are physically separated occupying different chromosomal loci. However, recent observations established that both genes have been unified to a single 35S-5S unit in the genus Artemisia (Asteraceae), a genomic arrangement typical of primitive eukaryotes such as yeast, among others. Here we aim to reveal the origin, distribution and mechanisms leading to the linked organisation of rDNA in the Asteraceae by analysing unit structure (PCR, Southern blot, sequencing), gene copy number (quantitative PCR) and chromosomal position (FISH) of 5S and 35S rRNA genes in approximately 200 species representing the family diversity and other closely related groups. Dominant linked rDNA genotype was found within three large groups in subfamily Asteroideae: tribe Anthemideae (93% of the studied cases), tribe Gnaphalieae (100%) and in the "Heliantheae alliance" (23%). The remaining five tribes of the Asteroideae displayed canonical non linked arrangement of rDNA, as did the other groups in the Asteraceae. Nevertheless, low copy linked genes were identified among several species that amplified unlinked units. The conserved position of functional 5S insertions downstream from the 26S gene suggests a unique, perhaps retrotransposon-mediated integration event at the base of subfamily Asteroideae. Further evolution likely involved divergence of 26S-5S intergenic spacers, amplification and homogenisation of units across the chromosomes and concomitant elimination of unlinked arrays. However, the opposite trend, from linked towards unlinked arrangement was also surmised in few species indicating possible reversibility of these processes. Our results indicate that nearly 25% of Asteraceae species may have evolved unusual linked arrangement of rRNA genes. Thus, in plants, fundamental changes in intrinsic structure of rDNA units, their copy

  11. Lineage-specific evolution of the vertebrate Otopetrin gene family revealed by comparative genomic analyses

    Directory of Open Access Journals (Sweden)

    Ryan Joseph F

    2011-01-01

    Full Text Available Abstract Background Mutations in the Otopetrin 1 gene (Otop1 in mice and fish produce an unusual bilateral vestibular pathology that involves the absence of otoconia without hearing impairment. The encoded protein, Otop1, is the only functionally characterized member of the Otopetrin Domain Protein (ODP family; the extended sequence and structural preservation of ODP proteins in metazoans suggest a conserved functional role. Here, we use the tools of sequence- and cytogenetic-based comparative genomics to study the Otop1 and the Otop2-Otop3 genes and to establish their genomic context in 25 vertebrates. We extend our evolutionary study to include the gene mutated in Usher syndrome (USH subtype 1G (Ush1g, both because of the head-to-tail clustering of Ush1g with Otop2 and because Otop1 and Ush1g mutations result in inner ear phenotypes. Results We established that OTOP1 is the boundary gene of an inversion polymorphism on human chromosome 4p16 that originated in the common human-chimpanzee lineage more than 6 million years ago. Other lineage-specific evolutionary events included a three-fold expansion of the Otop genes in Xenopus tropicalis and of Ush1g in teleostei fish. The tight physical linkage between Otop2 and Ush1g is conserved in all vertebrates. To further understand the functional organization of the Ushg1-Otop2 locus, we deduced a putative map of binding sites for CCCTC-binding factor (CTCF, a mammalian insulator transcription factor, from genome-wide chromatin immunoprecipitation-sequencing (ChIP-seq data in mouse and human embryonic stem (ES cells combined with detection of CTCF-binding motifs. Conclusions The results presented here clarify the evolutionary history of the vertebrate Otop and Ush1g families, and establish a framework for studying the possible interaction(s of Ush1g and Otop in developmental pathways.

  12. The king cobra genome reveals dynamic gene evolution and adaptation in the snake venom system.

    Science.gov (United States)

    Vonk, Freek J; Casewell, Nicholas R; Henkel, Christiaan V; Heimberg, Alysha M; Jansen, Hans J; McCleary, Ryan J R; Kerkkamp, Harald M E; Vos, Rutger A; Guerreiro, Isabel; Calvete, Juan J; Wüster, Wolfgang; Woods, Anthony E; Logan, Jessica M; Harrison, Robert A; Castoe, Todd A; de Koning, A P Jason; Pollock, David D; Yandell, Mark; Calderon, Diego; Renjifo, Camila; Currier, Rachel B; Salgado, David; Pla, Davinia; Sanz, Libia; Hyder, Asad S; Ribeiro, José M C; Arntzen, Jan W; van den Thillart, Guido E E J M; Boetzer, Marten; Pirovano, Walter; Dirks, Ron P; Spaink, Herman P; Duboule, Denis; McGlinn, Edwina; Kini, R Manjunatha; Richardson, Michael K

    2013-12-17

    Snakes are limbless predators, and many species use venom to help overpower relatively large, agile prey. Snake venoms are complex protein mixtures encoded by several multilocus gene families that function synergistically to cause incapacitation. To examine venom evolution, we sequenced and interrogated the genome of a venomous snake, the king cobra (Ophiophagus hannah), and compared it, together with our unique transcriptome, microRNA, and proteome datasets from this species, with data from other vertebrates. In contrast to the platypus, the only other venomous vertebrate with a sequenced genome, we find that snake toxin genes evolve through several distinct co-option mechanisms and exhibit surprisingly variable levels of gene duplication and directional selection that correlate with their functional importance in prey capture. The enigmatic accessory venom gland shows a very different pattern of toxin gene expression from the main venom gland and seems to have recruited toxin-like lectin genes repeatedly for new nontoxic functions. In addition, tissue-specific microRNA analyses suggested the co-option of core genetic regulatory components of the venom secretory system from a pancreatic origin. Although the king cobra is limbless, we recovered coding sequences for all Hox genes involved in amniote limb development, with the exception of Hoxd12. Our results provide a unique view of the origin and evolution of snake venom and reveal multiple genome-level adaptive responses to natural selection in this complex biological weapon system. More generally, they provide insight into mechanisms of protein evolution under strong selection.

  13. Rapid isolation of gene homologs across taxa: Efficient identification and isolation of gene orthologs from non-model organism genomes, a technical report

    Directory of Open Access Journals (Sweden)

    Heffer Alison

    2011-03-01

    Full Text Available Abstract Background Tremendous progress has been made in the field of evo-devo through comparisons of related genes from diverse taxa. While the vast number of species in nature precludes a complete analysis of the molecular evolution of even one single gene family, this would not be necessary to understand fundamental mechanisms underlying gene evolution if experiments could be designed to systematically sample representative points along the path of established phylogenies to trace changes in regulatory and coding gene sequence. This isolation of homologous genes from phylogenetically diverse, representative species can be challenging, especially if the gene is under weak selective pressure and evolving rapidly. Results Here we present an approach - Rapid Isolation of Gene Homologs across Taxa (RIGHT - to efficiently isolate specific members of gene families. RIGHT is based upon modification and a combination of degenerate polymerase chain reaction (PCR and gene-specific amplified fragment length polymorphism (AFLP. It allows targeted isolation of specific gene family members from any organism, only requiring genomic DNA. We describe this approach and how we used it to isolate members of several different gene families from diverse arthropods spanning millions of years of evolution. Conclusions RIGHT facilitates systematic isolation of one gene from large gene families. It allows for efficient gene isolation without whole genome sequencing, RNA extraction, or culturing of non-model organisms. RIGHT will be a generally useful method for isolation of orthologs from both distant and closely related species, increasing sample size and facilitating the tracking of molecular evolution of gene families and regulatory networks across the tree of life.

  14. Expansion of banana (Musa acuminata) gene families involved in ethylene biosynthesis and signalling after lineage-specific whole-genome duplications.

    Science.gov (United States)

    Jourda, Cyril; Cardi, Céline; Mbéguié-A-Mbéguié, Didier; Bocs, Stéphanie; Garsmeur, Olivier; D'Hont, Angélique; Yahiaoui, Nabila

    2014-05-01

    Whole-genome duplications (WGDs) are widespread in plants, and three lineage-specific WGDs occurred in the banana (Musa acuminata) genome. Here, we analysed the impact of WGDs on the evolution of banana gene families involved in ethylene biosynthesis and signalling, a key pathway for banana fruit ripening. Banana ethylene pathway genes were identified using comparative genomics approaches and their duplication modes and expression profiles were analysed. Seven out of 10 banana ethylene gene families evolved through WGD and four of them (1-aminocyclopropane-1-carboxylate synthase (ACS), ethylene-insensitive 3-like (EIL), ethylene-insensitive 3-binding F-box (EBF) and ethylene response factor (ERF)) were preferentially retained. Banana orthologues of AtEIN3 and AtEIL1, two major genes for ethylene signalling in Arabidopsis, were particularly expanded. This expansion was paralleled by that of EBF genes which are responsible for control of EIL protein levels. Gene expression profiles in banana fruits suggested functional redundancy for several MaEBF and MaEIL genes derived from WGD and subfunctionalization for some of them. We propose that EIL and EBF genes were co-retained after WGD in banana to maintain balanced control of EIL protein levels and thus avoid detrimental effects of constitutive ethylene signalling. In the course of evolution, subfunctionalization was favoured to promote finer control of ethylene signalling. © 2014 CIRAD New Phytologist © 2014 New Phytologist Trust.

  15. Genome-Wide Analysis, Classification, Evolution, and Expression Analysis of the Cytochrome P450 93 Family in Land Plants

    OpenAIRE

    Du, Hai; Ran, Feng; Dong, Hong-Li; Wen, Jing; Li, Jia-Na; Liang, Zhe

    2016-01-01

    Cytochrome P450 93 family (CYP93) belonging to the cytochrome P450 superfamily plays important roles in diverse plant processes. However, no previous studies have investigated the evolution and expression of the members of this family. In this study, we performed comprehensive genome-wide analysis to identify CYP93 genes in 60 green plants. In all, 214 CYP93 proteins were identified; they were specifically found in flowering plants and could be classified into ten subfamilies?CYP93A?K, with t...

  16. Gene loss, adaptive evolution and the co-evolution of plumage coloration genes with opsins in birds.

    Science.gov (United States)

    Borges, Rui; Khan, Imran; Johnson, Warren E; Gilbert, M Thomas P; Zhang, Guojie; Jarvis, Erich D; O'Brien, Stephen J; Antunes, Agostinho

    2015-10-06

    The wide range of complex photic systems observed in birds exemplifies one of their key evolutionary adaptions, a well-developed visual system. However, genomic approaches have yet to be used to disentangle the evolutionary mechanisms that govern evolution of avian visual systems. We performed comparative genomic analyses across 48 avian genomes that span extant bird phylogenetic diversity to assess evolutionary changes in the 17 representatives of the opsin gene family and five plumage coloration genes. Our analyses suggest modern birds have maintained a repertoire of up to 15 opsins. Synteny analyses indicate that PARA and PARIE pineal opsins were lost, probably in conjunction with the degeneration of the parietal organ. Eleven of the 15 avian opsins evolved in a non-neutral pattern, confirming the adaptive importance of vision in birds. Visual conopsins sw1, sw2 and lw evolved under negative selection, while the dim-light RH1 photopigment diversified. The evolutionary patterns of sw1 and of violet/ultraviolet sensitivity in birds suggest that avian ancestors had violet-sensitive vision. Additionally, we demonstrate an adaptive association between the RH2 opsin and the MC1R plumage color gene, suggesting that plumage coloration has been photic mediated. At the intra-avian level we observed some unique adaptive patterns. For example, barn owl showed early signs of pseudogenization in RH2, perhaps in response to nocturnal behavior, and penguins had amino acid deletions in RH2 sites responsible for the red shift and retinal binding. These patterns in the barn owl and penguins were convergent with adaptive strategies in nocturnal and aquatic mammals, respectively. We conclude that birds have evolved diverse opsin adaptations through gene loss, adaptive selection and coevolution with plumage coloration, and that differentiated selective patterns at the species level suggest novel photic pressures to influence evolutionary patterns of more-recent lineages.

  17. Genome wide identification and expression analysis of Homeodomain leucine zipper subfamily IV (HDZ IV gene family from Musa accuminata

    Directory of Open Access Journals (Sweden)

    Ashutosh ePandey

    2016-02-01

    Full Text Available The homedodomain zipper family (HD-ZIP of transcription factors is present only in plants and plays important role in the regulation of plant-specific processes. The subfamily IV of HDZ transcription factors (HD-ZIP IV has primarily been implicated in the regulation of epidermal structure development. Though this gene family is present in all lineages of land plants, members of this gene family have not been identified in banana, which is one of the major staple fruit crops. In the present work, we identified 21 HDZIV genes in banana by the computational analysis of banana genome resource. Our analysis suggested that these genes putatively encode proteins having all the characteristic domains of HDZIV transcription factors. The phylogenetic analysis of the banana HDZIV family genes further confirmed that after separation from a common ancestor, the banana and poales lineages might have followed distinct evolutionary paths. Further, we conclude that segmental duplication played a major role in the evolution of banana HDZIV genes. All the identified banana HDZIV genes expresses in different banana tissue, however at varying levels. The transcript levels of some of the banana HDZIV genes were also detected in banana fruit pulp, suggesting their putative role in fruit attributes. A large number of genes of this family showed modulated expression under drought and salinity stress. Taken together, the present work lays a foundation for elucidation of functional aspects of the banana HDZIV genes and for their possible use in the banana improvement programs.

  18. Genomic Survey and Expression Profiling of the MYB Gene Family in Watermelon

    Directory of Open Access Journals (Sweden)

    Qing XU

    2018-01-01

    Full Text Available Myeloblastosis (MYB proteins constitute one of the largest transcription factor (TF families in plants. They are functionally diverse in regulating plant development, metabolism, and multiple stress responses. However, the function of watermelon MYB proteins remains elusive to date. Here, a genome-wide identification of watermelon MYB TFs was performed by bioinformatics analysis. A total of 162 MYB genes were identified from watermelon (ClaMYB. A comprehensive overview of the ClaMYB genes was undertaken, including the gene structures, chromosomal distribution, gene duplication, conserved protein motif, and phylogenetic relationship. According to the analyses, the watermelon MYB genes were categorized into three groups (R1R2R3-MYB, R2R3-MYB, and MYB-related. Amino acid alignments for all MYB motifs of ClaMYBs demonstrated high conservation. Investigation of their chromosomal localization revealed that these ClaMYB genes distributed across the 11 watermelon chromosomes. Gene duplication analyses showed that tandem duplication events contributed predominantly to the expansion of the MYB gene family in the watermelon genome. Phylogenetic comparison of the ClaMYB proteins with Arabidopsis MYB proteins revealed that watermelon MYB proteins underwent a more diverse evolution after divergence from Arabidopsis. Some watermelon MYBs were found to cluster into the functional clades of Arabidopsis MYB proteins. Expression analysis under different stress conditions identified a group of watermelon MYB proteins implicated in the plant stress responses. The comprehensive investigation of watermelon MYB genes in this study provides a useful reference for future cloning and functional analysis of watermelon MYB proteins. Keywords: watermelon, MYB transcription factor, abiotic stress, phylogenetic analysis

  19. Genome-wide analysis of the Dof transcription factor gene family reveals soybean-specific duplicable and functional characteristics.

    Directory of Open Access Journals (Sweden)

    Yong Guo

    Full Text Available The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max. In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.

  20. Differential roles of TGIF family genes in mammalian reproduction

    Directory of Open Access Journals (Sweden)

    Renfree Marilyn B

    2011-09-01

    Full Text Available Abstract Background TG-interacting factors (TGIFs belong to a family of TALE-homeodomain proteins including TGIF1, TGIF2 and TGIFLX/Y in human. Both TGIF1 and TGIF2 act as transcription factors repressing TGF-β signalling. Human TGIFLX and its orthologue, Tex1 in the mouse, are X-linked genes that are only expressed in the adult testis. TGIF2 arose from TGIF1 by duplication, whereas TGIFLX arose by retrotransposition to the X-chromosome. These genes have not been characterised in any non-eutherian mammals. We therefore studied the TGIF family in the tammar wallaby (a marsupial mammal to investigate their roles in reproduction and how and when these genes may have evolved their functions and chromosomal locations. Results Both TGIF1 and TGIF2 were present in the tammar genome on autosomes but TGIFLX was absent. Tammar TGIF1 shared a similar expression pattern during embryogenesis, sexual differentiation and in adult tissues to that of TGIF1 in eutherian mammals, suggesting it has been functionally conserved. Tammar TGIF2 was ubiquitously expressed throughout early development as in the human and mouse, but in the adult, it was expressed only in the gonads and spleen, more like the expression pattern of human TGIFLX and mouse Tex1. Tammar TGIF2 mRNA was specifically detected in round and elongated spermatids. There was no mRNA detected in mature spermatozoa. TGIF2 protein was specifically located in the cytoplasm of spermatids, and in the residual body and the mid-piece of the mature sperm tail. These data suggest that tammar TGIF2 may participate in spermiogenesis, like TGIFLX does in eutherians. TGIF2 was detected for the first time in the ovary with mRNA produced in the granulosa and theca cells, suggesting it may also play a role in folliculogenesis. Conclusions The restricted and very similar expression of tammar TGIF2 to X-linked paralogues in eutherians suggests that the evolution of TGIF1, TGIF2 and TGIFLX in eutherians was accompanied by

  1. The Toll-like receptor gene family is integrated into human DNA damage and p53 networks.

    Directory of Open Access Journals (Sweden)

    Daniel Menendez

    2011-03-01

    Full Text Available In recent years the functions that the p53 tumor suppressor plays in human biology have been greatly extended beyond "guardian of the genome." Our studies of promoter response element sequences targeted by the p53 master regulatory transcription factor suggest a general role for this DNA damage and stress-responsive regulator in the control of human Toll-like receptor (TLR gene expression. The TLR gene family mediates innate immunity to a wide variety of pathogenic threats through recognition of conserved pathogen-associated molecular motifs. Using primary human immune cells, we have examined expression of the entire TLR gene family following exposure to anti-cancer agents that induce the p53 network. Expression of all TLR genes, TLR1 to TLR10, in blood lymphocytes and alveolar macrophages from healthy volunteers can be induced by DNA metabolic stressors. However, there is considerable inter-individual variability. Most of the TLR genes respond to p53 via canonical as well as noncanonical promoter binding sites. Importantly, the integration of the TLR gene family into the p53 network is unique to primates, a recurrent theme raised for other gene families in our previous studies. Furthermore, a polymorphism in a TLR8 response element provides the first human example of a p53 target sequence specifically responsible for endogenous gene induction. These findings-demonstrating that the human innate immune system, including downstream induction of cytokines, can be modulated by DNA metabolic stress-have many implications for health and disease, as well as for understanding the evolution of damage and p53 responsive networks.

  2. Identification of WOX family genes in Selaginella kraussiana for studies on stem cells and regeneration in lycophytes

    Directory of Open Access Journals (Sweden)

    Yachao eGe

    2016-02-01

    Full Text Available Plant stem cells give rise to all tissues and organs and also serve as the source for plant regeneration. The organization of plant stem cells has undergone a progressive change from simple to complex during the evolution of vascular plants. Most studies on plant stem cells have focused on model angiosperms, the most recently diverged branch of vascular plants. However, our knowledge of stem cell function in other vascular plants is limited. Lycophytes and euphyllophytes (ferns, gymnosperms, and angiosperms are two existing branches of vascular plants that separated more than 400 million years ago. Lycophytes retain many of the features of early vascular plants. Based on genome and transcriptome data, we identified WUSCHEL-RELATED HOMEOBOX (WOX genes in Selaginella kraussiana, a model lycophyte that is convenient for in vitro culture and observations of organ formation and regeneration. WOX genes are key players controlling stem cells in plants. Our results showed that the S. kraussiana genome encodes at least eight members of the WOX family, which represent an early stage of WOX family evolution. Identification of WOX genes in S. kraussiana could be a useful tool for molecular studies on the function of stem cells in lycophytes.

  3. Lineage-Specific Expansion of the Chalcone Synthase Gene Family in Rosids.

    Directory of Open Access Journals (Sweden)

    Kattina Zavala

    Full Text Available Rosids are a monophyletic group that includes approximately 70,000 species in 140 families, and they are found in a variety of habitats and life forms. Many important crops such as fruit trees and legumes are rosids. The evolutionary success of this group may have been influenced by their ability to produce flavonoids, secondary metabolites that are synthetized through a branch of the phenylpropanoid pathway where chalcone synthase is a key enzyme. In this work, we studied the evolution of the chalcone synthase gene family in 12 species belonging to the rosid clade. Our results show that the last common ancestor of the rosid clade possessed six chalcone synthase gene lineages that were differentially retained during the evolutionary history of the group. In fact, of the six gene lineages that were present in the last common ancestor, 7 species retained 2 of them, whereas the other 5 only retained one gene lineage. We also show that one of the gene lineages was disproportionately expanded in species that belonged to the order Fabales (soybean, barrel medic and Lotus japonicas. Based on the available literature, we suggest that this gene lineage possesses stress-related biological functions (e.g., response to UV light, pathogen defense. We propose that the observed expansion of this clade was a result of a selective pressure to increase the amount of enzymes involved in the production of phenylpropanoid pathway-derived secondary metabolites, which is consistent with the hypothesis that suggested that lineage-specific expansions fuel plant adaptation.

  4. Characterization and Comparison of the CPK Gene Family in the Apple (Malus × domestica) and Other Rosaceae Species and Its Response to Alternaria alternata Infection.

    Science.gov (United States)

    Wei, Menghan; Wang, Sanhong; Dong, Hui; Cai, Binhua; Tao, Jianmin

    2016-01-01

    As one of the Ca2+ sensors, calcium-dependent protein kinase (CPK) plays vital roles in immune and stress signaling, growth and development, and hormone responses, etc. Recently, the whole genome of apple (Malus × domestica), pear (Pyrus communis), peach (Prunus persica), plum (Prunus mume) and strawberry (Fragaria vesca) in Rosaceae family has been fully sequenced. However, little is known about the CPK gene family in these Rosaceae species. In this study, 123 CPK genes were identified from five Rosaceae species, including 37 apple CPKs, 37 pear CPKs, 17 peach CPKs, 16 strawberry CPKs, and 16 plum CPKs. Based on the phylogenetic tree topology and structural characteristics, we divided the CPK gene family into 4 distinct subfamilies: Group I, II, III, and IV. Whole-genome duplication (WGD) or segmental duplication played vital roles in the expansion of the CPK in these Rosaceae species. Most of segmental duplication pairs in peach and plum may have arisen from the γ triplication (~140 million years ago [MYA]), while in apple genome, many duplicated genes may have been derived from a recent WGD (30~45 MYA). Purifying selection also played a critical role in the function evolution of CPK family genes. Expression of apple CPK genes in response to apple pathotype of Alternaria alternata was verified by analysis of quantitative real-time RT-PCR (qPCR). Expression data demonstrated that CPK genes in apple might have evolved independently in different biological contexts. The analysis of evolution history and expression profile laid a foundation for further examining the function and complexity of the CPK gene family in Rosaceae.

  5. The king cobra genome reveals dynamic gene evolution and adaptation in the snake venom system

    Science.gov (United States)

    Vonk, Freek J.; Casewell, Nicholas R.; Henkel, Christiaan V.; Heimberg, Alysha M.; Jansen, Hans J.; McCleary, Ryan J. R.; Kerkkamp, Harald M. E.; Vos, Rutger A.; Guerreiro, Isabel; Calvete, Juan J.; Wüster, Wolfgang; Woods, Anthony E.; Logan, Jessica M.; Harrison, Robert A.; Castoe, Todd A.; de Koning, A. P. Jason; Pollock, David D.; Yandell, Mark; Calderon, Diego; Renjifo, Camila; Currier, Rachel B.; Salgado, David; Pla, Davinia; Sanz, Libia; Hyder, Asad S.; Ribeiro, José M. C.; Arntzen, Jan W.; van den Thillart, Guido E. E. J. M.; Boetzer, Marten; Pirovano, Walter; Dirks, Ron P.; Spaink, Herman P.; Duboule, Denis; McGlinn, Edwina; Kini, R. Manjunatha; Richardson, Michael K.

    2013-01-01

    Snakes are limbless predators, and many species use venom to help overpower relatively large, agile prey. Snake venoms are complex protein mixtures encoded by several multilocus gene families that function synergistically to cause incapacitation. To examine venom evolution, we sequenced and interrogated the genome of a venomous snake, the king cobra (Ophiophagus hannah), and compared it, together with our unique transcriptome, microRNA, and proteome datasets from this species, with data from other vertebrates. In contrast to the platypus, the only other venomous vertebrate with a sequenced genome, we find that snake toxin genes evolve through several distinct co-option mechanisms and exhibit surprisingly variable levels of gene duplication and directional selection that correlate with their functional importance in prey capture. The enigmatic accessory venom gland shows a very different pattern of toxin gene expression from the main venom gland and seems to have recruited toxin-like lectin genes repeatedly for new nontoxic functions. In addition, tissue-specific microRNA analyses suggested the co-option of core genetic regulatory components of the venom secretory system from a pancreatic origin. Although the king cobra is limbless, we recovered coding sequences for all Hox genes involved in amniote limb development, with the exception of Hoxd12. Our results provide a unique view of the origin and evolution of snake venom and reveal multiple genome-level adaptive responses to natural selection in this complex biological weapon system. More generally, they provide insight into mechanisms of protein evolution under strong selection. PMID:24297900

  6. On critical values concerning the evolution of the long period families

    International Nuclear Information System (INIS)

    Hou Xiyun

    2009-01-01

    In a previous paper, we proposed another special critical value concerning the evolution of the long period family around the equilateral equilibrium points, besides the two values given by Henrard. Are there any other special critical values? After studying the stability curves of the long period family carefully, we gave a negative answer. During the study, we found an interesting family of periodic orbits which we called the homo family. We studied the evolution of this family following the increase of μ. With these findings, we were able to explain the origin of the four branches of periodic families emanating from L 4 and the stability results of the equilateral equilibrium points.

  7. Functional evolution of a multigene family: orthologous and paralogous pheromone receptor genes in the turnip moth, Agrotis segetum.

    Directory of Open Access Journals (Sweden)

    Dan-Dan Zhang

    Full Text Available Lepidopteran pheromone receptors (PRs, for which orthologies are evident among closely related species, provide an intriguing example of gene family evolution in terms of how new functions may arise. However, only a limited number of PRs have been functionally characterized so far and thus evolutionary scenarios suffer from elements of speculation. In this study we investigated the turnip moth Agrotis segetum, in which female moths produce a mixture of chemically related pheromone components that elicit specific responses from receptor cells on male antennae. We cloned nine A. segetum PR genes and the Orco gene by degenerate primer based RT-PCR. The nine PR genes, named as AsegOR1 and AsegOR3-10, fall into four distinct orthologous clusters of known lepidopteran PRs, of which one contains six paralogues. The paralogues are under relaxed selective pressure, contrasting with the purifying selection on other clusters. We identified the receptors AsegOR9, AsegOR4 and AsegOR5, specific for the respective homologous pheromone components (Z-5-decenyl, (Z-7-dodecenyl and (Z-9-tetradecenyl acetates, by two-electrode voltage clamp recording from Xenopus laevis oocytes co-expressing Orco and each PR candidate. These receptors occur in three different orthologous clusters. We also found that the six paralogues with high sequence similarity vary dramatically in ligand selectivity and sensitivity. Different from AsegOR9, AsegOR6 showed a relatively large response to the behavioural antagonist (Z-5-decenol, and a small response to (Z-5-decenyl acetate. AsegOR1 was broadly tuned, but most responsive to (Z-5-decenyl acetate, (Z-7-dodecenyl acetate and the behavioural antagonist (Z-8-dodecenyl acetate. AsegOR8 and AsegOR7, which differ from AsegOR6 and AsegOR1 by 7 and 10 aa respectively, showed much lower sensitivities. AsegOR10 showed only small responses to all the tested compounds. These results suggest that new receptors arise through gene duplication, and

  8. Genome-wide investigation and transcriptome analysis of the WRKY gene family in Gossypium.

    Science.gov (United States)

    Ding, Mingquan; Chen, Jiadong; Jiang, Yurong; Lin, Lifeng; Cao, YueFen; Wang, Minhua; Zhang, Yuting; Rong, Junkang; Ye, Wuwei

    2015-02-01

    WRKY transcription factors play important roles in various stress responses in diverse plant species. In cotton, this family has not been well studied, especially in relation to fiber development. Here, the genomes and transcriptomes of Gossypium raimondii and Gossypium arboreum were investigated to identify fiber development related WRKY genes. This represents the first comprehensive comparative study of WRKY transcription factors in both diploid A and D cotton species. In total, 112 G. raimondii and 109 G. arboreum WRKY genes were identified. No significant gene structure or domain alterations were detected between the two species, but many SNPs distributed unequally in exon and intron regions. Physical mapping revealed that the WRKY genes in G. arboreum were not located in the corresponding chromosomes of G. raimondii, suggesting great chromosome rearrangement in the diploid cotton genomes. The cotton WRKY genes, especially subgroups I and II, have expanded through multiple whole genome duplications and tandem duplications compared with other plant species. Sequence comparison showed many functionally divergent sites between WRKY subgroups, while the genes within each group are under strong purifying selection. Transcriptome analysis suggested that many WRKY genes participate in specific fiber development processes such as fiber initiation, elongation and maturation with different expression patterns between species. Complex WRKY gene expression such as differential Dt and At allelic gene expression in G. hirsutum and alternative splicing events were also observed in both diploid and tetraploid cottons during fiber development process. In conclusion, this study provides important information on the evolution and function of WRKY gene family in cotton species.

  9. Identification and Expression Profiling of the BTB Domain-Containing Protein Gene Family in the Silkworm, Bombyx mori

    Directory of Open Access Journals (Sweden)

    Daojun Cheng

    2014-01-01

    Full Text Available The BTB domain is a conserved protein-protein interaction motif. In this study, we identified 56 BTB domain-containing protein genes in the silkworm, in addition to 46 in the honey bee, 55 in the red flour beetle, and 53 in the monarch butterfly. Silkworm BTB protein genes were classified into nine subfamilies according to their domain architecture, and most of them could be mapped on the different chromosomes. Phylogenetic analysis suggests that silkworm BTB protein genes may have undergone a duplication event in three subfamilies: BTB-BACK-Kelch, BTB-BACK-PHR, and BTB-FLYWCH. Comparative analysis demonstrated that the orthologs of each of 13 BTB protein genes present a rigorous orthologous relationship in the silkworm and other surveyed insects, indicating conserved functions of these genes during insect evolution. Furthermore, several silkworm BTB protein genes exhibited sex-specific expression in larval tissues or at different stages during metamorphosis. These findings not only contribute to a better understanding of the evolution of insect BTB protein gene families but also provide a basis for further investigation of the functions of BTB protein genes in the silkworm.

  10. Genome-wide identification and tissue-specific expression analysis of nucleotide binding site-leucine rich repeat gene family in Cicer arietinum (kabuli chickpea).

    Science.gov (United States)

    Sharma, Ranu; Rawat, Vimal; Suresh, C G

    2017-12-01

    The nucleotide binding site-leucine rich repeat (NBS-LRR) proteins play an important role in the defense mechanisms against pathogens. Using bioinformatics approach, we identified and annotated 104 NBS-LRR genes in chickpea. Phylogenetic analysis points to their diversification into two families namely TIR-NBS-LRR and non-TIR-NBS-LRR. Gene architecture revealed intron gain/loss events in this resistance gene family during their independent evolution into two families. Comparative genomics analysis elucidated its evolutionary relationship with other fabaceae species. Around 50% NBS-LRRs reside in macro-syntenic blocks underlining positional conservation along with sequence conservation of NBS-LRR genes in chickpea. Transcriptome sequencing data provided evidence for their transcription and tissue-specific expression. Four cis -regulatory elements namely WBOX, DRE, CBF, and GCC boxes, that commonly occur in resistance genes, were present in the promoter regions of these genes. Further, the findings will provide a strong background to use candidate disease resistance NBS-encoding genes and identify their specific roles in chickpea.

  11. Characterization of resistance gene analogues (RGAs) in apple (Malus × domestica Borkh.) and their evolutionary history of the Rosaceae family.

    Science.gov (United States)

    Perazzolli, Michele; Malacarne, Giulia; Baldo, Angela; Righetti, Laura; Bailey, Aubrey; Fontana, Paolo; Velasco, Riccardo; Malnoy, Mickael

    2014-01-01

    The family of resistance gene analogues (RGAs) with a nucleotide-binding site (NBS) domain accounts for the largest number of disease resistance genes and is one of the largest gene families in plants. We have identified 868 RGAs in the genome of the apple (Malus × domestica Borkh.) cultivar 'Golden Delicious'. This represents 1.51% of the total number of predicted genes for this cultivar. Several evolutionary features are pronounced in M. domestica, including a high fraction (80%) of RGAs occurring in clusters. This suggests frequent tandem duplication and ectopic translocation events. Of the identified RGAs, 56% are located preferentially on six chromosomes (Chr 2, 7, 8, 10, 11, and 15), and 25% are located on Chr 2. TIR-NBS and non-TIR-NBS classes of RGAs are primarily exclusive of different chromosomes, and 99% of non-TIR-NBS RGAs are located on Chr 11. A phylogenetic reconstruction was conducted to study the evolution of RGAs in the Rosaceae family. More than 1400 RGAs were identified in six species based on their NBS domain, and a neighbor-joining analysis was used to reconstruct the phylogenetic relationships among the protein sequences. Specific phylogenetic clades were found for RGAs of Malus, Fragaria, and Rosa, indicating genus-specific evolution of resistance genes. However, strikingly similar RGAs were shared in Malus, Pyrus, and Prunus, indicating high conservation of specific RGAs and suggesting a monophyletic origin of these three genera.

  12. Characterization of resistance gene analogues (RGAs in apple (Malus × domestica Borkh. and their evolutionary history of the Rosaceae family.

    Directory of Open Access Journals (Sweden)

    Michele Perazzolli

    Full Text Available The family of resistance gene analogues (RGAs with a nucleotide-binding site (NBS domain accounts for the largest number of disease resistance genes and is one of the largest gene families in plants. We have identified 868 RGAs in the genome of the apple (Malus × domestica Borkh. cultivar 'Golden Delicious'. This represents 1.51% of the total number of predicted genes for this cultivar. Several evolutionary features are pronounced in M. domestica, including a high fraction (80% of RGAs occurring in clusters. This suggests frequent tandem duplication and ectopic translocation events. Of the identified RGAs, 56% are located preferentially on six chromosomes (Chr 2, 7, 8, 10, 11, and 15, and 25% are located on Chr 2. TIR-NBS and non-TIR-NBS classes of RGAs are primarily exclusive of different chromosomes, and 99% of non-TIR-NBS RGAs are located on Chr 11. A phylogenetic reconstruction was conducted to study the evolution of RGAs in the Rosaceae family. More than 1400 RGAs were identified in six species based on their NBS domain, and a neighbor-joining analysis was used to reconstruct the phylogenetic relationships among the protein sequences. Specific phylogenetic clades were found for RGAs of Malus, Fragaria, and Rosa, indicating genus-specific evolution of resistance genes. However, strikingly similar RGAs were shared in Malus, Pyrus, and Prunus, indicating high conservation of specific RGAs and suggesting a monophyletic origin of these three genera.

  13. Characterization of Resistance Gene Analogues (RGAs) in Apple (Malus × domestica Borkh.) and Their Evolutionary History of the Rosaceae Family

    Science.gov (United States)

    Baldo, Angela; Righetti, Laura; Bailey, Aubrey; Fontana, Paolo; Velasco, Riccardo; Malnoy, Mickael

    2014-01-01

    The family of resistance gene analogues (RGAs) with a nucleotide-binding site (NBS) domain accounts for the largest number of disease resistance genes and is one of the largest gene families in plants. We have identified 868 RGAs in the genome of the apple (Malus × domestica Borkh.) cultivar ‘Golden Delicious’. This represents 1.51% of the total number of predicted genes for this cultivar. Several evolutionary features are pronounced in M. domestica, including a high fraction (80%) of RGAs occurring in clusters. This suggests frequent tandem duplication and ectopic translocation events. Of the identified RGAs, 56% are located preferentially on six chromosomes (Chr 2, 7, 8, 10, 11, and 15), and 25% are located on Chr 2. TIR-NBS and non-TIR-NBS classes of RGAs are primarily exclusive of different chromosomes, and 99% of non-TIR-NBS RGAs are located on Chr 11. A phylogenetic reconstruction was conducted to study the evolution of RGAs in the Rosaceae family. More than 1400 RGAs were identified in six species based on their NBS domain, and a neighbor-joining analysis was used to reconstruct the phylogenetic relationships among the protein sequences. Specific phylogenetic clades were found for RGAs of Malus, Fragaria, and Rosa, indicating genus-specific evolution of resistance genes. However, strikingly similar RGAs were shared in Malus, Pyrus, and Prunus, indicating high conservation of specific RGAs and suggesting a monophyletic origin of these three genera. PMID:24505246

  14. Genome-wide analysis and heavy metal-induced expression profiling of the HMA gene family in Populus trichocarpa

    Directory of Open Access Journals (Sweden)

    Dandan eLi

    2015-12-01

    Full Text Available The heavy metal ATPase (HMA family plays an important role in transition metal transport in plants. However, this gene family has not been extensively studied in Populus trichocarpa. We identified 17 HMA genes in P. trichocarpa (PtHMAs, of which PtHMA1–PtHMA4 belonged to the zinc (Zn/cobalt (Co/cadmium (Cd/lead (Pb subgroup, and PtHMA5–PtHMA8 were members of the copper (Cu/silver (Ag subgroup. Most of the genes were localized to chromosomes I and III. Gene structure, gene chromosomal location, and synteny analyses of PtHMAs indicated that tandem and segmental duplications likely contributed to the expansion and evolution of the PtHMAs. Most of the HMA genes contained abiotic stress-related cis-elements. Tissue-specific expression of PtHMA genes showed that PtHMA1 and PtHMA4 had relatively high expression levels in the leaves, whereas Cu/Ag subgroup (PtHMA5.1- PtHMA8 genes were upregulated in the roots. High concentrations of Cu, Ag, Zn, Cd, Co, Pb and Mn differentially regulated the expression of PtHMAs in various tissues. The preliminary results of the present study generated basic information on the HMA family of Populus that may serve as foundation for future functional studies.

  15. Evolutionary genomics of plant genes encoding N-terminal-TM-C2 domain proteins and the similar FAM62 genes and synaptotagmin genes of metazoans

    Directory of Open Access Journals (Sweden)

    Craxton Molly

    2007-07-01

    Full Text Available Abstract Background Synaptotagmin genes are found in animal genomes and are known to function in the nervous system. Genes with a similar domain architecture as well as sequence similarity to synaptotagmin C2 domains have also been found in plant genomes. The plant genes share an additional region of sequence similarity with a group of animal genes named FAM62. FAM62 genes also have a similar domain architecture. Little is known about the functions of the plant genes and animal FAM62 genes. Indeed, many members of the large and diverse Syt gene family await functional characterization. Understanding the evolutionary relationships among these genes will help to realize the full implications of functional studies and lead to improved genome annotation. Results I collected and compared plant Syt-like sequences from the primary nucleotide sequence databases at NCBI. The collection comprises six groups of plant genes conserved in embryophytes: NTMC2Type1 to NTMC2Type6. I collected and compared metazoan FAM62 sequences and identified some similar sequences from other eukaryotic lineages. I found evidence of RNA editing and alternative splicing. I compared the intron patterns of Syt genes. I also compared Rabphilin and Doc2 genes. Conclusion Genes encoding proteins with N-terminal-transmembrane-C2 domain architectures resembling synaptotagmins, are widespread in eukaryotes. A collection of these genes is presented here. The collection provides a resource for studies of intron evolution. I have classified the collection into homologous gene families according to distinctive patterns of sequence conservation and intron position. The evolutionary histories of these gene families are traceable through the appearance of family members in different eukaryotic lineages. Assuming an intron-rich eukaryotic ancestor, the conserved intron patterns distinctive of individual gene families, indicate independent origins of Syt, FAM62 and NTMC2 genes. Resemblances

  16. 100 million years of multigene family evolution: origin and evolution of the avian MHC class IIB

    Czech Academy of Sciences Publication Activity Database

    Goebel, J.; Promerová, Marta; Bonadonna, F.; McCoy, K. D.; Serbielle, C.; Strandh, M.; Yannic, G.; Burri, R.; Fumagalli, L.

    2017-01-01

    Roč. 18, č. 460 (2017), s. 1-9 ISSN 1471-2164 R&D Projects: GA ČR GAP505/10/1871 Institutional support: RVO:68081766 Keywords : Birds * Birth -death evolution * Concerted evolution * Gene duplication * Gene conversion * Major histocompatibility complex * Recombination Subject RIV: EG - Zoology OBOR OECD: Genetics and heredity (medical genetics to be 3) Impact factor: 3.729, year: 2016

  17. 100 million years of multigene family evolution: origin and evolution of the avian MHC class IIB

    Czech Academy of Sciences Publication Activity Database

    Goebel, J.; Promerová, Marta; Bonadonna, F.; McCoy, K. D.; Serbielle, C.; Strandh, M.; Yannic, G.; Burri, R.; Fumagalli, L.

    2017-01-01

    Roč. 18, č. 460 (2017), s. 1-9 ISSN 1471-2164 R&D Projects: GA ČR GAP505/10/1871 Institutional support: RVO:68081766 Keywords : Birds * Birth-death evolution * Concerted evolution * Gene duplication * Gene conversion * Major histocompatibility complex * Recombination Subject RIV: EG - Zoology OBOR OECD: Genetics and heredity (medical genetics to be 3) Impact factor: 3.729, year: 2016

  18. Characterization of Small HSPs from Anemonia viridis Reveals Insights into Molecular Evolution of Alpha Crystallin Genes among Cnidarians

    OpenAIRE

    Nicosia, Aldo; Maggio, Teresa; Mazzola, Salvatore; Gianguzza, Fabrizio; Cuttitta, Angela; Costa, Salvatore

    2014-01-01

    Gene family encoding small Heat-Shock Proteins (sHSPs containing α-crystallin domain) are found both in prokaryotic and eukaryotic organisms; however, there is limited knowledge of their evolution. In this study, two small HSP genes termed AvHSP28.6 and AvHSP27, both organized in one intron and two exons, were characterised in the Mediterranean snakelocks anemone Anemonia viridis. The release of the genome sequence of Hydra magnipapillata and Nematostella vectensis enabled a comprehensive stu...

  19. Genome-wide analysis of the AP2/ERF family in Musa species reveals divergence and neofunctionalisation during evolution.

    Science.gov (United States)

    Lakhwani, Deepika; Pandey, Ashutosh; Dhar, Yogeshwar Vikram; Bag, Sumit Kumar; Trivedi, Prabodh Kumar; Asif, Mehar Hasan

    2016-01-06

    AP2/ERF domain containing transcription factor super family is one of the important regulators in the plant kingdom. The involvement of AP2/ERF family members has been elucidated in various processes associated with plant growth, development as well as in response to hormones, biotic and abiotic stresses. In this study, we carried out genome-wide analysis to identify members of AP2/ERF family in Musa acuminata (A genome) and Musa balbisiana (B genome) and changes leading to neofunctionalisation of genes. Analysis identified 265 and 318 AP2/ERF encoding genes in M. acuminata and M. balbisiana respectively which were further classified into ERF, DREB, AP2, RAV and Soloist groups. Comparative analysis indicated that AP2/ERF family has undergone duplication, loss and divergence during evolution and speciation of the Musa A and B genomes. We identified nine genes which are up-regulated during fruit ripening and might be components of the regulatory machinery operating during ethylene-dependent ripening in banana. Tissue-specific expression analysis of the genes suggests that different regulatory mechanisms might be involved in peel and pulp ripening process through recruiting specific ERFs in these tissues. Analysis also suggests that MaRAV-6 and MaERF026 have structurally diverged from their M. balbisiana counterparts and have attained new functions during ripening.

  20. Genome-Wide Analysis, Classification, Evolution, and Expression Analysis of the Cytochrome P450 93 Family in Land Plants.

    Directory of Open Access Journals (Sweden)

    Hai Du

    Full Text Available Cytochrome P450 93 family (CYP93 belonging to the cytochrome P450 superfamily plays important roles in diverse plant processes. However, no previous studies have investigated the evolution and expression of the members of this family. In this study, we performed comprehensive genome-wide analysis to identify CYP93 genes in 60 green plants. In all, 214 CYP93 proteins were identified; they were specifically found in flowering plants and could be classified into ten subfamilies-CYP93A-K, with the last two being identified first. CYP93A is the ancestor that was derived in flowering plants, and the remaining showed lineage-specific distribution-CYP93B and CYP93C are present in dicots; CYP93F is distributed only in Poaceae; CYP93G and CYP93J are monocot-specific; CYP93E is unique to legumes; CYP93H and CYP93K are only found in Aquilegia coerulea, and CYP93D is Brassicaceae-specific. Each subfamily generally has conserved gene numbers, structures, and characteristics, indicating functional conservation during evolution. Synonymous nucleotide substitution (dN/dS analysis showed that CYP93 genes are under strong negative selection. Comparative expression analyses of CYP93 genes in dicots and monocots revealed that they are preferentially expressed in the roots and tend to be induced by biotic and/or abiotic stresses, in accordance with their well-known functions in plant secondary biosynthesis.

  1. Genome-wide analysis of the GRAS gene family in physic nut (Jatropha curcas L.).

    Science.gov (United States)

    Wu, Z Y; Wu, P Z; Chen, Y P; Li, M R; Wu, G J; Jiang, H W

    2015-12-29

    GRAS proteins play vital roles in plant growth and development. Physic nut (Jatropha curcas L.) was found to have a total of 48 GRAS family members (JcGRAS), 15 more than those found in Arabidopsis. The JcGRAS genes were divided into 12 subfamilies or 15 ancient monophyletic lineages based on the phylogenetic analysis of GRAS proteins from both flowering and lower plants. The functions of GRAS genes in 9 subfamilies have been reported previously for several plants, while the genes in the remaining 3 subfamilies were of unknown function; we named the latter families U1 to U3. No member of U3 subfamily is present in Arabidopsis and Poaceae species according to public genome sequence data. In comparison with the number of GRAS genes in Arabidopsis, more were detected in physic nut, resulting from the retention of many ancient GRAS subfamilies and the formation of tandem repeats during evolution. No evidence of recent duplication among JcGRAS genes was observed in physic nut. Based on digital gene expression data, 21 of the 48 genes exhibited differential expression in four tissues analyzed. Two members of subfamily U3 were expressed only in buds and flowers, implying that they may play specific roles. Our results provide valuable resources for future studies on the functions of GRAS proteins in physic nut.

  2. Insights into the evolution and diversification of the AT-hook Motif Nuclear Localized gene family in land plants.

    Science.gov (United States)

    Zhao, Jianfei; Favero, David S; Qiu, Jiwen; Roalson, Eric H; Neff, Michael M

    2014-10-14

    Members of the ancient land-plant-specific transcription factor AT-Hook Motif Nuclear Localized (AHL) gene family regulate various biological processes. However, the relationships among the AHL genes, as well as their evolutionary history, still remain unexplored. We analyzed over 500 AHL genes from 19 land plant species, ranging from the early diverging Physcomitrella patens and Selaginella to a variety of monocot and dicot flowering plants. We classified the AHL proteins into three types (Type-I/-II/-III) based on the number and composition of their functional domains, the AT-hook motif(s) and PPC domain. We further inferred their phylogenies via Bayesian inference analysis and predicted gene gain/loss events throughout their diversification. Our analyses suggested that the AHL gene family emerged in embryophytes and further evolved into two distinct clades, with Type-I AHLs forming one clade (Clade-A), and the other two types together diversifying in another (Clade-B). The two AHL clades likely diverged before the separation of Physcomitrella patens from the vascular plant lineage. In angiosperms, Clade-A AHLs expanded into 5 subfamilies; while, the ones in Clade-B expanded into 4 subfamilies. Examination of their expression patterns suggests that the AHLs within each clade share similar expression patterns with each other; however, AHLs in one monophyletic clade exhibit distinct expression patterns from the ones in the other clade. Over-expression of a Glycine max AHL PPC domain in Arabidopsis thaliana recapitulates the phenotype observed when over-expressing its Arabidopsis thaliana counterpart. This result suggests that the AHL genes from different land plant species may share conserved functions in regulating plant growth and development. Our study further suggests that such functional conservation may be due to conserved physical interactions among the PPC domains of AHL proteins. Our analyses reveal a possible evolutionary scenario for the AHL gene family

  3. Characterization and Comparison of the CPK Gene Family in the Apple (Malus × domestica and Other Rosaceae Species and Its Response to Alternaria alternata Infection.

    Directory of Open Access Journals (Sweden)

    Menghan Wei

    Full Text Available As one of the Ca2+ sensors, calcium-dependent protein kinase (CPK plays vital roles in immune and stress signaling, growth and development, and hormone responses, etc. Recently, the whole genome of apple (Malus × domestica, pear (Pyrus communis, peach (Prunus persica, plum (Prunus mume and strawberry (Fragaria vesca in Rosaceae family has been fully sequenced. However, little is known about the CPK gene family in these Rosaceae species. In this study, 123 CPK genes were identified from five Rosaceae species, including 37 apple CPKs, 37 pear CPKs, 17 peach CPKs, 16 strawberry CPKs, and 16 plum CPKs. Based on the phylogenetic tree topology and structural characteristics, we divided the CPK gene family into 4 distinct subfamilies: Group I, II, III, and IV. Whole-genome duplication (WGD or segmental duplication played vital roles in the expansion of the CPK in these Rosaceae species. Most of segmental duplication pairs in peach and plum may have arisen from the γ triplication (~140 million years ago [MYA], while in apple genome, many duplicated genes may have been derived from a recent WGD (30~45 MYA. Purifying selection also played a critical role in the function evolution of CPK family genes. Expression of apple CPK genes in response to apple pathotype of Alternaria alternata was verified by analysis of quantitative real-time RT-PCR (qPCR. Expression data demonstrated that CPK genes in apple might have evolved independently in different biological contexts. The analysis of evolution history and expression profile laid a foundation for further examining the function and complexity of the CPK gene family in Rosaceae.

  4. The Evolution of the FT/TFL1 Genes in Amaranthaceae and Their Expression Patterns in the Course of Vegetative Growth and Flowering in Chenopodium rubrum

    Czech Academy of Sciences Publication Activity Database

    Drabešová, Jana; Černá, Lucie; Mašterová, Helena; Koloušková, Pavla; Potocký, Martin; Štorchová, Helena

    2016-01-01

    Roč. 6, č. 10 (2016), s. 3065-3076 ISSN 2160-1836 R&D Projects: GA ČR(CZ) GAP506/12/1359; GA ČR GA13-02290S Institutional support: RVO:61389030 Keywords : rna-seq data * locus-t * ft homologs * functional evolution * floral initiation * reference genome * arabidopsis * protein * quantification * activation * transcriptome * flowering locus t * TERMINAL FLOWER1 gene family * evolution * flowering * gene rearrangement * Amaranthaceae * Chenopodium rubrum Subject RIV: EF - Botanics Impact factor: 2.861, year: 2016

  5. Genome-wide identification and analysis of the SBP-box family genes in apple (Malus × domestica Borkh.).

    Science.gov (United States)

    Li, Jun; Hou, Hongmin; Li, Xiaoqin; Xiang, Jiang; Yin, Xiangjing; Gao, Hua; Zheng, Yi; Bassett, Carole L; Wang, Xiping

    2013-09-01

    SQUAMOSA promoter binding protein (SBP)-box genes encode a family of plant-specific transcription factors and play many crucial roles in plant development. In this study, 27 SBP-box gene family members were identified in the apple (Malus × domestica Borkh.) genome, 15 of which were suggested to be putative targets of MdmiR156. Plant SBPs were classified into eight groups according to the phylogenetic analysis of SBP-domain proteins. Gene structure, gene chromosomal location and synteny analyses of MdSBP genes within the apple genome demonstrated that tandem and segmental duplications, as well as whole genome duplications, have likely contributed to the expansion and evolution of the SBP-box gene family in apple. Additionally, synteny analysis between apple and Arabidopsis indicated that several paired homologs of MdSBP and AtSPL genes were located in syntenic genomic regions. Tissue-specific expression analysis of MdSBP genes in apple demonstrated their diversified spatiotemporal expression patterns. Most MdmiR156-targeted MdSBP genes, which had relatively high transcript levels in stems, leaves, apical buds and some floral organs, exhibited a more differential expression pattern than most MdmiR156-nontargeted MdSBP genes. Finally, expression analysis of MdSBP genes in leaves upon various plant hormone treatments showed that many MdSBP genes were responsive to different plant hormones, indicating that MdSBP genes may be involved in responses to hormone signaling during stress or in apple development. Copyright © 2013 Elsevier Masson SAS. All rights reserved.

  6. Parallel evolution of tetrodotoxin resistance in three voltage-gated sodium channel genes in the garter snake Thamnophis sirtalis.

    Science.gov (United States)

    McGlothlin, Joel W; Chuckalovcak, John P; Janes, Daniel E; Edwards, Scott V; Feldman, Chris R; Brodie, Edmund D; Pfrender, Michael E; Brodie, Edmund D

    2014-11-01

    Members of a gene family expressed in a single species often experience common selection pressures. Consequently, the molecular basis of complex adaptations may be expected to involve parallel evolutionary changes in multiple paralogs. Here, we use bacterial artificial chromosome library scans to investigate the evolution of the voltage-gated sodium channel (Nav) family in the garter snake Thamnophis sirtalis, a predator of highly toxic Taricha newts. Newts possess tetrodotoxin (TTX), which blocks Nav's, arresting action potentials in nerves and muscle. Some Thamnophis populations have evolved resistance to extremely high levels of TTX. Previous work has identified amino acid sites in the skeletal muscle sodium channel Nav1.4 that confer resistance to TTX and vary across populations. We identify parallel evolution of TTX resistance in two additional Nav paralogs, Nav1.6 and 1.7, which are known to be expressed in the peripheral nervous system and should thus be exposed to ingested TTX. Each paralog contains at least one TTX-resistant substitution identical to a substitution previously identified in Nav1.4. These sites are fixed across populations, suggesting that the resistant peripheral nerves antedate resistant muscle. In contrast, three sodium channels expressed solely in the central nervous system (Nav1.1-1.3) showed no evidence of TTX resistance, consistent with protection from toxins by the blood-brain barrier. We also report the exon-intron structure of six Nav paralogs, the first such analysis for snake genes. Our results demonstrate that the molecular basis of adaptation may be both repeatable across members of a gene family and predictable based on functional considerations. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  7. Genome-Wide Analysis Suggests the Relaxed Purifying Selection Affect the Evolution of WOX Genes in Pyrus bretschneideri, Prunus persica, Prunus mume, and Fragaria vesca

    Directory of Open Access Journals (Sweden)

    Yunpeng Cao

    2017-06-01

    Full Text Available WUSCHEL-related homeobox (WOX family is one of the largest group of transcription factors (TFs specifically found in plant kingdom. WOX TFs play an important role in plant development processes and evolutionary novelties. Although the roles of WOXs in Arabidopsis and rice have been well-studied, however, little are known about the relationships among the main clades in the molecular evolution of these genes in Rosaceae. Here, we carried out a genome-wide analysis and identified 14, 10, 10, and 9 of WOX genes from four Rosaceae species (Fragaria vesca, Prunus persica, Prunus mume, and Pyrus bretschneideri, respectively. According to evolutionary analysis, as well as amino acid sequences of their homodomains, these genes were divided into three clades with nine subgroups. Furthermore, due to the conserved structural patterns among these WOX genes, it was proposed that there should exist some highly conserved regions of microsynteny in the four Rosaceae species. Moreover, most of WOX gene pairs were presented with the conserved orientation among syntenic genome regions. In addition, according to substitution models analysis using PMAL software, no significant positive selection was detected, but type I functional divergence was identified among certain amino acids in WOX protein. These results revealed that the relaxed purifying selection might be the main driving force during the evolution of WOX genes in the tested Rosaceae species. Our result will be useful for further precise research on evolution of the WOX genes in family Rosaceae.

  8. Genome-wide characterization of phenylalanine ammonia-lyase gene family in watermelon (Citrullus lanatus).

    Science.gov (United States)

    Dong, Chun-Juan; Shang, Qing-Mao

    2013-07-01

    Phenylalanine ammonia-lyase (PAL), the first enzyme in the phenylpropanoid pathway, plays a critical role in plant growth, development, and adaptation. PAL enzymes are encoded by a gene family in plants. Here, we report a genome-wide search for PAL genes in watermelon. A total of 12 PAL genes, designated ClPAL1-12, are identified . Nine are arranged in tandem in two duplication blocks located on chromosomes 4 and 7, and the other three ClPAL genes are distributed as single copies on chromosomes 2, 3, and 8. Both the cDNA and protein sequences of ClPALs share an overall high identity with each other. A phylogenetic analysis places 11 of the ClPALs into a separate cucurbit subclade, whereas ClPAL2, which belongs to neither monocots nor dicots, may serve as an ancestral PAL in plants. In the cucurbit subclade, seven ClPALs form homologous pairs with their counterparts from cucumber. Expression profiling reveals that 11 of the ClPAL genes are expressed and show preferential expression in the stems and male and female flowers. Six of the 12 ClPALs are moderately or strongly expressed in the fruits, particularly in the pulp, suggesting the potential roles of PAL in the development of fruit color and flavor. A promoter motif analysis of the ClPAL genes implies redundant but distinctive cis-regulatory structures for stress responsiveness. Finally, duplication events during the evolution and expansion of the ClPAL gene family are discussed, and the relationships between the ClPAL genes and their cucumber orthologs are estimated.

  9. Organization and evolution of the rat tyrosine hydroxylase gene

    International Nuclear Information System (INIS)

    Brown, E.R.; Coker, G.T. III; O'Malley, K.L.

    1987-01-01

    This report describes the organization of the rat tyrosine hydroxylase (TH) gene and compares its structure with the human phenylalanine hydroxylase gene. Both genes are single copy and contain 13 exons separated by 12 introns. Remarkably, the positions of 10 out 12 intron/exon boundaries are identical for the two genes. These results support the idea that these hydroxylases genes are members of a gene family which has a common evolutionary origin. The authors predict that this ancestral gene would have encoded exons similar to those of TH prior to evolutionary drift to other members of this gene family

  10. Molecular evolution of the odorant and gustatory receptor genes in lepidopteran insects: implications for their adaptation and speciation.

    Science.gov (United States)

    Engsontia, Patamarerk; Sangket, Unitsa; Chotigeat, Wilaiwan; Satasook, Chutamas

    2014-08-01

    Lepidoptera (comprised of butterflies and moths) is one of the largest groups of insects, including more than 160,000 described species. Chemoreception plays important roles in the adaptation of these species to a wide range of niches, e.g., plant hosts, egg-laying sites, and mates. This study investigated the molecular evolution of the lepidopteran odorant (Or) and gustatory receptor (Gr) genes using recently identified genes from Bombyx mori, Danaus plexippus, Heliconius melpomene, Plutella xylostella, Heliothis virescens, Manduca sexta, Cydia pomonella, and Spodoptera littoralis. A limited number of cases of large lineage-specific gene expansion are observed (except in the P. xylostella lineage), possibly due to selection against tandem gene duplication. There has been strong purifying selection during the evolution of both lepidopteran odorant and gustatory genes, as shown by the low ω values estimated through CodeML analysis, ranging from 0.0093 to 0.3926. However, purifying selection has been relaxed on some amino acid sites in these receptors, leading to sequence divergence, which is a precursor of positive selection on these sequences. Signatures of positive selection were detected only in a few loci from the lineage-specific analysis. Estimation of gene gains and losses suggests that the common ancestor of the Lepidoptera had fewer Or genes compared to extant species and an even more reduced number of Gr genes, particularly within the bitter receptor clade. Multiple gene gains and a few gene losses occurred during the evolution of Lepidoptera. Gene family expansion may be associated with the adaptation of lepidopteran species to plant hosts, especially after angiosperm radiation. Phylogenetic analysis of the moth sex pheromone receptor genes suggested that chromosomal translocations have occurred several times. New sex pheromone receptors have arisen through tandem gene duplication. Positive selection was detected at some amino acid sites predicted to be

  11. CRDB: database of chemosensory receptor gene families in vertebrate.

    Directory of Open Access Journals (Sweden)

    Dong Dong

    Full Text Available Chemosensory receptors (CR are crucial for animals to sense the environmental changes and survive on earth. The emergence of whole-genome sequences provides us an opportunity to identify the entire CR gene repertoires. To completely gain more insight into the evolution of CR genes in vertebrates, we identified the nearly all CR genes in 25 vertebrates using homology-based approaches. Among these CR gene repertoires, nearly half of them were identified for the first time in those previously uncharacterized species, such as the guinea pig, giant panda and elephant, etc. Consistent with previous findings, we found that the numbers of CR genes vary extensively among different species, suggesting an extreme form of 'birth-and-death' evolution. For the purpose of facilitating CR gene analysis, we constructed a database with the goals to provide a resource for CR genes annotation and a web tool for exploring their evolutionary patterns. Besides a search engine for the gene extraction from a specific chromosome region, an easy-to-use phylogenetic analysis tool was also provided to facilitate online phylogeny study of CR genes. Our work can provide a rigorous platform for further study on the evolution of CR genes in vertebrates.

  12. The king cobra genome reveals dynamic gene evolution and adaptation in the snake venom system

    OpenAIRE

    Vonk, Freek J.; Casewell, Nicholas R.; Henkel, Christiaan V.; Heimberg, Alysha M.; Jansen, Hans J.; McCleary, Ryan J. R.; Kerkkamp, Harald M. E.; Vos, Rutger A.; Guerreiro, Isabel; Calvete, Juan J.; Wüster, Wolfgang; Woods, Anthony E.; Logan, Jessica M.; Harrison, Robert A.; Castoe, Todd A.

    2013-01-01

    Snakes are limbless predators, and many species use venom to help overpower relatively large, agile prey. Snake venoms are complex protein mixtures encoded by several multilocus gene families that function synergistically to cause incapacitation. To examine venom evolution, we sequenced and interrogated the genome of a venomous snake, the king cobra (Ophiophagus hannah), and compared it, together with our unique transcriptome, microRNA, and proteome datasets from this species, with data from ...

  13. Interferon induced IFIT family genes in host antiviral defense.

    Science.gov (United States)

    Zhou, Xiang; Michal, Jennifer J; Zhang, Lifan; Ding, Bo; Lunney, Joan K; Liu, Bang; Jiang, Zhihua

    2013-01-01

    Secretion of interferons (IFNs) from virus-infected cells is a hallmark of host antiviral immunity and in fact, IFNs exert their antiviral activities through the induction of antiviral proteins. The IFN-induced protein with tetratricopeptide repeats (IFITs) family is among hundreds of IFN-stimulated genes. This family contains a cluster of duplicated loci. Most mammals have IFIT1, IFIT2, IFIT3 and IFIT5; however, bird, marsupial, frog and fish have only IFIT5. Regardless of species, IFIT5 is always adjacent to SLC16A12. IFIT family genes are predominantly induced by type I and type III interferons and are regulated by the pattern recognition and the JAK-STAT signaling pathway. IFIT family proteins are involved in many processes in response to viral infection. However, some viruses can escape the antiviral functions of the IFIT family by suppressing IFIT family genes expression or methylation of 5' cap of viral molecules. In addition, the variants of IFIT family genes could significantly influence the outcome of hepatitis C virus (HCV) therapy. We believe that our current review provides a comprehensive picture for the community to understand the structure and function of IFIT family genes in response to pathogens in human, as well as in animals.

  14. Characterization of small HSPs from Anemonia viridis reveals insights into molecular evolution of alpha crystallin genes among cnidarians.

    Directory of Open Access Journals (Sweden)

    Aldo Nicosia

    Full Text Available Gene family encoding small Heat-Shock Proteins (sHSPs containing α-crystallin domain are found both in prokaryotic and eukaryotic organisms; however, there is limited knowledge of their evolution. In this study, two small HSP genes termed AvHSP28.6 and AvHSP27, both organized in one intron and two exons, were characterised in the Mediterranean snakelocks anemone Anemonia viridis. The release of the genome sequence of Hydra magnipapillata and Nematostella vectensis enabled a comprehensive study of the molecular evolution of α-crystallin gene family among cnidarians. Most of the H. magnipapillata sHSP genes share the same gene organization described for AvHSP28.6 and AvHSP27, differing from the sHSP genes of N. vectensis which mainly show an intronless architecture. The different genomic organization of sHSPs, the phylogenetic analyses based on protein sequences, and the relationships among Cnidarians, suggest that the A.viridis sHSPs represent the common ancestor from which H. magnipapillata genes directly evolved through segmental genome duplication. Additionally retroposition events may be considered responsible for the divergence of sHSP genes of N. vectensis from A. viridis. Analyses of transcriptional expression profile showed that AvHSP28.6 was constitutively expressed among different tissues from both ectodermal and endodermal layers of the adult sea anemones, under normal physiological conditions and also under different stress condition. Specifically, we profiled the transcriptional activation of AvHSP28.6 after challenges with different abiotic/biotic stresses showing induction by extreme temperatures, heavy metals exposure and immune stimulation. Conversely, no AvHSP27 transcript was detected in such dissected tissues, in adult whole body cDNA library or under stress conditions. Hence, the involvement of AvHSP28.6 gene in the sea anemone defensome is strongly suggested.

  15. Characterization of small HSPs from Anemonia viridis reveals insights into molecular evolution of alpha crystallin genes among cnidarians.

    Science.gov (United States)

    Nicosia, Aldo; Maggio, Teresa; Mazzola, Salvatore; Gianguzza, Fabrizio; Cuttitta, Angela; Costa, Salvatore

    2014-01-01

    Gene family encoding small Heat-Shock Proteins (sHSPs containing α-crystallin domain) are found both in prokaryotic and eukaryotic organisms; however, there is limited knowledge of their evolution. In this study, two small HSP genes termed AvHSP28.6 and AvHSP27, both organized in one intron and two exons, were characterised in the Mediterranean snakelocks anemone Anemonia viridis. The release of the genome sequence of Hydra magnipapillata and Nematostella vectensis enabled a comprehensive study of the molecular evolution of α-crystallin gene family among cnidarians. Most of the H. magnipapillata sHSP genes share the same gene organization described for AvHSP28.6 and AvHSP27, differing from the sHSP genes of N. vectensis which mainly show an intronless architecture. The different genomic organization of sHSPs, the phylogenetic analyses based on protein sequences, and the relationships among Cnidarians, suggest that the A.viridis sHSPs represent the common ancestor from which H. magnipapillata genes directly evolved through segmental genome duplication. Additionally retroposition events may be considered responsible for the divergence of sHSP genes of N. vectensis from A. viridis. Analyses of transcriptional expression profile showed that AvHSP28.6 was constitutively expressed among different tissues from both ectodermal and endodermal layers of the adult sea anemones, under normal physiological conditions and also under different stress condition. Specifically, we profiled the transcriptional activation of AvHSP28.6 after challenges with different abiotic/biotic stresses showing induction by extreme temperatures, heavy metals exposure and immune stimulation. Conversely, no AvHSP27 transcript was detected in such dissected tissues, in adult whole body cDNA library or under stress conditions. Hence, the involvement of AvHSP28.6 gene in the sea anemone defensome is strongly suggested.

  16. Genomic assessment of the evolution of the prion protein gene family in vertebrates.

    Science.gov (United States)

    Harrison, Paul M; Khachane, Amit; Kumar, Manish

    2010-05-01

    Prion diseases are devastating neurological disorders caused by the propagation of particles containing an alternative beta-sheet-rich form of the prion protein (PrP). Genes paralogous to PrP, called Doppel and Shadoo, have been identified, that also have neuropathological relevance. To aid in the further functional characterization of PrP and its relatives, we annotated completely the PrP gene family (PrP-GF), in the genomes of 42 vertebrates, through combined strategic application of gene prediction programs and advanced remote homology detection techniques (such as HMMs, PSI-TBLASTN and pGenThreader). We have uncovered several previously undescribed paralogous genes and pseudogenes. We find that current high-quality genomic evidence indicates that the PrP relative Doppel, was likely present in the last common ancestor of present-day Tetrapoda, but was lost in the bird lineage, since its divergence from reptiles. Using the new gene annotations, we have defined the consensus of structural features that are characteristic of the PrP and Doppel structures, across diverse Tetrapoda clades. Furthermore, we describe in detail a transcribed pseudogene derived from Shadoo that is conserved across primates, and that overlaps the meiosis gene, SYCE1, thus possibly regulating its expression. In addition, we analysed the locus of PRNP/PRND for significant conservation across the genomic DNA of eleven mammals, and determined the phylogenetic penetration of non-coding exons. The genomic evidence indicates that the second PRNP non-coding exon found in even-toed ungulates and rodents, is conserved in all high-coverage genome assemblies of primates (human, chimp, orang utan and macaque), and is, at least, likely to have fallen out of use during primate speciation. Furthermore, we have demonstrated that the PRNT gene (at the PRNP human locus) is conserved across at least sixteen mammals, and evolves like a long non-coding RNA, fashioned from fragments of ancient, long

  17. Rapid evolution of chemosensory receptor genes in a pair of sibling species of orchid bees (Apidae: Euglossini).

    Science.gov (United States)

    Brand, Philipp; Ramírez, Santiago R; Leese, Florian; Quezada-Euan, J Javier G; Tollrian, Ralph; Eltz, Thomas

    2015-08-28

    Insects rely more on chemical signals (semiochemicals) than on any other sensory modality to find, identify, and choose mates. In most insects, pheromone production is typically regulated through biosynthetic pathways, whereas pheromone sensory detection is controlled by the olfactory system. Orchid bees are exceptional in that their semiochemicals are not produced metabolically, but instead male bees collect odoriferous compounds (perfumes) from the environment and store them in specialized hind-leg pockets to subsequently expose during courtship display. Thus, the olfactory sensory system of orchid bees simultaneously controls male perfume traits (sender components) and female preferences (receiver components). This functional linkage increases the opportunities for parallel evolution of male traits and female preferences, particularly in response to genetic changes of chemosensory detection (e.g. Odorant Receptor genes). To identify whether shifts in pheromone composition among related lineages of orchid bees are associated with divergence in chemosensory genes of the olfactory periphery, we searched for patterns of divergent selection across the antennal transcriptomes of two recently diverged sibling species Euglossa dilemma and E. viridissima. We identified 3185 orthologous genes including 94 chemosensory loci from five different gene families (Odorant Receptors, Ionotropic Receptors, Gustatory Receptors, Odorant Binding Proteins, and Chemosensory Proteins). Our results revealed that orthologs with signatures of divergent selection between E. dilemma and E. viridissima were significantly enriched for chemosensory genes. Notably, elevated signals of divergent selection were almost exclusively observed among chemosensory receptors (i.e. Odorant Receptors). Our results suggest that rapid changes in the chemosensory gene family occurred among closely related species of orchid bees. These findings are consistent with the hypothesis that strong divergent selection

  18. Gene structure, transcripts and calciotropic effects of the PTH family of peptides in Xenopus and chicken

    Directory of Open Access Journals (Sweden)

    Power Deborah M

    2010-12-01

    Full Text Available Abstract Background Parathyroid hormone (PTH and PTH-related peptide (PTHrP belong to a family of endocrine factors that share a highly conserved N-terminal region (amino acids 1-34 and play key roles in calcium homeostasis, bone formation and skeletal development. Recently, PTH-like peptide (PTH-L was identified in teleost fish raising questions about the evolution of these proteins. Although PTH and PTHrP have been intensively studied in mammals their function in other vertebrates is poorly documented. Amphibians and birds occupy unique phylogenetic positions, the former at the transition of aquatic to terrestrial life and the latter at the transition to homeothermy. Moreover, both organisms have characteristics indicative of a complex system in calcium regulation. This study investigated PTH family evolution in vertebrates with special emphasis on Xenopus and chicken. Results The PTH-L gene is present throughout the vertebrates with the exception of placental mammals. Gene structure of PTH and PTH-L seems to be conserved in vertebrates while PTHrP gene structure is divergent and has acquired new exons and alternative promoters. Splice variants of PTHrP and PTH-L are common in Xenopus and chicken and transcripts of the former have a widespread tissue distribution, although PTH-L is more restricted. PTH is widely expressed in fish tissue but from Xenopus to mammals becomes largely restricted to the parathyroid gland. The N-terminal (1-34 region of PTH, PTHrP and PTH-L in Xenopus and chicken share high sequence conservation and the capacity to modify calcium fluxes across epithelia suggesting a conserved role in calcium metabolism possibly via similar receptors. Conclusions The parathyroid hormone family contains 3 principal members, PTH, PTHrP and the recently identified PTH-L. In teleosts there are 5 genes which encode PTHrP (2, PTH (2 and PTH-L and in tetrapods there are 3 genes (PTHrP, PTH and PTH-L, the exception is placental mammals which

  19. Microevolution of Virulence-Related Genes in Helicobacter pylori Familial Infection.

    Directory of Open Access Journals (Sweden)

    Yoshikazu Furuta

    Full Text Available Helicobacter pylori, a bacterial pathogen that can infect human stomach causing gastritis, ulcers and cancer, is known to have a high degree of genome/epigenome diversity as the result of mutation and recombination. The bacteria often infect in childhood and persist for the life of the host. One of the reasons of the rapid evolution of H. pylori is that it changes its genome drastically for adaptation to a new host. To investigate microevolution and adaptation of the H. pylori genome, we undertook whole genome sequencing of the same or very similar sequence type in multi-locus sequence typing (MLST with seven genes in members of the same family consisting of parents and children in Japan. Detection of nucleotide substitutions revealed likely transmission pathways involving children. Nonsynonymous (amino acid changing mutations were found in virulence-related genes (cag genes, vacA, hcpDX, tnfα, ggt, htrA and the collagenase gene, outer membrane protein (OMP genes and other cell surface-related protein genes, signal transduction genes and restriction-modification genes. We reconstructed various pathways by which H. pylori can adapt to a new human host, and our results raised the possibility that the mutational changes in virulence-related genes have a role in adaptation to a child host. Changes in restriction-modification genes might remodel the methylome and transcriptome to help adaptation. This study has provided insights into H. pylori transmission and virulence and has implications for basic research as well as clinical practice.

  20. Conditions for the Evolution of Gene Clusters in Bacterial Genomes

    Science.gov (United States)

    Ballouz, Sara; Francis, Andrew R.; Lan, Ruiting; Tanaka, Mark M.

    2010-01-01

    Genes encoding proteins in a common pathway are often found near each other along bacterial chromosomes. Several explanations have been proposed to account for the evolution of these structures. For instance, natural selection may directly favour gene clusters through a variety of mechanisms, such as increased efficiency of coregulation. An alternative and controversial hypothesis is the selfish operon model, which asserts that clustered arrangements of genes are more easily transferred to other species, thus improving the prospects for survival of the cluster. According to another hypothesis (the persistence model), genes that are in close proximity are less likely to be disrupted by deletions. Here we develop computational models to study the conditions under which gene clusters can evolve and persist. First, we examine the selfish operon model by re-implementing the simulation and running it under a wide range of conditions. Second, we introduce and study a Moran process in which there is natural selection for gene clustering and rearrangement occurs by genome inversion events. Finally, we develop and study a model that includes selection and inversion, which tracks the occurrence and fixation of rearrangements. Surprisingly, gene clusters fail to evolve under a wide range of conditions. Factors that promote the evolution of gene clusters include a low number of genes in the pathway, a high population size, and in the case of the selfish operon model, a high horizontal transfer rate. The computational analysis here has shown that the evolution of gene clusters can occur under both direct and indirect selection as long as certain conditions hold. Under these conditions the selfish operon model is still viable as an explanation for the evolution of gene clusters. PMID:20168992

  1. The low-recombining pericentromeric region of barley restricts gene diversity and evolution but not gene expression

    Science.gov (United States)

    Baker, Katie; Bayer, Micha; Cook, Nicola; Dreißig, Steven; Dhillon, Taniya; Russell, Joanne; Hedley, Pete E; Morris, Jenny; Ramsay, Luke; Colas, Isabelle; Waugh, Robbie; Steffenson, Brian; Milne, Iain; Stephen, Gordon; Marshall, David; Flavell, Andrew J

    2014-01-01

    The low-recombining pericentromeric region of the barley genome contains roughly a quarter of the genes of the species, embedded in low-recombining DNA that is rich in repeats and repressive chromatin signatures. We have investigated the effects of pericentromeric region residency upon the expression, diversity and evolution of these genes. We observe no significant difference in average transcript level or developmental RNA specificity between the barley pericentromeric region and the rest of the genome. In contrast, all of the evolutionary parameters studied here show evidence of compromised gene evolution in this region. First, genes within the pericentromeric region of wild barley show reduced diversity and significantly weakened purifying selection compared with the rest of the genome. Second, gene duplicates (ohnolog pairs) derived from the cereal whole-genome duplication event ca. 60MYa have been completely eliminated from the barley pericentromeric region. Third, local gene duplication in the pericentromeric region is reduced by 29% relative to the rest of the genome. Thus, the pericentromeric region of barley is a permissive environment for gene expression but has restricted gene evolution in a sizeable fraction of barley's genes. PMID:24947331

  2. Existence families, functional calculi and evolution equations

    CERN Document Server

    deLaubenfels, Ralph

    1994-01-01

    This book presents an operator-theoretic approach to ill-posed evolution equations. It presents the basic theory, and the more surprising examples, of generalizations of strongly continuous semigroups known as 'existent families' and 'regularized semigroups'. These families of operators may be used either to produce all initial data for which a solution in the original space exists, or to construct a maximal subspace on which the problem is well-posed. Regularized semigroups are also used to construct functional, or operational, calculi for unbounded operators. The book takes an intuitive and constructive approach by emphasizing the interaction between functional calculus constructions and evolution equations. One thinks of a semigroup generated by A as etA and thinks of a regularized semigroup generated by A as etA g(A), producing solutions of the abstract Cauchy problem for initial data in the image of g(A). Material that is scattered throughout numerous papers is brought together and presented in a fresh, ...

  3. Structure of Mycobacterium tuberculosis Rv2714, a representative of a duplicated gene family in Actinobacteria

    International Nuclear Information System (INIS)

    Graña, Martin; Bellinzoni, Marco; Miras, Isabelle; Fiez-Vandal, Cedric; Haouz, Ahmed; Shepard, William; Buschiazzo, Alejandro; Alzari, Pedro M.

    2009-01-01

    The crystal structure of Rv2714, a protein of unknown function from M. tuberculosis, has been determined at 2.6 Å resolution using single-wavelength anomalous diffraction methods. The gene Rv2714 from Mycobacterium tuberculosis, which codes for a hypothetical protein of unknown function, is a representative member of a gene family that is largely confined to the order Actinomycetales of Actinobacteria. Sequence analysis indicates the presence of two paralogous genes in most mycobacterial genomes and suggests that gene duplication was an ancient event in bacterial evolution. The crystal structure of Rv2714 has been determined at 2.6 Å resolution, revealing a trimer in which the topology of the protomer core is similar to that observed in a functionally diverse set of enzymes, including purine nucleoside phosphorylases, some carboxypeptidases, bacterial peptidyl-tRNA hydrolases and even the plastidic form of an intron splicing factor. However, some structural elements, such as a β-hairpin insertion involved in protein oligomerization and a C-terminal α-helical domain that serves as a lid to the putative substrate-binding (or ligand-binding) site, are only found in Rv2714 bacterial homologues and represent specific signatures of this protein family

  4. Structure of Mycobacterium tuberculosis Rv2714, a representative of a duplicated gene family in Actinobacteria

    Energy Technology Data Exchange (ETDEWEB)

    Graña, Martin; Bellinzoni, Marco [Institut Pasteur, Unité de Biochimie Structurale, URA CNRS 2185, 25 Rue du Dr Roux, 75724 Paris (France); Miras, Isabelle; Fiez-Vandal, Cedric; Haouz, Ahmed; Shepard, William [Institut Pasteur, Plate-forme de Cristallogenèse et Diffraction des Rayons X, 25 Rue du Dr Roux, 75724 Paris (France); Buschiazzo, Alejandro; Alzari, Pedro M., E-mail: alzari@pasteur.fr [Institut Pasteur, Unité de Biochimie Structurale, URA CNRS 2185, 25 Rue du Dr Roux, 75724 Paris (France)

    2009-10-01

    The crystal structure of Rv2714, a protein of unknown function from M. tuberculosis, has been determined at 2.6 Å resolution using single-wavelength anomalous diffraction methods. The gene Rv2714 from Mycobacterium tuberculosis, which codes for a hypothetical protein of unknown function, is a representative member of a gene family that is largely confined to the order Actinomycetales of Actinobacteria. Sequence analysis indicates the presence of two paralogous genes in most mycobacterial genomes and suggests that gene duplication was an ancient event in bacterial evolution. The crystal structure of Rv2714 has been determined at 2.6 Å resolution, revealing a trimer in which the topology of the protomer core is similar to that observed in a functionally diverse set of enzymes, including purine nucleoside phosphorylases, some carboxypeptidases, bacterial peptidyl-tRNA hydrolases and even the plastidic form of an intron splicing factor. However, some structural elements, such as a β-hairpin insertion involved in protein oligomerization and a C-terminal α-helical domain that serves as a lid to the putative substrate-binding (or ligand-binding) site, are only found in Rv2714 bacterial homologues and represent specific signatures of this protein family.

  5. Divergence and adaptive evolution of the gibberellin oxidase genes in plants.

    Science.gov (United States)

    Huang, Yuan; Wang, Xi; Ge, Song; Rao, Guang-Yuan

    2015-09-29

    The important phytohormone gibberellins (GAs) play key roles in various developmental processes. GA oxidases (GAoxs) are critical enzymes in GA synthesis pathway, but their classification, evolutionary history and the forces driving the evolution of plant GAox genes remain poorly understood. This study provides the first large-scale evolutionary analysis of GAox genes in plants by using an extensive whole-genome dataset of 41 species, representing green algae, bryophytes, pteridophyte, and seed plants. We defined eight subfamilies under the GAox family, namely C19-GA2ox, C20-GA2ox, GA20ox,GA3ox, GAox-A, GAox-B, GAox-C and GAox-D. Of these, subfamilies GAox-A, GAox-B, GAox-C and GAox-D are described for the first time. On the basis of phylogenetic analyses and characteristic motifs of GAox genes, we demonstrated a rapid expansion and functional divergence of the GAox genes during the diversification of land plants. We also detected the subfamily-specific motifs and potential sites of some GAox genes, which might have evolved under positive selection. GAox genes originated very early-before the divergence of bryophytes and the vascular plants and the diversification of GAox genes is associated with the functional divergence and could be driven by positive selection. Our study not only provides information on the classification of GAox genes, but also facilitates the further functional characterization and analysis of GA oxidases.

  6. Comparative genomic analysis of SET domain family reveals the origin, expansion, and putative function of the arthropod-specific SmydA genes as histone modifiers in insects.

    Science.gov (United States)

    Jiang, Feng; Liu, Qing; Wang, Yanli; Zhang, Jie; Wang, Huimin; Song, Tianqi; Yang, Meiling; Wang, Xianhui; Kang, Le

    2017-06-01

    The SET domain is an evolutionarily conserved motif present in histone lysine methyltransferases, which are important in the regulation of chromatin and gene expression in animals. In this study, we searched for SET domain-containing genes (SET genes) in all of the 147 arthropod genomes sequenced at the time of carrying out this experiment to understand the evolutionary history by which SET domains have evolved in insects. Phylogenetic and ancestral state reconstruction analysis revealed an arthropod-specific SET gene family, named SmydA, that is ancestral to arthropod animals and specifically diversified during insect evolution. Considering that pseudogenization is the most probable fate of the new emerging gene copies, we provided experimental and evolutionary evidence to demonstrate their essential functions. Fluorescence in situ hybridization analysis and in vitro methyltransferase activity assays showed that the SmydA-2 gene was transcriptionally active and retained the original histone methylation activity. Expression knockdown by RNA interference significantly increased mortality, implying that the SmydA genes may be essential for insect survival. We further showed predominantly strong purifying selection on the SmydA gene family and a potential association between the regulation of gene expression and insect phenotypic plasticity by transcriptome analysis. Overall, these data suggest that the SmydA gene family retains essential functions that may possibly define novel regulatory pathways in insects. This work provides insights into the roles of lineage-specific domain duplication in insect evolution. © The Authors 2017. Published by Oxford University Press.

  7. Diversifying Selection in the Wheat Stem Rust Fungus Acts Predominantly on Pathogen-Associated Gene Families and Reveals Candidate Effectors

    Directory of Open Access Journals (Sweden)

    Jana eSperschneider

    2014-09-01

    Full Text Available Plant pathogens cause severe losses to crop plants and threaten global food production. One striking example is the wheat stem rust fungus, Puccinia graminis f. sp. tritici, which can rapidly evolve new virulent pathotypes in response to resistant host lines. Like several other filamentous fungal and oomycete plant pathogens, its genome features expanded gene families that have been implicated in host-pathogen interactions, possibly encoding effector proteins that interact directly with target host defence proteins. Previous efforts to understand virulence largely relied on the prediction of secreted, small and cysteine-rich proteins as candidate effectors and thus delivered an overwhelming number of candidates. Here, we implement an alternative analysis strategy that uses the signal of adaptive evolution as a line of evidence for effector function, combined with comparative information and expression data. We demonstrate that in planta up-regulated genes that are rapidly evolving are found almost exclusively in pathogen-associated gene families, affirming the impact of host-pathogen co-evolution on genome structure and the adaptive diversification of specialised gene families. In particular, we predict 42 effector candidates that are conserved only across pathogens, induced during infection and rapidly evolving. One of our top candidates has recently been shown to induce genotype-specific hypersensitive cell death in wheat. This shows that comparative genomics incorporating the evolutionary signal of adaptation is powerful for predicting effector candidates for laboratory verification. Our system can be applied to a wide range of pathogens and will give insight into host-pathogen dynamics, ultimately leading to progress in strategies for disease control.

  8. The Caenorhabditis chemoreceptor gene families

    OpenAIRE

    Robertson Hugh M; Thomas James H

    2008-01-01

    Abstract Background Chemoreceptor proteins mediate the first step in the transduction of environmental chemical stimuli, defining the breadth of detection and conferring stimulus specificity. Animal genomes contain families of genes encoding chemoreceptors that mediate taste, olfaction, and pheromone responses. The size and diversity of these families reflect the biology of chemoperception in specific species. Results Based on manual curation and sequence comparisons among putative G-protein-...

  9. Rate of evolution in brain-expressed genes in humans and other primates.

    Directory of Open Access Journals (Sweden)

    Hurng-Yi Wang

    2007-02-01

    Full Text Available Brain-expressed genes are known to evolve slowly in mammals. Nevertheless, since brains of higher primates have evolved rapidly, one might expect acceleration in DNA sequence evolution in their brain-expressed genes. In this study, we carried out full-length cDNA sequencing on the brain transcriptome of an Old World monkey (OWM and then conducted three-way comparisons among (i mouse, OWM, and human, and (ii OWM, chimpanzee, and human. Although brain-expressed genes indeed appear to evolve more rapidly in species with more advanced brains (apes > OWM > mouse, a similar lineage effect is observable for most other genes. The broad inclusion of genes in the reference set to represent the genomic average is therefore critical to this type of analysis. Calibrated against the genomic average, the rate of evolution among brain-expressed genes is probably lower (or at most equal in humans than in chimpanzee and OWM. Interestingly, the trend of slow evolution in coding sequence is no less pronounced among brain-specific genes, vis-à-vis brain-expressed genes in general. The human brain may thus differ from those of our close relatives in two opposite directions: (i faster evolution in gene expression, and (ii a likely slowdown in the evolution of protein sequences. Possible explanations and hypotheses are discussed.

  10. Genome-wide analysis of the MADS-box gene family in polyploid cotton (Gossypium hirsutum) and in its diploid parental species (Gossypium arboreum and Gossypium raimondii).

    Science.gov (United States)

    Nardeli, Sarah Muniz; Artico, Sinara; Aoyagi, Gustavo Mitsunori; de Moura, Stéfanie Menezes; da Franca Silva, Tatiane; Grossi-de-Sa, Maria Fatima; Romanel, Elisson; Alves-Ferreira, Marcio

    2018-06-01

    The MADS-box gene family encodes transcription factors that share a highly conserved domain known to bind to DNA. Members of this family control various processes of development in plants, from root formation to fruit ripening. In this work, a survey of diploid (Gossypium raimondii and Gossypium arboreum) and tetraploid (Gossypium hirsutum) cotton genomes found a total of 147, 133 and 207 MADS-box genes, respectively, distributed in the MIKC, Mα, Mβ, Mγ, and Mδ subclades. A comparative phylogenetic analysis among cotton species, Arabidopsis, poplar and grapevine MADS-box homologous genes allowed us to evaluate the evolution of each MADS-box lineage in cotton plants and identify sequences within well-established subfamilies. Chromosomal localization and phylogenetic analysis revealed that G. raimondii and G. arboreum showed a conserved evolution of the MIKC subclade and a distinct pattern of duplication events in the Mα, Mγ and Mδ subclades. Additionally, G. hirsutum showed a combination of its parental subgenomes followed by a distinct evolutionary history including gene gain and loss in each subclade. qPCR analysis revealed the expression patterns of putative homologs in the AP1, AP3, AGL6, SEP4, AGL15, AG, AGL17, TM8, SVP, SOC and TT16 subfamilies of G. hirsutum. The identification of putative cotton orthologs is discussed in the light of evolution and gene expression data from other plants. This analysis of the MADS-box genes in Gossypium species opens an avenue to understanding the origin and evolution of each gene subfamily within diploid and polyploid species and paves the way for functional studies in cotton species. Copyright © 2018 Elsevier Masson SAS. All rights reserved.

  11. Conditions for the evolution of gene clusters in bacterial genomes.

    Directory of Open Access Journals (Sweden)

    Sara Ballouz

    2010-02-01

    Full Text Available Genes encoding proteins in a common pathway are often found near each other along bacterial chromosomes. Several explanations have been proposed to account for the evolution of these structures. For instance, natural selection may directly favour gene clusters through a variety of mechanisms, such as increased efficiency of coregulation. An alternative and controversial hypothesis is the selfish operon model, which asserts that clustered arrangements of genes are more easily transferred to other species, thus improving the prospects for survival of the cluster. According to another hypothesis (the persistence model, genes that are in close proximity are less likely to be disrupted by deletions. Here we develop computational models to study the conditions under which gene clusters can evolve and persist. First, we examine the selfish operon model by re-implementing the simulation and running it under a wide range of conditions. Second, we introduce and study a Moran process in which there is natural selection for gene clustering and rearrangement occurs by genome inversion events. Finally, we develop and study a model that includes selection and inversion, which tracks the occurrence and fixation of rearrangements. Surprisingly, gene clusters fail to evolve under a wide range of conditions. Factors that promote the evolution of gene clusters include a low number of genes in the pathway, a high population size, and in the case of the selfish operon model, a high horizontal transfer rate. The computational analysis here has shown that the evolution of gene clusters can occur under both direct and indirect selection as long as certain conditions hold. Under these conditions the selfish operon model is still viable as an explanation for the evolution of gene clusters.

  12. Spider Transcriptomes Identify Ancient Large-Scale Gene Duplication Event Potentially Important in Silk Gland Evolution.

    Science.gov (United States)

    Clarke, Thomas H; Garb, Jessica E; Hayashi, Cheryl Y; Arensburger, Peter; Ayoub, Nadia A

    2015-06-08

    The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae). © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  13. Genome-wide identification and characterization of the SBP-box gene family in Petunia.

    Science.gov (United States)

    Zhou, Qin; Zhang, Sisi; Chen, Feng; Liu, Baojun; Wu, Lan; Li, Fei; Zhang, Jiaqi; Bao, Manzhu; Liu, Guofeng

    2018-03-12

    -to-reproductive phase transition. Petunia genome contains at least 21 SPL genes, and most of the genes are expressed in different tissues. The PhSPL genes may play conserved and diverse roles in plant growth and development, including flowering regulation, leaf initiation, axillary bud and inflorescence development. This work provides a comprehensive understanding of the SBP-box gene family in Petunia and lays a significant foundation for future studies on the function and evolution of SPL genes in petunia.

  14. Comparative study of human mitochondrial proteome reveals extensive protein subcellular relocalization after gene duplications

    Directory of Open Access Journals (Sweden)

    Huang Yong

    2009-11-01

    Full Text Available Abstract Background Gene and genome duplication is the principle creative force in evolution. Recently, protein subcellular relocalization, or neolocalization was proposed as one of the mechanisms responsible for the retention of duplicated genes. This hypothesis received support from the analysis of yeast genomes, but has not been tested thoroughly on animal genomes. In order to evaluate the importance of subcellular relocalizations for retention of duplicated genes in animal genomes, we systematically analyzed nuclear encoded mitochondrial proteins in the human genome by reconstructing phylogenies of mitochondrial multigene families. Results The 456 human mitochondrial proteins selected for this study were clustered into 305 gene families including 92 multigene families. Among the multigene families, 59 (64% consisted of both mitochondrial and cytosolic (non-mitochondrial proteins (mt-cy families while the remaining 33 (36% were composed of mitochondrial proteins (mt-mt families. Phylogenetic analyses of mt-cy families revealed three different scenarios of their neolocalization following gene duplication: 1 relocalization from mitochondria to cytosol, 2 from cytosol to mitochondria and 3 multiple subcellular relocalizations. The neolocalizations were most commonly enabled by the gain or loss of N-terminal mitochondrial targeting signals. The majority of detected subcellular relocalization events occurred early in animal evolution, preceding the evolution of tetrapods. Mt-mt protein families showed a somewhat different pattern, where gene duplication occurred more evenly in time. However, for both types of protein families, most duplication events appear to roughly coincide with two rounds of genome duplications early in vertebrate evolution. Finally, we evaluated the effects of inaccurate and incomplete annotation of mitochondrial proteins and found that our conclusion of the importance of subcellular relocalization after gene duplication on

  15. Gene Conversion in Angiosperm Genomes with an Emphasis on Genes Duplicated by Polyploidization

    Directory of Open Access Journals (Sweden)

    Xi-Yin Wang

    2011-01-01

    Full Text Available Angiosperm genomes differ from those of mammals by extensive and recursive polyploidizations. The resulting gene duplication provides opportunities both for genetic innovation, and for concerted evolution. Though most genes may escape conversion by their homologs, concerted evolution of duplicated genes can last for millions of years or longer after their origin. Indeed, paralogous genes on two rice chromosomes duplicated an estimated 60–70 million years ago have experienced gene conversion in the past 400,000 years. Gene conversion preserves similarity of paralogous genes, but appears to accelerate their divergence from orthologous genes in other species. The mutagenic nature of recombination coupled with the buffering effect provided by gene redundancy, may facilitate the evolution of novel alleles that confer functional innovations while insulating biological fitness of affected plants. A mixed evolutionary model, characterized by a primary birth-and-death process and occasional homoeologous recombination and gene conversion, may best explain the evolution of multigene families.

  16. miR-92a family and their target genes in tumorigenesis and metastasis

    Energy Technology Data Exchange (ETDEWEB)

    Li, Molin, E-mail: molin_li@hotmail.com [Department of Pathophysiology, Basic Medical Science of Dalian Medical University, Dalian 116044 (China); Institute of Cancer Stem Cell, Dalian Medical University Cancer Center, Dalian 116044 (China); Guan, Xingfang; Sun, Yuqiang [Department of Pathophysiology, Basic Medical Science of Dalian Medical University, Dalian 116044 (China); Mi, Jun [Institute of Cancer Stem Cell, Dalian Medical University Cancer Center, Dalian 116044 (China); Shu, Xiaohong [College of Pharmacy, Dalian Medical University Cancer Center, Dalian 116044 (China); Liu, Fang [Department of Surgery, The Second Affiliated Hospital of Dalian Medical University, Dalian 116027 (China); Li, Chuangang, E-mail: li_chuangang@sina.com [Department of Surgery, The Second Affiliated Hospital of Dalian Medical University, Dalian 116027 (China)

    2014-04-15

    The miR-92a family, including miR-25, miR-92a-1, miR-92a-2 and miR-363, arises from three different paralog clusters miR-17-92, miR-106a-363, and miR-106b-25 that are highly conservative in the process of evolution, and it was thought as a group of microRNAs (miRNAs) correlated with endothelial cells. Aberrant expression of miR-92a family was detected in multiple cancers, and the disturbance of miR-92a family was related with tumorigenesis and tumor development. In this review, the progress on the relationship between miR-92a family and their target genes and malignant tumors will be summarized. - Highlights: • Aberrant expression of miR-92a, miR-25 and miR-363 can be observed in many kinds of malignant tumors. • The expression of miR-92a family is regulated by LOH, epigenetic alteration, transcriptional factors such as SP1, MYC, E2F, wild-type p53 etc. • Roles of miR-92a family in tumorigenesis and development: promoting cell proliferation, invasion and metastasis, inhibiting cell apoptosis.

  17. miR-92a family and their target genes in tumorigenesis and metastasis

    International Nuclear Information System (INIS)

    Li, Molin; Guan, Xingfang; Sun, Yuqiang; Mi, Jun; Shu, Xiaohong; Liu, Fang; Li, Chuangang

    2014-01-01

    The miR-92a family, including miR-25, miR-92a-1, miR-92a-2 and miR-363, arises from three different paralog clusters miR-17-92, miR-106a-363, and miR-106b-25 that are highly conservative in the process of evolution, and it was thought as a group of microRNAs (miRNAs) correlated with endothelial cells. Aberrant expression of miR-92a family was detected in multiple cancers, and the disturbance of miR-92a family was related with tumorigenesis and tumor development. In this review, the progress on the relationship between miR-92a family and their target genes and malignant tumors will be summarized. - Highlights: • Aberrant expression of miR-92a, miR-25 and miR-363 can be observed in many kinds of malignant tumors. • The expression of miR-92a family is regulated by LOH, epigenetic alteration, transcriptional factors such as SP1, MYC, E2F, wild-type p53 etc. • Roles of miR-92a family in tumorigenesis and development: promoting cell proliferation, invasion and metastasis, inhibiting cell apoptosis

  18. Evolutionary genomics and adaptive evolution of the hedgehog gene family (Shh, Ihh and Dhh) in vertebrates

    DEFF Research Database (Denmark)

    Pereira, Joana; Johnson, Warren E.; O'Brien, Stephen J.

    2014-01-01

    The Hedgehog (Hh) gene family codes for a class of secreted proteins composed of two active domains that act as signalling molecules during embryo development, namely for the development of the nervous and skeletal systems and the formation of the testis cord. While only one Hh gene is found typi...

  19. Genome-wide Identification and Expression Analysis of the CDPK Gene Family in Grape, Vitis spp.

    Science.gov (United States)

    Zhang, Kai; Han, Yong-Tao; Zhao, Feng-Li; Hu, Yang; Gao, Yu-Rong; Ma, Yan-Fei; Zheng, Yi; Wang, Yue-Jin; Wen, Ying-Qiang

    2015-06-30

    Calcium-dependent protein kinases (CDPKs) play vital roles in plant growth and development, biotic and abiotic stress responses, and hormone signaling. Little is known about the CDPK gene family in grapevine. In this study, we performed a genome-wide analysis of the 12X grape genome (Vitis vinifera) and identified nineteen CDPK genes. Comparison of the structures of grape CDPK genes allowed us to examine their functional conservation and differentiation. Segmentally duplicated grape CDPK genes showed high structural conservation and contributed to gene family expansion. Additional comparisons between grape and Arabidopsis thaliana demonstrated that several grape CDPK genes occured in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of grapevine and Arabidopsis. Phylogenetic analysis divided the grape CDPK genes into four groups. Furthermore, we examined the expression of the corresponding nineteen homologous CDPK genes in the Chinese wild grape (Vitis pseudoreticulata) under various conditions, including biotic stress, abiotic stress, and hormone treatments. The expression profiles derived from reverse transcription and quantitative PCR suggested that a large number of VpCDPKs responded to various stimuli on the transcriptional level, indicating their versatile roles in the responses to biotic and abiotic stresses. Moreover, we examined the subcellular localization of VpCDPKs by transiently expressing six VpCDPK-GFP fusion proteins in Arabidopsis mesophyll protoplasts; this revealed high variability consistent with potential functional differences. Taken as a whole, our data provide significant insights into the evolution and function of grape CDPKs and a framework for future investigation of grape CDPK genes.

  20. Molecular evolution, characterization and expression analysis of SnRK2 gene family in Pak-choi (Brassica rapa ssp. chinensis

    Directory of Open Access Journals (Sweden)

    Zhinan eHuang

    2015-10-01

    Full Text Available Abstract: The sucrose non-fermenting 1-related protein kinase 2 (SnRK2 family members are plant-specific serine/threonine kinases that are involved in the plant response to abiotic stress and abscisic acid (ABA-dependent plant development. Further understanding of the evolutionary history and expression characteristics of these genes will help to elucidate the mechanisms of the stress tolerance in Pak-choi, an important green leafy vegetable in China. Thus, we investigated the evolutionary patterns, footprints and conservation of SnRK2 genes in selected plants and later cloned and analyzed SnRK2 genes in Pak-choi. We found that this gene family was preferentially retained in Brassicas after the Brassica-Arabidopsis thaliana split. Next, we cloned and sequenced 13 SnRK2 from both cDNA and DNA libraries of stress-induced Pak-choi, which were under conditions of ABA, salinity, cold, heat, and osmotic treatments. Most of the BcSnRK2s have eight exons and could be divided into three groups. The subcellular localization predictions suggested that the putative BcSnRK2 proteins were enriched in the nucleus. The results of an analysis of the expression patterns of the BcSnRK2 genes showed that BcSnRK2 group III genes were robustly induced by ABA treatments. Most of the BcSnRK2 genes were activated by low temperature, and the BcSnRK2.6 genes responded to both ABA and low temperature. In fact, most of the BcSnRK2 genes showed positive or negative regulation under ABA and low temperature treatments, suggesting that they may be global regulators that function at the intersection of multiple signaling pathways to play important roles in Pak-choi stress responses.

  1. The evolution of gene expression in primates

    OpenAIRE

    Tashakkori Ghanbarian, Avazeh

    2015-01-01

    The evolution of a gene’s expression profile is commonly assumed to be independent of its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between expression of neighboring genes in extant taxa. Indeed, in all eukaryotic genomes, genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their e...

  2. Recurrent APC gene mutations in Polish FAP families

    Directory of Open Access Journals (Sweden)

    Pławski Andrzej

    2007-12-01

    Full Text Available Abstract The molecular diagnostics of genetically conditioned disorders is based on the identification of the mutations in the predisposing genes. Hereditary cancer disorders of the gastrointestinal tracts are caused by mutations of the tumour suppressor genes or the DNA repair genes. Occurrence of recurrent mutation allows improvement of molecular diagnostics. The mutation spectrum in the genes causing hereditary forms of colorectal cancers in the Polish population was previously described. In the present work an estimation of the frequency of the recurrent mutations of the APC gene was performed. Eight types of mutations occurred in 19.4% of our FAP families and these constitute 43% of all Polish diagnosed families.

  3. Molecular evolution of rbcL in three gymnosperm families: identifying adaptive and coevolutionary patterns

    LENUS (Irish Health Repository)

    Sen, Lin

    2011-06-03

    Abstract Background The chloroplast-localized ribulose-1, 5-biphosphate carboxylase\\/oxygenase (Rubisco), the primary enzyme responsible for autotrophy, is instrumental in the continual adaptation of plants to variations in the concentrations of CO2. The large subunit (LSU) of Rubisco is encoded by the chloroplast rbcL gene. Although adaptive processes have been previously identified at this gene, characterizing the relationships between the mutational dynamics at the protein level may yield clues on the biological meaning of such adaptive processes. The role of such coevolutionary dynamics in the continual fine-tuning of RbcL remains obscure. Results We used the timescale and phylogenetic analyses to investigate and search for processes of adaptive evolution in rbcL gene in three gymnosperm families, namely Podocarpaceae, Taxaceae and Cephalotaxaceae. To understand the relationships between regions identified as having evolved under adaptive evolution, we performed coevolutionary analyses using the software CAPS. Importantly, adaptive processes were identified at amino acid sites located on the contact regions among the Rubisco subunits and on the interface between Rubisco and its activase. Adaptive amino acid replacements at these regions may have optimized the holoenzyme activity. This hypothesis was pinpointed by evidence originated from our analysis of coevolution that supported the correlated evolution between Rubisco and its activase. Interestingly, the correlated adaptive processes between both these proteins have paralleled the geological variation history of the concentration of atmospheric CO2. Conclusions The gene rbcL has experienced bursts of adaptations in response to the changing concentration of CO2 in the atmosphere. These adaptations have emerged as a result of a continuous dynamic of mutations, many of which may have involved innovation of functional Rubisco features. Analysis of the protein structure and the functional implications of such

  4. Functional requirements driving the gene duplication in 12 Drosophila species.

    Science.gov (United States)

    Zhong, Yan; Jia, Yanxiao; Gao, Yang; Tian, Dacheng; Yang, Sihai; Zhang, Xiaohui

    2013-08-15

    Gene duplication supplies the raw materials for novel gene functions and many gene families arisen from duplication experience adaptive evolution. Most studies of young duplicates have focused on mammals, especially humans, whereas reports describing their genome-wide evolutionary patterns across the closely related Drosophila species are rare. The sequenced 12 Drosophila genomes provide the opportunity to address this issue. In our study, 3,647 young duplicate gene families were identified across the 12 Drosophila species and three types of expansions, species-specific, lineage-specific and complex expansions, were detected in these gene families. Our data showed that the species-specific young duplicate genes predominated (86.6%) over the other two types. Interestingly, many independent species-specific expansions in the same gene family have been observed in many species, even including 11 or 12 Drosophila species. Our data also showed that the functional bias observed in these young duplicate genes was mainly related to responses to environmental stimuli and biotic stresses. This study reveals the evolutionary patterns of young duplicates across 12 Drosophila species on a genomic scale. Our results suggest that convergent evolution acts on young duplicate genes after the species differentiation and adaptive evolution may play an important role in duplicate genes for adaption to ecological factors and environmental changes in Drosophila.

  5. PlantTribes: a gene and gene family resource for comparative genomics in plants

    OpenAIRE

    Wall, P. Kerr; Leebens-Mack, Jim; Müller, Kai F.; Field, Dawn; Altman, Naomi S.; dePamphilis, Claude W.

    2007-01-01

    The PlantTribes database (http://fgp.huck.psu.edu/tribe.html) is a plant gene family database based on the inferred proteomes of five sequenced plant species: Arabidopsis thaliana, Carica papaya, Medicago truncatula, Oryza sativa and Populus trichocarpa. We used the graph-based clustering algorithm MCL [Van Dongen (Technical Report INS-R0010 2000) and Enright et al. (Nucleic Acids Res. 2002; 30: 1575–1584)] to classify all of these species’ protein-coding genes into putative gene families, ca...

  6. Analysis of Copy Number Variation in the Abp Gene Regions of Two House Mouse Subspecies Suggests Divergence during the Gene Family Expansions.

    Science.gov (United States)

    Pezer, Željka; Chung, Amanda G; Karn, Robert C; Laukaitis, Christina M

    2017-06-01

    The Androgen-binding protein ( Abp ) gene region of the mouse genome contains 64 genes, some encoding pheromones that influence assortative mating between mice from different subspecies. Using CNVnator and quantitative PCR, we explored copy number variation in this gene family in natural populations of Mus musculus domesticus ( Mmd ) and Mus musculus musculus ( Mmm ), two subspecies of house mice that form a narrow hybrid zone in Central Europe. We found that copy number variation in the center of the Abp gene region is very common in wild Mmd , primarily representing the presence/absence of the final duplications described for the mouse genome. Clustering of Mmd individuals based on this variation did not reflect their geographical origin, suggesting no population divergence in the Abp gene cluster. However, copy number variation patterns differ substantially between Mmd and other mouse taxa. Large blocks of Abp genes are absent in Mmm , Mus musculus castaneus and an outgroup, Mus spretus , although with differences in variation and breakpoint locations. Our analysis calls into question the reliance on a reference genome for interpreting the detailed organization of genes in taxa more distant from the Mmd reference genome. The polymorphic nature of the gene family expansion in all four taxa suggests that the number of Abp genes, especially in the central gene region, is not critical to the survival and reproduction of the mouse. However, Abp haplotypes of variable length may serve as a source of raw genetic material for new signals influencing reproductive communication and thus speciation of mice. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  7. Genome-wide analysis of Aux/IAA gene family in Solanaceae species using tomato as a model.

    Science.gov (United States)

    Wu, Jian; Peng, Zhen; Liu, Songyu; He, Yanjun; Cheng, Lin; Kong, Fuling; Wang, Jie; Lu, Gang

    2012-04-01

    treatments even though at least one stress-related cis-element was identified in their promoter regions. In conclusion, our comparative analysis provides an insight into the evolution and expression patterns in various tissues and in response to auxin or stresses of the Aux/IAA family members in tomato, which will provide a very useful reference for cloning and functional analysis of each member of AUX/IAA gene family in Solanaceae crops.

  8. The evolution of milk casein genes from tooth genes before the origin of mammals.

    Science.gov (United States)

    Kawasaki, Kazuhiko; Lafont, Anne-Gaelle; Sire, Jean-Yves

    2011-07-01

    Caseins are among cardinal proteins that evolved in the lineage leading to mammals. In milk, caseins and calcium phosphate (CaP) form a huge complex called casein micelle. By forming the micelle, milk maintains high CaP concentrations, which help altricial mammalian neonates to grow bone and teeth. Two types of caseins are known. Ca-sensitive caseins (α(s)- and β-caseins) bind Ca but precipitate at high Ca concentrations, whereas Ca-insensitive casein (κ-casein) does not usually interact with Ca but instead stabilizes the micelle. Thus, it is thought that these two types of caseins are both necessary for stable micelle formation. Both types of caseins show high substitution rates, which make it difficult to elucidate the evolution of caseins. Yet, recent studies have revealed that all casein genes belong to the secretory calcium-binding phosphoprotein (SCPP) gene family that arose by gene duplication. In the present study, we investigated exon-intron structures and phylogenetic distributions of casein and other SCPP genes, particularly the odontogenic ameloblast-associated (ODAM) gene, the SCPP-Pro-Gln-rich 1 (SCPPPQ1) gene, and the follicular dendritic cell secreted peptide (FDCSP) gene. The results suggest that contemporary Ca-sensitive casein genes arose from a putative common ancestor, which we refer to as CSN1/2. The six putative exons comprising CSN1/2 are all found in SCPPPQ1, although ODAM also shares four of these exons. By contrast, the five exons of the Ca-insensitive casein gene are all reminiscent of FDCSP. The phylogenetic distribution of these genes suggests that both SCPPPQ1 and FDCSP arose from ODAM. We thus argue that all casein genes evolved from ODAM via two different pathways; Ca-sensitive casein genes likely originated directly from SCPPPQ1, whereas the Ca-insensitive casein genes directly differentiated from FDCSP. Further, expression of ODAM, SCPPPQ1, and FDCSP was detected in dental tissues, supporting the idea that both types of caseins

  9. Parsing parallel evolution: ecological divergence and differential gene expression in the adaptive radiations of thick-lipped Midas cichlid fishes from Nicaragua.

    Science.gov (United States)

    Manousaki, Tereza; Hull, Pincelli M; Kusche, Henrik; Machado-Schiaffino, Gonzalo; Franchini, Paolo; Harrod, Chris; Elmer, Kathryn R; Meyer, Axel

    2013-02-01

    The study of parallel evolution facilitates the discovery of common rules of diversification. Here, we examine the repeated evolution of thick lips in Midas cichlid fishes (the Amphilophus citrinellus species complex)-from two Great Lakes and two crater lakes in Nicaragua-to assess whether similar changes in ecology, phenotypic trophic traits and gene expression accompany parallel trait evolution. Using next-generation sequencing technology, we characterize transcriptome-wide differential gene expression in the lips of wild-caught sympatric thick- and thin-lipped cichlids from all four instances of repeated thick-lip evolution. Six genes (apolipoprotein D, myelin-associated glycoprotein precursor, four-and-a-half LIM domain protein 2, calpain-9, GTPase IMAP family member 8-like and one hypothetical protein) are significantly underexpressed in the thick-lipped morph across all four lakes. However, other aspects of lips' gene expression in sympatric morphs differ in a lake-specific pattern, including the magnitude of differentially expressed genes (97-510). Generally, fewer genes are differentially expressed among morphs in the younger crater lakes than in those from the older Great Lakes. Body shape, lower pharyngeal jaw size and shape, and stable isotopes (δ(13)C and δ(15)N) differ between all sympatric morphs, with the greatest differentiation in the Great Lake Nicaragua. Some ecological traits evolve in parallel (those related to foraging ecology; e.g. lip size, body and head shape) but others, somewhat surprisingly, do not (those related to diet and food processing; e.g. jaw size and shape, stable isotopes). Taken together, this case of parallelism among thick- and thin-lipped cichlids shows a mosaic pattern of parallel and nonparallel evolution. © 2012 Blackwell Publishing Ltd.

  10. Evolution of the defensin-like gene family in grass genomes

    Indian Academy of Sciences (India)

    . Jiandong Wu, Xiaolei Jing, Yang Zhao, Qing Dong, Haiyang Jiang and Qing Ma. J. Genet. 95, 53–62. Table 1. Information about DEFL genes in B. distachyon, O. sativa, S. bicolor and Z. mays. Species. Gene name. Gene identifer. ORF (aa).

  11. Convergent evolution, habitat shifts and variable diversification rates in the ovenbird-woodcreeper family (Furnariidae).

    Science.gov (United States)

    Irestedt, Martin; Fjeldså, Jon; Dalén, Love; Ericson, Per G P

    2009-11-21

    The Neotropical ovenbird-woodcreeper family (Furnariidae) is an avian group characterized by exceptionally diverse ecomorphological adaptations. For instance, members of the family are known to construct nests of a remarkable variety. This offers a unique opportunity to examine whether changes in nest design, accompanied by expansions into new habitats, facilitates diversification. We present a multi-gene phylogeny and age estimates for the ovenbird-woodcreeper family and use these results to estimate the degree of convergent evolution in both phenotype and habitat utilisation. Furthermore, we discuss whether variation in species richness among ovenbird clades could be explained by differences in clade-specific diversification rates, and whether these rates differ among lineages with different nesting habits. In addition, the systematic positions of some enigmatic ovenbird taxa and the postulated monophyly of some species-rich genera are evaluated. The phylogenetic results reveal new examples of convergent evolution and show that ovenbirds have independently colonized open habitats at least six times. The calculated age estimates suggest that the ovenbird-woodcreeper family started to diverge at ca 33 Mya, and that the timing of habitat shifts into open environments may be correlated with the aridification of South America during the last 15 My. The results also show that observed large differences in species richness among clades can be explained by a substantial variation in net diversification rates. The synallaxines, which generally are adapted to dry habitats and build exposed vegetative nests, had the highest diversification rate of all major furnariid clades. Several key features may have played an important role for the radiation and evolution of convergent phenotypes in the ovenbird-woodcreeper family. Our results suggest that changes in nest building strategy and adaptation to novel habitats may have played an important role in a diversification that

  12. Evolution of the vertebrate Pax4/6 class of genes with focus on its novel member, the Pax10 gene.

    Science.gov (United States)

    Feiner, Nathalie; Meyer, Axel; Kuraku, Shigehiro

    2014-06-19

    The members of the paired box (Pax) family regulate key developmental pathways in many metazoans as tissue-specific transcription factors. Vertebrate genomes typically possess nine Pax genes (Pax1-9), which are derived from four proto-Pax genes in the vertebrate ancestor that were later expanded through the so-called two-round (2R) whole-genome duplication. A recent study proposed that pax6a genes of a subset of teleost fishes (namely, acanthopterygians) are remnants of a paralog generated in the 2R genome duplication, to be renamed pax6.3, and reported one more group of vertebrate Pax genes (Pax6.2), most closely related to the Pax4/6 class. We propose to designate this new member Pax10 instead and reconstruct the evolutionary history of the Pax4/6/10 class with solid phylogenetic evidence. Our synteny analysis showed that Pax4, -6, and -10 originated in the 2R genome duplications early in vertebrate evolution. The phylogenetic analyses of relationships between teleost pax6a and other Pax4, -6, and -10 genes, however, do not support the proposed hypothesis of an ancient origin of the acanthopterygian pax6a genes in the 2R genome duplication. Instead, we confirmed the traditional scenario that the acanthopterygian pax6a is derived from the more recent teleost-specific genome duplication. Notably, Pax6 is present in all vertebrates surveyed to date, whereas Pax4 and -10 were lost multiple times in independent vertebrate lineages, likely because of their restricted expression patterns: Among Pax6-positive domains, Pax10 has retained expression in the adult retina alone, which we documented through in situ hybridization and quantitative reverse transcription polymerase chain reaction experiments on zebrafish, Xenopus, and anole lizard. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  13. Evolution families of conformal mappings with fixed points and the Löwner-Kufarev equation

    International Nuclear Information System (INIS)

    Goryainov, V V

    2015-01-01

    The paper is concerned with evolution families of conformal mappings of the unit disc to itself that fix an interior point and a boundary point. Conditions are obtained for the evolution families to be differentiable, and an existence and uniqueness theorem for an evolution equation is proved. A convergence theorem is established which describes the topology of locally uniform convergence of evolution families in terms of infinitesimal generating functions. The main result in this paper is the embedding theorem which shows that any conformal mapping of the unit disc to itself with two fixed points can be embedded into a differentiable evolution family of such mappings. This result extends the range of the parametric method in the theory of univalent functions. In this way the problem of the mutual change of the derivative at an interior point and the angular derivative at a fixed point on the boundary is solved for a class of mappings of the unit disc to itself. In particular, the rotation theorem is established for this class of mappings. Bibliography: 27 titles

  14. New Genes and Functional Innovation in Mammals.

    Science.gov (United States)

    Luis Villanueva-Cañas, José; Ruiz-Orera, Jorge; Agea, M Isabel; Gallo, Maria; Andreu, David; Albà, M Mar

    2017-07-01

    The birth of genes that encode new protein sequences is a major source of evolutionary innovation. However, we still understand relatively little about how these genes come into being and which functions they are selected for. To address these questions, we have obtained a large collection of mammalian-specific gene families that lack homologues in other eukaryotic groups. We have combined gene annotations and de novo transcript assemblies from 30 different mammalian species, obtaining ∼6,000 gene families. In general, the proteins in mammalian-specific gene families tend to be short and depleted in aromatic and negatively charged residues. Proteins which arose early in mammalian evolution include milk and skin polypeptides, immune response components, and proteins involved in reproduction. In contrast, the functions of proteins which have a more recent origin remain largely unknown, despite the fact that these proteins also have extensive proteomics support. We identify several previously described cases of genes originated de novo from noncoding genomic regions, supporting the idea that this mechanism frequently underlies the evolution of new protein-coding genes in mammals. Finally, we show that most young mammalian genes are preferentially expressed in testis, suggesting that sexual selection plays an important role in the emergence of new functional genes. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  15. The ALMT Gene Family Performs Multiple Functions in Plants

    Directory of Open Access Journals (Sweden)

    Jie Liu

    2018-02-01

    Full Text Available The aluminium activated malate transporter (ALMT gene family is named after the first member of the family identified in wheat (Triticum aestivum L.. The product of this gene controls resistance to aluminium (Al toxicity. ALMT genes encode transmembrane proteins that function as anion channels and perform multiple functions involving the transport of organic anions (e.g., carboxylates and inorganic anions in cells. They share a PF11744 domain and are classified in the Fusaric acid resistance protein-like superfamily, CL0307. The proteins typically have five to seven transmembrane regions in the N-terminal half and a long hydrophillic C-terminal tail but predictions of secondary structure vary. Although widely spread in plants, relatively little information is available on the roles performed by other members of this family. In this review, we summarized functions of ALMT gene families, including Al resistance, stomatal function, mineral nutrition, microbe interactions, fruit acidity, light response and seed development.

  16. Repeat-associated plasticity in the Helicobacter pylori RD gene family.

    Science.gov (United States)

    Shak, Joshua R; Dick, Jonathan J; Meinersmann, Richard J; Perez-Perez, Guillermo I; Blaser, Martin J

    2009-11-01

    The bacterium Helicobacter pylori is remarkable for its ability to persist in the human stomach for decades without provoking sterilizing immunity. Since repetitive DNA can facilitate adaptive genomic flexibility via increased recombination, insertion, and deletion, we searched the genomes of two H. pylori strains for nucleotide repeats. We discovered a family of genes with extensive repetitive DNA that we have termed the H. pylori RD gene family. Each gene of this family is composed of a conserved 3' region, a variable mid-region encoding 7 and 11 amino acid repeats, and a 5' region containing one of two possible alleles. Analysis of five complete genome sequences and PCR genotyping of 42 H. pylori strains revealed extensive variation between strains in the number, location, and arrangement of RD genes. Furthermore, examination of multiple strains isolated from a single subject's stomach revealed intrahost variation in repeat number and composition. Despite prior evidence that the protein products of this gene family are expressed at the bacterial cell surface, enzyme-linked immunosorbent assay and immunoblot studies revealed no consistent seroreactivity to a recombinant RD protein by H. pylori-positive hosts. The pattern of repeats uncovered in the RD gene family appears to reflect slipped-strand mispairing or domain duplication, allowing for redundancy and subsequent diversity in genotype and phenotype. This novel family of hypervariable genes with conserved, repetitive, and allelic domains may represent an important locus for understanding H. pylori persistence in its natural host.

  17. Molecular evolution of flavonoid dioxygenases in the family Apiaceae.

    Science.gov (United States)

    Gebhardt, Yvonne; Witte, Simone; Forkmann, Gert; Lukacin, Richard; Matern, Ulrich; Martens, Stefan

    2005-06-01

    Plant species of the family Apiaceae are known to accumulate flavonoids mainly in the form of flavones and flavonols. Three 2-oxoglutarate-dependent dioxygenases, flavone synthase or flavanone 3 beta-hydroxylase and flavonol synthase are involved in the biosynthesis of these secondary metabolites. The corresponding genes were cloned recently from parsley (Petroselinum crispum) leaves. Flavone synthase I appears to be confined to the Apiaceae, and the unique occurrence as well as its high sequence similarity to flavanone 3beta-hydroxylase laid the basis for evolutionary studies. In order to examine the relationship of these two enzymes throughout the Apiaceae, RT-PCR based cloning and functional identification of flavone synthases I or flavanone 3beta-hydroxylases were accomplished from Ammi majus, Anethum graveolens, Apium graveolens, Pimpinella anisum, Conium maculatum and Daucus carota, yielding three additional synthase and three additional hydroxylase cDNAs. Molecular and phylogenetic analyses of these sequences were compatible with the phylogeny based on morphological characteristics and suggested that flavone synthase I most likely resulted from gene duplication of flavanone 3beta-hydroxylase, and functional diversification at some point during the development of the apiaceae subfamilies. Furthermore, the genomic sequences from Petroselinum crispum and Daucus carota revealed two introns in each of the synthases and a lack of introns in the hydroxylases. These results might be explained by intron losses from the hydroxylases occurring at a later stage of evolution.

  18. Genome-Wide Identification, Evolution and Expression Analysis of the Grape (Vitis vinifera L. Zinc Finger-Homeodomain Gene Family

    Directory of Open Access Journals (Sweden)

    Hao Wang

    2014-04-01

    Full Text Available Plant zinc finger-homeodomain (ZHD genes encode a family of transcription factors that have been demonstrated to play an important role in the regulation of plant growth and development. In this study, we identified a total of 13 ZHD genes (VvZHD in the grape genome that were further classified into at least seven groups. Genome synteny analysis revealed that a number of VvZHD genes were present in the corresponding syntenic blocks of Arabidopsis, indicating that they arose before the divergence of these two species. Gene expression analysis showed that the identified VvZHD genes displayed distinct spatiotemporal expression patterns, and were differentially regulated under various stress conditions and hormone treatments, suggesting that the grape VvZHDs might be also involved in plant response to a variety of biotic and abiotic insults. Our work provides insightful information and knowledge about the ZHD genes in grape, which provides a framework for further characterization of their roles in regulation of stress tolerance as well as other aspects of grape productivity.

  19. Human heavy-chain variable region gene family nonrandomly rearranged in familial chronic lymphocytic leukemia

    International Nuclear Information System (INIS)

    Shen, A.; Humphries, C.; Tucker, P.; Blattner, F.

    1987-01-01

    The authors have identified a family of human immunoglobulin heavy-chain variable-region (V/sub H/) genes, one member of which is rearranged in two affected members of a family in which the father and four of five siblings developed chronic lymphocytic leukemia. Cloning and sequencing of the rearranged V/sub H/ genes from leukemic lymphocytes of three affected siblings showed that two siblings had rearranged V/sub H/ genes (V/sub H/TS1 and V/sub H/WS1) that were 90% homologous. The corresponding germ-line gene, V/sub H/251, was found to part of a small (four gene) V/sub H/ gene family, which they term V/sub H/V. The DNA sequence homology to V/sub H/WS1 (95%) and V/sub H/TS1 (88%) and identical restriction sites on the 5' side of V/sub H/ confirm that rearrangement of V/sub H/251 followed by somatic mutation produced the identical V/sub H/ gene rearrangements in the two siblings. V/sub H/TS1 is not a functional V/sub H/ gene; a functional V/sub H/ rearrangement was found on the other chromosome of this patient. The other two siblings had different V/sub H/ gene rearrangements. All used different diversity genes. Mechanisms proposed for nonrandom selection of a single V/sub H/ gene include developmental regulation of this V/sub H/ gene rearrangement or selection of a subpopulation of B cells in which this V/sub H/ has been rearranged

  20. The SPINK gene family and celiac disease susceptibility

    NARCIS (Netherlands)

    Wapenaar, M.C.; Monsuur, A.J.; Poell, J.; Slot, R. van 't; Meijer, J.W.R.; Meijer, G.A.; Mulder, C.J.; Mearin, M.L.; Wijmenga, C.

    2007-01-01

    The gene family of serine protease inhibitors of the Kazal type (SPINK) are functional and positional candidate genes for celiac disease (CD). Our aim was to assess the gut mucosal gene expression and genetic association of SPINK1, -2, -4, and -5 in the Dutch CD population. Gene expression was

  1. The SPINK gene family and celiac disease susceptibility

    NARCIS (Netherlands)

    Wapenaar, Martin C.; Monsuur, Alienke J.; Poell, Jos; Slot, Ruben Van 't; Meijer, Jos W. R.; Meijer, Gerrit A.; Mulder, Chris J.; Mearin, Maria Luisa; Wijmenga, Cisca

    The gene family of serine protease inhibitors of the Kazal type (SPINK) are functional and positional candidate genes for celiac disease (CD). Our aim was to assess the gut mucosal gene expression and genetic association of SPINK1, -2, -4, and -5 in the Dutch CD population. Gene expression was

  2. Generic phylogeny, historical biogeography and character evolution of the cosmopolitan aquatic plant family Hydrocharitaceae

    Directory of Open Access Journals (Sweden)

    Chen Ling-Yun

    2012-03-01

    Full Text Available Abstract Background Hydrocharitaceae is a fully aquatic monocot family, consists of 18 genera with approximately 120 species. The family includes both fresh and marine aquatics and exhibits great diversity in form and habit including annual and perennial life histories; submersed, partially submersed and floating leaf habits and linear to orbicular leaf shapes. The family has a cosmopolitan distribution and is well represented in the Tertiary fossil record in Europe. At present, the historical biogeography of the family is not well understood and the generic relationships remain controversial. In this study we investigated the phylogeny and biogeography of Hydrocharitaceae by integrating fossils and DNA sequences from eight genes. We also conducted ancestral state reconstruction for three morphological characters. Results Phylogenetic analyses produced a phylogeny with most branches strongly supported by bootstrap values greater than 95 and Bayesian posterior probability values of 1.0. Stratiotes is the first diverging lineage with the remaining genera in two clades, one clade consists of Lagarosiphon, Ottelia, Blyxa, Apalanthe, Elodea and Egeria; and the other consists of Hydrocharis-Limnobium, Thalassia, Enhalus, Halophila, Najas, Hydrilla, Vallisneria, Nechamandra and Maidenia. Biogeographic analyses (DIVA, Mesquite and divergence time estimates (BEAST resolved the most recent common ancestor of Hydrocharitaceae as being in Asia during the Late Cretaceous and Palaeocene (54.7-72.6 Ma. Dispersals (including long-distance dispersal and migrations through Tethys seaway and land bridges probably played major roles in the intercontinental distribution of this family. Ancestral state reconstruction suggested that in Hydrocharitaceae evolution of dioecy is bidirectional, viz., from dioecy to hermaphroditism, and from hermaphroditism to dioecy, and that the aerial-submerged leaf habit and short-linear leaf shape are the ancestral states. Conclusions

  3. Generic phylogeny, historical biogeography and character evolution of the cosmopolitan aquatic plant family Hydrocharitaceae.

    Science.gov (United States)

    Chen, Ling-Yun; Chen, Jin-Ming; Gituru, Robert Wahiti; Wang, Qing-Feng

    2012-03-10

    Hydrocharitaceae is a fully aquatic monocot family, consists of 18 genera with approximately 120 species. The family includes both fresh and marine aquatics and exhibits great diversity in form and habit including annual and perennial life histories; submersed, partially submersed and floating leaf habits and linear to orbicular leaf shapes. The family has a cosmopolitan distribution and is well represented in the Tertiary fossil record in Europe. At present, the historical biogeography of the family is not well understood and the generic relationships remain controversial. In this study we investigated the phylogeny and biogeography of Hydrocharitaceae by integrating fossils and DNA sequences from eight genes. We also conducted ancestral state reconstruction for three morphological characters. Phylogenetic analyses produced a phylogeny with most branches strongly supported by bootstrap values greater than 95 and Bayesian posterior probability values of 1.0. Stratiotes is the first diverging lineage with the remaining genera in two clades, one clade consists of Lagarosiphon, Ottelia, Blyxa, Apalanthe, Elodea and Egeria; and the other consists of Hydrocharis-Limnobium, Thalassia, Enhalus, Halophila, Najas, Hydrilla, Vallisneria, Nechamandra and Maidenia. Biogeographic analyses (DIVA, Mesquite) and divergence time estimates (BEAST) resolved the most recent common ancestor of Hydrocharitaceae as being in Asia during the Late Cretaceous and Palaeocene (54.7-72.6 Ma). Dispersals (including long-distance dispersal and migrations through Tethys seaway and land bridges) probably played major roles in the intercontinental distribution of this family. Ancestral state reconstruction suggested that in Hydrocharitaceae evolution of dioecy is bidirectional, viz., from dioecy to hermaphroditism, and from hermaphroditism to dioecy, and that the aerial-submerged leaf habit and short-linear leaf shape are the ancestral states. Our study has shed light on the previously controversial

  4. The Genome of Tolypocladium inflatum: Evolution, Organization, and Expression of the Cyclosporin Biosynthetic Gene Cluster

    Science.gov (United States)

    Bushley, Kathryn E.; Raja, Rajani; Jaiswal, Pankaj; Cumbie, Jason S.; Nonogaki, Mariko; Boyd, Alexander E.; Owensby, C. Alisha; Knaus, Brian J.; Elser, Justin; Miller, Daniel; Di, Yanming; McPhail, Kerry L.; Spatafora, Joseph W.

    2013-01-01

    The ascomycete fungus Tolypocladium inflatum, a pathogen of beetle larvae, is best known as the producer of the immunosuppressant drug cyclosporin. The draft genome of T. inflatum strain NRRL 8044 (ATCC 34921), the isolate from which cyclosporin was first isolated, is presented along with comparative analyses of the biosynthesis of cyclosporin and other secondary metabolites in T. inflatum and related taxa. Phylogenomic analyses reveal previously undetected and complex patterns of homology between the nonribosomal peptide synthetase (NRPS) that encodes for cyclosporin synthetase (simA) and those of other secondary metabolites with activities against insects (e.g., beauvericin, destruxins, etc.), and demonstrate the roles of module duplication and gene fusion in diversification of NRPSs. The secondary metabolite gene cluster responsible for cyclosporin biosynthesis is described. In addition to genes necessary for cyclosporin biosynthesis, it harbors a gene for a cyclophilin, which is a member of a family of immunophilins known to bind cyclosporin. Comparative analyses support a lineage specific origin of the cyclosporin gene cluster rather than horizontal gene transfer from bacteria or other fungi. RNA-Seq transcriptome analyses in a cyclosporin-inducing medium delineate the boundaries of the cyclosporin cluster and reveal high levels of expression of the gene cluster cyclophilin. In medium containing insect hemolymph, weaker but significant upregulation of several genes within the cyclosporin cluster, including the highly expressed cyclophilin gene, was observed. T. inflatum also represents the first reference draft genome of Ophiocordycipitaceae, a third family of insect pathogenic fungi within the fungal order Hypocreales, and supports parallel and qualitatively distinct radiations of insect pathogens. The T. inflatum genome provides additional insight into the evolution and biosynthesis of cyclosporin and lays a foundation for further investigations of the role

  5. Asymmetric Evolution and Expansion of the NAC Transcription Factor in Polyploidized Cotton

    Directory of Open Access Journals (Sweden)

    Kai Fan

    2018-01-01

    Full Text Available Polyploidy in Gossypium hirsutum conferred different properties from its diploid ancestors under the regulation of transcription factors. The NAC transcription factor is a plant-specific family that can be related to plant growth and development. So far, little is known about the NAC family in cotton. This study identified 495 NAC genes in three cotton species and investigated the evolution and expansion of different genome-derived NAC genes in cotton. We revealed 15 distinct NAC subfamilies in cotton. Different subfamilies had different gene proportions, expansion rate, gene loss rate, and orthologous exchange rate. Paleohexaploidization (35% and cotton-specific decaploidy (32% might have primarily led to the expansion of the NAC family in cotton. Half of duplication events in G. hirsutum were inherited from its diploid ancestor, and others might have occurred after interspecific hybridization. In addition, NAC genes in the At and Dt subgenomes displayed asymmetric molecular evolution, as evidenced by their different gene loss rates, orthologous exchange, evolutionary rates, and expression levels. The dominant duplication event was different during the cotton evolutionary history. Different genome-derived NACs might have interacted with each other, which ultimately resulted in morphogenetic evolution. This study delineated the expansion and evolutionary history of the NAC family in cotton and illustrated the different fates of NAC genes during polyploidization.

  6. Characterization of the MLO gene family in Rosaceae and gene expression analysis in Malus domestica.

    Science.gov (United States)

    Pessina, Stefano; Pavan, Stefano; Catalano, Domenico; Gallotta, Alessandra; Visser, Richard G F; Bai, Yuling; Malnoy, Mickael; Schouten, Henk J

    2014-07-22

    Powdery mildew (PM) is a major fungal disease of thousands of plant species, including many cultivated Rosaceae. PM pathogenesis is associated with up-regulation of MLO genes during early stages of infection, causing down-regulation of plant defense pathways. Specific members of the MLO gene family act as PM-susceptibility genes, as their loss-of-function mutations grant durable and broad-spectrum resistance. We carried out a genome-wide characterization of the MLO gene family in apple, peach and strawberry, and we isolated apricot MLO homologs through a PCR-approach. Evolutionary relationships between MLO homologs were studied and syntenic blocks constructed. Homologs that are candidates for being PM susceptibility genes were inferred by phylogenetic relationships with functionally characterized MLO genes and, in apple, by monitoring their expression following inoculation with the PM causal pathogen Podosphaera leucotricha. Genomic tools available for Rosaceae were exploited in order to characterize the MLO gene family. Candidate MLO susceptibility genes were identified. In follow-up studies it can be investigated whether silencing or a loss-of-function mutations in one or more of these candidate genes leads to PM resistance.

  7. Genetic Variation of Goat Interferon Regulatory Factor 3 Gene and Its Implication in Goat Evolution.

    Science.gov (United States)

    Okpeku, Moses; Esmailizadeh, Ali; Adeola, Adeniyi C; Shu, Liping; Zhang, Yesheng; Wang, Yangzi; Sanni, Timothy M; Imumorin, Ikhide G; Peters, Sunday O; Zhang, Jiajin; Dong, Yang; Wang, Wen

    2016-01-01

    The immune systems are fundamentally vital for evolution and survival of species; as such, selection patterns in innate immune loci are of special interest in molecular evolutionary research. The interferon regulatory factor (IRF) gene family control many different aspects of the innate and adaptive immune responses in vertebrates. Among these, IRF3 is known to take active part in very many biological processes. We assembled and evaluated 1356 base pairs of the IRF3 gene coding region in domesticated goats from Africa (Nigeria, Ethiopia and South Africa) and Asia (Iran and China) and the wild goat (Capra aegagrus). Five segregating sites with θ value of 0.0009 for this gene demonstrated a low diversity across the goats' populations. Fu and Li tests were significantly positive but Tajima's D test was significantly negative, suggesting its deviation from neutrality. Neighbor joining tree of IRF3 gene in domesticated goats, wild goat and sheep showed that all domesticated goats have a closer relationship than with the wild goat and sheep. Maximum likelihood tree of the gene showed that different domesticated goats share a common ancestor and suggest single origin. Four unique haplotypes were observed across all the sequences, of which, one was particularly common to African goats (MOCH-K14-0425, Poitou and WAD). In assessing the evolution mode of the gene, we found that the codon model dN/dS ratio for all goats was greater than one. Phylogenetic Analysis by Maximum Likelihood (PAML) gave a ω0 (dN/dS) value of 0.067 with LnL value of -6900.3 for the first Model (M1) while ω2 = 1.667 in model M2 with LnL value of -6900.3 with positive selection inferred in 3 codon sites. Mechanistic empirical combination (MEC) model for evaluating adaptive selection pressure on particular codons also confirmed adaptive selection pressure in three codons (207, 358 and 408) in IRF3 gene. Positive diversifying selection inferred with recent evolutionary changes in domesticated goat IRF3

  8. Cytogenetics, conserved synteny and evolution of chicken fucosyltransferase genes compared to human

    NARCIS (Netherlands)

    Coullin, P.; Crooijmans, R.P.M.A.; Fillon, V.; Mollicone, R.; Groenen, M.A.M.; Adrien-Dehais, C.; Bernheim, A.; Zoorob, R.; Oriol, R.; Candelier, J.J.

    2003-01-01

    Fucosyltransferases appeared early in evolution, since they are present from bacteria to primates and the genes are well conserved. The aim of this work was to study these genes in the bird group, which is particularly attractive for the comprehension of the evolution of the vertebrate genome.

  9. The Vein Patterning 1 (VEP1 gene family laterally spread through an ecological network.

    Directory of Open Access Journals (Sweden)

    Rosa Tarrío

    Full Text Available Lateral gene transfer (LGT is a major evolutionary mechanism in prokaryotes. Knowledge about LGT--particularly, multicellular--eukaryotes has only recently started to accumulate. A widespread assumption sees the gene as the unit of LGT, largely because little is yet known about how LGT chances are affected by structural/functional features at the subgenic level. Here we trace the evolutionary trajectory of VEin Patterning 1, a novel gene family known to be essential for plant development and defense. At the subgenic level VEP1 encodes a dinucleotide-binding Rossmann-fold domain, in common with members of the short-chain dehydrogenase/reductase (SDR protein family. We found: i VEP1 likely originated in an aerobic, mesophilic and chemoorganotrophic α-proteobacterium, and was laterally propagated through nets of ecological interactions, including multiple LGTs between phylogenetically distant green plant/fungi-associated bacteria, and five independent LGTs to eukaryotes. Of these latest five transfers, three are ancient LGTs, implicating an ancestral fungus, the last common ancestor of land plants and an ancestral trebouxiophyte green alga, and two are recent LGTs to modern embryophytes. ii VEP1's rampant LGT behavior was enabled by the robustness and broad utility of the dinucleotide-binding Rossmann-fold, which provided a platform for the evolution of two unprecedented departures from the canonical SDR catalytic triad. iii The fate of VEP1 in eukaryotes has been different in different lineages, being ubiquitous and highly conserved in land plants, whereas fungi underwent multiple losses. And iv VEP1-harboring bacteria include non-phytopathogenic and phytopathogenic symbionts which are non-randomly distributed with respect to the type of harbored VEP1 gene. Our findings suggest that VEP1 may have been instrumental for the evolutionary transition of green plants to land, and point to a LGT-mediated 'Trojan Horse' mechanism for the evolution of

  10. The evolution of heart gene delivery vectors

    Science.gov (United States)

    Wasala, Nalinda B.; Shin, Jin-Hong; Duan, Dongsheng

    2012-01-01

    Gene therapy holds promise for treating numerous heart diseases. A key premise for the success of cardiac gene therapy is the development of powerful gene transfer vehicles that can achieve highly efficient and persistent gene transfer specifically in the heart. Other features of an ideal vector include negligible toxicity, minimal immunogenicity and easy manufacturing. Rapid progress in the fields of molecular biology and virology has offered great opportunities to engineer various genetic materials for heart gene delivery. Several nonviral vectors (e.g. naked plasmids, plasmid lipid/polymer complexes and oligonucleotides) have been tested. Commonly used viral vectors include lentivirus, adenovirus and adeno-associated virus. Among these, adeno-associated virus has shown many attractive features for pre-clinical experimentation in animal models of heart diseases. We review the history and evolution of these vectors for heart gene transfer. PMID:21837689

  11. Parallel Evolution of Genes and Languages in the Caucasus Region

    Science.gov (United States)

    Balanovsky, Oleg; Dibirova, Khadizhat; Dybo, Anna; Mudrak, Oleg; Frolova, Svetlana; Pocheshkhova, Elvira; Haber, Marc; Platt, Daniel; Schurr, Theodore; Haak, Wolfgang; Kuznetsova, Marina; Radzhabov, Magomed; Balaganskaya, Olga; Romanov, Alexey; Zakharova, Tatiana; Soria Hernanz, David F.; Zalloua, Pierre; Koshel, Sergey; Ruhlen, Merritt; Renfrew, Colin; Wells, R. Spencer; Tyler-Smith, Chris; Balanovska, Elena

    2012-01-01

    We analyzed 40 SNP and 19 STR Y-chromosomal markers in a large sample of 1,525 indigenous individuals from 14 populations in the Caucasus and 254 additional individuals representing potential source populations. We also employed a lexicostatistical approach to reconstruct the history of the languages of the North Caucasian family spoken by the Caucasus populations. We found a different major haplogroup to be prevalent in each of four sets of populations that occupy distinct geographic regions and belong to different linguistic branches. The haplogroup frequencies correlated with geography and, even more strongly, with language. Within haplogroups, a number of haplotype clusters were shown to be specific to individual populations and languages. The data suggested a direct origin of Caucasus male lineages from the Near East, followed by high levels of isolation, differentiation and genetic drift in situ. Comparison of genetic and linguistic reconstructions covering the last few millennia showed striking correspondences between the topology and dates of the respective gene and language trees, and with documented historical events. Overall, in the Caucasus region, unmatched levels of gene-language co-evolution occurred within geographically isolated populations, probably due to its mountainous terrain. PMID:21571925

  12. Sequence of the non-phosphorylating glyceraldehyde-3-phosphate dehydrogenase from Nicotiana plumbaginifolia and phylogenetic origin of the gene family.

    Science.gov (United States)

    Habenicht, A; Quesada, A; Cerff, R

    1997-10-01

    A cDNA-library has been constructed from Nicotiana plumbaginifolia seedlings, and the non-phosphorylating glyceraldehyde-3-phosphate dehydrogenase (GapN, EC 1.2.1.9) was isolated by plaque hybridization using the cDNA from pea as a heterologous probe. The cDNA comprises the entire GapN coding region. A putative polyadenylation signal is identified. Phylogenetic analysis based on the deduced amino acid sequences revealed that the GapN gene family represents a separate ancient branch within the aldehyde dehydrogenase superfamily. It can be shown that the GapN gene family and other distinct branches of the superfamily have its phylogenetic origin before the separation of primary life-forms. This further demonstrates that already very early in evolution, a broad diversification of the aldehyde dehydrogenases led to the formation of the superfamily.

  13. Convergent evolution of gene networks by single-gene duplications in higher eukaryotes.

    Science.gov (United States)

    Amoutzias, Gregory D; Robertson, David L; Oliver, Stephen G; Bornberg-Bauer, Erich

    2004-03-01

    By combining phylogenetic, proteomic and structural information, we have elucidated the evolutionary driving forces for the gene-regulatory interaction networks of basic helix-loop-helix transcription factors. We infer that recurrent events of single-gene duplication and domain rearrangement repeatedly gave rise to distinct networks with almost identical hub-based topologies, and multiple activators and repressors. We thus provide the first empirical evidence for scale-free protein networks emerging through single-gene duplications, the dominant importance of molecular modularity in the bottom-up construction of complex biological entities, and the convergent evolution of networks.

  14. Molecular pathways to parallel evolution: I. Gene nexuses and their morphological correlates.

    Science.gov (United States)

    Zuckerkandl, E

    1994-12-01

    Aspects of the regulatory interactions among genes are probably as old as most genes are themselves. Correspondingly, similar predispositions to changes in such interactions must have existed for long evolutionary periods. Features of the structure and the evolution of the system of gene regulation furnish the background necessary for a molecular understanding of parallel evolution. Patently "unrelated" organs, such as the fat body of a fly and the liver of a mammal, can exhibit fractional homology, a fraction expected to become subject to quantitation. This also seems to hold for different organs in the same organism, such as wings and legs of a fly. In informational macromolecules, on the other hand, homology is indeed all or none. In the quite different case of organs, analogy is expected usually to represent attenuated homology. Many instances of putative convergence are likely to turn out to be predominantly parallel evolution, presumably including the case of the vertebrate and cephalopod eyes. Homology in morphological features reflects a similarity in networks of active genes. Similar nexuses of active genes can be established in cells of different embryological origins. Thus, parallel development can be considered a counterpart to parallel evolution. Specific macromolecular interactions leading to the regulation of the c-fos gene are given as an example of a "controller node" defined as a regulatory unit. Quantitative changes in gene control are distinguished from relational changes, and frequent parallelism in quantitative changes is noted in Drosophila enzymes. Evolutionary reversions in quantitative gene expression are also expected. The evolution of relational patterns is attributed to several distinct mechanisms, notably the shuffling of protein domains. The growth of such patterns may in part be brought about by a particular process of compensation for "controller gene diseases," a process that would spontaneously tend to lead to increased regulatory

  15. Genome-Wide Identification, Evolutionary and Expression Analyses of the GALACTINOL SYNTHASE Gene Family in Rapeseed and Tobacco

    Directory of Open Access Journals (Sweden)

    Yonghai Fan

    2017-12-01

    Full Text Available Galactinol synthase (GolS is a key enzyme in raffinose family oligosaccharide (RFO biosynthesis. The finding that GolS accumulates in plants exposed to abiotic stresses indicates RFOs function in environmental adaptation. However, the evolutionary relationships and biological functions of GolS family in rapeseed (Brassica napus and tobacco (Nicotiana tabacum remain unclear. In this study, we identified 20 BnGolS and 9 NtGolS genes. Subcellular localization predictions showed that most of the proteins are localized to the cytoplasm. Phylogenetic analysis identified a lost event of an ancient GolS copy in the Solanaceae and an ancient duplication event leading to evolution of GolS4/7 in the Brassicaceae. The three-dimensional structures of two GolS proteins were conserved, with an important DxD motif for binding to UDP-galactose (uridine diphosphate-galactose and inositol. Expression profile analysis indicated that BnGolS and NtGolS genes were expressed in most tissues and highly expressed in one or two specific tissues. Hormone treatments strongly induced the expression of most BnGolS genes and homologous genes in the same subfamilies exhibited divergent-induced expression. Our study provides a comprehensive evolutionary analysis of GolS genes among the Brassicaceae and Solanaceae as well as an insight into the biological function of GolS genes in hormone response in plants.

  16. The nitrate transporter (NRT gene family in poplar.

    Directory of Open Access Journals (Sweden)

    Hua Bai

    Full Text Available Nitrate is an important nutrient required for plant growth. It also acts as a signal regulating plant development. Nitrate is actively taken up and transported by nitrate transporters (NRT, which form a large family with many members and distinct functions. In contrast to Arabidopsis and rice there is little information about the NRT family in woody plants such as Populus. In this study, a comprehensive analysis of the Populus NRT family was performed. Sixty-eight PtNRT1/PTR, 6 PtNRT2, and 5 PtNRT3 genes were identified in the P. trichocarpa genome. Phylogenetic analysis confirmed that the genes of the NRT family are divided into three clades: NRT1/PTR with four subclades, NRT2, and NRT3. Topological analysis indicated that all members of PtNRT1/PTR and PtNRT2 have 8 to 12 trans-membrane domains, whereas the PtNRT3 proteins have no or up to two trans-membrane domains. Four PtNRT3 members were predicted as secreted proteins. Microarray analyses revealed tissue-specific expression patterns of PtNRT genes with distinct clusters of NRTs for roots, for the elongation zone of the apical stem segment and the developing xylem and a further cluster for leaves, bark and wood. A comparison of different poplar species (P. trichocarpa, P. tremula, P. euphratica, P. fremontii x P. angustifolia, and P. x canescens showed that the tissue-specific patterns of the NRT genes varied to some extent with species. Bioinformatic analysis of putative cis-regulatory elements in the promoter regions of PtNRT family retrieved motifs suggesting the regulation of the NRT genes by N metabolism, by energy and carbon metabolism, and by phytohormones and stress. Multivariate analysis suggested that the combination and abundance of motifs in distinct promoters may lead to tissue-specificity. Our genome wide analysis of the PtNRT genes provides a valuable basis for functional analysis towards understanding the role of nitrate transporters for tree growth.

  17. Molecular evolution and diversification of snake toxin genes, revealed by analysis of intron sequences.

    Science.gov (United States)

    Fujimi, T J; Nakajyo, T; Nishimura, E; Ogura, E; Tsuchiya, T; Tamiya, T

    2003-08-14

    The genes encoding erabutoxin (short chain neurotoxin) isoforms (Ea, Eb, and Ec), LsIII (long chain neurotoxin) and a novel long chain neurotoxin pseudogene were cloned from a Laticauda semifasciata genomic library. Short and long chain neurotoxin genes were also cloned from the genome of Laticauda laticaudata, a closely related species of L. semifasciata, by PCR. A putative matrix attached region (MAR) sequence was found in the intron I of the LsIII gene. Comparative analysis of 11 structurally relevant snake toxin genes (three-finger-structure toxins) revealed the molecular evolution of these toxins. Three-finger-structure toxin genes diverged from a common ancestor through two types of evolutionary pathways (long and short types), early in the course of evolution. At a later stage of evolution in each gene, the accumulation of mutations in the exons, especially exon II, by accelerated evolution may have caused the increased diversification in their functions. It was also revealed that the putative MAR sequence found in the LsIII gene was integrated into the gene after the species-level divergence.

  18. On the role of sparseness in the evolution of modularity in gene regulatory networks.

    Science.gov (United States)

    Espinosa-Soto, Carlos

    2018-05-01

    Modularity is a widespread property in biological systems. It implies that interactions occur mainly within groups of system elements. A modular arrangement facilitates adjustment of one module without perturbing the rest of the system. Therefore, modularity of developmental mechanisms is a major factor for evolvability, the potential to produce beneficial variation from random genetic change. Understanding how modularity evolves in gene regulatory networks, that create the distinct gene activity patterns that characterize different parts of an organism, is key to developmental and evolutionary biology. One hypothesis for the evolution of modules suggests that interactions between some sets of genes become maladaptive when selection favours additional gene activity patterns. The removal of such interactions by selection would result in the formation of modules. A second hypothesis suggests that modularity evolves in response to sparseness, the scarcity of interactions within a system. Here I simulate the evolution of gene regulatory networks and analyse diverse experimentally sustained networks to study the relationship between sparseness and modularity. My results suggest that sparseness alone is neither sufficient nor necessary to explain modularity in gene regulatory networks. However, sparseness amplifies the effects of forms of selection that, like selection for additional gene activity patterns, already produce an increase in modularity. That evolution of new gene activity patterns is frequent across evolution also supports that it is a major factor in the evolution of modularity. That sparseness is widespread across gene regulatory networks indicates that it may have facilitated the evolution of modules in a wide variety of cases.

  19. Gene duplication, modularity and adaptation in the evolution of the aflatoxin gene cluster

    Directory of Open Access Journals (Sweden)

    Jakobek Judy L

    2007-07-01

    Full Text Available Abstract Background The biosynthesis of aflatoxin (AF involves over 20 enzymatic reactions in a complex polyketide pathway that converts acetate and malonate to the intermediates sterigmatocystin (ST and O-methylsterigmatocystin (OMST, the respective penultimate and ultimate precursors of AF. Although these precursors are chemically and structurally very similar, their accumulation differs at the species level for Aspergilli. Notable examples are A. nidulans that synthesizes only ST, A. flavus that makes predominantly AF, and A. parasiticus that generally produces either AF or OMST. Whether these differences are important in the evolutionary/ecological processes of species adaptation and diversification is unknown. Equally unknown are the specific genomic mechanisms responsible for ordering and clustering of genes in the AF pathway of Aspergillus. Results To elucidate the mechanisms that have driven formation of these clusters, we performed systematic searches of aflatoxin cluster homologs across five Aspergillus genomes. We found a high level of gene duplication and identified seven modules consisting of highly correlated gene pairs (aflA/aflB, aflR/aflS, aflX/aflY, aflF/aflE, aflT/aflQ, aflC/aflW, and aflG/aflL. With the exception of A. nomius, contrasts of mean Ka/Ks values across all cluster genes showed significant differences in selective pressure between section Flavi and non-section Flavi species. A. nomius mean Ka/Ks values were more similar to partial clusters in A. fumigatus and A. terreus. Overall, mean Ka/Ks values were significantly higher for section Flavi than for non-section Flavi species. Conclusion Our results implicate several genomic mechanisms in the evolution of ST, OMST and AF cluster genes. Gene modules may arise from duplications of a single gene, whereby the function of the pre-duplication gene is retained in the copy (aflF/aflE or the copies may partition the ancestral function (aflA/aflB. In some gene modules, the

  20. The evolution of Dscam genes across the arthropods.

    Science.gov (United States)

    Armitage, Sophie A O; Freiburg, Rebecca Y; Kurtz, Joachim; Bravo, Ignacio G

    2012-04-13

    One way of creating phenotypic diversity is through alternative splicing of precursor mRNAs. A gene that has evolved a hypervariable form is Down syndrome cell adhesion molecule (Dscam-hv), which in Drosophila melanogaster can produce thousands of isoforms via mutually exclusive alternative splicing. The extracellular region of this protein is encoded by three variable exon clusters, each containing multiple exon variants. The protein is vital for neuronal wiring where the extreme variability at the somatic level is required for axonal guidance, and it plays a role in immunity where the variability has been hypothesised to relate to recognition of different antigens. Dscam-hv has been found across the Pancrustacea. Additionally, three paralogous non-hypervariable Dscam-like genes have also been described for D. melanogaster. Here we took a bioinformatics approach, building profile Hidden Markov Models to search across species for putative orthologs to the Dscam genes and for hypervariable alternatively spliced exons, and inferring the phylogenetic relationships among them. Our aims were to examine whether Dscam orthologs exist outside the Bilateria, whether the origin of Dscam-hv could lie outside the Pancrustacea, when the Dscam-like orthologs arose, how many alternatively spliced exons of each exon cluster were present in the most common recent ancestor, and how these clusters evolved. Our results suggest that the origin of Dscam genes may lie after the split between the Cnidaria and the Bilateria and supports the hypothesis that Dscam-hv originated in the common ancestor of the Pancrustacea. Our phylogeny of Dscam gene family members shows six well-supported clades: five containing Dscam-like genes and one containing all the Dscam-hv genes, a seventh clade contains arachnid putative Dscam genes. Furthermore, the exon clusters appear to have experienced different evolutionary histories. Dscam genes have undergone independent duplication events in the insects and

  1. The evolution of Dscam genes across the arthropods

    Directory of Open Access Journals (Sweden)

    Armitage Sophie AO

    2012-04-01

    Full Text Available Abstract Background One way of creating phenotypic diversity is through alternative splicing of precursor mRNAs. A gene that has evolved a hypervariable form is Down syndrome cell adhesion molecule (Dscam-hv, which in Drosophila melanogaster can produce thousands of isoforms via mutually exclusive alternative splicing. The extracellular region of this protein is encoded by three variable exon clusters, each containing multiple exon variants. The protein is vital for neuronal wiring where the extreme variability at the somatic level is required for axonal guidance, and it plays a role in immunity where the variability has been hypothesised to relate to recognition of different antigens. Dscam-hv has been found across the Pancrustacea. Additionally, three paralogous non-hypervariable Dscam-like genes have also been described for D. melanogaster. Here we took a bioinformatics approach, building profile Hidden Markov Models to search across species for putative orthologs to the Dscam genes and for hypervariable alternatively spliced exons, and inferring the phylogenetic relationships among them. Our aims were to examine whether Dscam orthologs exist outside the Bilateria, whether the origin of Dscam-hv could lie outside the Pancrustacea, when the Dscam-like orthologs arose, how many alternatively spliced exons of each exon cluster were present in the most common recent ancestor, and how these clusters evolved. Results Our results suggest that the origin of Dscam genes may lie after the split between the Cnidaria and the Bilateria and supports the hypothesis that Dscam-hv originated in the common ancestor of the Pancrustacea. Our phylogeny of Dscam gene family members shows six well-supported clades: five containing Dscam-like genes and one containing all the Dscam-hv genes, a seventh clade contains arachnid putative Dscam genes. Furthermore, the exon clusters appear to have experienced different evolutionary histories. Conclusions Dscam genes have

  2. Horizontal transfer, not duplication, drives the expansion of protein families in prokaryotes.

    Directory of Open Access Journals (Sweden)

    Todd J Treangen

    2011-01-01

    Full Text Available Gene duplication followed by neo- or sub-functionalization deeply impacts the evolution of protein families and is regarded as the main source of adaptive functional novelty in eukaryotes. While there is ample evidence of adaptive gene duplication in prokaryotes, it is not clear whether duplication outweighs the contribution of horizontal gene transfer in the expansion of protein families. We analyzed closely related prokaryote strains or species with small genomes (Helicobacter, Neisseria, Streptococcus, Sulfolobus, average-sized genomes (Bacillus, Enterobacteriaceae, and large genomes (Pseudomonas, Bradyrhizobiaceae to untangle the effects of duplication and horizontal transfer. After removing the effects of transposable elements and phages, we show that the vast majority of expansions of protein families are due to transfer, even among large genomes. Transferred genes--xenologs--persist longer in prokaryotic lineages possibly due to a higher/longer adaptive role. On the other hand, duplicated genes--paralogs--are expressed more, and, when persistent, they evolve slower. This suggests that gene transfer and gene duplication have very different roles in shaping the evolution of biological systems: transfer allows the acquisition of new functions and duplication leads to higher gene dosage. Accordingly, we show that paralogs share most protein-protein interactions and genetic regulators, whereas xenologs share very few of them. Prokaryotes invented most of life's biochemical diversity. Therefore, the study of the evolution of biology systems should explicitly account for the predominant role of horizontal gene transfer in the diversification of protein families.

  3. Phylogenetic relationships among Perissodactyla: secretoglobin 1A1 gene duplication and triplication in the Equidae family.

    Science.gov (United States)

    Côté, Olivier; Viel, Laurent; Bienzle, Dorothee

    2013-12-01

    Secretoglobin family 1A member 1 (SCGB 1A1) is a small anti-inflammatory and immunomodulatory protein that is abundantly secreted in airway surface fluids. We recently reported the existence of three distinct SCGB1A1 genes in the domestic horse genome as opposed to the single gene copy consensus present in other mammals. The origin of SCGB1A1 gene triplication and the evolutionary relationship of the three genes amongst Equidae family members are unknown. For this study, SCGB1A1 genomic data were collected from various Equus individuals including E. caballus, E. przewalskii, E. asinus, E. grevyi, and E. quagga. Three SCGB1A1 genes in E. przewalskii, two SCGB1A1 genes in E. asinus, and a single SCGB1A1 gene in E. grevyi and E. quagga were identified. Sequence analysis revealed that the non-synonymous nucleotide substitutions between the different equid genes coded for 17 amino acid changes. Most of these changes localized to the SCGB 1A1 central cavity that binds hydrophobic ligands, suggesting that this area of SCGB 1A1 evolved to accommodate diverse molecular interactions. Three-dimensional modeling of the proteins revealed that the size of the SCGB 1A1 central cavity is larger than that of SCGB 1A1A. Altogether, these findings suggest that evolution of the SCGB1A1 gene may parallel the separation of caballine and non-caballine species amongst Equidae, and may indicate an expansion of function for SCGB1A1 gene products. Copyright © 2013 Elsevier Inc. All rights reserved.

  4. Isolation, structural analysis, and expression characteristics of the maize nuclear factor Y gene families

    International Nuclear Information System (INIS)

    Zhang, Zhongbao; Li, Xianglong; Zhang, Chun; Zou, Huawen; Wu, Zhongyi

    2016-01-01

    NUCLEAR FACTOR-Y (NF-Y) has been shown to play an important role in growth, development, and response to environmental stress. A NF-Y complex, which consists of three subunits, NF-YA, NF-YB, and, NF-YC, binds to CCAAT sequences in a promoter to control the expression of target genes. Although NF-Y proteins have been reported in Arabidopsis and rice, a comprehensive and systematic analysis of ZmNF-Y genes has not yet been performed. To examine the functions of ZmNF-Y genes in this family, we isolated and characterized 50 ZmNF-Y (14 ZmNF-YA, 18 ZmNF-YB, and 18 ZmNF-YC) genes in an analysis of the maize genome. The 50 ZmNF-Y genes were distributed on all 10 maize chromosomes, and 12 paralogs were identified. Multiple alignments showed that maize ZmNF-Y family proteins had conserved regions and relatively variable N-terminal or C-terminal domains. The comparative syntenic map illustrated 40 paralogous NF-Y gene pairs among the 10 maize chromosomes. Microarray data showed that the ZmNF-Y genes had tissue-specific expression patterns in various maize developmental stages and in response to biotic and abiotic stresses. The results suggested that ZmNF-YB2, 4, 8, 10, 13, and 16 and ZmNF-YC6, 8, and 15 were induced, while ZmNF-YA1, 3, 4, 6, 7, 10, 12, and 13, ZmNF-YB15, and ZmNF-YC3 and 9 were suppressed by drought stress. ZmNF-YA3, ZmNF-YA8 and ZmNF-YA12 were upregulated after infection by the three pathogens, while ZmNF-YA1 and ZmNF-YB2 were suppressed. These results indicate that the ZmNF-Ys may have significant roles in the response to abiotic and biotic stresses. - Highlights: • We indicated a total of 50 members of ZmNF-Y gene family in maize genome. • We analyzed gene structure, protein architecture of ZmNF-Y genes. • Evolution pattern and phylogenic relationships were analyzed among 50 ZmNF-Y genes. • Expression pattern of ZmNF-Ys were detected in various maize tissues. • Transcript levels of ZmNF-Ys were measured under various abiotic and biotic stresses.

  5. Horizontal Gene Transfer Contributes to the Evolution of Arthropod Herbivory.

    Science.gov (United States)

    Wybouw, Nicky; Pauchet, Yannick; Heckel, David G; Van Leeuwen, Thomas

    2016-06-27

    Within animals, evolutionary transition toward herbivory is severely limited by the hostile characteristics of plants. Arthropods have nonetheless counteracted many nutritional and defensive barriers imposed by plants and are currently considered as the most successful animal herbivores in terrestrial ecosystems. We gather a body of evidence showing that genomes of various plant feeding insects and mites possess genes whose presence can only be explained by horizontal gene transfer (HGT). HGT is the asexual transmission of genetic information between reproductively isolated species. Although HGT is known to have great adaptive significance in prokaryotes, its impact on eukaryotic evolution remains obscure. Here, we show that laterally transferred genes into arthropods underpin many adaptations to phytophagy, including efficient assimilation and detoxification of plant produced metabolites. Horizontally acquired genes and the traits they encode often functionally diversify within arthropod recipients, enabling the colonization of more host plant species and organs. We demonstrate that HGT can drive metazoan evolution by uncovering its prominent role in the adaptations of arthropods to exploit plants. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  6. Characterization and phylogenetic analysis of α-gliadin gene ...

    Indian Academy of Sciences (India)

    Home; Journals; Journal of Genetics; Volume 93; Issue 3 ... Research Article Volume 93 Issue 3 December 2014 pp 725-731 ... Although the unique properties of wheat -gliadin gene family are well characterized, little is known about the evolution and genomic divergence of -gliadin gene family within the Triticeae.

  7. Isolation, characterization, and expression of Le-msx, a maternally expressed member of the msx gene family from the glossiphoniid leech, Helobdella.

    Science.gov (United States)

    Master, V A; Kourakis, M J; Martindale, M Q

    1996-12-01

    The msx gene family is one of the most highly conserved of the nonclustered homeobox-containing genes. We have isolated an msx homolog (Le-msx) from the glossiphoniid leech, Helobdella robusta, and characterized its pattern of expression by whole mount in situ hybridization. In situ expression and reverse transcription polymerase chain reaction (RT-PCR) data results show that Le-msx is a maternal transcript initially uniformly distributed in the cortex of immature oocytes that becomes asymmetrically localized to the polar regions of the uncleaved zygote. This is the earliest reported expression for the msx gene family and the first maternally expressed homeodomain-containing transcription factor reported in annelids. During embryonic development, Le-msx is expressed in all 10 embryonic stem cells and their segmental founder cell descendants. At midembryonic stages, Le-msx is expressed in the expanding germinal plate. Le-msx is confined to the central nervous system and nephridia at late (stage 9) stages and subsequently disappears from nephridia. In addition, we present a phylogenetic hypothesis for the evolution of the msx gene family, including the identification of a putative C. elegans msx homolog and the realignment of the sponge msx homolog to the NK class of homeodomain genes.

  8. The miR-10 microRNA precursor family

    DEFF Research Database (Denmark)

    Tehler, Disa; Høyland-Kroghsbo, Nina Molin; Lund, Anders H

    2011-01-01

    The miR-10 microRNA precursor family encodes a group of short non-coding RNAs involved in gene regulation. The miR-10 family is highly conserved and has sparked the interest of many research groups because of the genomic localization in the vicinity of, coexpression with and regulation of the Hox...... gene developmental regulators. Here, we review the current knowledge of the evolution, physiological function and involvement in cancer of this family of microRNAs....

  9. Molecular evolution across the Asteraceae: micro- and macroevolutionary processes.

    Science.gov (United States)

    Kane, Nolan C; Barker, Michael S; Zhan, Shing H; Rieseberg, Loren H

    2011-12-01

    The Asteraceae (Compositae) is a large family of over 20,000 wild, weedy, and domesticated species that comprise approximately 10% of all angiosperms, including annual and perennial herbs, shrubs and trees, and species on every continent except Antarctica. As a result, the Asteraceae provide a unique opportunity to understand the evolutionary genomics of lineage radiation and diversification at numerous phylogenetic scales. Using publicly available expressed sequence tags from 22 species representing four of the major Asteraceae lineages, we assessed neutral and nonneutral evolutionary processes across this diverse plant family. We used bioinformatic tools to identify candidate genes under selection in each species. Evolution at silent and coding sites were assessed for different Gene Ontology functional categories to compare rates of evolution over both short and long evolutionary timescales. Our results indicate that patterns of molecular change across the family are surprisingly consistent on a macroevolutionary timescale and much more so more than would be predicted from the analysis of one (or many) examples of microevolution. These analyses also point to particular classes of genes that may be crucial in shaping the radiation of this diverse plant family. Similar analyses of nuclear and chloroplast genes in six other plant families confirm that many of these patterns are common features of the plant kingdom.

  10. Phylogeny of sharks of the family Triakidae (Carcharhiniformes) and its implications for the evolution of carcharhiniform placental viviparity.

    Science.gov (United States)

    López, J Andrés; Ryburn, Julie A; Fedrigo, Olivier; Naylor, Gavin J P

    2006-07-01

    We present a study of inter- and intra-familial relationships of the carcharhiniform shark family Triakidae aimed at testing existing hypotheses of relationships for this group and at improving understanding of the evolution of reproductive traits in elasmobranchs. Our analyses and conclusions are based on evidence from DNA sequences of four protein-coding genes (three from the mitochondrial genome and a single copy nuclear gene) from eight of the nine genera and 20 of the 39 species currently assigned to the Triakidae. The sequence data offer strong support for the following previously proposed triakid clades: Galeorhinini (Hypogaleus+Galeorhinus); a subset of the Iagini (Furgaleus+Hemitriakis but not Iago); and part of the Triakinae (Mustelus, Scylliogaleus and part of Triakis). Interestingly, the molecular data provide considerable evidence of paraphyly of the genera Triakis and Mustelus. Our results suggest that the subgenera Triakis and Cazon of Triakis represent two distinct lineages that are only distantly related and that the genus Mustelus as currently defined does not constitute a monophyletic assemblage unless S. quecketti and some species of Triakis (subgenus Cazon) are included in Mustelus. Within our sample of species of Mustelus (including Cazon and Scylliogaleus), the sequence data support two well-defined clades that can be diagnosed by mode of reproduction (placental vs. aplacental species). The phylogenetic framework presented here is used to infer key events in the evolution and loss of placental viviparity among carcharhiniform sharks.

  11. Identification of a novel Gig2 gene family specific to non-amniote vertebrates.

    Directory of Open Access Journals (Sweden)

    Yi-Bing Zhang

    Full Text Available Gig2 (grass carp reovirus (GCRV-induced gene 2 is first identified as a novel fish interferon (IFN-stimulated gene (ISG. Overexpression of a zebrafish Gig2 gene can protect cultured fish cells from virus infection. In the present study, we identify a novel gene family that is comprised of genes homologous to the previously characterized Gig2. EST/GSS search and in silico cloning identify 190 Gig2 homologous genes in 51 vertebrate species ranged from lampreys to amphibians. Further large-scale search of vertebrate and invertebrate genome databases indicate that Gig2 gene family is specific to non-amniotes including lampreys, sharks/rays, ray-finned fishes and amphibians. Phylogenetic analysis and synteny analysis reveal lineage-specific expansion of Gig2 gene family and also provide valuable evidence for the fish-specific genome duplication (FSGD hypothesis. Although Gig2 family proteins exhibit no significant sequence similarity to any known proteins, a typical Gig2 protein appears to consist of two conserved parts: an N-terminus that bears very low homology to the catalytic domains of poly(ADP-ribose polymerases (PARPs, and a novel C-terminal domain that is unique to this gene family. Expression profiling of zebrafish Gig2 family genes shows that some duplicate pairs have diverged in function via acquisition of novel spatial and/or temporal expression under stresses. The specificity of this gene family to non-amniotes might contribute to a large extent to distinct physiology in non-amniote vertebrates.

  12. Analysis of ribosomal protein gene structures: implications for intron evolution.

    Directory of Open Access Journals (Sweden)

    2006-03-01

    Full Text Available Many spliceosomal introns exist in the eukaryotic nuclear genome. Despite much research, the evolution of spliceosomal introns remains poorly understood. In this paper, we tried to gain insights into intron evolution from a novel perspective by comparing the gene structures of cytoplasmic ribosomal proteins (CRPs and mitochondrial ribosomal proteins (MRPs, which are held to be of archaeal and bacterial origin, respectively. We analyzed 25 homologous pairs of CRP and MRP genes that together had a total of 527 intron positions. We found that all 12 of the intron positions shared by CRP and MRP genes resulted from parallel intron gains and none could be considered to be "conserved," i.e., descendants of the same ancestor. This was supported further by the high frequency of proto-splice sites at these shared positions; proto-splice sites are proposed to be sites for intron insertion. Although we could not definitively disprove that spliceosomal introns were already present in the last universal common ancestor, our results lend more support to the idea that introns were gained late. At least, our results show that MRP genes were intronless at the time of endosymbiosis. The parallel intron gains between CRP and MRP genes accounted for 2.3% of total intron positions, which should provide a reliable estimate for future inferences of intron evolution.

  13. Deciphering the Origin, Evolution, and Physiological Function of the Subtelomeric Aryl-Alcohol Dehydrogenase Gene Family in the Yeast Saccharomyces cerevisiae.

    Science.gov (United States)

    Yang, Dong-Dong; de Billerbeck, Gustavo M; Zhang, Jin-Jing; Rosenzweig, Frank; Francois, Jean-Marie

    2018-01-01

    Homology searches indicate that Saccharomyces cerevisiae strain BY4741 contains seven redundant genes that encode putative aryl-alcohol dehydrogenases (AAD). Yeast AAD genes are located in subtelomeric regions of different chromosomes, and their functional role(s) remain enigmatic. Here, we show that two of these genes, AAD4 and AAD14 , encode functional enzymes that reduce aliphatic and aryl-aldehydes concomitant with the oxidation of cofactor NADPH, and that Aad4p and Aad14p exhibit different substrate preference patterns. Other yeast AAD genes are undergoing pseudogenization. The 5' sequence of AAD15 has been deleted from the genome. Repair of an AAD3 missense mutation at the catalytically essential Tyr 73 residue did not result in a functional enzyme. However, ancestral-state reconstruction by fusing Aad6 with Aad16 and by N-terminal repair of Aad10 restores NADPH-dependent aryl-alcohol dehydrogenase activities. Phylogenetic analysis indicates that AAD genes are narrowly distributed in wood-saprophyte fungi and in yeast that occupy lignocellulosic niches. Because yeast AAD genes exhibit activity on veratraldehyde, cinnamaldehyde, and vanillin, they could serve to detoxify aryl-aldehydes released during lignin degradation. However, none of these compounds induce yeast AAD gene expression, and Aad activities do not relieve aryl-aldehyde growth inhibition. Our data suggest an ancestral role for AAD genes in lignin degradation that is degenerating as a result of yeast's domestication and use in brewing, baking, and other industrial applications. IMPORTANCE Functional characterization of hypothetical genes remains one of the chief tasks of the postgenomic era. Although the first Saccharomyces cerevisiae genome sequence was published over 20 years ago, 22% of its estimated 6,603 open reading frames (ORFs) remain unverified. One outstanding example of this category of genes is the enigmatic seven-member AAD family. Here, we demonstrate that proteins encoded by two

  14. The function and evolution of Wnt genes in arthropods.

    Science.gov (United States)

    Murat, Sophie; Hopfen, Corinna; McGregor, Alistair P

    2010-11-01

    Wnt signalling is required for a wide range of developmental processes, from cleavage to patterning and cell migration. There are 13 subfamilies of Wnt ligand genes and this diverse repertoire appeared very early in metazoan evolution. In this review, we first summarise the known Wnt gene repertoire in various arthropods. Insects appear to have lost several Wnt subfamilies, either generally, such as Wnt3, or in lineage specific patterns, for example, the loss of Wnt7 in Anopheles. In Drosophila and Acyrthosiphon, only seven and six Wnt subfamilies are represented, respectively; however, the finding of nine Wnt genes in Tribolium suggests that arthropods had a larger repertoire ancestrally. We then discuss what is currently known about the expression and developmental function of Wnt ligands in Drosophila and other insects in comparison to other arthropods, such as the spiders Achaearanea and Cupiennius. We conclude that studies of Wnt genes have given us much insight into the developmental roles of some of these ligands. However, given the frequent loss of Wnt genes in insects and the derived development of Drosophila, further studies of these important genes are required in a broader range of arthropods to fully understand their developmental function and evolution. Copyright © 2010 Elsevier Ltd. All rights reserved.

  15. The Symbiodinium kawagutii genome illuminates dinoflagellate gene expression and coral symbiosis

    DEFF Research Database (Denmark)

    Lin, Senjie; Cheng, Shifeng; Song, Bo

    2015-01-01

    Symbiodinium-specific gene families. No whole-genome duplication was observed, but instead we found active (retro) transposition and gene family expansion, especially in processes important for successful symbiosis with corals. We also documented genes potentially governing sexual reproduction and cyst...... the molecular basis and evolution of coral symbiosis....

  16. Hox genes and evolution [version 1; referees: 3 approved

    Directory of Open Access Journals (Sweden)

    Steven M. Hrycaj

    2016-05-01

    Full Text Available Hox proteins are a deeply conserved group of transcription factors originally defined for their critical roles in governing segmental identity along the antero-posterior (AP axis in Drosophila. Over the last 30 years, numerous data generated in evolutionarily diverse taxa have clearly shown that changes in the expression patterns of these genes are closely associated with the regionalization of the AP axis, suggesting that Hox genes have played a critical role in the evolution of novel body plans within Bilateria. Despite this deep functional conservation and the importance of these genes in AP patterning, key questions remain regarding many aspects of Hox biology. In this commentary, we highlight recent reports that have provided novel insight into the origins of the mammalian Hox cluster, the role of Hox genes in the generation of a limbless body plan, and a novel putative mechanism in which Hox genes may encode specificity along the AP axis. Although the data discussed here offer a fresh perspective, it is clear that there is still much to learn about Hox biology and the roles it has played in the evolution of the Bilaterian body plan.

  17. The sociobiology of genes: the gene's eye view as a unifying behavioural-ecological framework for biological evolution.

    Science.gov (United States)

    De Tiège, Alexis; Van de Peer, Yves; Braeckman, Johan; Tanghe, Koen B

    2017-11-22

    Although classical evolutionary theory, i.e., population genetics and the Modern Synthesis, was already implicitly 'gene-centred', the organism was, in practice, still generally regarded as the individual unit of which a population is composed. The gene-centred approach to evolution only reached a logical conclusion with the advent of the gene-selectionist or gene's eye view in the 1960s and 1970s. Whereas classical evolutionary theory can only work with (genotypically represented) fitness differences between individual organisms, gene-selectionism is capable of working with fitness differences among genes within the same organism and genome. Here, we explore the explanatory potential of 'intra-organismic' and 'intra-genomic' gene-selectionism, i.e., of a behavioural-ecological 'gene's eye view' on genetic, genomic and organismal evolution. First, we give a general outline of the framework and how it complements the-to some extent-still 'organism-centred' approach of classical evolutionary theory. Secondly, we give a more in-depth assessment of its explanatory potential for biological evolution, i.e., for Darwin's 'common descent with modification' or, more specifically, for 'historical continuity or homology with modular evolutionary change' as it has been studied by evolutionary developmental biology (evo-devo) during the last few decades. In contrast with classical evolutionary theory, evo-devo focuses on 'within-organism' developmental processes. Given the capacity of gene-selectionism to adopt an intra-organismal gene's eye view, we outline the relevance of the latter model for evo-devo. Overall, we aim for the conceptual integration between the gene's eye view on the one hand, and more organism-centred evolutionary models (both classical evolutionary theory and evo-devo) on the other.

  18. Genome-wide analysis of the sox family in the calcareous sponge Sycon ciliatum: multiple genes with unique expression patterns

    Directory of Open Access Journals (Sweden)

    Fortunato Sofia

    2012-07-01

    Full Text Available Abstract Background Sox genes are HMG-domain containing transcription factors with important roles in developmental processes in animals; many of them appear to have conserved functions among eumetazoans. Demosponges have fewer Sox genes than eumetazoans, but their roles remain unclear. The aim of this study is to gain insight into the early evolutionary history of the Sox gene family by identification and expression analysis of Sox genes in the calcareous sponge Sycon ciliatum. Methods Calcaronean Sox related sequences were retrieved by searching recently generated genomic and transcriptome sequence resources and analyzed using variety of phylogenetic methods and identification of conserved motifs. Expression was studied by whole mount in situ hybridization. Results We have identified seven Sox genes and four Sox-related genes in the complete genome of Sycon ciliatum. Phylogenetic and conserved motif analyses showed that five of Sycon Sox genes represent groups B, C, E, and F present in cnidarians and bilaterians. Two additional genes are classified as Sox genes but cannot be assigned to specific subfamilies, and four genes are more similar to Sox genes than to other HMG-containing genes. Thus, the repertoire of Sox genes is larger in this representative of calcareous sponges than in the demosponge Amphimedon queenslandica. It remains unclear whether this is due to the expansion of the gene family in Sycon or a secondary reduction in the Amphimedon genome. In situ hybridization of Sycon Sox genes revealed a variety of expression patterns during embryogenesis and in specific cell types of adult sponges. Conclusions In this study, we describe a large family of Sox genes in Sycon ciliatum with dynamic expression patterns, indicating that Sox genes are regulators in development and cell type determination in sponges, as observed in higher animals. The revealed differences between demosponge and calcisponge Sox genes repertoire highlight the need to

  19. TreeFam: a curated database of phylogenetic trees of animal gene families

    DEFF Research Database (Denmark)

    Li, Heng; Coghlan, Avril; Ruan, Jue

    2006-01-01

    TreeFam is a database of phylogenetic trees of gene families found in animals. It aims to develop a curated resource that presents the accurate evolutionary history of all animal gene families, as well as reliable ortholog and paralog assignments. Curated families are being added progressively......, based on seed alignments and trees in a similar fashion to Pfam. Release 1.1 of TreeFam contains curated trees for 690 families and automatically generated trees for another 11 646 families. These represent over 128 000 genes from nine fully sequenced animal genomes and over 45 000 other animal proteins...

  20. Large Diversity of Nonstandard Genes and Dynamic Evolution of Chloroplast Genomes in Siphonous Green Algae (Bryopsidales, Chlorophyta).

    Science.gov (United States)

    Cremen, Ma Chiela M; Leliaert, Frederik; Marcelino, Vanessa R; Verbruggen, Heroen

    2018-04-01

    Chloroplast genomes have undergone tremendous alterations through the evolutionary history of the green algae (Chloroplastida). This study focuses on the evolution of chloroplast genomes in the siphonous green algae (order Bryopsidales). We present five new chloroplast genomes, which along with existing sequences, yield a data set representing all but one families of the order. Using comparative phylogenetic methods, we investigated the evolutionary dynamics of genomic features in the order. Our results show extensive variation in chloroplast genome architecture and intron content. Variation in genome size is accounted for by the amount of intergenic space and freestanding open reading frames that do not show significant homology to standard plastid genes. We show the diversity of these nonstandard genes based on their conserved protein domains, which are often associated with mobile functions (reverse transcriptase/intron maturase, integrases, phage- or plasmid-DNA primases, transposases, integrases, ligases). Investigation of the introns showed proliferation of group II introns in the early evolution of the order and their subsequent loss in the core Halimedineae, possibly through RT-mediated intron loss.

  1. Enamelin/ameloblastin gene polymorphisms in autosomal amelogenesis imperfecta among Syrian families.

    Science.gov (United States)

    Dashash, Mayssoon; Bazrafshani, Mohamed Riza; Poulton, Kay; Jaber, Saaed; Naeem, Emad; Blinkhorn, Anthony Stevenson

    2011-02-01

      This study was undertaken to investigate whether a single G deletion within a series of seven G residues (codon 196) at the exon 9-intron 9 boundary of the enamelin gene ENAM and a tri-nucleotide deletion at codon 180 in exon 7 (GGA vs deletion) of ameloblastin gene AMBN could have a role in autosomal amelogenesis imperfecta among affected Syrian families.   A new technique - size-dependent, deletion screening - was developed to detect nucleotide deletion in ENAM and AMBN genes. Twelve Syrian families with autosomal-dominant or -recessive amelogenesis imperfecta were included.   A homozygous/heterozygous mutation in the ENAM gene (152/152, 152/153) was identified in affected members of three families with autosomal-dominant amelogenesis imperfecta and one family with autosomal-recessive amelogenesis imperfecta. A heterozygous mutation (222/225) in the AMBN gene was identified. However, no disease causing mutations was found. The present findings provide useful information for the implication of ENAM gene polymorphism in autosomal-dominant/-recessive amelogenesis imperfecta.   Further investigations are required to identify other genes responsible for the various clinical phenotypes. © 2010 Blackwell Publishing Asia Pty Ltd.

  2. The molecular evolution of cytochrome P450 genes within and between drosophila species.

    Science.gov (United States)

    Good, Robert T; Gramzow, Lydia; Battlay, Paul; Sztal, Tamar; Batterham, Philip; Robin, Charles

    2014-04-20

    We map 114 gene gains and 74 gene losses in the P450 gene family across the phylogeny of 12 Drosophila species by examining the congruence of gene trees and species trees. Although the number of P450 genes varies from 74 to 94 in the species examined, we infer that there were at least 77 P450 genes in the ancestral Drosophila genome. One of the most striking observations in the data set is the elevated loss of P450 genes in the Drosophila sechellia lineage. The gain and loss events are not evenly distributed among the P450 genes-with 30 genes showing no gene gains or losses whereas others show as many as 20 copy number changes among the species examined. The P450 gene clades showing the fewest number of gene gain and loss events tend to be those evolving with the most purifying selection acting on the protein sequences, although there are exceptions, such as the rapid rate of amino acid replacement observed in the single copy phantom (Cyp306a1) gene. Within D. melanogaster, we observe gene copy number polymorphism in ten P450 genes including multiple cases of interparalog chimeras. Nonallelic homologous recombination (NAHR) has been associated with deleterious mutations in humans, but here we provide a second possible example of an NAHR event in insect P450s being adaptive. Specifically, we find that a polymorphic Cyp12a4/Cyp12a5 chimera correlates with resistance to an insecticide. Although we observe such interparalog exchange in our within-species data sets, we have little evidence of it between species, raising the possibility that such events may occur more frequently than appreciated but are masked by subsequent sequence change. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  3. Genome-Wide Identification and Functional Analysis of the Calcineurin B-like Protein and Calcineurin B-like Protein-Interacting Protein Kinase Gene Families in Turnip (Brassica rapa var. rapa

    Directory of Open Access Journals (Sweden)

    Xin Yin

    2017-07-01

    Full Text Available The calcineurin B-like protein (CBL–CBL-interacting protein kinase (CIPK complex has been identified as a primary component in calcium sensors that perceives various stress signals. Turnip (Brassica rapa var. rapa has been widely cultivated in the Qinghai–Tibet Plateau for a century as a food crop of worldwide economic significance. These CBL–CIPK complexes have been demonstrated to play crucial roles in plant response to various environmental stresses. However, no report is available on the genome-wide characterization of these two gene families in turnip. In the present study, 19 and 51 members of the BrrCBL and BrrCIPK genes, respectively, are first identified in turnip and phylogenetically grouped into three and two distinct clusters, respectively. The expansion of these two gene families is mainly attributable to segmental duplication. Moreover, the differences in expression patterns in quantitative real-time PCR, as well as interaction profiles in the yeast two-hybrid assay, suggest the functional divergence of paralog genes during long-term evolution in turnip. Overexpressing and complement lines in Arabidopsis reveal that BrrCBL9.2 improves, but BrrCBL9.1 does not affect, salt tolerance in Arabidopsis. Thus, the expansion of the BrrCBL and BrrCIPK gene families enables the functional differentiation and evolution of some new gene functions of paralog genes. These paralog genes then play prominent roles in turnip's adaptation to the adverse environment of the Qinghai–Tibet Plateau. Overall, the study results contribute to our understanding of the functions of the CBL–CIPK complex and provide basis for selecting appropriate genes for the in-depth functional studies of BrrCBL–BrrCIPK in turnip.

  4. Rapid evolution of the sequences and gene repertoires of secreted proteins in bacteria.

    Directory of Open Access Journals (Sweden)

    Teresa Nogueira

    Full Text Available Proteins secreted to the extracellular environment or to the periphery of the cell envelope, the secretome, play essential roles in foraging, antagonistic and mutualistic interactions. We hypothesize that arms races, genetic conflicts and varying selective pressures should lead to the rapid change of sequences and gene repertoires of the secretome. The analysis of 42 bacterial pan-genomes shows that secreted, and especially extracellular proteins, are predominantly encoded in the accessory genome, i.e. among genes not ubiquitous within the clade. Genes encoding outer membrane proteins might engage more frequently in intra-chromosomal gene conversion because they are more often in multi-genic families. The gene sequences encoding the secretome evolve faster than the rest of the genome and in particular at non-synonymous positions. Cell wall proteins in Firmicutes evolve particularly fast when compared with outer membrane proteins of Proteobacteria. Virulence factors are over-represented in the secretome, notably in outer membrane proteins, but cell localization explains more of the variance in substitution rates and gene repertoires than sequence homology to known virulence factors. Accordingly, the repertoires and sequences of the genes encoding the secretome change fast in the clades of obligatory and facultative pathogens and also in the clades of mutualists and free-living bacteria. Our study shows that cell localization shapes genome evolution. In agreement with our hypothesis, the repertoires and the sequences of genes encoding secreted proteins evolve fast. The particularly rapid change of extracellular proteins suggests that these public goods are key players in bacterial adaptation.

  5. The 5S rDNA family evolves through concerted and birth-and-death evolution in fish genomes: an example from freshwater stingrays

    Science.gov (United States)

    2011-01-01

    Background Ribosomal 5S genes are well known for the critical role they play in ribosome folding and functionality. These genes are thought to evolve in a concerted fashion, with high rates of homogenization of gene copies. However, the majority of previous analyses regarding the evolutionary process of rDNA repeats were conducted in invertebrates and plants. Studies have also been conducted on vertebrates, but these analyses were usually restricted to the 18S, 5.8S and 28S rRNA genes. The recent identification of divergent 5S rRNA gene paralogs in the genomes of elasmobranches and teleost fishes indicate that the eukaryotic 5S rRNA gene family has a more complex genomic organization than previously thought. The availability of new sequence data from lower vertebrates such as teleosts and elasmobranches enables an enhanced evolutionary characterization of 5S rDNA among vertebrates. Results We identified two variant classes of 5S rDNA sequences in the genomes of Potamotrygonidae stingrays, similar to the genomes of other vertebrates. One class of 5S rRNA genes was shared only by elasmobranches. A broad comparative survey among 100 vertebrate species suggests that the 5S rRNA gene variants in fishes originated from rounds of genome duplication. These variants were then maintained or eliminated by birth-and-death mechanisms, under intense purifying selection. Clustered multiple copies of 5S rDNA variants could have arisen due to unequal crossing over mechanisms. Simultaneously, the distinct genome clusters were independently homogenized, resulting in the maintenance of clusters of highly similar repeats through concerted evolution. Conclusions We believe that 5S rDNA molecular evolution in fish genomes is driven by a mixed mechanism that integrates birth-and-death and concerted evolution. PMID:21627815

  6. Frequent and recent retrotransposition of orthologous genes plays a role in the evolution of sperm glycolytic enzymes

    Directory of Open Access Journals (Sweden)

    de Villena Fernando

    2010-05-01

    Full Text Available Abstract Background The central metabolic pathway of glycolysis converts glucose to pyruvate, with the net production of 2 ATP and 2 NADH per glucose molecule. Each of the ten reactions in this pathway is typically catalyzed by multiple isozymes encoded by a multigene family. Several isozymes in this pathway are expressed only during spermatogenesis, and gene targeting studies indicate that they are essential for sperm function and male fertility in mouse. At least three of the novel glycolytic isozymes are encoded by retrogenes (Pgk2, Aldoart1, and Aldoart2. Their restricted expression profile suggests that retrotransposition may play a significant role in the evolution of sperm glycolytic enzymes. Results We conducted a comprehensive genomic analysis of glycolytic enzymes in the human and mouse genomes and identified several intronless copies for all enzymes in the pathway, except Pfk. Within each gene family, a single orthologous gene was typically retrotransposed frequently and independently in both species. Several retroposed sequences maintained open reading frames (ORFs and/or provided evidence of alternatively spliced exons. We analyzed expression of sequences with ORFs and Gpi1 transcript in mouse spermatogenic cells. Conclusions Our analysis detected frequent, recent, and lineage-specific retrotransposition of orthologous glycolytic enzymes in the human and mouse genomes. Retrotransposition events are associated with LINE/LTR and genomic integration is random. We found evidence for the alternative splicing of parent genes. Many retroposed sequences have maintained ORFs, suggesting a functional role for these genes.

  7. Evolution of trappin genes in mammals

    Directory of Open Access Journals (Sweden)

    Furutani Yutaka

    2010-01-01

    Full Text Available Abstract Background Trappin is a multifunctional host-defense peptide that has antiproteolytic, antiinflammatory, and antimicrobial activities. The numbers and compositions of trappin paralogs vary among mammalian species: human and sheep have a single trappin-2 gene; mouse and rat have no trappin gene; pig and cow have multiple trappin genes; and guinea pig has a trappin gene and two other derivativegenes. Independent duplications of trappin genes in pig and cow were observed recently after the species were separated. To determine whether these trappin gene duplications are restricted only to certain mammalian lineages, we analyzed recently-developed genome databases for the presence of duplicate trappin genes. Results The database analyses revealed that: 1 duplicated trappin multigenes were found recently in the nine-banded armadillo; 2 duplicated two trappin genes had been found in the Afrotherian species (elephant, tenrec, and hyrax since ancient days; 3 a single trappin-2 gene was found in various eutherians species; and 4 no typical trappin gene has been found in chicken, zebra finch, and opossum. Bayesian analysis estimated the date of the duplication of trappin genes in the Afrotheria, guinea pig, armadillo, cow, and pig to be 244, 35, 11, 13, and 3 million-years ago, respectively. The coding regions of trappin multigenes of almadillo, bovine, and pig evolved much faster than the noncoding exons, introns, and the flanking regions, showing that these genes have undergone accelerated evolution, and positive Darwinian selection was observed in pig-specific trappin paralogs. Conclusion These results suggest that trappin is an eutherian-specific molecule and eutherian genomes have the potential to form trappin multigenes.

  8. Phylogenomic analysis of UDP glycosyltransferase 1 multigene family in Linum usitatissimum identified genes with varied expression patterns

    Science.gov (United States)

    2012-01-01

    Background The glycosylation process, catalyzed by ubiquitous glycosyltransferase (GT) family enzymes, is a prevalent modification of plant secondary metabolites that regulates various functions such as hormone homeostasis, detoxification of xenobiotics and biosynthesis and storage of secondary metabolites. Flax (Linum usitatissimum L.) is a commercially grown oilseed crop, important because of its essential fatty acids and health promoting lignans. Identification and characterization of UDP glycosyltransferase (UGT) genes from flax could provide valuable basic information about this important gene family and help to explain the seed specific glycosylated metabolite accumulation and other processes in plants. Plant genome sequencing projects are useful to discover complexity within this gene family and also pave way for the development of functional genomics approaches. Results Taking advantage of the newly assembled draft genome sequence of flax, we identified 137 UDP glycosyltransferase (UGT) genes from flax using a conserved signature motif. Phylogenetic analysis of these protein sequences clustered them into 14 major groups (A-N). Expression patterns of these genes were investigated using publicly available expressed sequence tag (EST), microarray data and reverse transcription quantitative real time PCR (RT-qPCR). Seventy-three per cent of these genes (100 out of 137) showed expression evidence in 15 tissues examined and indicated varied expression profiles. The RT-qPCR results of 10 selected genes were also coherent with the digital expression analysis. Interestingly, five duplicated UGT genes were identified, which showed differential expression in various tissues. Of the seven intron loss/gain positions detected, two intron positions were conserved among most of the UGTs, although a clear relationship about the evolution of these genes could not be established. Comparison of the flax UGTs with orthologs from four other sequenced dicot genomes indicated that

  9. Phylogenomic analysis of UDP glycosyltransferase 1 multigene family in Linum usitatissimum identified genes with varied expression patterns

    Directory of Open Access Journals (Sweden)

    Barvkar Vitthal T

    2012-05-01

    Full Text Available Abstract Background The glycosylation process, catalyzed by ubiquitous glycosyltransferase (GT family enzymes, is a prevalent modification of plant secondary metabolites that regulates various functions such as hormone homeostasis, detoxification of xenobiotics and biosynthesis and storage of secondary metabolites. Flax (Linum usitatissimum L. is a commercially grown oilseed crop, important because of its essential fatty acids and health promoting lignans. Identification and characterization of UDP glycosyltransferase (UGT genes from flax could provide valuable basic information about this important gene family and help to explain the seed specific glycosylated metabolite accumulation and other processes in plants. Plant genome sequencing projects are useful to discover complexity within this gene family and also pave way for the development of functional genomics approaches. Results Taking advantage of the newly assembled draft genome sequence of flax, we identified 137 UDP glycosyltransferase (UGT genes from flax using a conserved signature motif. Phylogenetic analysis of these protein sequences clustered them into 14 major groups (A-N. Expression patterns of these genes were investigated using publicly available expressed sequence tag (EST, microarray data and reverse transcription quantitative real time PCR (RT-qPCR. Seventy-three per cent of these genes (100 out of 137 showed expression evidence in 15 tissues examined and indicated varied expression profiles. The RT-qPCR results of 10 selected genes were also coherent with the digital expression analysis. Interestingly, five duplicated UGT genes were identified, which showed differential expression in various tissues. Of the seven intron loss/gain positions detected, two intron positions were conserved among most of the UGTs, although a clear relationship about the evolution of these genes could not be established. Comparison of the flax UGTs with orthologs from four other sequenced dicot

  10. Positive selection and ancient duplications in the evolution of class B floral homeotic genes of orchids and grasses

    Directory of Open Access Journals (Sweden)

    Koch Marcus A

    2009-04-01

    Full Text Available Abstract Background Positive selection is recognized as the prevalence of nonsynonymous over synonymous substitutions in a gene. Models of the functional evolution of duplicated genes consider neofunctionalization as key to the retention of paralogues. For instance, duplicate transcription factors are specifically retained in plant and animal genomes and both positive selection and transcriptional divergence appear to have played a role in their diversification. However, the relative impact of these two factors has not been systematically evaluated. Class B MADS-box genes, comprising DEF-like and GLO-like genes, encode developmental transcription factors essential for establishment of perianth and male organ identity in the flowers of angiosperms. Here, we contrast the role of positive selection and the known divergence in expression patterns of genes encoding class B-like MADS-box transcription factors from monocots, with emphasis on the family Orchidaceae and the order Poales. Although in the monocots these two groups are highly diverse and have a strongly canalized floral morphology, there is no information on the role of positive selection in the evolution of their distinctive flower morphologies. Published research shows that in Poales, class B-like genes are expressed in stamens and in lodicules, the perianth organs whose identity might also be specified by class B-like genes, like the identity of the inner tepals of their lily-like relatives. In orchids, however, the number and pattern of expression of class B-like genes have greatly diverged. Results The DEF-like genes from Orchidaceae form four well-supported, ancient clades of orthologues. In contrast, orchid GLO-like genes form a single clade of ancient orthologues and recent paralogues. DEF-like genes from orchid clade 2 (OMADS3-like genes are under less stringent purifying selection than the other orchid DEF-like and GLO-like genes. In comparison with orchids, purifying selection

  11. Teleost Fish-Specific Preferential Retention of Pigmentation Gene-Containing Families After Whole Genome Duplications in Vertebrates

    Science.gov (United States)

    Lorin, Thibault; Brunet, Frédéric G.; Laudet, Vincent; Volff, Jean-Nicolas

    2018-01-01

    Vertebrate pigmentation is a highly diverse trait mainly determined by neural crest cell derivatives. It has been suggested that two rounds (1R/2R) of whole-genome duplications (WGDs) at the basis of vertebrates allowed changes in gene regulation associated with neural crest evolution. Subsequently, the teleost fish lineage experienced other WGDs, including the teleost-specific Ts3R before teleost radiation and the more recent Ss4R at the basis of salmonids. As the teleost lineage harbors the highest number of pigment cell types and pigmentation diversity in vertebrates, WGDs might have contributed to the evolution and diversification of the pigmentation gene repertoire in teleosts. We have compared the impact of the basal vertebrate 1R/2R duplications with that of the teleost-specific Ts3R and salmonid-specific Ss4R WGDs on 181 gene families containing genes involved in pigmentation. We show that pigmentation genes (PGs) have been globally more frequently retained as duplicates than other genes after Ts3R and Ss4R but not after the early 1R/2R. This is also true for non-pigmentary paralogs of PGs, suggesting that the function in pigmentation is not the sole key driver of gene retention after WGDs. On the long-term, specific categories of PGs have been repeatedly preferentially retained after ancient 1R/2R and Ts3R WGDs, possibly linked to the molecular nature of their proteins (e.g., DNA binding transcriptional regulators) and their central position in protein-protein interaction networks. Taken together, our results support a major role of WGDs in the diversification of the pigmentation gene repertoire in the teleost lineage, with a possible link with the diversity of pigment cell lineages observed in these animals compared to other vertebrates. PMID:29599177

  12. Modular evolution of glutathione peroxidase genes in association with different biochemical properties of their encoded proteins in invertebrate animals

    Directory of Open Access Journals (Sweden)

    Zo Young-Gun

    2009-04-01

    Full Text Available Abstract Background Phospholipid hydroperoxide glutathione peroxidases (PHGPx, the most abundant isoforms of GPx families, interfere directly with hydroperoxidation of lipids. Biochemical properties of these proteins vary along with their donor organisms, which has complicated the phylogenetic classification of diverse PHGPx-like proteins. Despite efforts for comprehensive analyses, the evolutionary aspects of GPx genes in invertebrates remain largely unknown. Results We isolated GPx homologs via in silico screening of genomic and/or expressed sequence tag databases of eukaryotic organisms including protostomian species. Genes showing strong similarity to the mammalian PHGPx genes were commonly found in all genomes examined. GPx3- and GPx7-like genes were additionally detected from nematodes and platyhelminths, respectively. The overall distribution of the PHGPx-like proteins with different biochemical properties was biased across taxa; selenium- and glutathione (GSH-dependent proteins were exclusively detected in platyhelminth and deuterostomian species, whereas selenium-independent and thioredoxin (Trx-dependent enzymes were isolated in the other taxa. In comparison of genomic organization, the GSH-dependent PHGPx genes showed a conserved architectural pattern, while their Trx-dependent counterparts displayed complex exon-intron structures. A codon for the resolving Cys engaged in reductant binding was found to be substituted in a series of genes. Selection pressure to maintain the selenocysteine codon in GSH-dependent genes also appeared to be relaxed during their evolution. With the dichotomized fashion in genomic organizations, a highly polytomic topology of their phylogenetic trees implied that the GPx genes have multiple evolutionary intermediate forms. Conclusion Comparative analysis of invertebrate GPx genes provides informative evidence to support the modular pathways of GPx evolution, which have been accompanied with sporadic

  13. The molecular evolution of the p120-catenin subfamily and its functional associations.

    Directory of Open Access Journals (Sweden)

    Robert H Carnahan

    2010-12-01

    Full Text Available p120-catenin (p120 is the prototypical member of a subclass of armadillo-related proteins that includes δ-catenin/NPRAP, ARVCF, p0071, and the more distantly related plakophilins 1-3. In vertebrates, p120 is essential in regulating surface expression and stability of all classical cadherins, and directly interacts with Kaiso, a BTB/ZF family transcription factor.To clarify functional relationships between these proteins and how they relate to the classical cadherins, we have examined the proteomes of 14 diverse vertebrate and metazoan species. The data reveal a single ancient δ-catenin-like p120 family member present in the earliest metazoans and conserved throughout metazoan evolution. This single p120 family protein is present in all protostomes, and in certain early-branching chordate lineages. Phylogenetic analyses suggest that gene duplication and functional diversification into "p120-like" and "δ-catenin-like" proteins occurred in the urochordate-vertebrate ancestor. Additional gene duplications during early vertebrate evolution gave rise to the seven vertebrate p120 family members. Kaiso family members (i.e., Kaiso, ZBTB38 and ZBTB4 are found only in vertebrates, their origin following that of the p120-like gene lineage and coinciding with the evolution of vertebrate-specific mechanisms of epigenetic gene regulation by CpG island methylation.The p120 protein family evolved from a common δ-catenin-like ancestor present in all metazoans. Through several rounds of gene duplication and diversification, however, p120 evolved in vertebrates into an essential, ubiquitously expressed protein, whereas loss of the more selectively expressed δ-catenin, p0071 and ARVCF are tolerated in most species. Together with phylogenetic studies of the vertebrate cadherins, our data suggest that the p120-like and δ-catenin-like genes co-evolved separately with non-neural (E- and P-cadherin and neural (N- and R-cadherin cadherin lineages, respectively. The

  14. Characterization of the avian Trojan gene family reveals contrasting evolutionary constraints.

    Science.gov (United States)

    Petrov, Petar; Syrjänen, Riikka; Smith, Jacqueline; Gutowska, Maria Weronika; Uchida, Tatsuya; Vainio, Olli; Burt, David W

    2015-01-01

    "Trojan" is a leukocyte-specific, cell surface protein originally identified in the chicken. Its molecular function has been hypothesized to be related to anti-apoptosis and the proliferation of immune cells. The Trojan gene has been localized onto the Z sex chromosome. The adjacent two genes also show significant homology to Trojan, suggesting the existence of a novel gene/protein family. Here, we characterize this Trojan family, identify homologues in other species and predict evolutionary constraints on these genes. The two Trojan-related proteins in chicken were predicted as a receptor-type tyrosine phosphatase and a transmembrane protein, bearing a cytoplasmic immuno-receptor tyrosine-based activation motif. We identified the Trojan gene family in ten other bird species and found related genes in three reptiles and a fish species. The phylogenetic analysis of the homologues revealed a gradual diversification among the family members. Evolutionary analyzes of the avian genes predicted that the extracellular regions of the proteins have been subjected to positive selection. Such selection was possibly a response to evolving interacting partners or to pathogen challenges. We also observed an almost complete lack of intracellular positively selected sites, suggesting a conserved signaling mechanism of the molecules. Therefore, the contrasting patterns of selection likely correlate with the interaction and signaling potential of the molecules.

  15. Herbicide resistance-endowing ACCase gene mutations in hexaploid wild oat (Avena fatua): insights into resistance evolution in a hexaploid species

    Science.gov (United States)

    Yu, Q; Ahmad-Hamdani, M S; Han, H; Christoffers, M J; Powles, S B

    2013-01-01

    Many herbicide-resistant weed species are polyploids, but far too little about the evolution of resistance mutations in polyploids is understood. Hexaploid wild oat (Avena fatua) is a global crop weed and many populations have evolved herbicide resistance. We studied plastidic acetyl-coenzyme A carboxylase (ACCase)-inhibiting herbicide resistance in hexaploid wild oat and revealed that resistant individuals can express one, two or three different plastidic ACCase gene resistance mutations (Ile-1781-Leu, Asp-2078-Gly and Cys-2088-Arg). Using ACCase resistance mutations as molecular markers, combined with genetic, molecular and biochemical approaches, we found in individual resistant wild-oat plants that (1) up to three unlinked ACCase gene loci assort independently following Mendelian laws for disomic inheritance, (2) all three of these homoeologous ACCase genes were transcribed, with each able to carry its own mutation and (3) in a hexaploid background, each individual ACCase resistance mutation confers relatively low-level herbicide resistance, in contrast to high-level resistance conferred by the same mutations in unrelated diploid weed species of the Poaceae (grass) family. Low resistance conferred by individual ACCase resistance mutations is likely due to a dilution effect by susceptible ACCase expressed by homoeologs in hexaploid wild oat and/or differential expression of homoeologous ACCase gene copies. Thus, polyploidy in hexaploid wild oat may slow resistance evolution. Evidence of coexisting non-target-site resistance mechanisms among wild-oat populations was also revealed. In all, these results demonstrate that herbicide resistance and its evolution can be more complex in hexaploid wild oat than in unrelated diploid grass weeds. Our data provide a starting point for the daunting task of understanding resistance evolution in polyploids. PMID:23047200

  16. Taxonomically restricted genes are associated with the evolution of sociality in the honey bee

    Directory of Open Access Journals (Sweden)

    Tsutsui Neil D

    2011-03-01

    Full Text Available Abstract Background Studies have shown that taxonomically restricted genes are significant in number and important for the evolution of lineage specific traits. Social insects have gained many novel morphological and behavioral traits relative to their solitary ancestors. The task repertoire of an advanced social insect, for example, can be 40-50 tasks, about twice that of a solitary wasp or bee. The genetic basis of this expansion in behavioral repertoire is still poorly understood, and a role for taxonomically restricted genes has not been explored at the whole genome level. Results Here we present comparative genomics results suggesting that taxonomically restricted genes may have played an important role in generating the expansion of behavioral repertoire associated with the evolution of eusociality. First, we show that the current honey bee official gene set contains about 700 taxonomically restricted genes. These are split between orphans, genes found only in the Hymenoptera, and genes found only in insects. Few of the orphans or genes restricted to the Hymenoptera have been the focus of experimental work, but several of those that have are associated with novel eusocial traits or traits thought to have changed radically as a consequence of eusociality. Second, we predicted that if taxonomically restricted genes are important for generating novel eusocial traits, then they should be expressed with greater frequency in workers relative to the queen, as the workers exhibit most of the novel behavior of the honey bee relative to their solitary ancestors. We found support for this prediction. Twice as many taxonomically restricted genes were found amongst the genes with higher expression in workers compared to those with higher expression in queens. Finally, we compiled an extensive list of candidate taxonomically restricted genes involved in eusocial evolution by analyzing several caste specific gene expression data sets. Conclusions This

  17. Evolution of the genus Aristolochia - Systematics, Molecular Evolution and Ecology

    OpenAIRE

    Wanke, Stefan

    2007-01-01

    Evolution of Piperales – matK gene and trnK intron sequence data reveal lineage specific resolution contrast. Piperales are one of the largest basal angiosperm orders with a nearly worldwide distribution. The order includes three species rich genera, Piper (ca. 1,000 species), Peperomia (ca. 1,500-1,700 species), and Aristolochia s. l. (ca. 500 species). Sequences of the matK gene and the non-coding trnK group II intron are analysed for a dense set of 105 taxa representing all families (excep...

  18. Comparative genome sequencing of Drosophila pseudoobscura: Chromosomal, gene, and cis-element evolution

    DEFF Research Database (Denmark)

    Richards, Stephen; Liu, Yue; Bettencourt, Brian R.

    2005-01-01

    years (Myr) since the pseudoobscura/melanogaster divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome-wide average, consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than random and nearby sequences......We have sequenced the genome of a second Drosophila species, Drosophila pseudoobscura, and compared this to the genome sequence of Drosophila melanogaster, a primary model organism. Throughout evolution the vast majority of Drosophila genes have remained on the same chromosome arm, but within each...... between the species-but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a pattern of repeat-mediated chromosomal rearrangement, and high coadaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence...

  19. Characterization of the Carbohydrate Binding Module 18 gene family in the amphibian pathogen Batrachochytrium dendrobatidis.

    Science.gov (United States)

    Liu, Peng; Stajich, Jason E

    2015-04-01

    Batrachochytrium dendrobatidis (Bd) is the causative agent of chytridiomycosis responsible for worldwide decline in amphibian populations. Previous analysis of the Bd genome revealed a unique expansion of the carbohydrate-binding module family 18 (CBM18) predicted to be a sub-class of chitin recognition domains. CBM expansions have been linked to the evolution of pathogenicity in a variety of fungal species by protecting the fungus from the host. Based on phylogenetic analysis and presence of additional protein domains, the gene family can be classified into 3 classes: Tyrosinase-, Deacetylase-, and Lectin-like. Examination of the mRNA expression levels from sporangia and zoospores of nine of the cbm18 genes found that the Lectin-like genes had the highest expression while the Tyrosinase-like genes showed little expression, especially in zoospores. Heterologous expression of GFP-tagged copies of four CBM18 genes in Saccharomyces cerevisiae demonstrated that two copies containing secretion signal peptides are trafficked to the cell boundary. The Lectin-like genes cbm18-ll1 and cbm18-ll2 co-localized with the chitinous cell boundaries visualized by staining with calcofluor white. In vitro assays of the full length and single domain copies from CBM18-LL1 demonstrated chitin binding and no binding to cellulose or xylan. Expressed CBM18 domain proteins were demonstrated to protect the fungus, Trichoderma reeseii, in vitro against hydrolysis from exogenously added chitinase, likely by binding and limiting exposure of fungal chitin. These results demonstrate that cbm18 genes can play a role in fungal defense and expansion of their copy number may be an important pathogenicity factor of this emerging infectious disease of amphibians. Copyright © 2015 Elsevier Inc. All rights reserved.

  20. Are there laws of genome evolution?

    Directory of Open Access Journals (Sweden)

    Eugene V Koonin

    2011-08-01

    Full Text Available Research in quantitative evolutionary genomics and systems biology led to the discovery of several universal regularities connecting genomic and molecular phenomic variables. These universals include the log-normal distribution of the evolutionary rates of orthologous genes; the power law-like distributions of paralogous family size and node degree in various biological networks; the negative correlation between a gene's sequence evolution rate and expression level; and differential scaling of functional classes of genes with genome size. The universals of genome evolution can be accounted for by simple mathematical models similar to those used in statistical physics, such as the birth-death-innovation model. These models do not explicitly incorporate selection; therefore, the observed universal regularities do not appear to be shaped by selection but rather are emergent properties of gene ensembles. Although a complete physical theory of evolutionary biology is inconceivable, the universals of genome evolution might qualify as "laws of evolutionary genomics" in the same sense "law" is understood in modern physics.

  1. Quantum selfish gene (biological evolution in terms of quantum mechanics)

    OpenAIRE

    Ozhigov, Yuri I.

    2013-01-01

    I propose to treat the biological evolution of genoms by means of quantum mechanical tools. We start with the concept of meta- gene, which specifies the "selfish gene" of R.Dawkins. Meta- gene encodes the abstract living unity, which can live relatively independently of the others, and can contain a few real creatures. Each population of living creatures we treat as the wave function on meta- genes, which module squared is the total number of creatures with the given meta-gene, and the phase ...

  2. Prevalence of variations in melanoma susceptibility genes among Slovenian melanoma families

    Directory of Open Access Journals (Sweden)

    Besic Nikola

    2008-09-01

    Full Text Available Abstract Background Two high-risk genes have been implicated in the development of CM (cutaneous melanoma. Germline mutations of the CDKN2A gene are found in CDK4 gene reported to date. Beside those high penetrance genes, certain allelic variants of the MC1R gene modify the risk of developing the disease. The aims of our study were: to determine the prevalence of germline CDKN2A mutations and variants in members of families with familial CM and in patients with multiple primary CM; to search for possible CDK4 mutations, and to determine the frequency of variations in the MC1R gene. Methods From January 2001 until January 2007, 64 individuals were included in the study. The group included 28 patients and 7 healthy relatives belonging to 25 families, 26 patients with multiple primary tumors and 3 children with CM. Additionally 54 healthy individuals were included as a control group. Mutations and variants of the melanoma susceptibility genes were identified by direct sequencing. Results Seven families with CDKN2A mutations were discovered (7/25 or 28.0%. The L94Q mutation found in one family had not been previously reported in other populations. The D84N variant, with possible biological impact, was discovered in the case of patient without family history but with multiple primary CM. Only one mutation carrier was found in the control group. Further analysis revealed that c.540C>T heterozygous carriers were more common in the group of CM patients and their healthy relatives (11/64 vs. 2/54. One p14ARF variant was discovered in the control group and no mutations of the CDK4 gene were found. Most frequently found variants of the MC1R gene were T314T, V60L, V92M, R151C, R160W and R163Q with frequencies slightly higher in the group of patients and their relatives than in the group of controls, but the difference was statistically insignificant. Conclusion The present study has shown high prevalence of p16INK4A mutations in Slovenian population of

  3. Analysis of factor VIII gene inversions in 164 unrelated hemophilia A families

    Energy Technology Data Exchange (ETDEWEB)

    Vnencak-Jones, L.; Phillips, J.A. III; Janco, R.L. [Vanderbilt Univ. School of Medicine, Nashville, TN (United States)] [and others

    1994-09-01

    Hemophilia A is an X-linked recessive disease with variable phenotype and both heterogeneous and wide spread mutations in the factor VIII (F8) gene. As a result, diagnostic carrier or prenatal testing often relies upon laborious DNA linkage analysis. Recently, inversion mutations resulting from an intrachromosomal recombination between DNA sequences in one of two A genes {approximately}500 kb upstream from the F8 gene and a homologous A gene in intron 22 of the F8 gene were identified and found in 45% of severe hemophiliacs. We have analyzed banked DNA collected since 1986 from affected males or obligate carrier females representing 164 unrelated hemophilia A families. The disease was sporadic in 37%, familial in 54% and in 10% of families incomplete information was given. A unique deletion was identified in 1/164, a normal pattern was observed in 110/164 (67%), and 53/164 (32%) families had inversion mutations with 43/53 (81%) involving the distal A gene (R3 pattern) and 10/53 (19%) involving the proximal A gene (R2 pattern). While 19% of all rearrangements were R2, in 35 families with severe disease (< 1% VIII:C activity) all 16 rearrangements seen were R3. In 18 families with the R3 pattern and known activities, 16 (89%) had levels < 1%, with the remaining 2 families having {le} 2.4% activity. Further, 18 referrals specifically noted the production of inhibitors and 8/18 (45%) had the R3 pattern. Our findings demonstrate that the R3 inversion mutation patterns is (1) only seen with VIII:C activity levels of {le} 2.4%, (2) seen in 46% of families with severe hemophilia, (3) seen in 45% of hemophiliacs known to have inhibitors, (4) not correlated with sporadic or familial disease and (5) not in disequilibrium with the Bcl I or Taq I intron 18 or ST14 polymorphisms. Finally, in families positive for an inversion mutation, direct testing offers a highly accurate and less expensive alternative to DNA linkage analysis.

  4. The human protein disulfide isomerase gene family

    Directory of Open Access Journals (Sweden)

    Galligan James J

    2012-07-01

    Full Text Available Abstract Enzyme-mediated disulfide bond formation is a highly conserved process affecting over one-third of all eukaryotic proteins. The enzymes primarily responsible for facilitating thiol-disulfide exchange are members of an expanding family of proteins known as protein disulfide isomerases (PDIs. These proteins are part of a larger superfamily of proteins known as the thioredoxin protein family (TRX. As members of the PDI family of proteins, all proteins contain a TRX-like structural domain and are predominantly expressed in the endoplasmic reticulum. Subcellular localization and the presence of a TRX domain, however, comprise the short list of distinguishing features required for gene family classification. To date, the PDI gene family contains 21 members, varying in domain composition, molecular weight, tissue expression, and cellular processing. Given their vital role in protein-folding, loss of PDI activity has been associated with the pathogenesis of numerous disease states, most commonly related to the unfolded protein response (UPR. Over the past decade, UPR has become a very attractive therapeutic target for multiple pathologies including Alzheimer disease, Parkinson disease, alcoholic and non-alcoholic liver disease, and type-2 diabetes. Understanding the mechanisms of protein-folding, specifically thiol-disulfide exchange, may lead to development of a novel class of therapeutics that would help alleviate a wide range of diseases by targeting the UPR.

  5. Nearly Complete 28S rRNA Gene Sequences Confirm New Hypotheses of Sponge Evolution

    Science.gov (United States)

    Thacker, Robert W.; Hill, April L.; Hill, Malcolm S.; Redmond, Niamh E.; Collins, Allen G.; Morrow, Christine C.; Spicer, Lori; Carmack, Cheryl A.; Zappe, Megan E.; Pohlmann, Deborah; Hall, Chelsea; Diaz, Maria C.; Bangalore, Purushotham V.

    2013-01-01

    The highly collaborative research sponsored by the NSF-funded Assembling the Porifera Tree of Life (PorToL) project is providing insights into some of the most difficult questions in metazoan systematics. Our understanding of phylogenetic relationships within the phylum Porifera has changed considerably with increased taxon sampling and data from additional molecular markers. PorToL researchers have falsified earlier phylogenetic hypotheses, discovered novel phylogenetic alliances, found phylogenetic homes for enigmatic taxa, and provided a more precise understanding of the evolution of skeletal features, secondary metabolites, body organization, and symbioses. Some of these exciting new discoveries are shared in the papers that form this issue of Integrative and Comparative Biology. Our analyses of over 300 nearly complete 28S ribosomal subunit gene sequences provide specific case studies that illustrate how our dataset confirms new hypotheses of sponge evolution. We recovered monophyletic clades for all 4 classes of sponges, as well as the 4 major clades of Demospongiae (Keratosa, Myxospongiae, Haploscleromorpha, and Heteroscleromorpha), but our phylogeny differs in several aspects from traditional classifications. In most major clades of sponges, families within orders appear to be paraphyletic. Although additional sampling of genes and taxa are needed to establish whether this pattern results from a lack of phylogenetic resolution or from a paraphyletic classification system, many of our results are congruent with those obtained from 18S ribosomal subunit gene sequences and complete mitochondrial genomes. These data provide further support for a revision of the traditional classification of sponges. PMID:23748742

  6. Emergence of a Homo sapiens-specific gene family and chromosome 16p11.2 CNV susceptibility.

    Science.gov (United States)

    Nuttle, Xander; Giannuzzi, Giuliana; Duyzend, Michael H; Schraiber, Joshua G; Narvaiza, Iñigo; Sudmant, Peter H; Penn, Osnat; Chiatante, Giorgia; Malig, Maika; Huddleston, John; Benner, Chris; Camponeschi, Francesca; Ciofi-Baffoni, Simone; Stessman, Holly A F; Marchetto, Maria C N; Denman, Laura; Harshman, Lana; Baker, Carl; Raja, Archana; Penewit, Kelsi; Janke, Nicolette; Tang, W Joyce; Ventura, Mario; Banci, Lucia; Antonacci, Francesca; Akey, Joshua M; Amemiya, Chris T; Gage, Fred H; Reymond, Alexandre; Eichler, Evan E

    2016-08-11

    Genetic differences that specify unique aspects of human evolution have typically been identified by comparative analyses between the genomes of humans and closely related primates, including more recently the genomes of archaic hominins. Not all regions of the genome, however, are equally amenable to such study. Recurrent copy number variation (CNV) at chromosome 16p11.2 accounts for approximately 1% of cases of autism and is mediated by a complex set of segmental duplications, many of which arose recently during human evolution. Here we reconstruct the evolutionary history of the locus and identify bolA family member 2 (BOLA2) as a gene duplicated exclusively in Homo sapiens. We estimate that a 95-kilobase-pair segment containing BOLA2 duplicated across the critical region approximately 282 thousand years ago (ka), one of the latest among a series of genomic changes that dramatically restructured the locus during hominid evolution. All humans examined carried one or more copies of the duplication, which nearly fixed early in the human lineage--a pattern unlikely to have arisen so rapidly in the absence of selection (P sapiens-specific duplication. In summary, the duplicative transposition of BOLA2 at the root of the H. sapiens lineage about 282 ka simultaneously increased copy number of a gene associated with iron homeostasis and predisposed our species to recurrent rearrangements associated with disease.

  7. Dichotomy in the NRT gene families of dicots and grass species.

    Directory of Open Access Journals (Sweden)

    Darren Plett

    Full Text Available A large proportion of the nitrate (NO(3(- acquired by plants from soil is actively transported via members of the NRT families of NO(3(- transporters. In Arabidopsis, the NRT1 family has eight functionally characterised members and predominantly comprises low-affinity transporters; the NRT2 family contains seven members which appear to be high-affinity transporters; and there are two NRT3 (NAR2 family members which are known to participate in high-affinity transport. A modified reciprocal best hit (RBH approach was used to identify putative orthologues of the Arabidopsis NRT genes in the four fully sequenced grass genomes (maize, rice, sorghum, Brachypodium. We also included the poplar genome in our analysis to establish whether differences between Arabidopsis and the grasses may be generally applicable to monocots and dicots. Our analysis reveals fundamental differences between Arabidopsis and the grass species in the gene number and family structure of all three families of NRT transporters. All grass species possessed additional NRT1.1 orthologues and appear to lack NRT1.6/NRT1.7 orthologues. There is significant separation in the NRT2 phylogenetic tree between NRT2 genes from dicots and grass species. This indicates that determination of function of NRT2 genes in grass species will not be possible in cereals based simply on sequence homology to functionally characterised Arabidopsis NRT2 genes and that proper functional analysis will be required. Arabidopsis has a unique NRT3.2 gene which may be a fusion of the NRT3.1 and NRT3.2 genes present in all other species examined here. This work provides a framework for future analysis of NO(3(- transporters and NO(3(- transport in grass crop species.

  8. Evolution of GHF5 endoglucanase gene structure in plant-parasitic nematodes: no evidence for an early domain shuffling event

    Directory of Open Access Journals (Sweden)

    Gheysen Godelieve

    2008-11-01

    Full Text Available Abstract Background Endo-1,4-beta-glucanases or cellulases from the glycosyl hydrolase family 5 (GHF5 have been found in numerous bacteria and fungi, and recently also in higher eukaryotes, particularly in plant-parasitic nematodes (PPN. The origin of these genes has been attributed to horizontal gene transfer from bacteria, although there still is a lot of uncertainty about the origin and structure of the ancestral GHF5 PPN endoglucanase. It is not clear whether this ancestral endoglucanase consisted of the whole gene cassette, containing a catalytic domain and a carbohydrate-binding module (CBM, type 2 in PPN and bacteria or only of the catalytic domain while the CBM2 was retrieved by domain shuffling later in evolution. Previous studies on the evolution of these genes have focused primarily on data of sedentary nematodes, while in this study, extra data from migratory nematodes were included. Results Two new endoglucanases from the migratory nematodes Pratylenchus coffeae and Ditylenchus africanus were included in this study. The latter one is the first gene isolated from a PPN of a different superfamily (Sphaerularioidea; all previously known nematode endoglucanases belong to the superfamily Tylenchoidea (order Rhabditida. Phylogenetic analyses were conducted with the PPN GHF5 endoglucanases and homologous endoglucanases from bacterial and other eukaryotic lineages such as beetles, fungi and plants. No statistical incongruence between the phylogenetic trees deduced from the catalytic domain and the CBM2 was found, which could suggest that both domains have evolved together. Furthermore, based on gene structure data, we inferred a model for the evolution of the GHF5 endoglucanase gene structure in plant-parasitic nematodes. Our data confirm a close relationship between Pratylenchus spp. and the root knot nematodes, while some Radopholus similis endoglucanases are more similar to cyst nematode genes. Conclusion We conclude that the ancestral

  9. Evolution of GHF5 endoglucanase gene structure in plant-parasitic nematodes: no evidence for an early domain shuffling event.

    Science.gov (United States)

    Kyndt, Tina; Haegeman, Annelies; Gheysen, Godelieve

    2008-11-03

    Endo-1,4-beta-glucanases or cellulases from the glycosyl hydrolase family 5 (GHF5) have been found in numerous bacteria and fungi, and recently also in higher eukaryotes, particularly in plant-parasitic nematodes (PPN). The origin of these genes has been attributed to horizontal gene transfer from bacteria, although there still is a lot of uncertainty about the origin and structure of the ancestral GHF5 PPN endoglucanase. It is not clear whether this ancestral endoglucanase consisted of the whole gene cassette, containing a catalytic domain and a carbohydrate-binding module (CBM, type 2 in PPN and bacteria) or only of the catalytic domain while the CBM2 was retrieved by domain shuffling later in evolution. Previous studies on the evolution of these genes have focused primarily on data of sedentary nematodes, while in this study, extra data from migratory nematodes were included. Two new endoglucanases from the migratory nematodes Pratylenchus coffeae and Ditylenchus africanus were included in this study. The latter one is the first gene isolated from a PPN of a different superfamily (Sphaerularioidea); all previously known nematode endoglucanases belong to the superfamily Tylenchoidea (order Rhabditida). Phylogenetic analyses were conducted with the PPN GHF5 endoglucanases and homologous endoglucanases from bacterial and other eukaryotic lineages such as beetles, fungi and plants. No statistical incongruence between the phylogenetic trees deduced from the catalytic domain and the CBM2 was found, which could suggest that both domains have evolved together. Furthermore, based on gene structure data, we inferred a model for the evolution of the GHF5 endoglucanase gene structure in plant-parasitic nematodes. Our data confirm a close relationship between Pratylenchus spp. and the root knot nematodes, while some Radopholus similis endoglucanases are more similar to cyst nematode genes. We conclude that the ancestral PPN GHF5 endoglucanase gene most probably consisted of

  10. Fast rate of evolution in alternatively spliced coding regions of mammalian genes

    Directory of Open Access Journals (Sweden)

    Nurtdinov Ramil N

    2006-04-01

    Full Text Available Abstract Background At least half of mammalian genes are alternatively spliced. Alternative isoforms are often genome-specific and it has been suggested that alternative splicing is one of the major mechanisms for generating protein diversity in the course of evolution. Another way of looking at alternative splicing is to consider sequence evolution of constitutive and alternative regions of protein-coding genes. Indeed, it turns out that constitutive and alternative regions evolve in different ways. Results A set of 3029 orthologous pairs of human and mouse alternatively spliced genes was considered. The rate of nonsynonymous substitutions (dN, the rate of synonymous substitutions (dS, and their ratio (ω = dN/dS appear to be significantly higher in alternatively spliced coding regions compared to constitutive regions. When N-terminal, internal and C-terminal alternatives are analysed separately, C-terminal alternatives appear to make the main contribution to the observed difference. The effects become even more pronounced in a subset of fast evolving genes. Conclusion These results provide evidence of weaker purifying selection and/or stronger positive selection in alternative regions and thus one more confirmation of accelerated evolution in alternative regions. This study corroborates the theory that alternative splicing serves as a testing ground for molecular evolution.

  11. Undefined familial colorectal cancer and the role of pleiotropism in cancer susceptibility genes.

    Science.gov (United States)

    Dobbins, Sara E; Broderick, Peter; Chubb, Daniel; Kinnersley, Ben; Sherborne, Amy L; Houlston, Richard S

    2016-10-01

    Although family history is a major risk factor for colorectal cancer (CRC) a genetic diagnosis cannot be obtained in over 50 % of familial cases when screened for known CRC cancer susceptibility genes. The genetics of undefined-familial CRC is complex and recent studies have implied additional clinically actionable mutations for CRC in susceptibility genes for other cancers. To clarify the contribution of non-CRC susceptibility genes to undefined-familial CRC we conducted a mutational screen of 114 cancer susceptibility genes in 847 patients with early-onset undefined-familial CRC and 1609 controls by analysing high-coverage exome sequencing data. We implemented American College of Medical Genetics and Genomics standards and guidelines for assigning pathogenicity to variants. Globally across all 114 cancer susceptibility genes no statistically significant enrichment of likely pathogenic variants was shown (6.7 % cases 57/847, 5.3 % controls 85/1609; P = 0.15). Moreover there was no significant enrichment of mutations in genes such as TP53 or BRCA2 which have been proposed for clinical testing in CRC. In conclusion, while we identified genes that may be considered interesting candidates as determinants of CRC risk warranting further research, there is currently scant evidence to support a role for genes other than those responsible for established CRC syndromes in the clinical management of familial CRC.

  12. Aux/IAA Gene Family in Plants: Molecular Structure, Regulation, and Function

    Directory of Open Access Journals (Sweden)

    Jie Luo

    2018-01-01

    Full Text Available Auxin plays a crucial role in the diverse cellular and developmental responses of plants across their lifespan. Plants can quickly sense and respond to changes in auxin levels, and these responses involve several major classes of auxin-responsive genes, including the Auxin/Indole-3-Acetic Acid (Aux/IAA family, the auxin response factor (ARF family, small auxin upregulated RNA (SAUR, and the auxin-responsive Gretchen Hagen3 (GH3 family. Aux/IAA proteins are short-lived nuclear proteins comprising several highly conserved domains that are encoded by the auxin early response gene family. These proteins have specific domains that interact with ARFs and inhibit the transcription of genes activated by ARFs. Molecular studies have revealed that Aux/IAA family members can form diverse dimers with ARFs to regulate genes in various ways. Functional analyses of Aux/IAA family members have indicated that they have various roles in plant development, such as root development, shoot growth, and fruit ripening. In this review, recently discovered details regarding the molecular characteristics, regulation, and protein–protein interactions of the Aux/IAA proteins are discussed. These details provide new insights into the molecular basis of the Aux/IAA protein functions in plant developmental processes.

  13. Models of gene gain and gene loss for probabilistic reconstruction of gene content in the last universal common ancestor of life

    OpenAIRE

    Kannan, Lavanya; Li, Hua; Rubinstein, Boris; Mushegian, Arcady

    2013-01-01

    Background The problem of probabilistic inference of gene content in the last common ancestor of several extant species with completely sequenced genomes is: for each gene that is conserved in all or some of the genomes, assign the probability that its ancestral gene was present in the genome of their last common ancestor. Results We have developed a family of models of gene gain and gene loss in evolution, and applied the maximum-likelihood approach that uses phylogenetic tree of prokaryotes...

  14. Characterization and gene expression analysis of the cir multi-gene family of plasmodium chabaudi chabaudi (AS)

    KAUST Repository

    Lawton, Jennifer

    2012-03-29

    Background: The pir genes comprise the largest multi-gene family in Plasmodium, with members found in P. vivax, P. knowlesi and the rodent malaria species. Despite comprising up to 5% of the genome, little is known about the functions of the proteins encoded by pir genes. P. chabaudi causes chronic infection in mice, which may be due to antigenic variation. In this model, pir genes are called cirs and may be involved in this mechanism, allowing evasion of host immune responses. In order to fully understand the role(s) of CIR proteins during P. chabaudi infection, a detailed characterization of the cir gene family was required.Results: The cir repertoire was annotated and a detailed bioinformatic characterization of the encoded CIR proteins was performed. Two major sub-families were identified, which have been named A and B. Members of each sub-family displayed different amino acid motifs, and were thus predicted to have undergone functional divergence. In addition, the expression of the entire cir repertoire was analyzed via RNA sequencing and microarray. Up to 40% of the cir gene repertoire was expressed in the parasite population during infection, and dominant cir transcripts could be identified. In addition, some differences were observed in the pattern of expression between the cir subgroups at the peak of P. chabaudi infection. Finally, specific cir genes were expressed at different time points during asexual blood stages.Conclusions: In conclusion, the large number of cir genes and their expression throughout the intraerythrocytic cycle of development indicates that CIR proteins are likely to be important for parasite survival. In particular, the detection of dominant cir transcripts at the peak of P. chabaudi infection supports the idea that CIR proteins are expressed, and could perform important functions in the biology of this parasite. Further application of the methodologies described here may allow the elucidation of CIR sub-family A and B protein

  15. Characterization and gene expression analysis of the cir multi-gene family of plasmodium chabaudi chabaudi (AS

    Directory of Open Access Journals (Sweden)

    Lawton Jennifer

    2012-03-01

    Full Text Available Abstract Background The pir genes comprise the largest multi-gene family in Plasmodium, with members found in P. vivax, P. knowlesi and the rodent malaria species. Despite comprising up to 5% of the genome, little is known about the functions of the proteins encoded by pir genes. P. chabaudi causes chronic infection in mice, which may be due to antigenic variation. In this model, pir genes are called cirs and may be involved in this mechanism, allowing evasion of host immune responses. In order to fully understand the role(s of CIR proteins during P. chabaudi infection, a detailed characterization of the cir gene family was required. Results The cir repertoire was annotated and a detailed bioinformatic characterization of the encoded CIR proteins was performed. Two major sub-families were identified, which have been named A and B. Members of each sub-family displayed different amino acid motifs, and were thus predicted to have undergone functional divergence. In addition, the expression of the entire cir repertoire was analyzed via RNA sequencing and microarray. Up to 40% of the cir gene repertoire was expressed in the parasite population during infection, and dominant cir transcripts could be identified. In addition, some differences were observed in the pattern of expression between the cir subgroups at the peak of P. chabaudi infection. Finally, specific cir genes were expressed at different time points during asexual blood stages. Conclusions In conclusion, the large number of cir genes and their expression throughout the intraerythrocytic cycle of development indicates that CIR proteins are likely to be important for parasite survival. In particular, the detection of dominant cir transcripts at the peak of P. chabaudi infection supports the idea that CIR proteins are expressed, and could perform important functions in the biology of this parasite. Further application of the methodologies described here may allow the elucidation of CIR sub-family

  16. Characterization and gene expression analysis of the cir multi-gene family of plasmodium chabaudi chabaudi (AS)

    KAUST Repository

    Lawton, Jennifer; Brugat, Thibaut; Yan, Yam Xue; Reid, Adam James; Bö hme, Ulrike; Otto, Thomas Dan; Pain, Arnab; Jackson, Andrew; Berriman, Matthew; Cunningham, Deirdre; Preiser, Peter; Langhorne, Jean

    2012-01-01

    Background: The pir genes comprise the largest multi-gene family in Plasmodium, with members found in P. vivax, P. knowlesi and the rodent malaria species. Despite comprising up to 5% of the genome, little is known about the functions of the proteins encoded by pir genes. P. chabaudi causes chronic infection in mice, which may be due to antigenic variation. In this model, pir genes are called cirs and may be involved in this mechanism, allowing evasion of host immune responses. In order to fully understand the role(s) of CIR proteins during P. chabaudi infection, a detailed characterization of the cir gene family was required.Results: The cir repertoire was annotated and a detailed bioinformatic characterization of the encoded CIR proteins was performed. Two major sub-families were identified, which have been named A and B. Members of each sub-family displayed different amino acid motifs, and were thus predicted to have undergone functional divergence. In addition, the expression of the entire cir repertoire was analyzed via RNA sequencing and microarray. Up to 40% of the cir gene repertoire was expressed in the parasite population during infection, and dominant cir transcripts could be identified. In addition, some differences were observed in the pattern of expression between the cir subgroups at the peak of P. chabaudi infection. Finally, specific cir genes were expressed at different time points during asexual blood stages.Conclusions: In conclusion, the large number of cir genes and their expression throughout the intraerythrocytic cycle of development indicates that CIR proteins are likely to be important for parasite survival. In particular, the detection of dominant cir transcripts at the peak of P. chabaudi infection supports the idea that CIR proteins are expressed, and could perform important functions in the biology of this parasite. Further application of the methodologies described here may allow the elucidation of CIR sub-family A and B protein

  17. Characterization of the avian Trojan gene family reveals contrasting evolutionary constraints.

    Directory of Open Access Journals (Sweden)

    Petar Petrov

    Full Text Available "Trojan" is a leukocyte-specific, cell surface protein originally identified in the chicken. Its molecular function has been hypothesized to be related to anti-apoptosis and the proliferation of immune cells. The Trojan gene has been localized onto the Z sex chromosome. The adjacent two genes also show significant homology to Trojan, suggesting the existence of a novel gene/protein family. Here, we characterize this Trojan family, identify homologues in other species and predict evolutionary constraints on these genes. The two Trojan-related proteins in chicken were predicted as a receptor-type tyrosine phosphatase and a transmembrane protein, bearing a cytoplasmic immuno-receptor tyrosine-based activation motif. We identified the Trojan gene family in ten other bird species and found related genes in three reptiles and a fish species. The phylogenetic analysis of the homologues revealed a gradual diversification among the family members. Evolutionary analyzes of the avian genes predicted that the extracellular regions of the proteins have been subjected to positive selection. Such selection was possibly a response to evolving interacting partners or to pathogen challenges. We also observed an almost complete lack of intracellular positively selected sites, suggesting a conserved signaling mechanism of the molecules. Therefore, the contrasting patterns of selection likely correlate with the interaction and signaling potential of the molecules.

  18. Plant ion channels: gene families, physiology, and functional genomics analyses.

    Science.gov (United States)

    Ward, John M; Mäser, Pascal; Schroeder, Julian I

    2009-01-01

    Distinct potassium, anion, and calcium channels in the plasma membrane and vacuolar membrane of plant cells have been identified and characterized by patch clamping. Primarily owing to advances in Arabidopsis genetics and genomics, and yeast functional complementation, many of the corresponding genes have been identified. Recent advances in our understanding of ion channel genes that mediate signal transduction and ion transport are discussed here. Some plant ion channels, for example, ALMT and SLAC anion channel subunits, are unique. The majority of plant ion channel families exhibit homology to animal genes; such families include both hyperpolarization- and depolarization-activated Shaker-type potassium channels, CLC chloride transporters/channels, cyclic nucleotide-gated channels, and ionotropic glutamate receptor homologs. These plant ion channels offer unique opportunities to analyze the structural mechanisms and functions of ion channels. Here we review gene families of selected plant ion channel classes and discuss unique structure-function aspects and their physiological roles in plant cell signaling and transport.

  19. Global Analysis of miRNA Gene Clusters and Gene Families Reveals Dynamic and Coordinated Expression

    Directory of Open Access Journals (Sweden)

    Li Guo

    2014-01-01

    Full Text Available To further understand the potential expression relationships of miRNAs in miRNA gene clusters and gene families, a global analysis was performed in 4 paired tumor (breast cancer and adjacent normal tissue samples using deep sequencing datasets. The compositions of miRNA gene clusters and families are not random, and clustered and homologous miRNAs may have close relationships with overlapped miRNA species. Members in the miRNA group always had various expression levels, and even some showed larger expression divergence. Despite the dynamic expression as well as individual difference, these miRNAs always indicated consistent or similar deregulation patterns. The consistent deregulation expression may contribute to dynamic and coordinated interaction between different miRNAs in regulatory network. Further, we found that those clustered or homologous miRNAs that were also identified as sense and antisense miRNAs showed larger expression divergence. miRNA gene clusters and families indicated important biological roles, and the specific distribution and expression further enrich and ensure the flexible and robust regulatory network.

  20. Novel genetic variants in miR-191 gene and familial ovarian cancer

    International Nuclear Information System (INIS)

    Shen, Jie; DiCioccio, Richard; Odunsi, Kunle; Lele, Shashikant B; Zhao, Hua

    2010-01-01

    Half of the familial aggregation of ovarian cancer can't be explained by any known risk genes, suggesting the existence of other genetic risk factors. Some of these unknown factors may not be traditional protein encoding genes. MicroRNA (miRNA) plays a critical role in tumorigenesis, but it is still unknown if variants in miRNA genes lead to predisposition to cancer. Considering the fact that miRNA regulates a number of tumor suppressor genes (TSGs) and oncogenes, genetic variations in miRNA genes could affect the levels of expression of TSGs or oncogenes and, thereby, cancer risk. To test this hypothesis in familial ovarian cancer, we screened for genetic variants in thirty selected miRNA genes, which are predicted to regulate key ovarian cancer genes and are reported to be misexpressed in ovarian tumor tissues, in eighty-three patients with familial ovarian cancer. All of the patients are non-carriers of any known BRCA1/2 or mismatch repair (MMR) gene mutations. Seven novel genetic variants were observed in four primary or precursor miRNA genes. Among them, three rare variants were found in the precursor or primary precursor of the miR-191 gene. In functional assays, the one variant located in the precursor of miR-191 resulted in conformational changes in the predicted secondary structures, and consequently altered the expression of mature miR-191. In further analysis, we found that this particular variant exists in five family members who had ovarian cancer. Our findings suggest that there are novel genetic variants in miRNA genes, and those certain genetic variants in miRNA genes can affect the expression of mature miRNAs and, consequently, might alter the regulation of TSGs or oncogenes. Additionally, the variant might be potentially associated with the development of familial ovarian cancer

  1. Molecular analysis of the NDP gene in two families with Norrie disease.

    Science.gov (United States)

    Rivera-Vega, M Refugio; Chiñas-Lopez, Silvet; Vaca, Ana Luisa Jimenez; Arenas-Sordo, M Luz; Kofman-Alfaro, Susana; Messina-Baas, Olga; Cuevas-Covarrubias, Sergio Alberto

    2005-04-01

    To describe the molecular defects in the Norrie disease protein (NDP) gene in two families with Norrie disease (ND). We analysed two families with ND at molecular level through polymerase chain reaction, DNA sequence analysis and GeneScan. Two molecular defects found in the NDP gene were: a missense mutation (265C > G) within codon 97 that resulted in the interchange of arginine by proline, and a partial deletion in the untranslated 3' region of exon 3 of the NDP gene. Clinical findings were more severe in the family that presented the partial deletion. We also diagnosed the carrier status of one daughter through GeneScan; this method proved to be a useful tool for establishing female carriers of ND. Here we report two novel mutations in the NDP gene in Mexican patients and propose that GeneScan is a viable mean of establishing ND carrier status.

  2. Ancient signals: comparative genomics of plant MAPK and MAPKK gene families

    DEFF Research Database (Denmark)

    Hamel, Louis-Philippe; Nicole, Marie-Claude; Sritubtim, Somrudee

    2006-01-01

    MAPK signal transduction modules play crucial roles in regulating many biological processes in plants, and their components are encoded by highly conserved genes. The recent availability of genome sequences for rice and poplar now makes it possible to examine how well the previously described...... Arabidopsis MAPK and MAPKK gene family structures represent the broader evolutionary situation in plants, and analysis of gene expression data for MPK and MKK genes in all three species allows further refinement of those families, based on functionality. The Arabidopsis MAPK nomenclature appears sufficiently...

  3. Polymorphism in the interferon-{alpha} gene family

    Energy Technology Data Exchange (ETDEWEB)

    Golovleva, I.; Lundgren, E.; Beckman, L. [Univ. of Umea (Sweden); Kandefer-Szerszen, M. [Maria Curie-Sklodowska Univ., Lublin (Poland)

    1996-09-01

    A pronounced genetic polymorphism of the interferon type I gene family has been assumed on the basis of RFLP analysis of the genomic region as well as the large number of sequences published compared to the number of loci. However, IFNA2 is the only locus that has been carefully analyzed concerning gene frequency, and only naturally occurring rare alleles have been found. We have extended the studies on a variation of expressed sequences by studying the IFNA1, IFNA2, IFNA10, IFNA13, IFNA14, and IFNA17 genes. Genomic white-blood-cell DNA from a population sample of blood donors and from a family material were screened by single-nucleotide primer extension (allele-specific primer extension) of PCR fragments. Because of sequence similarities, in some cases {open_quotes}nested{close_quotes} PCR was used, and, when applicable, restriction analysis or control sequencing was performed. All individuals carried the interferon-{alpha} 1 and interferon-{alpha} 13 variants but not the LeIF D variant. At the IFNA2 and IFNA14 loci only one sequence variant was found, while in the IFNA10 and IFNA17 groups two alleles were detected in each group. The IFNA10 and IFNA17 alleles segregated in families and showed a close fit to the Hardy-Weinberg equilibrium. There was a significant linkage disequilibrium between IFNA10 and IFNA17 alleles. The fact that the extent of genetic polymorphism was lower than expected suggests that a majority of the previously described gene sequences represent nonpolymorphic rare mutants that may have arisen in tumor cell lines. 44 refs., 4 figs., 4 tabs.

  4. Science & Society seminar: Evolution is not only a story of genes

    CERN Multimedia

    2002-01-01

    Memes are behaviours and ideas copied from person to person by imitation. These include songs, habits, skills, inventions and ways of doing things. Darwinian evolutionary theory, which holds that genes control the traits of organisms, has traditionally explained human nature. Susan Blackmore offers a new look at evolution, and considers evolving memes as well as genes. This will be the subject of the next Science and Society seminar, 'The evolution of Meme machines', that will take place on Thursday 24 October. According to the meme idea, everything changed in human evolution when imitation first appeared because imitation let loose a new replicator, the meme. Since that time, two replicators have been driving human evolution, not one. This is why humans have such big brains, and why they alone produce and understand grammatical language, sing, dance, wear clothes and have complex cumulative cultures. Unlike other brains, human brains had to solve the problem of choosing which memes to imitate. In other wor...

  5. A Unique Evolution of the S2 Gene of Equine Infectious Anemia Virus in Hosts Correlated with Particular Infection Statuses

    Science.gov (United States)

    Wang, Xue-Feng; Wang, Shuai; Liu, Qiang; Lin, Yue-Zhi; Du, Cheng; Tang, Yan-Dong; Na, Lei; Wang, Xiaojun; Zhou, Jian-Hua

    2014-01-01

    Equine infectious anemia virus (EIAV) is a member of the Lentivirus genus in the Retroviridae family that exhibits a genomic structure similar to that of HIV-1. The S2 accessory proteins play important roles in viral replication in vivo and in viral pathogenicity; however, studies on S2 evolution in vivo are limited. This study analyzed the evolutionary characteristics of the S2 gene of a pathogenic EIAV strain, EIAVLN40, in four experimentally infected horses. The results demonstrated that 14.7% (10 of 68 residues) of the stable amino acid mutations occurred longitudinally in S2 during a 150-day infection period. Further analysis revealed that six of the ten mutated residues were positively selected during the infection. Alignment and phylogenetic analyses showed that the S2 gene sequences of viruses isolated from the infected horses at the early stage of EIAVLN40 infection were highly homologous and similar to the vaccine-specific sequence. The S2 gene variants isolated from the febrile episodes and late phase of infection became homologous to the S2 gene sequence of the inoculating EIAVLN40 strain. Our results indicate that the S2 gene evolves in diversity and divergence in vivo in different stages of EIAV infection and that this evolution correlates with the pathogenicity of the virus. PMID:25390683

  6. Duplication and relocation of the functional DPY19L2 gene within low copy repeats

    Directory of Open Access Journals (Sweden)

    Cheung Joseph

    2006-03-01

    Full Text Available Abstract Background Low copy repeats (LCRs are thought to play an important role in recent gene evolution, especially when they facilitate gene duplications. Duplicate genes are fundamental to adaptive evolution, providing substrates for the development of new or shared gene functions. Moreover, silencing of duplicate genes can have an indirect effect on adaptive evolution by causing genomic relocation of functional genes. These changes are theorized to have been a major factor in speciation. Results Here we present a novel example showing functional gene relocation within a LCR. We characterize the genomic structure and gene content of eight related LCRs on human Chromosomes 7 and 12. Two members of a novel transmembrane gene family, DPY19L, were identified in these regions, along with six transcribed pseudogenes. One of these genes, DPY19L2, is found on Chromosome 12 and is not syntenic with its mouse orthologue. Instead, the human locus syntenic to mouse Dpy19l2 contains a pseudogene, DPY19L2P1. This indicates that the ancestral copy of this gene has been silenced, while the descendant copy has remained active. Thus, the functional copy of this gene has been relocated to a new genomic locus. We then describe the expansion and evolution of the DPY19L gene family from a single gene found in invertebrate animals. Ancient duplications have led to multiple homologues in different lineages, with three in fish, frogs and birds and four in mammals. Conclusion Our results show that the DPY19L family has expanded throughout the vertebrate lineage and has undergone recent primate-specific evolution within LCRs.

  7. Evolution and Expression Patterns of CYC/TB1 Genes in Anacyclus: Phylogenetic Insights for Floral Symmetry Genes in Asteraceae

    Science.gov (United States)

    Bello, María A.; Cubas, Pilar; Álvarez, Inés; Sanjuanbenito, Guillermo; Fuertes-Aguilar, Javier

    2017-01-01

    Homologs of the CYC/TB1 gene family have been independently recruited many times across the eudicots to control aspects of floral symmetry The family Asteraceae exhibits the largest known diversification in this gene paralog family accompanied by a parallel morphological floral richness in its specialized head-like inflorescence. In Asteraceae, whether or not CYC/TB1 gene floral symmetry function is preserved along organismic and gene lineages is unknown. In this study, we used phylogenetic, structural and expression analyses focused on the highly derived genus Anacyclus (tribe Anthemidae) to address this question. Phylogenetic reconstruction recovered eight main gene lineages present in Asteraceae: two from CYC1, four from CYC2 and two from CYC3-like genes. The species phylogeny was recovered in most of the gene lineages, allowing the delimitation of orthologous sets of CYC/TB1 genes in Asteraceae. Quantitative real-time PCR analysis indicated that in Anacyclus three of the four isolated CYC2 genes are more highly expressed in ray flowers. The expression of the four AcCYC2 genes overlaps in several organs including the ligule of ray flowers, as well as in anthers and ovules throughout development. PMID:28487706

  8. msh/Msx gene family in neural development.

    Science.gov (United States)

    Ramos, Casto; Robert, Benoît

    2005-11-01

    The involvement of Msx homeobox genes in skull and tooth formation has received a great deal of attention. Recent studies also indicate a role for the msh/Msx gene family in development of the nervous system. In this article, we discuss the functions of these transcription factors in neural-tissue organogenesis. We will deal mainly with the interactions of the Drosophila muscle segment homeobox (msh) gene with other homeobox genes and the repressive cascade that leads to neuroectoderm patterning; the role of Msx genes in neural-crest induction, focusing especially on the differences between lower and higher vertebrates; their implication in patterning of the vertebrate neural tube, particularly in diencephalon midline formation. Finally, we will examine the distinct activities of Msx1, Msx2 and Msx3 genes during neurogenesis, taking into account their relationships with signalling molecules such as BMP.

  9. Darwinian Evolution of Mutualistic RNA Replicators with Different Genes

    Science.gov (United States)

    Mizuuchi, R.; Ichihashi, N.

    2017-07-01

    We report a sustainable long-term replication and evolution of two distinct cooperative RNA replicators encoding different genes. One of the RNAs evolved to maintain or increase the cooperativity, despite selective advantage of selfish mutations.

  10. Genome-Wide Identification and Analysis of the TIFY Gene Family in Grape

    Science.gov (United States)

    Zhang, Yucheng; Gao, Min; Singer, Stacy D.; Fei, Zhangjun; Wang, Hua; Wang, Xiping

    2012-01-01

    Background The TIFY gene family constitutes a plant-specific group of genes with a broad range of functions. This family encodes four subfamilies of proteins, including ZML, TIFY, PPD and JASMONATE ZIM-Domain (JAZ) proteins. JAZ proteins are targets of the SCFCOI1 complex, and function as negative regulators in the JA signaling pathway. Recently, it has been reported in both Arabidopsis and rice that TIFY genes, and especially JAZ genes, may be involved in plant defense against insect feeding, wounding, pathogens and abiotic stresses. Nonetheless, knowledge concerning the specific expression patterns and evolutionary history of plant TIFY family members is limited, especially in a woody species such as grape. Methodology/Principal Findings A total of two TIFY, four ZML, two PPD and 11 JAZ genes were identified in the Vitis vinifera genome. Phylogenetic analysis of TIFY protein sequences from grape, Arabidopsis and rice indicated that the grape TIFY proteins are more closely related to those of Arabidopsis than those of rice. Both segmental and tandem duplication events have been major contributors to the expansion of the grape TIFY family. In addition, synteny analysis between grape and Arabidopsis demonstrated that homologues of several grape TIFY genes were found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of lineages that led to grape and Arabidopsis. Analyses of microarray and quantitative real-time RT-PCR expression data revealed that grape TIFY genes are not a major player in the defense against biotrophic pathogens or viruses. However, many of these genes were responsive to JA and ABA, but not SA or ET. Conclusion The genome-wide identification, evolutionary and expression analyses of grape TIFY genes should facilitate further research of this gene family and provide new insights regarding their evolutionary history and regulatory control. PMID:22984514

  11. The relaxin family peptide receptors and their ligands: new developments and paradigms in the evolution from jawless fish to mammals.

    Science.gov (United States)

    Yegorov, Sergey; Bogerd, Jan; Good, Sara V

    2014-12-01

    Relaxin family peptide receptors (Rxfps) and their ligands, relaxin (Rln) and insulin-like (Insl) peptides, are broadly implicated in the regulation of reproductive and neuroendocrine processes in mammals. Most placental mammals harbour genes for four receptors, namely rxfp1, rxfp2, rxfp3 and rxfp4. The number and identity of rxfps in other vertebrates are immensely variable, which is probably attributable to intraspecific variation in reproductive and neuroendocrine regulation. Here, we highlight several interesting, but greatly overlooked, aspects of the rln/insl-rxfp evolutionary history: the ancient origin, recruitment of novel receptors, diverse roles of selection, differential retention and lineage-specific loss of genes over evolutionary time. The tremendous diversity of rln/insl and rxfp genes appears to have arisen from two divergent receptors and one ligand that were duplicated by whole genome duplications (WGD) in early vertebrate evolution, although several genes, notably relaxin in mammals, were also duplicated via small scale duplications. Duplication and loss of genes have varied across lineages: teleosts retained more WGD-derived genes, dominated by those thought to be involved in neuroendocrine regulation (rln3, insl5 and rxfp 3/4 genes), while eutherian mammals witnessed the diversification and rapid evolution of genes involved in reproduction (rln/insl3). Several genes that arose early in evolutionary history were lost in most mammals, but retained in teleosts and, to a lesser extent, in early diverging tetrapods. To elaborate on their evolutionary history, we provide updated phylogenies of the Rxfp1/2 and Rxfp3/4 receptors and their ligands, including new sequences from early diverging vertebrate taxa such as coelacanth, skate, spotted gar, and lamprey. We also summarize the recent progress made towards understanding the functional biology of Rxfps in non-mammalian taxa, providing a new conceptual framework for research on Rxfp signaling across

  12. Glutamine synthetase gene evolution: A good molecular clock

    International Nuclear Information System (INIS)

    Pesole, G.; Lanvave, C.; Saccone, C.; Bozzetti, M.P.; Preparata, G.

    1991-01-01

    Glutamine synthetase gene evolution in various animals, plants, and bacteria was evaluated by a general stationary Markov model. The evolutionary process proved to be unexpectedly regular even for a time span as long as that between the divergence of prokaryotes from eukaryotes. This enabled us to draw phylogenetic trees for species whose phylogeny cannot be easily reconstructed from the fossil record. The calculation of the times of divergence of the various organelle-specific enzymes led us to hypothesize that the pea and bean chloroplast genes for these enzymes originated from the duplication of nuclear genes as a result of the different metabolic needs of the various species. The data indicate that the duplication of plastid glutamine synthetase genes occurred long after the endosymbiotic events that produced the organelles themselves

  13. Genome-Wide Identification and Comparative Analysis of the 3-Hydroxy-3-methylglutaryl Coenzyme A Reductase (HMGR Gene Family in Gossypium

    Directory of Open Access Journals (Sweden)

    Wei Liu

    2018-01-01

    Full Text Available Terpenes are the largest and most diverse class of secondary metabolites in plants and play a very important role in plant adaptation to environment. 3-Hydroxy-3-methylglutaryl coenzyme A reductase (HMGR is a rate-limiting enzyme in the process of terpene biosynthesis in the cytosol. Previous study found the HMGR genes underwent gene expansion in Gossypium raimondii, but the characteristics and evolution of the HMGR gene family in Gossypium genus are unclear. In this study, genome-wide identification and comparative study of HMGR gene family were carried out in three Gossypium species with genome sequences, i.e., G. raimondii, Gossypium arboreum, and Gossypium hirsutum. In total, nine, nine and 18 HMGR genes were identified in G. raimondii, G. arboreum, and G. hirsutum, respectively. The results indicated that the HMGR genes underwent gene expansion and a unique gene cluster containing four HMGR genes was found in all the three Gossypium species. The phylogenetic analysis suggested that the expansion of HMGR genes had occurred in their common ancestor. There was a pseudogene that had a 10-bp deletion resulting in a frameshift mutation and could not be translated into functional proteins in G. arboreum and the A-subgenome of G. hirsutum. The expression profiles of the two pseudogenes showed that they had tissue-specific expression. Additionally, the expression pattern of the pseudogene in the A-subgenome of G. hirsutum was similar to its paralogous gene in the D-subgenome of G. hirsutum. Our results provide useful information for understanding cytosolic terpene biosynthesis in Gossypium species.

  14. Gene conversion in the rice genome

    DEFF Research Database (Denmark)

    Xu, Shuqing; Clark, Terry; Zheng, Hongkun

    2008-01-01

    -chromosomal conversions distributed between chromosome 1 and 5, 2 and 6, and 3 and 5 are more frequent than genome average (Z-test, P ... is not tightly linked to natural selection in the rice genome. To assess the contribution of segmental duplication on gene conversion statistics, we determined locations of conversion partners with respect to inter-chromosomal segment duplication. The number of conversions associated with segmentation is less...... involved in conversion events. CONCLUSION: The evolution of gene families in the rice genome may have been accelerated by conversion with pseudogenes. Our analysis suggests a possible role for gene conversion in the evolution of pathogen-response genes....

  15. The FGGY carbohydrate kinase family: insights into the evolution of functional specificities.

    Directory of Open Access Journals (Sweden)

    Ying Zhang

    2011-12-01

    Full Text Available Function diversification in large protein families is a major mechanism driving expansion of cellular networks, providing organisms with new metabolic capabilities and thus adding to their evolutionary success. However, our understanding of the evolutionary mechanisms of functional diversity in such families is very limited, which, among many other reasons, is due to the lack of functionally well-characterized sets of proteins. Here, using the FGGY carbohydrate kinase family as an example, we built a confidently annotated reference set (CARS of proteins by propagating experimentally verified functional assignments to a limited number of homologous proteins that are supported by their genomic and functional contexts. Then, we analyzed, on both the phylogenetic and the molecular levels, the evolution of different functional specificities in this family. The results show that the different functions (substrate specificities encoded by FGGY kinases have emerged only once in the evolutionary history following an apparently simple divergent evolutionary model. At the same time, on the molecular level, one isofunctional group (L-ribulokinase, AraB evolved at least two independent solutions that employed distinct specificity-determining residues for the recognition of a same substrate (L-ribulose. Our analysis provides a detailed model of the evolution of the FGGY kinase family. It also shows that only combined molecular and phylogenetic approaches can help reconstruct a full picture of functional diversifications in such diverse families.

  16. [Evolution of stress in families of children with attention deficit hyperactivity disorder].

    Science.gov (United States)

    Guerro-Prado, D; Mardomingo-Sanz, M L; Ortiz-Guerra, J J; García-García, P; Soler-López, B

    2015-11-01

    The objective of this study was to assess the evolution of stress in families of children and adolescents who start psychopharmacological treatment after being diagnosed with attention deficit hyperactivity disorder (ADHD), and the ability to detect this change using the FSI (Family Strain Index) questionnaire. Forty eight (48) specialists in child-adolescent psychiatry or neuropediatrics included 429 families of children diagnosed with ADHD, represented by the father, mother or guardian of the child. In the baseline visit, and at two and four months, the intensity of the symptoms of ADHD was evaluated using the abbreviated Conners scale, and family stress was evaluated using the FSI questionnaire. The following was observed: a) an improvement in the overall FSI score and in all its dimensions (Pchildren or adolescents (420) received treatment with modified-release methylphenidate. There was a significant relationship between the positive evolution of symptoms in children with ADHD and the reduction of family stress, as evaluated by the FSI questionnaire, after starting psychopharmacological treatment. This study showed a great sensitivity to change in the clinical situation of patients with ADHD, evaluated through the stress it produces on its families. It is recommended to use this questionnaire as an indirect measurement of the repercussions of the disorder on the environment of the child with ADHD in terms of family stress. Copyright © 2014 Asociación Española de Pediatría. Published by Elsevier España, S.L.U. All rights reserved.

  17. Amino acid transporter expansions associated with the evolution of obligate endosymbiosis in sap-feeding insects (Hemiptera: sternorrhyncha).

    Science.gov (United States)

    Dahan, Romain A; Duncan, Rebecca P; Wilson, Alex C C; Dávalos, Liliana M

    2015-03-25

    Mutualistic obligate endosymbioses shape the evolution of endosymbiont genomes, but their impact on host genomes remains unclear. Insects of the sub-order Sternorrhyncha (Hemiptera) depend on bacterial endosymbionts for essential amino acids present at low abundances in their phloem-based diet. This obligate dependency has been proposed to explain why multiple amino acid transporter genes are maintained in the genomes of the insect hosts. We implemented phylogenetic comparative methods to test whether amino acid transporters have proliferated in sternorrhynchan genomes at rates grater than expected by chance. By applying a series of methods to reconcile gene and species trees, inferring the size of gene families in ancestral lineages, and simulating the null process of birth and death in multi-gene families, we uncovered a 10-fold increase in duplication rate in the AAAP family of amino acid transporters within Sternorrhyncha. This gene family expansion was unmatched in other closely related clades lacking endosymbionts that provide essential amino acids. Our findings support the influence of obligate endosymbioses on host genome evolution by both inferring significant expansions of gene families involved in symbiotic interactions, and discovering increases in the rate of duplication associated with multiple emergences of obligate symbiosis in Sternorrhyncha.

  18. The Evolution of the American Family and Its Effects on Health Behavior Choices.

    Science.gov (United States)

    Manning, Terri M.

    The evolution of the family concerns health educators because family environment has been consistently linked to development of various addictions and negative behaviors, such as drug and alcohol abuse, eating disorders, workaholism, excessive exercise, sexual promiscuity, vandalism, youth crime, and violence and abuse. It is recognized that a…

  19. Genome-wide identification and characterization of R2R3MYB family in Rosaceae.

    Science.gov (United States)

    González, Máximo; Carrasco, Basilio; Salazar, Erika

    2016-09-01

    Transcription factors R2R3MYB family have been associated with the control of secondary metabolites, development of structures, cold tolerance and response to biotic and abiotic stress, among others. In recent years, genomes of Rosaceae botanical family are available. Although this information has been used to study the karyotype evolution of these species from an ancestral genome, there are no studies that treat the evolution and diversity of gene families present in these species or in the botanical family. Here we present the first comparative study of the R2R3MYB subfamily of transcription factors in three species of Rosaceae family (Malus domestica, Prunus persica and Fragaria vesca). We described 186, 98 and 86 non-redundant gene models for apple, peach and strawberry, respectively. In this research, we analyzed the intron-exon structure and genomic distribution of R2R3MYB families mentioned above. The phylogenetic comparisons revealed putative functions of some R2R3MYB transcription factors. This analysis found 44 functional subgroups, seven of which were unique for Rosaceae. In addition, our results showed a highly collinearity among some genes revealing the existence of conserved gene models between the three species studied. Although some gene models in these species have been validated under several approaches, more research in the Rosaceae family is necessary to determine gene expression patterns in specific tissues and development stages to facilitate understanding of the regulatory and biochemical mechanism in this botanical family.

  20. Evolution dynamics of a model for gene duplication under adaptive conflict

    Science.gov (United States)

    Ancliff, Mark; Park, Jeong-Man

    2014-06-01

    We present and solve the dynamics of a model for gene duplication showing escape from adaptive conflict. We use a Crow-Kimura quasispecies model of evolution where the fitness landscape is a function of Hamming distances from two reference sequences, which are assumed to optimize two different gene functions, to describe the dynamics of a mixed population of individuals with single and double copies of a pleiotropic gene. The evolution equations are solved through a spin coherent state path integral, and we find two phases: one is an escape from an adaptive conflict phase, where each copy of a duplicated gene evolves toward subfunctionalization, and the other is a duplication loss of function phase, where one copy maintains its pleiotropic form and the other copy undergoes neutral mutation. The phase is determined by a competition between the fitness benefits of subfunctionalization and the greater mutational load associated with maintaining two gene copies. In the escape phase, we find a dynamics of an initial population of single gene sequences only which escape adaptive conflict through gene duplication and find that there are two time regimes: until a time t* single gene sequences dominate, and after t* double gene sequences outgrow single gene sequences. The time t* is identified as the time necessary for subfunctionalization to evolve and spread throughout the double gene sequences, and we show that there is an optimum mutation rate which minimizes this time scale.

  1. t2prhd: a tool to study the patterns of repeat evolution

    Directory of Open Access Journals (Sweden)

    Pénzes Zsolt

    2008-01-01

    Full Text Available Abstract Background The models developed to characterize the evolution of multigene families (such as the birth-and-death and the concerted models have also been applied on the level of sequence repeats inside a gene/protein. Phylogenetic reconstruction is the method of choice to study the evolution of gene families and also sequence repeats in the light of these models. The characterization of the gene family evolution in view of the evolutionary models is done by the evaluation of the clustering of the sequences with the originating loci in mind. As the locus represents positional information, it is straightforward that in the case of the repeats the exact position in the sequence should be used, as the simple numbering according to repeat order can be misleading. Results We have developed a novel rapid visual approach to study repeat evolution, that takes into account the exact repeat position in a sequence. The "pairwise repeat homology diagram" visualizes sequence repeats detected by a profile HMM in a pair of sequences and highlights their homology relations inferred by a phylogenetic tree. The method is implemented in a Perl script (t2prhd available for downloading at http://t2prhd.sourceforge.net and is also accessible as an online tool at http://t2prhd.brc.hu. The power of the method is demonstrated on the EGF-like and fibronectin-III-like (Fn-III domain repeats of three selected mammalian Tenascin sequences. Conclusion Although pairwise repeat homology diagrams do not carry all the information provided by the phylogenetic tree, they allow a rapid and intuitive assessment of repeat evolution. We believe, that t2prhd is a helpful tool with which to study the pattern of repeat evolution. This method can be particularly useful in cases of large datasets (such as large gene families, as the command line interface makes it possible to automate the generation of pairwise repeat homology diagrams with the aid of scripts.

  2. Evolution of the HIV-1 nef gene in HLA-B*57 Positive Elite Suppressors

    Directory of Open Access Journals (Sweden)

    Siliciano Robert F

    2010-11-01

    Full Text Available Abstract Elite controllers or suppressors (ES are HIV-1 infected patients who maintain viral loads of gag and nef in HLA-B*57 positive ES. We previously showed evolution in the gag gene of ES which surprisingly was mostly due to synonymous mutations rather than non-synonymous mutation in targeted CTL epitopes. This finding could be the result of structural constraints on Gag, and we therefore examined the less conserved nef gene. We found slow evolution of nef in plasma virus in some ES. This evolution is mostly due to synonymous mutations and occurs at a rate similar to that seen in the gag gene in the same patients. The results provide further evidence of ongoing viral replication in ES and suggest that the nef and gag genes in these patients respond similarly to selective pressure from the host.

  3. Genome-wide identification of the SWEET gene family in wheat.

    Science.gov (United States)

    Gao, Yue; Wang, Zi Yuan; Kumar, Vikranth; Xu, Xiao Feng; Yuan, De Peng; Zhu, Xiao Feng; Li, Tian Ya; Jia, Baolei; Xuan, Yuan Hu

    2018-02-05

    The SWEET (sugars will eventually be exported transporter) family is a newly characterized group of sugar transporters. In plants, the key roles of SWEETs in phloem transport, nectar secretion, pollen nutrition, stress tolerance, and plant-pathogen interactions have been identified. SWEET family genes have been characterized in many plant species, but a comprehensive analysis of SWEET members has not yet been performed in wheat. Here, 59 wheat SWEETs (hereafter TaSWEETs) were identified through homology searches. Analyses of phylogenetic relationships, numbers of transmembrane helices (TMHs), gene structures, and motifs showed that TaSWEETs carrying 3-7 TMHs could be classified into four clades with 10 different types of motifs. Examination of the expression patterns of 18 SWEET genes revealed that a few are tissue-specific while most are ubiquitously expressed. In addition, the stem rust-mediated expression patterns of SWEET genes were monitored using a stem rust-susceptible cultivar, 'Little Club' (LC). The resulting data showed that the expression of five out of the 18 SWEETs tested was induced following inoculation. In conclusion, we provide the first comprehensive analysis of the wheat SWEET gene family. Information regarding the phylogenetic relationships, gene structures, and expression profiles of SWEET genes in different tissues and following stem rust disease inoculation will be useful in identifying the potential roles of SWEETs in specific developmental and pathogenic processes. Copyright © 2017 Elsevier B.V. All rights reserved.

  4. Transcriptome characterisation of the ant Formica exsecta with new insights into the evolution of desaturase genes in social hymenoptera.

    Directory of Open Access Journals (Sweden)

    Hélène Badouin

    Full Text Available BACKGROUND: Despite the recent sequencing of seven ant genomes, no genomic data are available for the genus Formica, an important group for the study of eusocial traits. We sequenced the transcriptome of the ant Formica exsecta with the 454 FLX Titanium technology from a pooled sample of workers from 70 Finnish colonies. RESULTS: About 1,000,000 reads were obtained from a normalised cDNA library. We compared the assemblers MIRA3.0 and Newbler2.6 and showed that the latter performed better on this dataset due to a new option which is dedicated to improve contig formation in low depth portions of the assemblies. The 29,579 contigs represent 27 Mb. 50% showed similarity with known proteins and 25% could be assigned a category of gene ontology. We found more than 13,000 high-quality single nucleotide polymorphisms. The Δ9 desaturase gene family is an important multigene family involved in chemical communication in insects. We found six Δ9 desaturases in this Formica exsecta transcriptome dataset that were used to reconstruct a maximum-likelihood phylogeny of insect desaturases and to test for signatures of positive selection in this multigene family in ant lineages. We found differences with previous phylogenies of this gene family in ants, and found two clades potentially under positive selection. CONCLUSION: This first transcriptome reference sequence of Formica exsecta provided sequence and polymorphism data that will allow researchers working on Formica ants to develop studies to tackle the genetic basis of eusocial phenotypes. In addition, this study provided some general guidelines for de novo transcriptome assembly that should be useful for future transcriptome sequencing projects. Finally, we found potential signatures of positive selection in some clades of the Δ9 desaturase gene family in ants, which suggest the potential role of sequence divergence and adaptive evolution in shaping the large diversity of chemical cues in social insects.

  5. Identification and characterization of NF-YB family genes in tung tree.

    Science.gov (United States)

    Yang, Susu; Wang, Yangdong; Yin, Hengfu; Guo, Haobo; Gao, Ming; Zhu, Huiping; Chen, Yicun

    2015-12-01

    The NF-YB transcription factor gene family encodes a subunit of the CCAAT box-binding factor (CBF), a highly conserved trimeric activator that strongly binds to the CCAAT box promoter element. Studies on model plants have shown that NF-YB proteins participate in important developmental and physiological processes, but little is known about NF-YB proteins in trees. Here, we identified seven NF-YB transcription factor-encoding genes in Vernicia fordii, an important oilseed tree in China. A phylogenetic analysis separated the genes into two groups; non-LEC1 type (VfNF-YB1, 5, 7, 9, 11, 13) and LEC1-type (VfNF-YB 14). A gene structure analysis showed that VfNF-YB 5 has three introns and the other genes have no introns. The seven VfNF-YB sequences contain highly conserved domains, a disordered region at the N terminus, and two long helix structures at the C terminus. Phylogenetic analyses showed that VfNF-YB family genes are highly homologous to GmNF-YB genes, and many of them are closely related to functionally characterized NF-YBs. In expression analyses of various tissues (root, stem, leaf, and kernel) and the root during pathogen infection, VfNF-YB1, 5, and 11 were dominantly expressed in kernels, and VfNF-YB7 and 9 were expressed only in the root. Different VfNF-YB family genes showed different responses to pathogen infection, suggesting that they play different roles in the pathogen response. Together, these findings represent the first extensive evaluation of the NF-YB family in tung tree and provide a foundation for dissecting the functions of VfNF-YB genes in seed development, stress adaption, fatty acid synthesis, and pathogen response.

  6. Familial placement and relations of Rehmannia and Triaenophora (Scrophulariaceae s.l.) inferred from five gene regions.

    Science.gov (United States)

    Xia, Zhi; Wang, Yin-Zheng; Smith, James F

    2009-02-01

    Accurate classification systems based on evolution are imperative for biological investigations. The recent explosion of molecular phylogenetics has resulted in a much improved classification of angiosperms. More than five phylogenetic lineages have been recognized from Scrophulariaceae sensu lato since the family was determined to be polyphyletic; however, questions remain about the genera that have not been assigned to one of the segregate families of Scrophulariaceae s.l. Rehmannia Liboschitz and Triaenophora Solereder are such genera with uncertain familial placement. There also is debate whether Triaenophora should be segregated from Rehmannia. To evaluate the phylogenetic relations between Rehmannia and Triaenophora, to find their closest relatives, and to verify their familial placement, we conducted phylogenetic analyses of the sequences of one nuclear DNA (ITS) region and four chloroplast DNA gene regions (trnL-F, rps16, rbcL, and rps2) individually and combined. The analyses showed that Rehmannia and Triaenophora are each strongly supported as monophyletic and together are sister to Orobanchaceae. This relation was corroborated by phytochemical and morphological data. Based on these data, we suggest that Rehmannia and Triaenophora represent the second nonparasitic branch sister to the remainder of Orobanchaceae (including Lindenbergia).

  7. High level of microsynteny and purifying selection affect the evolution of WRKY family in Gramineae.

    Science.gov (United States)

    Jin, Jing; Kong, Jingjing; Qiu, Jianle; Zhu, Huasheng; Peng, Yuancheng; Jiang, Haiyang

    2016-01-01

    The WRKY gene family, which encodes proteins in the regulation processes of diverse developmental stages, is one of the largest families of transcription factors in higher plants. In this study, by searching for interspecies gene colinearity (microsynteny) and dating the age distributions of duplicated genes, we found 35 chromosomal segments of subgroup I genes of WRKY family (WRKY I) in four Gramineae species (Brachypodium, rice, sorghum, and maize) formed eight orthologous groups. After a stepwise gene-by-gene reciprocal comparison of all the protein sequences in the WRKY I gene flanking areas, highly conserved regions of microsynteny were found in the four Gramineae species. Most gene pairs showed conserved orientation within syntenic genome regions. Furthermore, tandem duplication events played the leading role in gene expansion. Eventually, environmental selection pressure analysis indicated strong purifying selection for the WRKY I genes in Gramineae, which may have been followed by gene loss and rearrangement. The results presented in this study provide basic information of Gramineae WRKY I genes and form the foundation for future functional studies of these genes. High level of microsynteny in the four grass species provides further evidence that a large-scale genome duplication event predated speciation.

  8. Pareto evolution of gene networks: an algorithm to optimize multiple fitness objectives

    International Nuclear Information System (INIS)

    Warmflash, Aryeh; Siggia, Eric D; Francois, Paul

    2012-01-01

    The computational evolution of gene networks functions like a forward genetic screen to generate, without preconceptions, all networks that can be assembled from a defined list of parts to implement a given function. Frequently networks are subject to multiple design criteria that cannot all be optimized simultaneously. To explore how these tradeoffs interact with evolution, we implement Pareto optimization in the context of gene network evolution. In response to a temporal pulse of a signal, we evolve networks whose output turns on slowly after the pulse begins, and shuts down rapidly when the pulse terminates. The best performing networks under our conditions do not fall into categories such as feed forward and negative feedback that also encode the input–output relation we used for selection. Pareto evolution can more efficiently search the space of networks than optimization based on a single ad hoc combination of the design criteria. (paper)

  9. Pareto evolution of gene networks: an algorithm to optimize multiple fitness objectives.

    Science.gov (United States)

    Warmflash, Aryeh; Francois, Paul; Siggia, Eric D

    2012-10-01

    The computational evolution of gene networks functions like a forward genetic screen to generate, without preconceptions, all networks that can be assembled from a defined list of parts to implement a given function. Frequently networks are subject to multiple design criteria that cannot all be optimized simultaneously. To explore how these tradeoffs interact with evolution, we implement Pareto optimization in the context of gene network evolution. In response to a temporal pulse of a signal, we evolve networks whose output turns on slowly after the pulse begins, and shuts down rapidly when the pulse terminates. The best performing networks under our conditions do not fall into categories such as feed forward and negative feedback that also encode the input-output relation we used for selection. Pareto evolution can more efficiently search the space of networks than optimization based on a single ad hoc combination of the design criteria.

  10. An ancient history of gene duplications, fusions and losses in the evolution of APOBEC3 mutators in mammals

    Science.gov (United States)

    2012-01-01

    Background The APOBEC3 (A3) genes play a key role in innate antiviral defense in mammals by introducing directed mutations in the DNA. The human genome encodes for seven A3 genes, with multiple splice alternatives. Different A3 proteins display different substrate specificity, but the very basic question on how discerning self from non-self still remains unresolved. Further, the expression of A3 activity/ies shapes the way both viral and host genomes evolve. Results We present here a detailed temporal analysis of the origin and expansion of the A3 repertoire in mammals. Our data support an evolutionary scenario where the genome of the mammalian ancestor encoded for at least one ancestral A3 gene, and where the genome of the ancestor of placental mammals (and possibly of the ancestor of all mammals) already encoded for an A3Z1-A3Z2-A3Z3 arrangement. Duplication events of the A3 genes have occurred independently in different lineages: humans, cats and horses. In all of them, gene duplication has resulted in changes in enzyme activity and/or substrate specificity, in a paradigmatic example of convergent adaptive evolution at the genomic level. Finally, our results show that evolutionary rates for the three A3Z1, A3Z2 and A3Z3 motifs have significantly decreased in the last 100 Mya. The analysis constitutes a textbook example of the evolution of a gene locus by duplication and sub/neofunctionalization in the context of virus-host arms race. Conclusions Our results provide a time framework for identifying ancestral and derived genomic arrangements in the APOBEC loci, and to date the expansion of this gene family for different lineages through time, as a response to changes in viral/retroviral/retrotransposon pressure. PMID:22640020

  11. Evolution of life cycle, colony morphology, and host specificity in the family Hydractiniidae (Hydrozoa, Cnidaria).

    Science.gov (United States)

    Miglietta, Maria Pia; Cunningham, Clifford W

    2012-12-01

    Biased transitions are common throughout the tree of life. The class hydrozoa is no exception, having lost the feeding medusa stage at least 70 times. The family hydractiniidae includes one lineage with pelagic medusae (Podocoryna) and several without (e.g., Hydractinia). The benthic colony stage also varies widely in host specificity and in colony form. The five-gene phylogeny presented here requires multiple transitions between character states for medusae, host specificity, and colony phenotype. Significant phylogenetic correlations exist between medusoid form, colony morphology, and host specificity. Species with nonfeeding medusae are usually specialized on a single host type, and reticulate colonies are correlated with nonmotile hosts. The history of feeding medusae is less certain. Podocoryna is nested within five lineages lacking medusae. This requires either repeated losses of medusae, or the remarkable re-evolution of a feeding medusa after at least 150 million years. Traditional ancestral reconstruction favors medusa regain, but a likelihood framework testing biased transitions cannot distinguish between multiple losses versus regain. A hypothesis of multiple losses of feeding medusae requires transient selection pressure favoring such a loss. Populations of species with feeding medusae are always locally rare and lack of feeding medusae does not result in restricted species distribution around the world. © 2012 The Author(s). Evolution© 2012 The Society for the Study of Evolution.

  12. The underlying mechanisms of genetic innovation and speciation in the family Corynebacteriaceae: A phylogenomics approach.

    Science.gov (United States)

    Zhi, Xiao-Yang; Jiang, Zhao; Yang, Ling-Ling; Huang, Ying

    2017-02-01

    The pangenome of a bacterial species population is formed by genetic reduction and genetic expansion over the long course of evolution. Gene loss is a pervasive source of genetic reduction, and (exogenous and endogenous) gene gain is the main driver of genetic expansion. To understand the genetic innovation and speciation of the family Corynebacteriaceae, which cause a wide range of serious infections in humans and animals, we analyzed the pangenome of this family, and reconstructed its phylogeny using a phylogenomics approach. Genetic variations have occurred throughout the whole evolutionary history of the Corynebacteriaceae. Gene loss has been the primary force causing genetic changes, not only in terms of the number of protein families affected, but also because of its continuity on the time series. The variation in metabolism caused by these genetic changes mainly occurred for membrane transporters, two-component systems, and metabolism related to amino acids and carbohydrates. Interestingly, horizontal gene transfer (HGT) not only caused changes related to pathogenicity, but also triggered the acquisition of antimicrobial resistance. The Darwinian theory of evolution did not adequately explain the effects of dispersive HGT and/or gene loss in the evolution of the Corynebacteriaceae. These findings provide new insight into the evolution and speciation of Corynebacteriaceae and advance our understanding of the genetic innovation in microbial populations. Copyright © 2016 Elsevier Inc. All rights reserved.

  13. Horizontal gene transfer and the evolution of transcriptionalregulation in Escherichia coli

    Energy Technology Data Exchange (ETDEWEB)

    Price, Morgan N.; Dehal, Paramvir S.; Arkin, Adam P.

    2007-12-20

    Background: Most bacterial genes were acquired by horizontalgene transfer from other bacteria instead of being inherited bycontinuous vertical descent from an ancient ancestor}. To understand howthe regulation of these {acquired} genes evolved, we examined theevolutionary histories of transcription factors and of regulatoryinteractions from the model bacterium Escherichia coli K12. Results:Although most transcription factors have paralogs, these usually arose byhorizontal gene transfer rather than by duplication within the E. colilineage, as previously believed. In general, most neighbor regulators --regulators that are adjacent to genes that they regulate -- were acquiredby horizontal gene transfer, while most global regulators evolvedvertically within the gamma-Proteobacteria. Neighbor regulators wereoften acquired together with the adjacent operon that they regulate, sothe proximity might be maintained by repeated transfers (like "selfishoperons"). Many of the as-yet-uncharacterized (putative) regulators havealso been acquired together with adjacent genes, so we predict that theseare neighbor regulators as well. When we analyzed the histories ofregulatory interactions, we found that the evolution of regulation byduplication was rare, and surprisingly, many of the regulatoryinteractions that are shared between paralogs result from convergentevolution. Another surprise was that horizontally transferred genes aremore likely than other genes to be regulated by multiple regulators, andmost of this complex regulation probably evolved after the transfer.Conclusions: Our results highlight the rapid evolution of niche-specificgene regulation in bacteria.

  14. Reflection of the Evolution of the Family Institution in Russkaya Pravda

    Directory of Open Access Journals (Sweden)

    Oleg Vasilyevich Inshakov

    2016-12-01

    Full Text Available The problems of maintaining conditions of expanded reproduction of family as a basic unit of society remain relevant in the Russian state policy for a millennium. An important example of this gives the first written code of laws Russkaya Pravda, in which the standards reflecting the interests of the institutions of state and family harmonization had been recorded and developed from the beginning of the 11th century. The evolution of this document from Russkaya Pravda by Jaroslav Mudryj (not earlier than 1016 to Russkaya Pravda by the Jaroslavichi (about 1072 and later, discloses the change of status, importance and role of small family in a period of transition from heathenism to Christianity, strengthening rational basis of economy, weakening tribal relations within large families, and rejecting the outdated forms of marriage. The analysis of Russkaya Pravda reveals the systemic complexity of family as a complicated institution, resulting from a combination of a plurality of specific institutions in the framework of this primary group of society organization is manifested. It is reasonable to allocate institutions of different levels in the system of family relations. These institutions are implemented by its members, have fixed status, ensure system’s safety, and contribute to sustainable and expanded reproduction. The evolution of Russkaya Pravda shows awareness of increasing importance of family reproduction for preserving the quality and quantity of people as a natural and social foundation of the state, and as a decisive factor of strengthening the ancient Rus. Protection of family members and property is becoming the priority of state policy under the pressure of complex social processes. Russkaya Pravda has considerable heuristic potential for systemic institutional analysis of the family’s structure, functions and forms in ancient Rus in the 11th century. Implementing systemic institutional research of a family in the old Russian

  15. Functional evolution of ADAMTS genes: Evidence from analyses of phylogeny and gene organization

    Directory of Open Access Journals (Sweden)

    Van Meir Erwin G

    2005-02-01

    Full Text Available Abstract Background The ADAMTS (A Disintegrin-like and Metalloprotease with Thrombospondin motifs proteins are a family of metalloproteases with sequence similarity to the ADAM proteases, that contain the thrombospondin type 1 sequence repeat motifs (TSRs common to extracellular matrix proteins. ADAMTS proteins have recently gained attention with the discovery of their role in a variety of diseases, including tissue and blood disorders, cancer, osteoarthritis, Alzheimer's and the genetic syndromes Weill-Marchesani syndrome (ADAMTS10, thrombotic thrombocytopenic purpura (ADAMTS13, and Ehlers-Danlos syndrome type VIIC (ADAMTS2 in humans and belted white-spotting mutation in mice (ADAMTS20. Results Phylogenetic analysis and comparison of the exon/intron organization of vertebrate (Homo, Mus, Fugu, chordate (Ciona and invertebrate (Drosophila and Caenorhabditis ADAMTS homologs has elucidated the evolutionary relationships of this important gene family, which comprises 19 members in humans. Conclusions The evolutionary history of ADAMTS genes in vertebrate genomes has been marked by rampant gene duplication, including a retrotransposition that gave rise to a distinct ADAMTS subfamily (ADAMTS1, -4, -5, -8, -15 that may have distinct aggrecanase and angiogenesis functions.

  16. Genomewide analysis of TCP transcription factor gene family in ...

    Indian Academy of Sciences (India)

    Home; Journals; Journal of Genetics; Volume 93; Issue 3. Genomewide ... Teosinte branched1/cycloidea/proliferating cell factor1 (TCP) proteins are a large family of transcriptional regulators in angiosperms. They are ... To the best of our knowledge, this is the first study of a genomewide analysis of apple TCP gene family.

  17. A six-gene phylogeny provides new insights into choanoflagellate evolution.

    Science.gov (United States)

    Carr, Martin; Richter, Daniel J; Fozouni, Parinaz; Smith, Timothy J; Jeuck, Alexandra; Leadbeater, Barry S C; Nitsche, Frank

    2017-02-01

    Recent studies have shown that molecular phylogenies of the choanoflagellates (Class Choanoflagellatea) are in disagreement with their traditional taxonomy, based on morphology, and that Choanoflagellatea requires considerable taxonomic revision. Furthermore, phylogenies suggest that the morphological and ecological evolution of the group is more complex than has previously been recognized. Here we address the taxonomy of the major choanoflagellate order Craspedida, by erecting four new genera. The new genera are shown to be morphologically, ecologically and phylogenetically distinct from other choanoflagellate taxa. Furthermore, we name five novel craspedid species, as well as formally describe ten species that have been shown to be either misidentified or require taxonomic revision. Our revised phylogeny, including 18 new species and sequence data for two additional genes, provides insights into the morphological and ecological evolution of the choanoflagellates. We examine the distribution within choanoflagellates of these two additional genes, EF-1A and EFL, closely related translation GTPases which are required for protein synthesis. Mapping the presence and absence of these genes onto the phylogeny highlights multiple events of gene loss within the choanoflagellates. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  18. Genome Wide Identification, Phylogeny, and Expression of Aquaporin Genes in Common Carp (Cyprinus carpio.

    Directory of Open Access Journals (Sweden)

    Chuanju Dong

    Full Text Available Aquaporins (Aqps are integral membrane proteins that facilitate the transport of water and small solutes across cell membranes. Among vertebrate species, Aqps are highly conserved in both gene structure and amino acid sequence. These proteins are vital for maintaining water homeostasis in living organisms, especially for aquatic animals such as teleost fish. Studies on teleost Aqps are mainly limited to several model species with diploid genomes. Common carp, which has a tetraploidized genome, is one of the most common aquaculture species being adapted to a wide range of aquatic environments. The complete common carp genome has recently been released, providing us the possibility for gene evolution of aqp gene family after whole genome duplication.In this study, we identified a total of 37 aqp genes from common carp genome. Phylogenetic analysis revealed that most of aqps are highly conserved. Comparative analysis was performed across five typical vertebrate genomes. We found that almost all of the aqp genes in common carp were duplicated in the evolution of the gene family. We postulated that the expansion of the aqp gene family in common carp was the result of an additional whole genome duplication event and that the aqp gene family in other teleosts has been lost in their evolution history with the reason that the functions of genes are redundant and conservation. Expression patterns were assessed in various tissues, including brain, heart, spleen, liver, intestine, gill, muscle, and skin, which demonstrated the comprehensive expression profiles of aqp genes in the tetraploidized genome. Significant gene expression divergences have been observed, revealing substantial expression divergences or functional divergences in those duplicated aqp genes post the latest WGD event.To some extent, the gene families are also considered as a unique source for evolutionary studies. Moreover, the whole set of common carp aqp gene family provides an

  19. Convergent evolution of RFX transcription factors and ciliary genes predated the origin of metazoans

    Directory of Open Access Journals (Sweden)

    Chen Nansheng

    2010-05-01

    Full Text Available Abstract Background Intraflagellar transport (IFT genes, which are critical for the development and function of cilia and flagella in metazoans, are tightly regulated by the Regulatory Factor X (RFX transcription factors (TFs. However, how and when their evolutionary relationship was established remains unknown. Results We have identified evidence suggesting that RFX TFs and IFT genes evolved independently and their evolution converged before the first appearance of metazoans. Both ciliary genes and RFX TFs exist in all metazoans as well as some unicellular eukaryotes. However, while RFX TFs and IFT genes are found simultaneously in all sequenced metazoan genomes, RFX TFs do not co-exist with IFT genes in most pre-metazoans and thus do not regulate them in these organisms. For example, neither the budding yeast nor the fission yeast possesses cilia although both have well-defined RFX TFs. Conversely, most unicellular eukaryotes, including the green alga Chlamydomonas reinhardtii, have typical cilia and well conserved IFT genes but lack RFX TFs. Outside of metazoans, RFX TFs and IFT genes co-exist only in choanoflagellates including M. brevicollis, and only one fungus Allomyces macrogynus of the 51 sequenced fungus genomes. M. brevicollis has two putative RFX genes and a full complement of ciliary genes. Conclusions The evolution of RFX TFs and IFT genes were independent in pre-metazoans. We propose that their convergence in evolution, or the acquired transcriptional regulation of IFT genes by RFX TFs, played a pivotal role in the establishment of metazoan.

  20. Genome-wide identification and characterization of the bHLH gene family in tomato.

    Science.gov (United States)

    Sun, Hua; Fan, Hua-Jie; Ling, Hong-Qing

    2015-01-22

    The basic helix-loop-helix (bHLH) proteins are a large superfamily of transcription factors, and play a central role in a wide range of metabolic, physiological, and developmental processes in higher organisms. Tomato is an important vegetable crop, and its genome sequence has been published recently. However, the bHLH gene family of tomato has not been systematically identified and characterized yet. In this study, we identified 159 bHLH protein-encoding genes (SlbHLH) in tomato genome and analyzed their structures. Although bHLH domains were conserved among the bHLH proteins between tomato and Arabidopsis, the intron sequences and distribution of tomato bHLH genes were extremely different compared with Arabidopsis. The gene duplication analysis showed that 58.5% and 6.3% of SlbHLH genes belonged to low-stringency and high-stringency duplication, respectively, indicating that the SlbHLH genes are mainly generated via short low-stringency region duplication in tomato. Subsequently, we classified the SlbHLH genes into 21 subfamilies by phylogenetic tree analysis, and predicted their possible functions by comparison with their homologous genes of Arabidopsis. Moreover, the expression profile analysis of SlbHLH genes from 10 different tissues showed that 21 SlbHLH genes exhibited tissue-specific expression. Further, we identified that 11 SlbHLH genes were associated with fruit development and ripening (eight of them associated with young fruit development and three with fruit ripening). The evolutionary analysis revealed that 92% SlbHLH genes might be evolved from ancestor(s) originated from early land plant, and 8% from algae. In this work, we systematically identified SlbHLHs by analyzing the tomato genome sequence using a set of bioinformatics approaches, and characterized their chromosomal distribution, gene structures, duplication, phylogenetic relationship and expression profiles, as well predicted their possible biological functions via comparative analysis

  1. The SKP1-like gene family of Arabidopsis exhibits a high degree of differential gene expression and gene product interaction during development.

    Directory of Open Access Journals (Sweden)

    Mohammad H Dezfulian

    Full Text Available The Arabidopsis thaliana genome encodes several families of polypeptides that are known or predicted to participate in the formation of the SCF-class of E3-ubiquitin ligase complexes. One such gene family encodes the Skp1-like class of polypeptide subunits, where 21 genes have been identified and are known to be expressed in Arabidopsis. Phylogenetic analysis based on deduced polypeptide sequence organizes the family of ASK proteins into 7 clades. The complexity of the ASK gene family, together with the close structural similarity among its members raises the prospect of significant functional redundancy among select paralogs. We have assessed the potential for functional redundancy within the ASK gene family by analyzing an expanded set of criteria that define redundancy with higher resolution. The criteria used include quantitative expression of locus-specific transcripts using qRT-PCR, assessment of the sub-cellular localization of individual ASK:YFP auto-fluorescent fusion proteins expressed in vivo as well as the in planta assessment of individual ASK-F-Box protein interactions using bimolecular fluorescent complementation techniques in combination with confocal imagery in live cells. The results indicate significant functional divergence of steady state transcript abundance and protein-protein interaction specificity involving ASK proteins in a pattern that is poorly predicted by sequence-based phylogeny. The information emerging from this and related studies will prove important for defining the functional intersection of expression, localization and gene product interaction that better predicts the formation of discrete SCF complexes, as a prelude to investigating their molecular mode of action.

  2. The Role of a Novel TRMT1 Gene Mutation and Rare GRM1 Gene Defect in Intellectual Disability in Two Azeri Families.

    Science.gov (United States)

    Davarniya, Behzad; Hu, Hao; Kahrizi, Kimia; Musante, Luciana; Fattahi, Zohreh; Hosseini, Masoumeh; Maqsoud, Fariba; Farajollahi, Reza; Wienker, Thomas F; Ropers, H Hilger; Najmabadi, Hossein

    2015-01-01

    Cognitive impairment or intellectual disability (ID) is a widespread neurodevelopmental disorder characterized by low IQ (below 70). ID is genetically heterogeneous and is estimated to affect 1-3% of the world's population. In affected children from consanguineous families, autosomal recessive inheritance is common, and identifying the underlying genetic cause is an important issue in clinical genetics. In the framework of a larger project, aimed at identifying candidate genes for autosomal recessive intellectual disorder (ARID), we recently carried out single nucleotide polymorphism-based genome-wide linkage analysis in several families from Ardabil province in Iran. The identification of homozygosity-by-descent loci in these families, in combination with whole exome sequencing, led us to identify possible causative homozygous changes in two families. In the first family, a missense variant was found in GRM1 gene, while in the second family, a frameshift alteration was identified in TRMT1, both of which were found to co-segregate with the disease. GRM1, a known causal gene for autosomal recessive spinocerebellar ataxia (SCAR13, MIM#614831), encodes the metabotropic glutamate receptor1 (mGluR1). This gene plays an important role in synaptic plasticity and cerebellar development. Conversely, the TRMT1 gene encodes a tRNA methyltransferase that dimethylates a single guanine residue at position 26 of most tRNAs using S-adenosyl methionine as the methyl group donor. We recently presented TRMT1 as a candidate gene for ARID in a consanguineous Iranian family (Najmabadi et al., 2011). We believe that this second Iranian family with a biallelic loss-of-function mutation in TRMT1 gene supports the idea that this gene likely has function in development of the disorder.

  3. Brain evolution relating to family, play, and the separation call.

    Science.gov (United States)

    MacLean, P D

    1985-04-01

    Mammals stem from the mammal-like reptiles (therapsids) that were widely prevalent in Pangaea 250 million years ago. In the evolutionary transition from reptiles to mammals, three key developments were (1) nursing, in conjunction with maternal care; (2) audiovocal communication for maintaining maternal-offspring contact; and (3) play. The separation call perhaps ranks as the earliest and most basic mammalian vocalization, while play may have functioned originally to promote harmony in the nest. How did such family related behavior develop? In its evolution, the forebrain of advanced mammals has expanded as a triune structure that anatomically and chemically reflects ancestral commonalities with reptiles, early mammals, and late mammals. Recent findings suggest that the development of the behavioral triad in question may have depended on the evolution of the thalamocingulate division of the limbic system, a derivative from early mammals. The thalamocingulate division (which has no distinctive counterpart in the reptilian brain) is, in turn, geared in with the prefrontal neocortex that, in human beings, may be inferred to play a key role in familial acculturation.

  4. Molecular and morphological systematics of the Ellisellidae (Coelenterata: Octocorallia): Parallel evolution in a globally distributed family of octocorals

    KAUST Repository

    Bilewitch, Jaret P.

    2014-04-01

    The octocorals of the Ellisellidae constitute a diverse and widely distributed family with subdivisions into genera based on colonial growth forms. Branching patterns are repeated in several genera and congeners often display region-specific variations in a given growth form. We examined the systematic patterns of ellisellid genera and the evolution of branching form diversity using molecular phylogenetic and ancestral morphological reconstructions. Six of eight included genera were found to be polyphyletic due to biogeographical incompatibility with current taxonomic assignments and the creation of at least six new genera plus several reassignments among existing genera is necessary. Phylogenetic patterns of diversification of colony branching morphology displayed a similar transformation order in each of the two primary ellisellid clades, with a sea fan form estimated as the most-probable common ancestor with likely origins in the Indo-Pacific region. The observed parallelism in evolution indicates the existence of a constraint on the genetic elements determining ellisellid colonial morphology. However, the lack of correspondence between levels of genetic divergence and morphological diversity among genera suggests that future octocoral studies should focus on the role of changes in gene regulation in the evolution of branching patterns. © 2014 Elsevier Inc.

  5. Molecular and morphological systematics of the Ellisellidae (Coelenterata: Octocorallia): Parallel evolution in a globally distributed family of octocorals

    KAUST Repository

    Bilewitch, Jaret P.; Ekins, Merrick; Hooper, John; Degnan, Sandie M.

    2014-01-01

    The octocorals of the Ellisellidae constitute a diverse and widely distributed family with subdivisions into genera based on colonial growth forms. Branching patterns are repeated in several genera and congeners often display region-specific variations in a given growth form. We examined the systematic patterns of ellisellid genera and the evolution of branching form diversity using molecular phylogenetic and ancestral morphological reconstructions. Six of eight included genera were found to be polyphyletic due to biogeographical incompatibility with current taxonomic assignments and the creation of at least six new genera plus several reassignments among existing genera is necessary. Phylogenetic patterns of diversification of colony branching morphology displayed a similar transformation order in each of the two primary ellisellid clades, with a sea fan form estimated as the most-probable common ancestor with likely origins in the Indo-Pacific region. The observed parallelism in evolution indicates the existence of a constraint on the genetic elements determining ellisellid colonial morphology. However, the lack of correspondence between levels of genetic divergence and morphological diversity among genera suggests that future octocoral studies should focus on the role of changes in gene regulation in the evolution of branching patterns. © 2014 Elsevier Inc.

  6. Gene arrangement and sequence of mitochondrial genomes yield insights into the phylogeny and evolution of bees and sphecid wasps (Hymenoptera: Apoidea).

    Science.gov (United States)

    Zheng, Bo-Ying; Cao, Li-Jun; Tang, Pu; van Achterberg, Kees; Hoffmann, Ary A; Chen, Hua-Yan; Chen, Xue-Xin; Wei, Shu-Jun

    2018-07-01

    The Apoidea represent a large and common superfamily of the Hymenoptera including the bees and sphecid wasps. A robust phylogenetic tree is essential to understanding the diversity, taxonomy and evolution of the Apoidea. In this study, features of apoid mitochondrial genomes were used to reconstruct phylogenetic relationships. Twelve apoid mitochondrial genomes were newly sequenced, representing six families and nine subfamilies. Gene rearrangement events have occurred in all apoid mitochondrial genomes sequenced to date. Sphecid wasps have both tRNA and protein-coding gene rearrangements in 5 of 8 species. In bees, the only rearranged genes are tRNAs; long-tongued bees (Apidae + Megachilidae) are characterized by movement of trnA to the trnI-trnQ-trnM tRNA cluster. Phylogenetic analyses of mitochondrial gene sequences support the known paraphyly of sphecid wasps, with bees nested within this clade. The Ampulicidae is sister to the remaining Apoidea. Crabronidae is paraphyletic, split into Crabronidae s.s. and Philanthidae, with the latter group a sister clade to bees. The monophyletic bees are either classified into two clades, long-tongued bees (Apidae + Megachilidae) and short-tongued bees (Andrenidae + Halictidae + Colletidae + Melitidae), or three groups with the Melitidae sister to the other bees. Our study showed that both gene sequences and arrangements provide information on the phylogeny of apoid families. Copyright © 2018 Elsevier Inc. All rights reserved.

  7. Disruption of the petal identity gene APETALA3-3 is highly correlated with loss of petals within the buttercup family (Ranunculaceae).

    Science.gov (United States)

    Zhang, Rui; Guo, Chunce; Zhang, Wengen; Wang, Peipei; Li, Lin; Duan, Xiaoshan; Du, Qinggao; Zhao, Liang; Shan, Hongyan; Hodges, Scott A; Kramer, Elena M; Ren, Yi; Kong, Hongzhi

    2013-03-26

    Absence of petals, or being apetalous, is usually one of the most important features that characterizes a group of flowering plants at high taxonomic ranks (i.e., family and above). The apetalous condition, however, appears to be the result of parallel or convergent evolution with unknown genetic causes. Here we show that within the buttercup family (Ranunculaceae), apetalous genera in at least seven different lineages were all derived from petalous ancestors, indicative of parallel petal losses. We also show that independent petal losses within this family were strongly associated with decreased or eliminated expression of a single floral organ identity gene, APETALA3-3 (AP3-3), apparently owing to species-specific molecular lesions. In an apetalous mutant of Nigella, insertion of a transposable element into the second intron has led to silencing of the gene and transformation of petals into sepals. In several naturally occurring apetalous genera, such as Thalictrum, Beesia, and Enemion, the gene has either been lost altogether or disrupted by deletions in coding or regulatory regions. In Clematis, a large genus in which petalous species evolved secondarily from apetalous ones, the gene exhibits hallmarks of a pseudogene. These results suggest that, as a petal identity gene, AP3-3 has been silenced or down-regulated by different mechanisms in different evolutionary lineages. This also suggests that petal identity did not evolve many times independently across the Ranunculaceae but was lost in numerous instances. The genetic mechanisms underlying the independent petal losses, however, may be complex, with disruption of AP3-3 being either cause or effect.

  8. Gene Environment Interactions and Predictors of Colorectal Cancer in Family-Based, Multi-Ethnic Groups.

    Science.gov (United States)

    Shiao, S Pamela K; Grayson, James; Yu, Chong Ho; Wasek, Brandi; Bottiglieri, Teodoro

    2018-02-16

    For the personalization of polygenic/omics-based health care, the purpose of this study was to examine the gene-environment interactions and predictors of colorectal cancer (CRC) by including five key genes in the one-carbon metabolism pathways. In this proof-of-concept study, we included a total of 54 families and 108 participants, 54 CRC cases and 54 matched family friends representing four major racial ethnic groups in southern California (White, Asian, Hispanics, and Black). We used three phases of data analytics, including exploratory, family-based analyses adjusting for the dependence within the family for sharing genetic heritage, the ensemble method, and generalized regression models for predictive modeling with a machine learning validation procedure to validate the results for enhanced prediction and reproducibility. The results revealed that despite the family members sharing genetic heritage, the CRC group had greater combined gene polymorphism rates than the family controls ( p relation to gene-environment interactions in the prevention of CRC.

  9. Comprehensive analysis of the flowering genes in Chinese cabbage and examination of evolutionary pattern of CO-like genes in plant kingdom

    Science.gov (United States)

    Song, Xiaoming; Duan, Weike; Huang, Zhinan; Liu, Gaofeng; Wu, Peng; Liu, Tongkun; Li, Ying; Hou, Xilin

    2015-09-01

    In plants, flowering is the most important transition from vegetative to reproductive growth. The flowering patterns of monocots and eudicots are distinctly different, but few studies have described the evolutionary patterns of the flowering genes in them. In this study, we analysed the evolutionary pattern, duplication and expression level of these genes. The main results were as follows: (i) characterization of flowering genes in monocots and eudicots, including the identification of family-specific, orthologous and collinear genes; (ii) full characterization of CONSTANS-like genes in Brassica rapa (BraCOL genes), the key flowering genes; (iii) exploration of the evolution of COL genes in plant kingdom and construction of the evolutionary pattern of COL genes; (iv) comparative analysis of CO and FT genes between Brassicaceae and Grass, which identified several family-specific amino acids, and revealed that CO and FT protein structures were similar in B. rapa and Arabidopsis but different in rice; and (v) expression analysis of photoperiod pathway-related genes in B. rapa under different photoperiod treatments by RT-qPCR. This analysis will provide resources for understanding the flowering mechanisms and evolutionary pattern of COL genes. In addition, this genome-wide comparative study of COL genes may also provide clues for evolution of other flowering genes.

  10. Investigation of genes encoding calcineurin B-like protein family in legumes and their expression analyses in chickpea (Cicer arietinum L.).

    Science.gov (United States)

    Meena, Mukesh Kumar; Ghawana, Sanjay; Sardar, Atish; Dwivedi, Vikas; Khandal, Hitaishi; Roy, Riti; Chattopadhyay, Debasis

    2015-01-01

    Calcium ion (Ca2+) is a ubiquitous second messenger that transmits various internal and external signals including stresses and, therefore, is important for plants' response process. Calcineurin B-like proteins (CBLs) are one of the plant calcium sensors, which sense and convey the changes in cytosolic Ca2+-concentration for response process. A search in four leguminous plant (soybean, Medicago truncatula, common bean and chickpea) genomes identified 9 to 15 genes in each species that encode CBL proteins. Sequence analyses of CBL peptides and coding sequences (CDS) suggested that there are nine original CBL genes in these legumes and some of them were multiplied during whole genome or local gene duplication. Coding sequences of chickpea CBL genes (CaCBL) were cloned from their cDNAs and sequenced, and their annotations in the genome assemblies were corrected accordingly. Analyses of protein sequences and gene structures of CBL family in plant kingdom indicated its diverse origin but showed a remarkable conservation in overall protein structure with appearance of complex gene structure in the course of evolution. Expression of CaCBL genes in different tissues and in response to different stress and hormone treatment were studied. Most of the CaCBL genes exhibited high expression in flowers. Expression profile of CaCBL genes in response to different abiotic stresses and hormones related to development and stresses (ABA, auxin, cytokinin, SA and JA) at different time intervals suggests their diverse roles in development and plant defence in addition to abiotic stress tolerance. These data not only contribute to a better understanding of the complex regulation of chickpea CBL gene family, but also provide valuable information for further research in chickpea functional genomics.

  11. Genomewide analysis of MATE-type gene family in maize reveals ...

    Indian Academy of Sciences (India)

    Huasheng Zhu and Jiandong Wu contributed equally to this work. As a group of secondary active transporters, the MATE gene family consists of multiple genes that widely exist in ..... Roots of the stress-treated plants were collected at 0,.

  12. Mutation analysis of pre-mRNA splicing genes in Chinese families with retinitis pigmentosa

    Science.gov (United States)

    Pan, Xinyuan; Chen, Xue; Liu, Xiaoxing; Gao, Xiang; Kang, Xiaoli; Xu, Qihua; Chen, Xuejuan; Zhao, Kanxing; Zhang, Xiumei; Chu, Qiaomei; Wang, Xiuying

    2014-01-01

    Purpose Seven genes involved in precursor mRNA (pre-mRNA) splicing have been implicated in autosomal dominant retinitis pigmentosa (adRP). We sought to detect mutations in all seven genes in Chinese families with RP, to characterize the relevant phenotypes, and to evaluate the prevalence of mutations in splicing genes in patients with adRP. Methods Six unrelated families from our adRP cohort (42 families) and two additional families with RP with uncertain inheritance mode were clinically characterized in the present study. Targeted sequence capture with next-generation massively parallel sequencing (NGS) was performed to screen mutations in 189 genes including all seven pre-mRNA splicing genes associated with adRP. Variants detected with NGS were filtered with bioinformatics analyses, validated with Sanger sequencing, and prioritized with pathogenicity analysis. Results Mutations in pre-mRNA splicing genes were identified in three individual families including one novel frameshift mutation in PRPF31 (p.Leu366fs*1) and two known mutations in SNRNP200 (p.Arg681His and p.Ser1087Leu). The patients carrying SNRNP200 p.R681H showed rapid disease progression, and the family carrying p.S1087L presented earlier onset ages and more severe phenotypes compared to another previously reported family with p.S1087L. In five other families, we identified mutations in other RP-related genes, including RP1 p. Ser781* (novel), RP2 p.Gln65* (novel) and p.Ile137del (novel), IMPDH1 p.Asp311Asn (recurrent), and RHO p.Pro347Leu (recurrent). Conclusions Mutations in splicing genes identified in the present and our previous study account for 9.5% in our adRP cohort, indicating the important role of pre-mRNA splicing deficiency in the etiology of adRP. Mutations in the same splicing gene, or even the same mutation, could correlate with different phenotypic severities, complicating the genotype–phenotype correlation and clinical prognosis. PMID:24940031

  13. Localization of 18S ribosomal genes in suckermouth armoured catfishes Loricariidae (Teleostei, Siluriformes with discussion on the Ag-NOR evolution

    Directory of Open Access Journals (Sweden)

    Anderson Alves

    2012-09-01

    Full Text Available The family Loricariidae with about 690 species divided into six subfamilies, is one of the world’s largest fish families. Cytogenetic studies conducted in the family showed that among 90 species analyzed the diploid number ranges from 2n=38 in Ancistrus sp. to 2n=96 in Hemipsilichthys gobio Luetken, 1874. In the present study, fluorescence in situ hybridization (FISH was employed to determine the chromosomal localization of the 18S rDNA gene in four suckermouth armoured catfishes: Kronichthys lacerta (Nichols, 1919, Pareiorhaphis splendens (Bizerril, 1995, Liposarcus multiradiatus (Hancock, 1828 and Hypostomus prope plecostomus (Linnaeus, 1758. All species analyzed showed one chromosome pair with 18S rDNA sequences, as observed in the previous Ag-NORs analyses. The presence of size and numerical polymorphism was observed and discussed, with proposing a hypothesis of the Ag-NOR evolution in Loricariidae.

  14. Members of the barley NAC transcription factor gene family show differential co-regulation with senescence-associated genes during senescence of flag leaves

    DEFF Research Database (Denmark)

    Christiansen, Michael W; Gregersen, Per L.

    2014-01-01

    -expressed with members of the NAC gene family. In conclusion, a list of up to 15 NAC genes from barley that are strong candidates for being regulatory factors of importance for senescence and biotic stress-related traits affecting the productivity of cereal crop plants has been generated. Furthermore, a list of 71...... in the NAC transcription factor family during senescence of barley flag leaves was studied. Several members of the NAC transcription factor gene family were up-regulated during senescence in a microarray experiment, together with a large range of senescence-associated genes, reflecting the coordinated...... activation of degradation processes in senescing barley leaf tissues. This picture was confirmed in a detailed quantitative reverse transcription–PCR (qRT–PCR) experiment, which also showed distinct gene expression patterns for different members of the NAC gene family, suggesting a group of ~15 out of the 47...

  15. Short interspersed nuclear elements (SINEs) are abundant in Solanaceae and have a family-specific impact on gene structure and genome organization.

    Science.gov (United States)

    Seibt, Kathrin M; Wenke, Torsten; Muders, Katja; Truberg, Bernd; Schmidt, Thomas

    2016-05-01

    Short interspersed nuclear elements (SINEs) are highly abundant non-autonomous retrotransposons that are widespread in plants. They are short in size, non-coding, show high sequence diversity, and are therefore mostly not or not correctly annotated in plant genome sequences. Hence, comparative studies on genomic SINE populations are rare. To explore the structural organization and impact of SINEs, we comparatively investigated the genome sequences of the Solanaceae species potato (Solanum tuberosum), tomato (Solanum lycopersicum), wild tomato (Solanum pennellii), and two pepper cultivars (Capsicum annuum). Based on 8.5 Gbp sequence data, we annotated 82 983 SINE copies belonging to 10 families and subfamilies on a base pair level. Solanaceae SINEs are dispersed over all chromosomes with enrichments in distal regions. Depending on the genome assemblies and gene predictions, 30% of all SINE copies are associated with genes, particularly frequent in introns and untranslated regions (UTRs). The close association with genes is family specific. More than 10% of all genes annotated in the Solanaceae species investigated contain at least one SINE insertion, and we found genes harbouring up to 16 SINE copies. We demonstrate the involvement of SINEs in gene and genome evolution including the donation of splice sites, start and stop codons and exons to genes, enlargement of introns and UTRs, generation of tandem-like duplications and transduction of adjacent sequence regions. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  16. The Role of a Novel TRMT1 Gene Mutation and Rare GRM1 Gene Defect in Intellectual Disability in Two Azeri Families.

    Directory of Open Access Journals (Sweden)

    Behzad Davarniya

    Full Text Available Cognitive impairment or intellectual disability (ID is a widespread neurodevelopmental disorder characterized by low IQ (below 70. ID is genetically heterogeneous and is estimated to affect 1-3% of the world's population. In affected children from consanguineous families, autosomal recessive inheritance is common, and identifying the underlying genetic cause is an important issue in clinical genetics. In the framework of a larger project, aimed at identifying candidate genes for autosomal recessive intellectual disorder (ARID, we recently carried out single nucleotide polymorphism-based genome-wide linkage analysis in several families from Ardabil province in Iran. The identification of homozygosity-by-descent loci in these families, in combination with whole exome sequencing, led us to identify possible causative homozygous changes in two families. In the first family, a missense variant was found in GRM1 gene, while in the second family, a frameshift alteration was identified in TRMT1, both of which were found to co-segregate with the disease. GRM1, a known causal gene for autosomal recessive spinocerebellar ataxia (SCAR13, MIM#614831, encodes the metabotropic glutamate receptor1 (mGluR1. This gene plays an important role in synaptic plasticity and cerebellar development. Conversely, the TRMT1 gene encodes a tRNA methyltransferase that dimethylates a single guanine residue at position 26 of most tRNAs using S-adenosyl methionine as the methyl group donor. We recently presented TRMT1 as a candidate gene for ARID in a consanguineous Iranian family (Najmabadi et al., 2011. We believe that this second Iranian family with a biallelic loss-of-function mutation in TRMT1 gene supports the idea that this gene likely has function in development of the disorder.

  17. The Role of a Novel TRMT1 Gene Mutation and Rare GRM1 Gene Defect in Intellectual Disability in Two Azeri Families

    Science.gov (United States)

    Kahrizi, Kimia; Musante, Luciana; Fattahi, Zohreh; Hosseini, Masoumeh; Maqsoud, Fariba; Farajollahi, Reza; Wienker, Thomas F.; Ropers, H. Hilger; Najmabadi, Hossein

    2015-01-01

    Cognitive impairment or intellectual disability (ID) is a widespread neurodevelopmental disorder characterized by low IQ (below 70). ID is genetically heterogeneous and is estimated to affect 1–3% of the world’s population. In affected children from consanguineous families, autosomal recessive inheritance is common, and identifying the underlying genetic cause is an important issue in clinical genetics. In the framework of a larger project, aimed at identifying candidate genes for autosomal recessive intellectual disorder (ARID), we recently carried out single nucleotide polymorphism-based genome-wide linkage analysis in several families from Ardabil province in Iran. The identification of homozygosity-by-descent loci in these families, in combination with whole exome sequencing, led us to identify possible causative homozygous changes in two families. In the first family, a missense variant was found in GRM1 gene, while in the second family, a frameshift alteration was identified in TRMT1, both of which were found to co-segregate with the disease. GRM1, a known causal gene for autosomal recessive spinocerebellar ataxia (SCAR13, MIM#614831), encodes the metabotropic glutamate receptor1 (mGluR1). This gene plays an important role in synaptic plasticity and cerebellar development. Conversely, the TRMT1 gene encodes a tRNA methyltransferase that dimethylates a single guanine residue at position 26 of most tRNAs using S-adenosyl methionine as the methyl group donor. We recently presented TRMT1 as a candidate gene for ARID in a consanguineous Iranian family (Najmabadi et al., 2011). We believe that this second Iranian family with a biallelic loss-of-function mutation in TRMT1 gene supports the idea that this gene likely has function in development of the disorder. PMID:26308914

  18. Gene duplication, silencing and expression alteration govern the molecular evolution of PRC2 genes in plants.

    Science.gov (United States)

    Furihata, Hazuka Y; Suenaga, Kazuya; Kawanabe, Takahiro; Yoshida, Takanori; Kawabe, Akira

    2016-10-13

    PRC2 genes were analyzed for their number of gene duplications, d N /d S ratios and expression patterns among Brassicaceae and Gramineae species. Although both amino acid sequences and copy number of the PRC2 genes were generally well conserved in both Brassicaceae and Gramineae species, we observed that some rapidly evolving genes experienced duplications and expression pattern changes. After multiple duplication events, all but one or two of the duplicated copies tend to be silenced. Silenced copies were reactivated in the endosperm and showed ectopic expression in developing seeds. The results indicated that rapid evolution of some PRC2 genes is initially caused by a relaxation of selective constraint following the gene duplication events. Several loci could become maternally expressed imprinted genes and acquired functional roles in the endosperm.

  19. Relaxed evolution in the tyrosine aminotransferase gene tat in old world fruit bats (Chiroptera: Pteropodidae).

    Science.gov (United States)

    Shen, Bin; Fang, Tao; Yang, Tianxiao; Jones, Gareth; Irwin, David M; Zhang, Shuyi

    2014-01-01

    Frugivorous and nectarivorous bats fuel their metabolism mostly by using carbohydrates and allocate the restricted amounts of ingested proteins mainly for anabolic protein syntheses rather than for catabolic energy production. Thus, it is possible that genes involved in protein (amino acid) catabolism may have undergone relaxed evolution in these fruit- and nectar-eating bats. The tyrosine aminotransferase (TAT, encoded by the Tat gene) is the rate-limiting enzyme in the tyrosine catabolic pathway. To test whether the Tat gene has undergone relaxed evolution in the fruit- and nectar-eating bats, we obtained the Tat coding region from 20 bat species including four Old World fruit bats (Pteropodidae) and two New World fruit bats (Phyllostomidae). Phylogenetic reconstructions revealed a gene tree in which all echolocating bats (including the New World fruit bats) formed a monophyletic group. The phylogenetic conflict appears to stem from accelerated TAT protein sequence evolution in the Old World fruit bats. Our molecular evolutionary analyses confirmed a change in the selection pressure acting on Tat, which was likely caused by a relaxation of the evolutionary constraints on the Tat gene in the Old World fruit bats. Hepatic TAT activity assays showed that TAT activities in species of the Old World fruit bats are significantly lower than those of insectivorous bats and omnivorous mice, which was not caused by a change in TAT protein levels in the liver. Our study provides unambiguous evidence that the Tat gene has undergone relaxed evolution in the Old World fruit bats in response to changes in their metabolism due to the evolution of their special diet.

  20. The ACBP gene family in Rhodnius prolixus

    DEFF Research Database (Denmark)

    Majerowicz, David; Hannibal-Bach, Hans K; Castro, Rodolfo S C

    2016-01-01

    The acyl-CoA-binding proteins (ACBP) constitute a family of conserved proteins that bind acyl-CoA with high affinity and protect it from hydrolysis. Thus, ACBPs may have essential roles in basal cellular lipid metabolism. The genome of the insect Rhodnius prolixus encodes five ACBP genes similar...

  1. Trans gene regulation in adaptive evolution: a genetic algorithm model.

    Science.gov (United States)

    Behera, N; Nanjundiah, V

    1997-09-21

    This is a continuation of earlier studies on the evolution of infinite populations of haploid genotypes within a genetic algorithm framework. We had previously explored the evolutionary consequences of the existence of indeterminate-"plastic"-loci, where a plastic locus had a finite probability in each generation of functioning (being switched "on") or not functioning (being switched "off"). The relative probabilities of the two outcomes were assigned on a stochastic basis. The present paper examines what happens when the transition probabilities are biased by the presence of regulatory genes. We find that under certain conditions regulatory genes can improve the adaptation of the population and speed up the rate of evolution (on occasion at the cost of lowering the degree of adaptation). Also, the existence of regulatory loci potentiates selection in favour of plasticity. There is a synergistic effect of regulatory genes on plastic alleles: the frequency of such alleles increases when regulatory loci are present. Thus, phenotypic selection alone can be a potentiating factor in a favour of better adaptation. Copyright 1997 Academic Press Limited.

  2. Identification of the 14-3-3 gene family in Rafflesia cantleyi

    Science.gov (United States)

    Rosli, Khadijah; Wan, Kiew-Lian

    2018-04-01

    Rafflesia is known to be the largest flower in the world. Due to its size and appearance, it is considered to be very unique. Little is known about the molecular biology of this rare parasitic flowering plant as it is very difficult to locate and has a short life-span as a flower. Physiological activities in plants are regulated by signalling regulators such as the members of the 14-3-3 gene family. The number of members of this gene family varies in plants and there are thirteen known members in Arabidopsis thaliana. Their role is to bind to phosphorylated targets to complete signal transduction processes. Sequence comparison using BLAST of transcriptome data from three different Rafflesia cantleyi floral bud stages against the Swissprot database revealed 27 transcripts annotated as members of this gene family. All of the transcripts were expressed during floral bud stage 1 (S1) while 14 and four transcripts were expressed during floral bud stages 2 (S2) and 3 (S3), respectively. Significant downregulation was recorded for six and nine transcripts at S1 vs. S2 and S2 vs. S3 respectively. This gene family may play a critical role as signalling regulators during the development of Rafflesia floral bud.

  3. Developmental evolution in social insects: regulatory networks from genes to societies.

    Science.gov (United States)

    Linksvayer, Timothy A; Fewell, Jennifer H; Gadau, Jürgen; Laubichler, Manfred D

    2012-05-01

    The evolution and development of complex phenotypes in social insect colonies, such as queen-worker dimorphism or division of labor, can, in our opinion, only be fully understood within an expanded mechanistic framework of Developmental Evolution. Conversely, social insects offer a fertile research area in which fundamental questions of Developmental Evolution can be addressed empirically. We review the concept of gene regulatory networks (GRNs) that aims to fully describe the battery of interacting genomic modules that are differentially expressed during the development of individual organisms. We discuss how distinct types of network models have been used to study different levels of biological organization in social insects, from GRNs to social networks. We propose that these hierarchical networks spanning different organizational levels from genes to societies should be integrated and incorporated into full GRN models to elucidate the evolutionary and developmental mechanisms underlying social insect phenotypes. Finally, we discuss prospects and approaches to achieve such an integration. © 2012 WILEY PERIODICALS, INC.

  4. Genome-Wide Identification, Evolution and Functional Divergence of MYB Transcription Factors in Chinese White Pear (Pyrus bretschneideri).

    Science.gov (United States)

    Li, Xiaolong; Xue, Cheng; Li, Jiaming; Qiao, Xin; Li, Leiting; Yu, Li'ang; Huang, Yuhua; Wu, Jun

    2016-04-01

    The MYB superfamily is large and functionally diverse in plants. To date, MYB family genes have not yet been identified in Chinese white pear (Pyrus bretschneideri), and their functions remain unclear. In this study, we identified 231 genes as candidate MYB genes and divided them into four subfamilies. The R2R3-MYB (PbrMYB) family shared an R2R3 domain with 104 amino acid residues, including five conserved tryptophan residues. The Pbr MYB family was divided into 37 functional subgroups including 33 subgroups which contained both MYB genes of Rosaceae plants and AtMYB genes, and four subgroups which included only Rosaceae MYB genes or AtMYB genes. PbrMYB genes with similar functions clustered into the same subgroup, indicating functional conservation. We also found that whole-genome duplication (WGD) and dispersed duplications played critical roles in the expansion of the MYB family. The 87 Pbr MYB duplicated gene pairs dated back to the two WGD events. Purifying selection was the primary force driving Pbr MYB gene evolution. The 15 gene pairs presented 1-7 codon sites under positive selection. A total of 147 expressed genes were identified from RNA-sequencing data of fruit, and six Pbr MYB members in subgroup C1 were identified as important candidate genes in the regulation of lignin synthesis by quantitative real-time PCR analysis. Further correlation analysis revealed that six PbrMYBs were significantly correlated with five structural gene families (F5H, HCT, CCR, POD and C3'H) in the lignin pathway. The phylogenetic, evolution and expression analyses of the MYB gene family in Chinese white pear establish a solid foundation for future comprehensive functional analysis of Pbr MYB genes. © The Author 2016. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  5. Phylogenomic analysis of secondary metabolism genes sheds light on their evolution in Aspergilli

    DEFF Research Database (Denmark)

    Theobald, Sebastian; Vesth, Tammi Camilla; Rasmussen, Jane Lind Nybo

    .Natural products are encoded by genes located in close proximity, called secondary metabolic gene clusters, which makes them interesting targets for genomic analysis. We use a modified version of the Secondary Metabolite Unique Regions Finder (SMURF) algorithm, combined with InterPro annotations to create...... approximate maximum likelihood trees of conserved domains from secondary metabolic genes across 56 species, giving insights into the secondary metabolism gene diversity and evolution.In this study we can describe the evolution of non ribosomal peptide synthetases (NRPS), polyketide synthases (PKS) and hybrids.......In the aspMine project, we are sequencing and analyzing over 300 species of Aspergilli, agroup of filamentous fungi rich in natural compounds. The vast amount of data obtained from these species challenges the way we were mining for products and requires new pipelines for secondary metabolite analysis...

  6. Divergent evolution of multiple virus-resistance genes from a progenitor in Capsicum spp.

    Science.gov (United States)

    Kim, Saet-Byul; Kang, Won-Hee; Huy, Hoang Ngoc; Yeom, Seon-In; An, Jeong-Tak; Kim, Seungill; Kang, Min-Young; Kim, Hyun Jung; Jo, Yeong Deuk; Ha, Yeaseong; Choi, Doil; Kang, Byoung-Cheorl

    2017-01-01

    Plants have evolved hundreds of nucleotide-binding and leucine-rich domain proteins (NLRs) as potential intracellular immune receptors, but the evolutionary mechanism leading to the ability to recognize specific pathogen effectors is elusive. Here, we cloned Pvr4 (a Potyvirus resistance gene in Capsicum annuum) and Tsw (a Tomato spotted wilt virus resistance gene in Capsicum chinense) via a genome-based approach using independent segregating populations. The genes both encode typical NLRs and are located at the same locus on pepper chromosome 10. Despite the fact that these two genes recognize completely different viral effectors, the genomic structures and coding sequences of the two genes are strikingly similar. Phylogenetic studies revealed that these two immune receptors diverged from a progenitor gene of a common ancestor. Our results suggest that sequence variations caused by gene duplication and neofunctionalization may underlie the evolution of the ability to specifically recognize different effectors. These findings thereby provide insight into the divergent evolution of plant immune receptors. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.

  7. APC gene mutations and extraintestinal phenotype of familial adenomatous polyposis

    NARCIS (Netherlands)

    Giardiello, F. M.; Petersen, G. M.; Piantadosi, S.; Gruber, S. B.; Traboulsi, E. I.; Offerhaus, G. J.; Muro, K.; Krush, A. J.; Booker, S. V.; Luce, M. C.; Laken, S. J.; Kinzler, K. W.; Vogelstein, B.; Hamilton, S. R.

    1997-01-01

    Familial adenomatous polyposis (FAP) is caused by germline mutation of the adenomatous polyposis coli (APC) gene on chromosome 5q. This study assessed genotype-phenotype correlations for extraintestinal lesions in FAP. Mutations of the APC gene were compared with the occurrence of seven

  8. Redundancy and divergence in the amyloid precursor protein family.

    Science.gov (United States)

    Shariati, S Ali M; De Strooper, Bart

    2013-06-27

    Gene duplication provides genetic material required for functional diversification. An interesting example is the amyloid precursor protein (APP) protein family. The APP gene family has experienced both expansion and contraction during evolution. The three mammalian members have been studied quite extensively in combined knock out models. The underlying assumption is that APP, amyloid precursor like protein 1 and 2 (APLP1, APLP2) are functionally redundant. This assumption is primarily supported by the similarities in biochemical processing of APP and APLPs and on the fact that the different APP genes appear to genetically interact at the level of the phenotype in combined knockout mice. However, unique features in each member of the APP family possibly contribute to specification of their function. In the current review, we discuss the evolution and the biology of the APP protein family with special attention to the distinct properties of each homologue. We propose that the functions of APP, APLP1 and APLP2 have diverged after duplication to contribute distinctly to different neuronal events. Our analysis reveals that APLP2 is significantly diverged from APP and APLP1. Copyright © 2013 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  9. Distribution and evolution of genes responsible for biosynthesis of mycotoxins in Fusarium

    Science.gov (United States)

    Fusarium secondary metabolites (SMs) include some of the mycotoxins of greatest concern to food and feed safety. In fungi, genes directly involved in synthesis of the same SM are typically located adjacent to one another in gene clusters. To better understand the distribution and evolution of mycoto...

  10. Genomic comparison of closely related Giant Viruses supports an accordion-like model of evolution.

    Directory of Open Access Journals (Sweden)

    Jonathan eFilée

    2015-06-01

    Full Text Available Genome gigantism occurs so far in Phycodnaviridae and Mimiviridae (order Megavirales. Origin and evolution of these Giant Viruses (GVs remain open questions. Interestingly, availability of a collection of closely related GV genomes enabling genomic comparisons offer the opportunity to better understand the different evolutionary forces acting on these genomes. Whole genome alignment for 5 groups of viruses belonging to the Mimiviridae and Phycodnaviridae families show that there is no trend of genome expansion or general tendency of genome contraction. Instead, GV genomes accumulated genomic mutations over the time with gene gains compensating the different losses. In addition, each lineage displays specific patterns of genome evolution. Mimiviridae (megaviruses and mimiviruses and Chlorella Phycodnaviruses evolved mainly by duplications and losses of genes belonging to large paralogous families (including movements of diverse mobiles genetic elements, whereas Micromonas and Ostreococcus Phycodnaviruses derive most of their genetic novelties thought lateral gene transfers. Taken together, these data support an accordion-like model of evolution in which GV genomes have undergone successive steps of gene gain and gene loss, accrediting the hypothesis that genome gigantism appears early, before the diversification of the different GV lineages.

  11. Genomic comparison of closely related Giant Viruses supports an accordion-like model of evolution.

    Science.gov (United States)

    Filée, Jonathan

    2015-01-01

    Genome gigantism occurs so far in Phycodnaviridae and Mimiviridae (order Megavirales). Origin and evolution of these Giant Viruses (GVs) remain open questions. Interestingly, availability of a collection of closely related GV genomes enabling genomic comparisons offer the opportunity to better understand the different evolutionary forces acting on these genomes. Whole genome alignment for five groups of viruses belonging to the Mimiviridae and Phycodnaviridae families show that there is no trend of genome expansion or general tendency of genome contraction. Instead, GV genomes accumulated genomic mutations over the time with gene gains compensating the different losses. In addition, each lineage displays specific patterns of genome evolution. Mimiviridae (megaviruses and mimiviruses) and Chlorella Phycodnaviruses evolved mainly by duplications and losses of genes belonging to large paralogous families (including movements of diverse mobiles genetic elements), whereas Micromonas and Ostreococcus Phycodnaviruses derive most of their genetic novelties thought lateral gene transfers. Taken together, these data support an accordion-like model of evolution in which GV genomes have undergone successive steps of gene gain and gene loss, accrediting the hypothesis that genome gigantism appears early, before the diversification of the different GV lineages.

  12. Evolution in the block: common elements of 5S rDNA organization and evolutionary patterns in distant fish genera.

    Science.gov (United States)

    Campo, Daniel; García-Vázquez, Eva

    2012-01-01

    The 5S rDNA is organized in the genome as tandemly repeated copies of a structural unit composed of a coding sequence plus a nontranscribed spacer (NTS). The coding region is highly conserved in the evolution, whereas the NTS vary in both length and sequence. It has been proposed that 5S rRNA genes are members of a gene family that have arisen through concerted evolution. In this study, we describe the molecular organization and evolution of the 5S rDNA in the genera Lepidorhombus and Scophthalmus (Scophthalmidae) and compared it with already known 5S rDNA of the very different genera Merluccius (Merluccidae) and Salmo (Salmoninae), to identify common structural elements or patterns for understanding 5S rDNA evolution in fish. High intra- and interspecific diversity within the 5S rDNA family in all the genera can be explained by a combination of duplications, deletions, and transposition events. Sequence blocks with high similarity in all the 5S rDNA members across species were identified for the four studied genera, with evidences of intense gene conversion within noncoding regions. We propose a model to explain the evolution of the 5S rDNA, in which the evolutionary units are blocks of nucleotides rather than the entire sequences or single nucleotides. This model implies a "two-speed" evolution: slow within blocks (homogenized by recombination) and fast within the gene family (diversified by duplications and deletions).

  13. Assembled Plastid and Mitochondrial Genomes, as well as Nuclear Genes, Place the Parasite Family Cynomoriaceae in the Saxifragales.

    Science.gov (United States)

    Bellot, Sidonie; Cusimano, Natalie; Luo, Shixiao; Sun, Guiling; Zarre, Shahin; Gröger, Andreas; Temsch, Eva; Renner, Susanne S

    2016-08-03

    Cynomoriaceae, one of the last unplaced families of flowering plants, comprise one or two species or subspecies of root parasites that occur from the Mediterranean to the Gobi Desert. Using Illumina sequencing, we assembled the mitochondrial and plastid genomes as well as some nuclear genes of a Cynomorium specimen from Italy. Selected genes were also obtained by Sanger sequencing from individuals collected in China and Iran, resulting in matrices of 33 mitochondrial, 6 nuclear, and 14 plastid genes and rDNAs enlarged to include a representative angiosperm taxon sampling based on data available in GenBank. We also compiled a new geographic map to discern possible discontinuities in the parasites' occurrence. Cynomorium has large genomes of 13.70-13.61 (Italy) to 13.95-13.76 pg (China). Its mitochondrial genome consists of up to 49 circular subgenomes and has an overall gene content similar to that of photosynthetic angiosperms, while its plastome retains only 27 of the normally 116 genes. Nuclear, plastid and mitochondrial phylogenies place Cynomoriaceae in Saxifragales, and we found evidence for several horizontal gene transfers from different hosts, as well as intracellular gene transfers. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  14. Comparative Analysis of Gene Expression for Convergent Evolution of Camera Eye Between Octopus and Human

    Science.gov (United States)

    Ogura, Atsushi; Ikeo, Kazuho; Gojobori, Takashi

    2004-01-01

    Although the camera eye of the octopus is very similar to that of humans, phylogenetic and embryological analyses have suggested that their camera eyes have been acquired independently. It has been known as a typical example of convergent evolution. To study the molecular basis of convergent evolution of camera eyes, we conducted a comparative analysis of gene expression in octopus and human camera eyes. We sequenced 16,432 ESTs of the octopus eye, leading to 1052 nonredundant genes that have matches in the protein database. Comparing these 1052 genes with 13,303 already-known ESTs of the human eye, 729 (69.3%) genes were commonly expressed between the human and octopus eyes. On the contrary, when we compared octopus eye ESTs with human connective tissue ESTs, the expression similarity was quite low. To trace the evolutionary changes that are potentially responsible for camera eye formation, we also compared octopus-eye ESTs with the completed genome sequences of other organisms. We found that 1019 out of the 1052 genes had already existed at the common ancestor of bilateria, and 875 genes were conserved between humans and octopuses. It suggests that a larger number of conserved genes and their similar gene expression may be responsible for the convergent evolution of the camera eye. PMID:15289475

  15. Search for intracranial aneurysm susceptibility gene(s using Finnish families

    Directory of Open Access Journals (Sweden)

    Ryynänen Markku

    2002-08-01

    Full Text Available Abstract Background Cerebrovascular disease is the third leading cause of death in the United States, and about one-fourth of cerebrovascular deaths are attributed to ruptured intracranial aneurysms (IA. Epidemiological evidence suggests that IAs cluster in families, and are therefore probably genetic. Identification of individuals at risk for developing IAs by genetic tests will allow concentration of diagnostic imaging on high-risk individuals. We used model-free linkage analysis based on allele sharing with a two-stage design for a genome-wide scan to identify chromosomal regions that may harbor IA loci. Methods We previously estimated sibling relative risk in the Finnish population at between 9 and 16, and proceeded with a genome-wide scan for loci predisposing to IA. In 85 Finnish families with two or more affected members, 48 affected sibling pairs (ASPs were available for our genetic study. Power calculations indicated that 48 ASPs were adequate to identify chromosomal regions likely to harbor predisposing genes and that a liberal stage I lod score threshold of 0.8 provided a reasonable balance between detection of false positive regions and failure to detect real loci with moderate effect. Results Seven chromosomal regions exceeded the stage I lod score threshold of 0.8 and five exceeded 1.0. The most significant region, on chromosome 19q, had a maximum multipoint lod score (MLS of 2.6. Conclusions Our study provides evidence for the locations of genes predisposing to IA. Further studies are necessary to elucidate the genes and their role in the pathophysiology of IA, and to design genetic tests.

  16. Expressional and Biochemical Characterization of Rice Disease Resistance Gene Xa3/Xa26 Family

    Institute of Scientific and Technical Information of China (English)

    Songjie Xu; Yinglong Cao; Xianghua Li; Shiping Wang

    2007-01-01

    The rice (Oryza sativa L.) Xa3/Xa26 gene, conferring race-specific resistance to bacterial blight disease and encoding a leucine-rich repeat (LRR) receptor kinase-like protein, belongs to a multigene family consisting of tandem clustered homologous genes, colocalizing with several uncharacterized genes for resistance to bacterial blight or fungal blast. To provide more information on the expressional and biochemical characteristics of the Xa3/Xa26 family, we analyzed the family members. Four Xa3/Xa26 family members in the indica rice variety Teqing, which carries a bacterial blight resistance gene with a chromosomal location tightly linked to Xa3/Xa26, and five Xa3/Xa26 family members in the japonica rice variety Nipponbare, which carries at least one uncharacterized blast resistance gene, were constitutively expressed in leaf tissue. The result suggests that some of the family members may be candidates of these uncharacterized resistance genes. At least five putative N-glycosylation sites in the LRR domain of XA3/XA26 protein are not glycosylated. The XA3/XA26 and its family members MRKa and MRKc all possess the consensus sequences of paired cysteines, which putatively function in dimerization of the receptor proteins for signal transduction, immediately before the first LRR and immediately after the last LRR. However, no homo-dimer between the XA3/XA26 molecules or hetero-dimer between XA3/XA26 and MRKa or MRKc were formed, indicating that XA3/XA26 protein might function either as a monomer or a hetero-dimer formed with other protein outside of the XA3/XA26 family. These results provide valuable information for further extensive investigation into this multiple protein family.

  17. Combining phylogenetic and syntenic analyses for understanding the evolution of TCP ECE genes in eudicots.

    Directory of Open Access Journals (Sweden)

    Hélène L Citerne

    Full Text Available TCP ECE genes encode transcription factors which have received much attention for their repeated recruitment in the control of floral symmetry in core eudicots, and more recently in monocots. Major duplications of TCP ECE genes have been described in core eudicots, but the evolutionary history of this gene family is unknown in basal eudicots. Reconstructing the phylogeny of ECE genes in basal eudicots will help set a framework for understanding the functional evolution of these genes. TCP ECE genes were sequenced in all major lineages of basal eudicots and Gunnera which belongs to the sister clade to all other core eudicots. We show that in these lineages they have a complex evolutionary history with repeated duplications. We estimate the timing of the two major duplications already identified in the core eudicots within a timeframe before the divergence of Gunnera and after the divergence of Proteales. We also use a synteny-based approach to examine the extent to which the expansion of TCP ECE genes in diverse eudicot lineages may be due to genome-wide duplications. The three major core-eudicot specific clades share a number of collinear genes, and their common evolutionary history may have originated at the γ event. Genomic comparisons in Arabidopsis thaliana and Solanumlycopersicum highlight their separate polyploid origin, with syntenic fragments with and without TCP ECE genes showing differential gene loss and genomic rearrangements. Comparison between recently available genomes from two basal eudicots Aquilegiacoerulea and Nelumbonucifera suggests that the two TCP ECE paralogs in these species are also derived from large-scale duplications. TCP ECE loci from basal eudicots share many features with the three main core eudicot loci, and allow us to infer the makeup of the ancestral eudicot locus.

  18. Identification of genes that have undergone adaptive evolution in ...

    African Journals Online (AJOL)

    Cassava (Manihot esculenta) is a vital food security crop and staple in Africa, yet cassava brown streak disease (CBSD) and cassava mosaic disease result in substantial yield losses. The aim of this study was to identify genes that have undergone positive selection during adaptive evolution, from CBSD resistant, tolerant ...

  19. A family with X-linked anophthalmia: exclusion of SOX3 as a candidate gene.

    Science.gov (United States)

    Slavotinek, Anne; Lee, Stephen S; Hamilton, Steven P

    2005-10-01

    We report on a four-generation family with X-linked anophthalmia in four affected males and show that this family has LOD scores consistent with linkage to Xq27, the third family reported to be linked to the ANOP1 locus. We sequenced the SOX3 gene at Xq27 as a candidate gene for the X-linked anophthalmia based on the high homology of this gene to SOX2, a gene previously mutated in bilateral anophthlamia. However, no amino acid sequence alterations were identified in SOX3. We have improved the definition of the phenotype in males with anophthalmia linked to the ANOP1 locus, as microcephaly, ocular colobomas, and severe renal malformations have not been described in families linked to ANOP1. (c) 2005 Wiley-Liss, Inc.

  20. The apolipoprotein L family of programmed cell death and immunity genes rapidly evolved in primates at discrete sites of host-pathogen interactions.

    Science.gov (United States)

    Smith, Eric E; Malik, Harmit S

    2009-05-01

    Apolipoprotein L1 (APOL1) is a human protein that confers immunity to Trypanosoma brucei infections but can be countered by a trypanosome-encoded antagonist SRA. APOL1 belongs to a family of programmed cell death genes whose proteins can initiate host apoptosis or autophagic death. We report here that all six members of the APOL gene family (APOL1-6) present in humans have rapidly evolved in simian primates. APOL6, furthermore, shows evidence of an adaptive sweep during recent human evolution. In each APOL gene tested, we found rapidly evolving codons in or adjacent to the SRA-interacting protein domain (SID), which is the domain of APOL1 that interacts with SRA. In APOL6, we also found a rapidly changing 13-amino-acid cluster in the membrane-addressing domain (MAD), which putatively functions as a pH sensor and regulator of cell death. We predict that APOL genes are antagonized by pathogens by at least two distinct mechanisms: SID antagonists, which include SRA, that interact with the SID of various APOL proteins, and MAD antagonists that interact with the MAD hinge base of APOL6. These antagonists either block or prematurely cause APOL-mediated programmed cell death of host cells to benefit the infecting pathogen. These putative interactions must occur inside host cells, in contrast to secreted APOL1 that trafficks to the trypanosome lysosome. Hence, the dynamic APOL gene family appears to be an important link between programmed cell death of host cells and immunity to pathogens.

  1. [Genome-wide identification and bioinformatic analysis of PPR gene family in tomato].

    Science.gov (United States)

    Ding, Anming; Li, Ling; Qu, Xu; Sun, Tingting; Chen, Yaqiong; Zong, Peng; Li, Zunqiang; Gong, Daping; Sun, Yuhe

    2014-01-01

    Pentatricopeptide repeats (PPRs) genes constitute one of the largest gene families in plants, which play a broad and essential role in plant growth and development. In this study, the protein sequences annotated by the tomato (S. lycopersicum L.) genome project were screened with the Pfam PPR sequences. A total of 471 putative PPR-encoding genes were identified. Based on the motifs defined in A. thaliana L., protein structure and conserved sequences for each tomato motif were analyzed. We also analyzed phylogenetic relationship, subcellular localization, expression and GO analysis of the identified gene sequences. Our results demonstrate that tomato PPR gene family contains two subfamilies, P and PLS, each accounting for half of the family. PLS subfamily can be divided into four subclasses i.e., PLS, E, E+ and DYW. Each subclass of sequences forms a clade in the phylogenetic tree. The PPR motifs were found highly conserved among plants. The tomato PPR genes were distributed over 12 chromosomes and most of them lack introns. The majority of PPR proteins harbor mitochondrial or chloroplast localization sequences, whereas GO analysis showed that most PPR proteins participate in RNA-related biological processes.

  2. NDP gene mutations in 14 French families with Norrie disease.

    Science.gov (United States)

    Royer, Ghislaine; Hanein, Sylvain; Raclin, Valérie; Gigarel, Nadine; Rozet, Jean-Michel; Munnich, Arnold; Steffann, Julie; Dufier, Jean-Louis; Kaplan, Josseline; Bonnefont, Jean-Paul

    2003-12-01

    Norrie disease is a rare X-inked recessive condition characterized by congenital blindness and occasionally deafness and mental retardation in males. This disease has been ascribed to mutations in the NDP gene on chromosome Xp11.1. Previous investigations of the NDP gene have identified largely sixty disease-causing sequence variants. Here, we report on ten different NDP gene allelic variants in fourteen of a series of 21 families fulfilling inclusion criteria. Two alterations were intragenic deletions and eight were nucleotide substitutions or splicing variants, six of them being hitherto unreported, namely c.112C>T (p.Arg38Cys), c.129C>G (p.His43Gln), c.133G>A (p.Val45Met), c.268C>T (p.Arg90Cys), c.382T>C (p.Cys128Arg), c.23479-1G>C (unknown). No NDP gene sequence variant was found in seven of the 21 families. This observation raises the issue of misdiagnosis, phenocopies, or existence of other X-linked or autosomal genes, the mutations of which would mimic the Norrie disease phenotype. Copyright 2003 Wiley-Liss, Inc.

  3. Contrasting patterns of evolution of 45S and 5S rDNA families uncover new aspects in the genome constitution of the agronomically important grass Thinopyrum intermedium (Triticeae).

    Science.gov (United States)

    Mahelka, Václav; Kopecky, David; Baum, Bernard R

    2013-09-01

    We employed sequencing of clones and in situ hybridization (genomic and fluorescent in situ hybridization [GISH and rDNA-FISH]) to characterize both the sequence variation and genomic organization of 45S (herein ITS1-5.8S-ITS2 region) and 5S (5S gene + nontranscribed spacer) ribosomal DNA (rDNA) families in the allohexaploid grass Thinopyrum intermedium. Both rDNA families are organized within several rDNA loci within all three subgenomes of the allohexaploid species. Both families have undergone different patterns of evolution. The 45S rDNA family has evolved in a concerted manner: internal transcribed spacer (ITS) sequences residing within the arrays of two subgenomes out of three got homogenized toward one major ribotype, whereas the third subgenome contained a minor proportion of distinct unhomogenized copies. Homogenization mechanisms such as unequal crossover and/or gene conversion were coupled with the loss of certain 45S rDNA loci. Unlike in the 45S family, the data suggest that neither interlocus homogenization among homeologous chromosomes nor locus loss occurred in 5S rDNA. Consistently with other Triticeae, the 5S rDNA family in intermediate wheatgrass comprised two distinct array types-the long- and short-spacer unit classes. Within the long and short units, we distinguished five and three different types, respectively, likely representing homeologous unit classes donated by putative parental species. Although the major ITS ribotype corresponds in our phylogenetic analysis to the E-genome species, the minor ribotype corresponds to Dasypyrum. 5S sequences suggested the contributions from Pseudoroegneria, Dasypyrum, and Aegilops. The contribution from Aegilops to the intermediate wheatgrass' genome is a new finding with implications in wheat improvement. We discuss rDNA evolution and potential origin of intermediate wheatgrass.

  4. Relaxed evolution in the tyrosine aminotransferase gene tat in old world fruit bats (Chiroptera: Pteropodidae.

    Directory of Open Access Journals (Sweden)

    Bin Shen

    Full Text Available Frugivorous and nectarivorous bats fuel their metabolism mostly by using carbohydrates and allocate the restricted amounts of ingested proteins mainly for anabolic protein syntheses rather than for catabolic energy production. Thus, it is possible that genes involved in protein (amino acid catabolism may have undergone relaxed evolution in these fruit- and nectar-eating bats. The tyrosine aminotransferase (TAT, encoded by the Tat gene is the rate-limiting enzyme in the tyrosine catabolic pathway. To test whether the Tat gene has undergone relaxed evolution in the fruit- and nectar-eating bats, we obtained the Tat coding region from 20 bat species including four Old World fruit bats (Pteropodidae and two New World fruit bats (Phyllostomidae. Phylogenetic reconstructions revealed a gene tree in which all echolocating bats (including the New World fruit bats formed a monophyletic group. The phylogenetic conflict appears to stem from accelerated TAT protein sequence evolution in the Old World fruit bats. Our molecular evolutionary analyses confirmed a change in the selection pressure acting on Tat, which was likely caused by a relaxation of the evolutionary constraints on the Tat gene in the Old World fruit bats. Hepatic TAT activity assays showed that TAT activities in species of the Old World fruit bats are significantly lower than those of insectivorous bats and omnivorous mice, which was not caused by a change in TAT protein levels in the liver. Our study provides unambiguous evidence that the Tat gene has undergone relaxed evolution in the Old World fruit bats in response to changes in their metabolism due to the evolution of their special diet.

  5. Fast and simple protein-alignment-guided assembly of orthologous gene families from microbiome sequencing reads.

    Science.gov (United States)

    Huson, Daniel H; Tappu, Rewati; Bazinet, Adam L; Xie, Chao; Cummings, Michael P; Nieselt, Kay; Williams, Rohan

    2017-01-25

    Microbiome sequencing projects typically collect tens of millions of short reads per sample. Depending on the goals of the project, the short reads can either be subjected to direct sequence analysis or be assembled into longer contigs. The assembly of whole genomes from metagenomic sequencing reads is a very difficult problem. However, for some questions, only specific genes of interest need to be assembled. This is then a gene-centric assembly where the goal is to assemble reads into contigs for a family of orthologous genes. We present a new method for performing gene-centric assembly, called protein-alignment-guided assembly, and provide an implementation in our metagenome analysis tool MEGAN. Genes are assembled on the fly, based on the alignment of all reads against a protein reference database such as NCBI-nr. Specifically, the user selects a gene family based on a classification such as KEGG and all reads binned to that gene family are assembled. Using published synthetic community metagenome sequencing reads and a set of 41 gene families, we show that the performance of this approach compares favorably with that of full-featured assemblers and that of a recently published HMM-based gene-centric assembler, both in terms of the number of reference genes detected and of the percentage of reference sequence covered. Protein-alignment-guided assembly of orthologous gene families complements whole-metagenome assembly in a new and very useful way.

  6. Diagnosing CADASIL using MRI: evidence from families with known mutations of Notch 3 gene

    International Nuclear Information System (INIS)

    Chawda, S.J.; Lange, R.P.J. de; St-Clair, D.; Hourihan, M.D.; Halpin, S.F.S.

    2000-01-01

    Clinical data and MRI findings are presented on 18 subjects from two families with neuropathologically confirmed CADASIL. DNA analysis revealed mutations in exon 4 of Notch 3 gene in both families. All family members with mutations in Notch 3 gene had extensive abnormalities on MRI, principally lesions in the white matter of the frontal lobes and in the external capsules. Of several family members in whom a diagnosis of CADASIL was suspected on the basis of minor symptoms, one had MRI changes consistent with CADASIL; none of these cases carried a mutation in the Notch 3 gene. MRI and clinical features that may alert the radiologist to the diagnosis of CADASIL are reviewed. However, a wide differential diagnosis exists for the MRI appearances of CADASIL, including multiple sclerosis and small-vessel disease secondary to hypertension. The definitive diagnosis cannot be made on MRI alone and requires additional evidence, where available, from a positive family history and by screening DNA for mutations of Notch 3 gene. (orig.)

  7. Algorithms for computing parsimonious evolutionary scenarios for genome evolution, the last universal common ancestor and dominance of horizontal gene transfer in the evolution of prokaryotes

    Directory of Open Access Journals (Sweden)

    Galperin Michael Y

    2003-01-01

    Full Text Available Abstract Background Comparative analysis of sequenced genomes reveals numerous instances of apparent horizontal gene transfer (HGT, at least in prokaryotes, and indicates that lineage-specific gene loss might have been even more common in evolution. This complicates the notion of a species tree, which needs to be re-interpreted as a prevailing evolutionary trend, rather than the full depiction of evolution, and makes reconstruction of ancestral genomes a non-trivial task. Results We addressed the problem of constructing parsimonious scenarios for individual sets of orthologous genes given a species tree. The orthologous sets were taken from the database of Clusters of Orthologous Groups of proteins (COGs. We show that the phyletic patterns (patterns of presence-absence in completely sequenced genomes of almost 90% of the COGs are inconsistent with the hypothetical species tree. Algorithms were developed to reconcile the phyletic patterns with the species tree by postulating gene loss, COG emergence and HGT (the latter two classes of events were collectively treated as gene gains. We prove that each of these algorithms produces a parsimonious evolutionary scenario, which can be represented as mapping of loss and gain events on the species tree. The distribution of the evolutionary events among the tree nodes substantially depends on the underlying assumptions of the reconciliation algorithm, e.g. whether or not independent gene gains (gain after loss after gain are permitted. Biological considerations suggest that, on average, gene loss might be a more likely event than gene gain. Therefore different gain penalties were used and the resulting series of reconstructed gene sets for the last universal common ancestor (LUCA of the extant life forms were analysed. The number of genes in the reconstructed LUCA gene sets grows as the gain penalty increases. However, qualitative examination of the LUCA versions reconstructed with different gain penalties

  8. Genome-wide analysis of the WRKY gene family in cotton.

    Science.gov (United States)

    Dou, Lingling; Zhang, Xiaohong; Pang, Chaoyou; Song, Meizhen; Wei, Hengling; Fan, Shuli; Yu, Shuxun

    2014-12-01

    WRKY proteins are major transcription factors involved in regulating plant growth and development. Although many studies have focused on the functional identification of WRKY genes, our knowledge concerning many areas of WRKY gene biology is limited. For example, in cotton, the phylogenetic characteristics, global expression patterns, molecular mechanisms regulating expression, and target genes/pathways of WRKY genes are poorly characterized. Therefore, in this study, we present a genome-wide analysis of the WRKY gene family in cotton (Gossypium raimondii and Gossypium hirsutum). We identified 116 WRKY genes in G. raimondii from the completed genome sequence, and we cloned 102 WRKY genes in G. hirsutum. Chromosomal location analysis indicated that WRKY genes in G. raimondii evolved mainly from segmental duplication followed by tandem amplifications. Phylogenetic analysis of alga, bryophyte, lycophyta, monocot and eudicot WRKY domains revealed family member expansion with increasing complexity of the plant body. Microarray, expression profiling and qRT-PCR data revealed that WRKY genes in G. hirsutum may regulate the development of fibers, anthers, tissues (roots, stems, leaves and embryos), and are involved in the response to stresses. Expression analysis showed that most group II and III GhWRKY genes are highly expressed under diverse stresses. Group I members, representing the ancestral form, seem to be insensitive to abiotic stress, with low expression divergence. Our results indicate that cotton WRKY genes might have evolved by adaptive duplication, leading to sensitivity to diverse stresses. This study provides fundamental information to inform further analysis and understanding of WRKY gene functions in cotton species.

  9. Extensive gene remodeling in the viral world: new evidence for nongradual evolution in the mobilome network.

    Science.gov (United States)

    Jachiet, Pierre-Alain; Colson, Philippe; Lopez, Philippe; Bapteste, Eric

    2014-08-07

    Complex nongradual evolutionary processes such as gene remodeling are difficult to model, to visualize, and to investigate systematically. Despite these challenges, the creation of composite (or mosaic) genes by combination of genetic segments from unrelated gene families was established as an important adaptive phenomena in eukaryotic genomes. In contrast, almost no general studies have been conducted to quantify composite genes in viruses. Although viral genome mosaicism has been well-described, the extent of gene mosaicism and its rules of emergence remain largely unexplored. Applying methods from graph theory to inclusive similarity networks, and using data from more than 3,000 complete viral genomes, we provide the first demonstration that composite genes in viruses are 1) functionally biased, 2) involved in key aspects of the arm race between cells and viruses, and 3) can be classified into two distinct types of composite genes in all viral classes. Beyond the quantification of the widespread recombination of genes among different viruses of the same class, we also report a striking sharing of genetic information between viruses of different classes and with different nucleic acid types. This latter discovery provides novel evidence for the existence of a large and complex mobilome network, which appears partly bound by the sharing of genetic information and by the formation of composite genes between mobile entities with different genetic material. Considering that there are around 10E31 viruses on the planet, gene remodeling appears as a hugely significant way of generating and moving novel sequences between different kinds of organisms on Earth. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  10. Genome organization and expression of the rat ACBP gene family

    DEFF Research Database (Denmark)

    Mandrup, S; Andreasen, P H; Knudsen, J

    1993-01-01

    pool former. We have molecularly cloned and characterized the rat ACBP gene family which comprises one expressed and four processed pseudogenes. One of these was shown to exist in two allelic forms. A comprehensive computer-aided analysis of the promoter region of the expressed ACBP gene revealed...

  11. Evolution-development congruence in pattern formation dynamics: Bifurcations in gene expression and regulation of networks structures.

    Science.gov (United States)

    Kohsokabe, Takahiro; Kaneko, Kunihiko

    2016-01-01

    Search for possible relationships between phylogeny and ontogeny is important in evolutionary-developmental biology. Here we uncover such relationships by numerical evolution and unveil their origin in terms of dynamical systems theory. By representing developmental dynamics of spatially located cells with gene expression dynamics with cell-to-cell interaction under external morphogen gradient, gene regulation networks are evolved under mutation and selection with the fitness to approach a prescribed spatial pattern of expressed genes. For most numerical evolution experiments, evolution of pattern over generations and development of pattern by an evolved network exhibit remarkable congruence. Both in the evolution and development pattern changes consist of several epochs where stripes are formed in a short time, while for other temporal regimes, pattern hardly changes. In evolution, these quasi-stationary regimes are generations needed to hit relevant mutations, while in development, they are due to some gene expression that varies slowly and controls the pattern change. The morphogenesis is regulated by combinations of feedback or feedforward regulations, where the upstream feedforward network reads the external morphogen gradient, and generates a pattern used as a boundary condition for the later patterns. The ordering from up to downstream is common in evolution and development, while the successive epochal changes in development and evolution are represented as common bifurcations in dynamical-systems theory, which lead to the evolution-development congruence. Mechanism of exceptional violation of the congruence is also unveiled. Our results provide a new look on developmental stages, punctuated equilibrium, developmental bottlenecks, and evolutionary acquisition of novelty in morphogenesis. © 2015 The Authors. Journal of Experimental Zoology Part B: Molecular and Developmental Evolution Published by Wiley Periodicals, Inc.

  12. Adaptive molecular evolution of the Major Histocompatibility Complex genes, DRA and DQA, in the genus Equus.

    Science.gov (United States)

    Kamath, Pauline L; Getz, Wayne M

    2011-05-18

    Major Histocompatibility Complex (MHC) genes are central to vertebrate immune response and are believed to be under balancing selection by pathogens. This hypothesis has been supported by observations of extremely high polymorphism, elevated nonsynonymous to synonymous base pair substitution rates and trans-species polymorphisms at these loci. In equids, the organization and variability of this gene family has been described, however the full extent of diversity and selection is unknown. As selection is not expected to act uniformly on a functional gene, maximum likelihood codon-based models of selection that allow heterogeneity in selection across codon positions can be valuable for examining MHC gene evolution and the molecular basis for species adaptations. We investigated the evolution of two class II MHC genes of the Equine Lymphocyte Antigen (ELA), DRA and DQA, in the genus Equus with the addition of novel alleles identified in plains zebra (E. quagga, formerly E. burchelli). We found that both genes exhibited a high degree of polymorphism and inter-specific sharing of allele lineages. To our knowledge, DRA allelic diversity was discovered to be higher than has ever been observed in vertebrates. Evidence was also found to support a duplication of the DQA locus. Selection analyses, evaluated in terms of relative rates of nonsynonymous to synonymous mutations (dN/dS) averaged over the gene region, indicated that the majority of codon sites were conserved and under purifying selection (dN

  13. Adaptive molecular evolution of the Major Histocompatibility Complex genes, DRA and DQA, in the genus Equus

    Directory of Open Access Journals (Sweden)

    Getz Wayne M

    2011-05-01

    Full Text Available Abstract Background Major Histocompatibility Complex (MHC genes are central to vertebrate immune response and are believed to be under balancing selection by pathogens. This hypothesis has been supported by observations of extremely high polymorphism, elevated nonsynonymous to synonymous base pair substitution rates and trans-species polymorphisms at these loci. In equids, the organization and variability of this gene family has been described, however the full extent of diversity and selection is unknown. As selection is not expected to act uniformly on a functional gene, maximum likelihood codon-based models of selection that allow heterogeneity in selection across codon positions can be valuable for examining MHC gene evolution and the molecular basis for species adaptations. Results We investigated the evolution of two class II MHC genes of the Equine Lymphocyte Antigen (ELA, DRA and DQA, in the genus Equus with the addition of novel alleles identified in plains zebra (E. quagga, formerly E. burchelli. We found that both genes exhibited a high degree of polymorphism and inter-specific sharing of allele lineages. To our knowledge, DRA allelic diversity was discovered to be higher than has ever been observed in vertebrates. Evidence was also found to support a duplication of the DQA locus. Selection analyses, evaluated in terms of relative rates of nonsynonymous to synonymous mutations (dN/dS averaged over the gene region, indicated that the majority of codon sites were conserved and under purifying selection (dN dS. However, the most likely evolutionary codon models allowed for variable rates of selection across codon sites at both loci and, at the DQA, supported the hypothesis of positive selection acting on specific sites. Conclusions Observations of elevated genetic diversity and trans-species polymorphisms supported the conclusion that balancing selection may be acting on these loci. Furthermore, at the DQA, positive selection was

  14. Evolutionary relationship and structural characterization of the EPF/EPFL gene family.

    Science.gov (United States)

    Takata, Naoki; Yokota, Kiyonobu; Ohki, Shinya; Mori, Masashi; Taniguchi, Toru; Kurita, Manabu

    2013-01-01

    EPF1-EPF2 and EPFL9/Stomagen act antagonistically in regulating leaf stomatal density. The aim of this study was to elucidate the evolutionary functional divergence of EPF/EPFL family genes. Phylogenetic analyses showed that AtEPFL9/Stomagen-like genes are conserved only in vascular plants and are closely related to AtEPF1/EPF2-like genes. Modeling showed that EPF/EPFL peptides share a common 3D structure that is constituted of a scaffold and loop. Molecular dynamics simulation suggested that AtEPF1/EPF2-like peptides form an additional disulfide bond in their loop regions and show greater flexibility in these regions than AtEPFL9/Stomagen-like peptides. This study uncovered the evolutionary relationship and the conformational divergence of proteins encoded by the EPF/EPFL family genes.

  15. Phylogeny and adaptive evolution of the brain-development gene microcephalin (MCPH1 in cetaceans

    Directory of Open Access Journals (Sweden)

    Montgomery Stephen H

    2011-04-01

    Full Text Available Abstract Background Representatives of Cetacea have the greatest absolute brain size among animals, and the largest relative brain size aside from humans. Despite this, genes implicated in the evolution of large brain size in primates have yet to be surveyed in cetaceans. Results We sequenced ~1240 basepairs of the brain development gene microcephalin (MCPH1 in 38 cetacean species. Alignments of these data and a published complete sequence from Tursiops truncatus with primate MCPH1 were utilized in phylogenetic analyses and to estimate ω (rate of nonsynonymous substitution/rate of synonymous substitution using site and branch models of molecular evolution. We also tested the hypothesis that selection on MCPH1 was correlated with brain size in cetaceans using a continuous regression analysis that accounted for phylogenetic history. Our analyses revealed widespread signals of adaptive evolution in the MCPH1 of Cetacea and in other subclades of Mammalia, however, there was not a significant positive association between ω and brain size within Cetacea. Conclusion In conjunction with a recent study of Primates, we find no evidence to support an association between MCPH1 evolution and the evolution of brain size in highly encephalized mammalian species. Our finding of significant positive selection in MCPH1 may be linked to other functions of the gene.

  16. Death and resurrection of the human IRGM gene.

    Directory of Open Access Journals (Sweden)

    Cemalettin Bekpen

    2009-03-01

    Full Text Available Immunity-related GTPases (IRG play an important role in defense against intracellular pathogens. One member of this gene family in humans, IRGM, has been recently implicated as a risk factor for Crohn's disease. We analyzed the detailed structure of this gene family among primates and showed that most of the IRG gene cluster was deleted early in primate evolution, after the divergence of the anthropoids from prosimians ( about 50 million years ago. Comparative sequence analysis of New World and Old World monkey species shows that the single-copy IRGM gene became pseudogenized as a result of an Alu retrotransposition event in the anthropoid common ancestor that disrupted the open reading frame (ORF. We find that the ORF was reestablished as a part of a polymorphic stop codon in the common ancestor of humans and great apes. Expression analysis suggests that this change occurred in conjunction with the insertion of an endogenous retrovirus, which altered the transcription initiation, splicing, and expression profile of IRGM. These data argue that the gene became pseudogenized and was then resurrected through a series of complex structural events and suggest remarkable functional plasticity where alleles experience diverse evolutionary pressures over time. Such dynamism in structure and evolution may be critical for a gene family locked in an arms race with an ever-changing repertoire of intracellular parasites.

  17. Gene finding with a hidden Markov model of genome structure and evolution

    DEFF Research Database (Denmark)

    Pedersen, Jakob Skou; Hein, Jotun

    2003-01-01

    the model are linear in alignment length and genome number. The model is applied to the problem of gene finding. The benefit of modelling sequence evolution is demonstrated both in a range of simulations and on a set of orthologous human/mouse gene pairs. AVAILABILITY: Free availability over the Internet...

  18. Investigation of genes encoding calcineurin B-like protein family in legumes and their expression analyses in chickpea (Cicer arietinum L..

    Directory of Open Access Journals (Sweden)

    Mukesh Kumar Meena

    Full Text Available Calcium ion (Ca2+ is a ubiquitous second messenger that transmits various internal and external signals including stresses and, therefore, is important for plants' response process. Calcineurin B-like proteins (CBLs are one of the plant calcium sensors, which sense and convey the changes in cytosolic Ca2+-concentration for response process. A search in four leguminous plant (soybean, Medicago truncatula, common bean and chickpea genomes identified 9 to 15 genes in each species that encode CBL proteins. Sequence analyses of CBL peptides and coding sequences (CDS suggested that there are nine original CBL genes in these legumes and some of them were multiplied during whole genome or local gene duplication. Coding sequences of chickpea CBL genes (CaCBL were cloned from their cDNAs and sequenced, and their annotations in the genome assemblies were corrected accordingly. Analyses of protein sequences and gene structures of CBL family in plant kingdom indicated its diverse origin but showed a remarkable conservation in overall protein structure with appearance of complex gene structure in the course of evolution. Expression of CaCBL genes in different tissues and in response to different stress and hormone treatment were studied. Most of the CaCBL genes exhibited high expression in flowers. Expression profile of CaCBL genes in response to different abiotic stresses and hormones related to development and stresses (ABA, auxin, cytokinin, SA and JA at different time intervals suggests their diverse roles in development and plant defence in addition to abiotic stress tolerance. These data not only contribute to a better understanding of the complex regulation of chickpea CBL gene family, but also provide valuable information for further research in chickpea functional genomics.

  19. NU-IN: Nucleotide evolution and input module for the EvolSimulator genome simulation platform

    Directory of Open Access Journals (Sweden)

    Barker Michael S

    2010-08-01

    Full Text Available Abstract Background There is increasing demand to test hypotheses that contrast the evolution of genes and gene families among genomes, using simulations that work across these levels of organization. The EvolSimulator program was developed recently to provide a highly flexible platform for forward simulations of amino acid evolution in multiple related lineages of haploid genomes, permitting copy number variation and lateral gene transfer. Synonymous nucleotide evolution is not currently supported, however, and would be highly advantageous for comparisons to full genome, transcriptome, and single nucleotide polymorphism (SNP datasets. In addition, EvolSimulator creates new genomes for each simulation, and does not allow the input of user-specified sequences and gene family information, limiting the incorporation of further biological realism and/or user manipulations of the data. Findings We present modified C++ source code for the EvolSimulator platform, which we provide as the extension module NU-IN. With NU-IN, synonymous and non-synonymous nucleotide evolution is fully implemented, and the user has the ability to use real or previously-simulated sequence data to initiate a simulation of one or more lineages. Gene family membership can be optionally specified, as well as gene retention probabilities that model biased gene retention. We provide PERL scripts to assist the user in deriving this information from previous simulations. We demonstrate the features of NU-IN by simulating genome duplication (polyploidy in the presence of ongoing copy number variation in an evolving lineage. This example is initiated with real genomic data, and produces output that we analyse directly with existing bioinformatic pipelines. Conclusions The NU-IN extension module is a publicly available open source software (GNU GPLv3 license extension to EvolSimulator. With the NU-IN module, users are now able to simulate both drift and selection at the nucleotide

  20. [Study of gene mutation and pathogenetic mechanism for a family with Waardenburg syndrome].

    Science.gov (United States)

    Chen, Hongsheng; Liao, Xinbin; Liu, Yalan; He, Chufeng; Zhang, Hua; Jiang, Lu; Feng, Yong; Mei, Lingyun

    2017-08-10

    To explore the pathogenetic mechanism of a family affected with Waardenburg syndrome. Clinical data of the family was collected. Potential mutation of the MITF, SOX10 and SNAI2 genes were screened. Plasmids for wild type (WT) and mutant MITF proteins were constructed to determine their exogenous expression and subcellular distribution by Western blotting and immunofluorescence assay, respectively. A heterozygous c.763C>T (p.R255X) mutation was detected in exon 8 of the MITF gene in the proband and all other patients from the family. No pathological mutation of the SOX10 and SNAI2 genes was detected. The DNA sequences of plasmids of MITF wild and mutant MITF R255X were confirmed. Both proteins were detected with the expected size. WT MITF protein only localized in the nucleus, whereas R255X protein showed aberrant localization in the nucleus as well as the cytoplasm. The c.763C>T mutation of the MITF gene probably underlies the disease in this family. The mutation can affect the subcellular distribution of MITF proteins in vitro, which may shed light on the molecular mechanism of Waardenburg syndrome caused by mutations of the MITF gene.

  1. Toxin gene determination and evolution in scorpaenoid fish.

    Science.gov (United States)

    Chuang, Po-Shun; Shiao, Jen-Chieh

    2014-09-01

    In this study, we determine the toxin genes from both cDNA and genomic DNA of four scorpaenoid fish and reconstruct their evolutionary relationship. The deduced protein sequences of the two toxin subunits in Sebastapistes strongia, Scorpaenopsis oxycephala, and Sebastiscus marmoratus are about 700 amino acid, similar to the sizes of the stonefish (Synanceia horrida, and Synanceia verrucosa) and lionfish (Pterois antennata and Pterois volitans) toxins previously published. The intron positions are highly conserved among these species, which indicate the applicability of gene finding by using genomic DNA template. The phylogenetic analysis shows that the two toxin subunits were duplicated prior to the speciation of Scorpaenoidei. The precedence of the gene duplication over speciation indicates that the toxin genes may be common to the whole family of Scorpaeniform. Furthermore, one additional toxin gene has been determined in the genomic DNA of Dendrochirus zebra. The phylogenetic analysis suggests that an additional gene duplication occurred before the speciation of the lionfish (Pteroinae) and a pseudogene may be generally present in the lineage of lionfish. Copyright © 2014 Elsevier Ltd. All rights reserved.

  2. A novel AVP gene mutation in a Turkish family with neurohypophyseal diabetes insipidus.

    Science.gov (United States)

    Ilhan, M; Tiryakioglu, N O; Karaman, O; Coskunpinar, E; Yildiz, R S; Turgut, S; Tiryakioglu, D; Toprak, H; Tasan, E

    2016-03-01

    Familial neurohypophyseal diabetes insipidus (FNDI) is a rare, autosomal dominant, inherited disorder which is characterized by severe polydipsia and polyuria generally presenting in early childhood. In the present study, we aimed to analyze the AVP gene in a Turkish family with FNDI. Four patients with neurohypophyseal diabetes insipidus and ten healthy members of the family were studied. Diabetes insipidus was diagnosed by the water deprivation test in affected family members. Mutation analysis was performed by sequencing the whole coding region of AVP-NPII gene using DNA isolated from peripheral blood samples. Urine osmolality was low (C in all patients. c.-3A>C mutation in 5'UTR of AVP gene in this family might lead to the truncation of signal peptide, aggregation of AVP in the cytoplasm instead of targeting in the endoplasmic reticulum, thereby could disrupt AVP secretion without causing neuronal cytotoxicity, which might explain the presence of bright spot. The predicted effect of this mutation should be investigated by further in vitro molecular studies.

  3. The polyphenol oxidase gene family in land plants: Lineage-specific duplication and expansion

    Directory of Open Access Journals (Sweden)

    Tran Lan T

    2012-08-01

    Full Text Available Abstract Background Plant polyphenol oxidases (PPOs are enzymes that typically use molecular oxygen to oxidize ortho-diphenols to ortho-quinones. These commonly cause browning reactions following tissue damage, and may be important in plant defense. Some PPOs function as hydroxylases or in cross-linking reactions, but in most plants their physiological roles are not known. To better understand the importance of PPOs in the plant kingdom, we surveyed PPO gene families in 25 sequenced genomes from chlorophytes, bryophytes, lycophytes, and flowering plants. The PPO genes were then analyzed in silico for gene structure, phylogenetic relationships, and targeting signals. Results Many previously uncharacterized PPO genes were uncovered. The moss, Physcomitrella patens, contained 13 PPO genes and Selaginella moellendorffii (spike moss and Glycine max (soybean each had 11 genes. Populus trichocarpa (poplar contained a highly diversified gene family with 11 PPO genes, but several flowering plants had only a single PPO gene. By contrast, no PPO-like sequences were identified in several chlorophyte (green algae genomes or Arabidopsis (A. lyrata and A. thaliana. We found that many PPOs contained one or two introns often near the 3’ terminus. Furthermore, N-terminal amino acid sequence analysis using ChloroP and TargetP 1.1 predicted that several putative PPOs are synthesized via the secretory pathway, a unique finding as most PPOs are predicted to be chloroplast proteins. Phylogenetic reconstruction of these sequences revealed that large PPO gene repertoires in some species are mostly a consequence of independent bursts of gene duplication, while the lineage leading to Arabidopsis must have lost all PPO genes. Conclusion Our survey identified PPOs in gene families of varying sizes in all land plants except in the genus Arabidopsis. While we found variation in intron numbers and positions, overall PPO gene structure is congruent with the phylogenetic

  4. Gene expression and adaptive noncoding changes during human evolution.

    Science.gov (United States)

    Babbitt, Courtney C; Haygood, Ralph; Nielsen, William J; Wray, Gregory A

    2017-06-05

    Despite evidence for adaptive changes in both gene expression and non-protein-coding, putatively regulatory regions of the genome during human evolution, the relationship between gene expression and adaptive changes in cis-regulatory regions remains unclear. Here we present new measurements of gene expression in five tissues of humans and chimpanzees, and use them to assess this relationship. We then compare our results with previous studies of adaptive noncoding changes, analyzing correlations at the level of gene ontology groups, in order to gain statistical power to detect correlations. Consistent with previous studies, we find little correlation between gene expression and adaptive noncoding changes at the level of individual genes; however, we do find significant correlations at the level of biological function ontology groups. The types of function include processes regulated by specific transcription factors, responses to genetic or chemical perturbations, and differentiation of cell types within the immune system. Among functional categories co-enriched with both differential expression and noncoding adaptation, prominent themes include cancer, particularly epithelial cancers, and neural development and function.

  5. Linkage and candidate gene analysis of X-linked familial exudative vitreoretinopathy.

    Science.gov (United States)

    Shastry, B S; Hejtmancik, J F; Plager, D A; Hartzer, M K; Trese, M T

    1995-05-20

    Familial exudative vitreoretinopathy (FEVR) is a hereditary eye disorder characterized by avascularity of the peripheral retina, retinal exudates, tractional detachment, and retinal folds. The disorder is most commonly transmitted as an autosomal dominant trait, but X-linked transmission also occurs. To initiate the process of identifying the gene responsible for the X-linked disorder, linkage analysis has been performed with three previously unreported three- or four-generation families. Two-point analysis showed linkage to MAOA (Zmax = 2.1, theta max = 0) and DXS228 (Zmax = 0.5, theta max = 0.11), and this was further confirmed by multipoint analysis with these same markers (Zmax = 2.81 at MAOA), which both lie near the gene causing Norrie disease. Molecular genetic analysis further reveals a missense mutation (R121W) in the third exon of the Norrie's disease gene that perfectly cosegregates with the disease through three generations in one family. This mutation was not detected in the unaffected family members and six normal unrelated controls, suggesting that it is likely to be the pathogenic mutation. Additionally, a polymorphic missense mutation (H127R) was detected in a severely affected patient.

  6. 5S rRNA gene arrangements in protists: a case of nonadaptive evolution.

    Science.gov (United States)

    Drouin, Guy; Tsang, Corey

    2012-06-01

    Given their high copy number and high level of expression, one might expect that both the sequence and organization of eukaryotic ribosomal RNA genes would be conserved during evolution. Although the organization of 18S, 5.8S and 28S ribosomal RNA genes is indeed relatively well conserved, that of 5S rRNA genes is much more variable. Here, we review the different types of 5S rRNA gene arrangements which have been observed in protists. This includes linkages to the other ribosomal RNA genes as well as linkages to ubiquitin, splice-leader, snRNA and tRNA genes. Mapping these linkages to independently derived phylogenies shows that these diverse linkages have repeatedly been gained and lost during evolution. This argues against such linkages being the primitive condition not only in protists but also in other eukaryote species. Because the only characteristic the diverse genes with which 5S rRNA genes are found linked with is that they are tandemly repeated, these arrangements are unlikely to provide any selective advantage. Rather, the observed high variability in 5S rRNA genes arrangements is likely the result of the fact that 5S rRNA genes contain internal promoters, that these genes are often transposed by diverse recombination mechanisms and that these new gene arrangements are rapidly homogenized by unequal crossingovers and/or by gene conversions events in species with short generation times and frequent founder events.

  7. Extraordinary molecular evolution in the PRDM9 fertility gene.

    Directory of Open Access Journals (Sweden)

    James H Thomas

    2009-12-01

    Full Text Available Recent work indicates that allelic incompatibility in the mouse PRDM9 (Meisetz gene can cause hybrid male sterility, contributing to genetic isolation and potentially speciation. The only phenotype of mouse PRDM9 knockouts is a meiosis I block that causes sterility in both sexes. The PRDM9 gene encodes a protein with histone H3(K4 trimethyltransferase activity, a KRAB domain, and a DNA-binding domain consisting of multiple tandem C2H2 zinc finger (ZF domains. We have analyzed human coding polymorphism and interspecies evolutionary changes in the PRDM9 gene. The ZF domains of PRDM9 are evolving very rapidly, with compelling evidence of positive selection in primates. Positively selected amino acids are predominantly those known to make nucleotide specific contacts in C2H2 zinc fingers. These results suggest that PRDM9 is subject to recurrent selection to change DNA-binding specificity. The human PRDM9 protein is highly polymorphic in its ZF domains and nearly all polymorphisms affect the same nucleotide contact residues that are subject to positive selection. ZF domain nucleotide sequences are strongly homogenized within species, indicating that interfinger recombination contributes to their evolution. PRDM9 has previously been assumed to be a transcription factor required to induce meiosis specific genes, a role that is inconsistent with its molecular evolution. We suggest instead that PRDM9 is involved in some aspect of centromere segregation conflict and that rapidly evolving centromeric DNA drives changes in PRDM9 DNA-binding domains.

  8. Common mutations identified in the MLH1 gene in familial Lynch syndrome

    Directory of Open Access Journals (Sweden)

    Jisha Elias

    2017-12-01

    In this study we identified three families with Lynch syndrome from a rural cancer center in western India (KCHRC, Goraj, Gujarat, where 70-75 CRC patients are seen annually. DNA isolated from the blood of consented family members of all three families (8-10 members/family was subjected to NGS sequencing methods on an Illumina HiSeq 4000 platform. We identified unique mutations in the MLH1 gene in all three HNPCC family members. Two of the three unrelated families shared a common mutation (154delA and 156delA. Total 8 members of a family were identified as carriers for 156delA mutation of which 5 members were unaffected while 3 were affected (age of onset: 1 member <30yrs & 2 were>40yr. The family with 154delA mutation showed 2 affected members (>40yr carrying the mutations.LYS618DEL mutation found in 8 members of the third family showed that both affected and unaffected carried the mutation. Thus the common mutations identified in the MLH1 gene in two unrelated families had a high risk for lynch syndrome especially above the age of 40.

  9. The (d)evolution of methanotrophy in the Beijerinckiaceae--a comparative genomics analysis.

    Science.gov (United States)

    Tamas, Ivica; Smirnova, Angela V; He, Zhiguo; Dunfield, Peter F

    2014-02-01

    The alphaproteobacterial family Beijerinckiaceae contains generalists that grow on a wide range of substrates, and specialists that grow only on methane and methanol. We investigated the evolution of this family by comparing the genomes of the generalist organotroph Beijerinckia indica, the facultative methanotroph Methylocella silvestris and the obligate methanotroph Methylocapsa acidiphila. Highly resolved phylogenetic construction based on universally conserved genes demonstrated that the Beijerinckiaceae forms a monophyletic cluster with the Methylocystaceae, the only other family of alphaproteobacterial methanotrophs. Phylogenetic analyses also demonstrated a vertical inheritance pattern of methanotrophy and methylotrophy genes within these families. Conversely, many lateral gene transfer (LGT) events were detected for genes encoding carbohydrate transport and metabolism, energy production and conversion, and transcriptional regulation in the genome of B. indica, suggesting that it has recently acquired these genes. A key difference between the generalist B. indica and its specialist methanotrophic relatives was an abundance of transporter elements, particularly periplasmic-binding proteins and major facilitator transporters. The most parsimonious scenario for the evolution of methanotrophy in the Alphaproteobacteria is that it occurred only once, when a methylotroph acquired methane monooxygenases (MMOs) via LGT. This was supported by a compositional analysis suggesting that all MMOs in Alphaproteobacteria methanotrophs are foreign in origin. Some members of the Beijerinckiaceae subsequently lost methanotrophic functions and regained the ability to grow on multicarbon energy substrates. We conclude that B. indica is a recidivist multitroph, the only known example of a bacterium having completely abandoned an evolved lifestyle of specialized methanotrophy.

  10. The evolution of CHROMOMETHYLASES and gene body DNA methylation in plants

    OpenAIRE

    Leebens-Mack, Jim; Griffin, Patrick; Rohr, Nicholas; Niederhuth, Chad; Ji, Lexiang; Bewick, Adam; Schmitz, Robert

    2017-01-01

    Background The evolution of gene body methylation (gbM), its origins, and its functional consequences are poorly understood. By pairing the largest collection of transcriptomes (>1000) and methylomes (77) across Viridiplantae, we provide novel insights into the evolution of gbM and its relationship to CHROMOMETHYLASE (CMT) proteins. Results CMTs are evolutionary conserved DNA methyltransferases in Viridiplantae. Duplication events gave rise to what are now referred to as CMT1, 2 and 3. Indepe...

  11. Adaptive evolution of mitochondrial energy metabolism genes associated with increased energy demand in flying insects.

    Science.gov (United States)

    Yang, Yunxia; Xu, Shixia; Xu, Junxiao; Guo, Yan; Yang, Guang

    2014-01-01

    Insects are unique among invertebrates for their ability to fly, which raises intriguing questions about how energy metabolism in insects evolved and changed along with flight. Although physiological studies indicated that energy consumption differs between flying and non-flying insects, the evolution of molecular energy metabolism mechanisms in insects remains largely unexplored. Considering that about 95% of adenosine triphosphate (ATP) is supplied by mitochondria via oxidative phosphorylation, we examined 13 mitochondrial protein-encoding genes to test whether adaptive evolution of energy metabolism-related genes occurred in insects. The analyses demonstrated that mitochondrial DNA protein-encoding genes are subject to positive selection from the last common ancestor of Pterygota, which evolved primitive flight ability. Positive selection was also found in insects with flight ability, whereas no significant sign of selection was found in flightless insects where the wings had degenerated. In addition, significant positive selection was also identified in the last common ancestor of Neoptera, which changed its flight mode from direct to indirect. Interestingly, detection of more positively selected genes in indirect flight rather than direct flight insects suggested a stronger selective pressure in insects having higher energy consumption. In conclusion, mitochondrial protein-encoding genes involved in energy metabolism were targets of adaptive evolution in response to increased energy demands that arose during the evolution of flight ability in insects.

  12. Insight into pattern of codon biasness and nucleotide base usage in serotonin receptor gene family from different mammalian species.

    Science.gov (United States)

    Dass, J Febin Prabhu; Sudandiradoss, C

    2012-07-15

    5-HT (5-Hydroxy-tryptamine) or serotonin receptors are found both in central and peripheral nervous system as well as in non-neuronal tissues. In the animal and human nervous system, serotonin produces various functional effects through a variety of membrane bound receptors. In this study, we focus on 5-HT receptor family from different mammals and examined the factors that account for codon and nucleotide usage variation. A total of 110 homologous coding sequences from 11 different mammalian species were analyzed using relative synonymous codon usage (RSCU), correspondence analysis (COA) and hierarchical cluster analysis together with nucleotide base usage frequency of chemically similar amino acid codons. The mean effective number of codon (ENc) value of 37.06 for 5-HT(6) shows very high codon bias within the family and may be due to high selective translational efficiency. The COA and Spearman's rank correlation reveals that the nucleotide compositional mutation bias as the major factors influencing the codon usage in serotonin receptor genes. The hierarchical cluster analysis suggests that gene function is another dominant factor that affects the codon usage bias, while species is a minor factor. Nucleotide base usage was reported using Goldman, Engelman, Stietz (GES) scale reveals the presence of high uracil (>45%) content at functionally important hydrophobic regions. Our in silico approach will certainly help for further investigations on critical inference on evolution, structure, function and gene expression aspects of 5-HT receptors family which are potential antipsychotic drug targets. Copyright © 2012 Elsevier B.V. All rights reserved.

  13. Concerted and nonconcerted evolution of the Hsp70 gene superfamily in two sibling species of nematodes.

    Science.gov (United States)

    Nikolaidis, Nikolas; Nei, Masatoshi

    2004-03-01

    We have identified the Hsp70 gene superfamily of the nematode Caenorhabditis briggsae and investigated the evolution of these genes in comparison with Hsp70 genes from C. elegans, Drosophila, and yeast. The Hsp70 genes are classified into three monophyletic groups according to their subcellular localization, namely, cytoplasm (CYT), endoplasmic reticulum (ER), and mitochondria (MT). The Hsp110 genes can be classified into the polyphyletic CYT group and the monophyletic ER group. The different Hsp70 and Hsp110 groups appeared to evolve following the model of divergent evolution. This model can also explain the evolution of the ER and MT genes. On the other hand, the CYT genes are divided into heat-inducible and constitutively expressed genes. The constitutively expressed genes have evolved more or less following the birth-and-death process, and the rates of gene birth and gene death are different between the two nematode species. By contrast, some heat-inducible genes show an intraspecies phylogenetic clustering. This suggests that they are subject to sequence homogenization resulting from gene conversion-like events. In addition, the heat-inducible genes show high levels of sequence conservation in both intra-species and inter-species comparisons, and in most cases, amino acid sequence similarity is higher than nucleotide sequence similarity. This indicates that purifying selection also plays an important role in maintaining high sequence similarity among paralogous Hsp70 genes. Therefore, we suggest that the CYT heat-inducible genes have been subjected to a combination of purifying selection, birth-and-death process, and gene conversion-like events.

  14. Population genomic scans suggest novel genes underlie convergent flowering time evolution in the introduced range of Arabidopsis thaliana.

    Science.gov (United States)

    Gould, Billie A; Stinchcombe, John R

    2017-01-01

    A long-standing question in evolutionary biology is whether the evolution of convergent phenotypes results from selection on the same heritable genetic components. Using whole-genome sequencing and genome scans, we tested whether the evolution of parallel longitudinal flowering time clines in the native and introduced ranges of Arabidopsis thaliana has a similar genetic basis. We found that common variants of large effect on flowering time in the native range do not appear to have been under recent strong selection in the introduced range. We identified a set of 38 new candidate genes that are putatively linked to the evolution of flowering time. A high degree of conditional neutrality of flowering time variants between the native and introduced range may preclude parallel evolution at the level of genes. Overall, neither gene pleiotropy nor available standing genetic variation appears to have restricted the evolution of flowering time to high-frequency variants from the native range or to known flowering time pathway genes. © 2016 John Wiley & Sons Ltd.

  15. On theoretical models of gene expression evolution with random genetic drift and natural selection.

    Directory of Open Access Journals (Sweden)

    Osamu Ogasawara

    2009-11-01

    Full Text Available The relative contributions of natural selection and random genetic drift are a major source of debate in the study of gene expression evolution, which is hypothesized to serve as a bridge from molecular to phenotypic evolution. It has been suggested that the conflict between views is caused by the lack of a definite model of the neutral hypothesis, which can describe the long-run behavior of evolutionary change in mRNA abundance. Therefore previous studies have used inadequate analogies with the neutral prediction of other phenomena, such as amino acid or nucleotide sequence evolution, as the null hypothesis of their statistical inference.In this study, we introduced two novel theoretical models, one based on neutral drift and the other assuming natural selection, by focusing on a common property of the distribution of mRNA abundance among a variety of eukaryotic cells, which reflects the result of long-term evolution. Our results demonstrated that (1 our models can reproduce two independently found phenomena simultaneously: the time development of gene expression divergence and Zipf's law of the transcriptome; (2 cytological constraints can be explicitly formulated to describe long-term evolution; (3 the model assuming that natural selection optimized relative mRNA abundance was more consistent with previously published observations than the model of optimized absolute mRNA abundances.The models introduced in this study give a formulation of evolutionary change in the mRNA abundance of each gene as a stochastic process, on the basis of previously published observations. This model provides a foundation for interpreting observed data in studies of gene expression evolution, including identifying an adequate time scale for discriminating the effect of natural selection from that of random genetic drift of selectively neutral variations.

  16. The importance of melanoma inhibitory activity gene family in the tumor progression of oral cancer.

    Science.gov (United States)

    Sasahira, Tomonori; Bosserhoff, Anja Katrin; Kirita, Tadaaki

    2018-05-01

    Oral squamous cell carcinoma has a high potential for locoregional invasion and nodal metastasis. Consequently, early detection of such malignancies is of immense importance. The melanoma inhibitory activity (MIA) gene family comprises MIA, MIA2, transport and Golgi organization protein 1 (TANGO), and otoraplin (OTOR). These members of the MIA gene family have a highly conserved Src homology 3 (SH3)-like structure. Although the molecules of this family share 34-45% amino acid homology and 47-59% cDNA sequence homology, those members, excluding OTOR, play different tumor-associated functions. MIA has a pivotal role in the progression and metastasis of melanoma; MIA2 and TANGO have been suggested to possess tumor-suppressive functions; and OTOR is uniquely expressed in cochlea of the inner ear. Therefore, the definite functions of the MIA gene family in cancer cells remain unclear. Since the members of the MIA gene family are secreted proteins, these molecules might be useful tumor markers that can be detected in the body fluids, including serum and saliva. In this review, we described the molecular biological functions of the MIA gene family in oral cancer. © 2018 Japanese Society of Pathology and John Wiley & Sons Australia, Ltd.

  17. Sensitive Periods, Vasotocin-Family Peptides, and the Evolution and Development of Social Behavior

    Directory of Open Access Journals (Sweden)

    Nicole M. Baran

    2017-08-01

    Full Text Available Nonapeptides, by modulating the activity of neural circuits in specific social contexts, provide an important mechanism underlying the evolution of diverse behavioral phenotypes across vertebrate taxa. Vasotocin-family nonapeptides, in particular, have been found to be involved in behavioral plasticity and diversity in social behavior, including seasonal variation, sexual dimorphism, and species differences. Although nonapeptides have been the focus of a great deal of research over the last several decades, the vast majority of this work has focused on adults. However, behavioral diversity may also be explained by the ways in which these peptides shape neural circuits and influence social processes during development. In this review, I synthesize comparative work on vasotocin-family peptides during development and classic work on early forms of social learning in developmental psychobiology. I also summarize recent work demonstrating that early life manipulations of the nonapeptide system alter attachment, affiliation, and vocal learning in zebra finches. I thus hypothesize that vasotocin-family peptides are involved in the evolution of social behaviors through their influence on learning during sensitive periods in social development.

  18. Evolutionary relationship and structural characterization of the EPF/EPFL gene family.

    Directory of Open Access Journals (Sweden)

    Naoki Takata

    Full Text Available EPF1-EPF2 and EPFL9/Stomagen act antagonistically in regulating leaf stomatal density. The aim of this study was to elucidate the evolutionary functional divergence of EPF/EPFL family genes. Phylogenetic analyses showed that AtEPFL9/Stomagen-like genes are conserved only in vascular plants and are closely related to AtEPF1/EPF2-like genes. Modeling showed that EPF/EPFL peptides share a common 3D structure that is constituted of a scaffold and loop. Molecular dynamics simulation suggested that AtEPF1/EPF2-like peptides form an additional disulfide bond in their loop regions and show greater flexibility in these regions than AtEPFL9/Stomagen-like peptides. This study uncovered the evolutionary relationship and the conformational divergence of proteins encoded by the EPF/EPFL family genes.

  19. The gene transformer-2 of Anastrepha fruit flies (Diptera, Tephritidae) and its evolution in insects.

    Science.gov (United States)

    Sarno, Francesca; Ruiz, María F; Eirín-López, José M; Perondini, André L P; Selivon, Denise; Sánchez, Lucas

    2010-05-13

    In the tephritids Ceratitis, Bactrocera and Anastrepha, the gene transformer provides the memory device for sex determination via its auto-regulation; only in females is functional Tra protein produced. To date, the isolation and characterisation of the gene transformer-2 in the tephritids has only been undertaken in Ceratitis, and it has been shown that its function is required for the female-specific splicing of doublesex and transformer pre-mRNA. It therefore participates in transformer auto-regulatory function. In this work, the characterisation of this gene in eleven tephritid species belonging to the less extensively analysed genus Anastrepha was undertaken in order to throw light on the evolution of transformer-2. The gene transformer-2 produces a protein of 249 amino acids in both sexes, which shows the features of the SR protein family. No significant partially spliced mRNA isoform specific to the male germ line was detected, unlike in Drosophila. It is transcribed in both sexes during development and in adult life, in both the soma and germ line. The injection of Anastrepha transformer-2 dsRNA into Anastrepha embryos caused a change in the splicing pattern of the endogenous transformer and doublesex pre-mRNA of XX females from the female to the male mode. Consequently, these XX females were transformed into pseudomales. The comparison of the eleven Anastrepha Transformer-2 proteins among themselves, and with the Transformer-2 proteins of other insects, suggests the existence of negative selection acting at the protein level to maintain Transformer-2 structural features. These results indicate that transformer-2 is required for sex determination in Anastrepha through its participation in the female-specific splicing of transformer and doublesex pre-mRNAs. It is therefore needed for the auto-regulation of the gene transformer. Thus, the transformer/transfomer-2 > doublesex elements at the bottom of the cascade, and their relationships, probably represent

  20. Evolution of stress-regulated gene expression in duplicate genes of Arabidopsis thaliana.

    Directory of Open Access Journals (Sweden)

    Cheng Zou

    2009-07-01

    Full Text Available Due to the selection pressure imposed by highly variable environmental conditions, stress sensing and regulatory response mechanisms in plants are expected to evolve rapidly. One potential source of innovation in plant stress response mechanisms is gene duplication. In this study, we examined the evolution of stress-regulated gene expression among duplicated genes in the model plant Arabidopsis thaliana. Key to this analysis was reconstructing the putative ancestral stress regulation pattern. By comparing the expression patterns of duplicated genes with the patterns of their ancestors, duplicated genes likely lost and gained stress responses at a rapid rate initially, but the rate is close to zero when the synonymous substitution rate (a proxy for time is > approximately 0.8. When considering duplicated gene pairs, we found that partitioning of putative ancestral stress responses occurred more frequently compared to cases of parallel retention and loss. Furthermore, the pattern of stress response partitioning was extremely asymmetric. An analysis of putative cis-acting DNA regulatory elements in the promoters of the duplicated stress-regulated genes indicated that the asymmetric partitioning of ancestral stress responses are likely due, at least in part, to differential loss of DNA regulatory elements; the duplicated genes losing most of their stress responses were those that had lost more of the putative cis-acting elements. Finally, duplicate genes that lost most or all of the ancestral responses are more likely to have gained responses to other stresses. Therefore, the retention of duplicates that inherit few or no functions seems to be coupled to neofunctionalization. Taken together, our findings provide new insight into the patterns of evolutionary changes in gene stress responses after duplication and lay the foundation for testing the adaptive significance of stress regulatory changes under highly variable biotic and abiotic environments.

  1. Identification of pathogenic gene variants in small families with intellectually disabled siblings by exome sequencing.

    Science.gov (United States)

    Schuurs-Hoeijmakers, Janneke H M; Vulto-van Silfhout, Anneke T; Vissers, Lisenka E L M; van de Vondervoort, Ilse I G M; van Bon, Bregje W M; de Ligt, Joep; Gilissen, Christian; Hehir-Kwa, Jayne Y; Neveling, Kornelia; del Rosario, Marisol; Hira, Gausiya; Reitano, Santina; Vitello, Aurelio; Failla, Pinella; Greco, Donatella; Fichera, Marco; Galesi, Ornella; Kleefstra, Tjitske; Greally, Marie T; Ockeloen, Charlotte W; Willemsen, Marjolein H; Bongers, Ernie M H F; Janssen, Irene M; Pfundt, Rolph; Veltman, Joris A; Romano, Corrado; Willemsen, Michèl A; van Bokhoven, Hans; Brunner, Han G; de Vries, Bert B A; de Brouwer, Arjan P M

    2013-12-01

    Intellectual disability (ID) is a common neurodevelopmental disorder affecting 1-3% of the general population. Mutations in more than 10% of all human genes are considered to be involved in this disorder, although the majority of these genes are still unknown. We investigated 19 small non-consanguineous families with two to five affected siblings in order to identify pathogenic gene variants in known, novel and potential ID candidate genes. Non-consanguineous families have been largely ignored in gene identification studies as small family size precludes prior mapping of the genetic defect. Using exome sequencing, we identified pathogenic mutations in three genes, DDHD2, SLC6A8, and SLC9A6, of which the latter two have previously been implicated in X-linked ID phenotypes. In addition, we identified potentially pathogenic mutations in BCORL1 on the X-chromosome and in MCM3AP, PTPRT, SYNE1, and ZNF528 on autosomes. We show that potentially pathogenic gene variants can be identified in small, non-consanguineous families with as few as two affected siblings, thus emphasising their value in the identification of syndromic and non-syndromic ID genes.

  2. Exclusion of known gene for enamel development in two Brazilian families with amelogenesis imperfecta.

    Science.gov (United States)

    Santos, Maria C L G; Hart, P Suzanne; Ramaswami, Mukundhan; Kanno, Cláudia M; Hart, Thomas C; Line, Sergio R P

    2007-01-31

    Amelogenesis imperfecta (AI) is a genetically heterogeneous group of diseases that result in defective development of tooth enamel. Mutations in several enamel proteins and proteinases have been associated with AI. The object of this study was to evaluate evidence of etiology for the six major candidate gene loci in two Brazilian families with AI. Genomic DNA was obtained from family members and all exons and exon-intron boundaries of the ENAM, AMBN, AMELX, MMP20, KLK4 and Amelotin gene were amplified and sequenced. Each family was also evaluated for linkage to chromosome regions known to contain genes important in enamel development. The present study indicates that the AI in these two families is not caused by any of the known loci for AI or any of the major candidate genes proposed in the literature. These findings indicate extensive genetic heterogeneity for non-syndromic AI.

  3. Genome-wide evolutionary characterization and expression analyses of major latex protein (MLP) family genes in Vitis vinifera.

    Science.gov (United States)

    Zhang, Ningbo; Li, Ruimin; Shen, Wei; Jiao, Shuzhen; Zhang, Junxiang; Xu, Weirong

    2018-04-27

    The major latex protein/ripening-related protein (MLP/RRP) subfamily is known to be involved in a wide range of biological processes of plant development and various stress responses. However, the biological function of MLP/RRP proteins is still far from being clear and identification of them may provide important clues for understanding their roles. Here, we report a genome-wide evolutionary characterization and gene expression analysis of the MLP family in European Vitis species. A total of 14 members, was found in the grape genome, all of which are located on chromosome 1, where are predominantly arranged in tandem clusters. We have noticed, most surprisingly, promoter-sharing by several non-identical but highly similar gene members to a greater extent than expected by chance. Synteny analysis between the grape and Arabidopsis thaliana genomes suggested that 3 grape MLP genes arose before the divergence of the two species. Phylogenetic analysis provided further insights into the evolutionary relationship between the genes, as well as their putative functions, and tissue-specific expression analysis suggested distinct biological roles for different members. Our expression data suggested a couple of candidate genes involved in abiotic stresses and phytohormone responses. The present work provides new insight into the evolution and regulation of Vitis MLP genes, which represent targets for future studies and inclusion in tolerance-related molecular breeding programs.

  4. Whole genome identification, phylogeny and evolution of the cytochrome P450 family 2 (CYP2) sub-families in birds

    DEFF Research Database (Denmark)

    Almeida, Daniela; Maldonado, Emanuel; Khan, Imran

    2016-01-01

    The cytochrome P450 (CYP) superfamily defends organisms from endogenous and noxious environmental compounds, and thus is crucial for survival. However, beyond mammals the molecular evolution of CYP2 subfamilies is poorly understood. Here, we characterized the CYP2 family across 48 novel avian who...

  5. Evolution of the NANOG pseudogene family in the human and chimpanzee genomes

    Directory of Open Access Journals (Sweden)

    Maughan Peter J

    2006-02-01

    Full Text Available Abstract Background The NANOG gene is expressed in mammalian embryonic stem cells where it maintains cellular pluripotency. An unusually large family of pseudogenes arose from it with one unprocessed and ten processed pseudogenes in the human genome. This article compares the NANOG gene and its pseudogenes in the human and chimpanzee genomes and derives an evolutionary history of this pseudogene family. Results The NANOG gene and all pseudogenes except NANOGP8 are present at their expected orthologous chromosomal positions in the chimpanzee genome when compared to the human genome, indicating that their origins predate the human-chimpanzee divergence. Analysis of flanking DNA sequences demonstrates that NANOGP8 is absent from the chimpanzee genome. Conclusion Based on the most parsimonious ordering of inferred source-gene mutations, the deduced evolutionary origins for the NANOG pseudogene family in the human and chimpanzee genomes, in order of most ancient to most recent, are NANOGP6, NANOGP5, NANOGP3, NANOGP10, NANOGP2, NANOGP9, NANOGP7, NANOGP1, and NANOGP4. All of these pseudogenes were fixed in the genome of the human-chimpanzee common ancestor. NANOGP8 is the most recent pseudogene and it originated exclusively in the human lineage after the human-chimpanzee divergence. NANOGP1 is apparently an unprocessed pseudogene. Comparison of its sequence to the functional NANOG gene's reading frame suggests that this apparent pseudogene remained functional after duplication and, therefore, was subject to selection-driven conservation of its reading frame, and that it may retain some functionality or that its loss of function may be evolutionarily recent.

  6. Molecular Evolution of the Infrared Sensory Gene TRPA1 in Snakes and Implications for Functional Studies

    Science.gov (United States)

    Jiang, Ke; Zhang, Peng

    2011-01-01

    TRPA1 is a calcium ion channel protein recently identified as the infrared receptor in pit organ-containing snakes. Therefore, understanding the molecular evolution of TRPA1 may help to illuminate the origin of “heat vision” in snakes and reveal the molecular mechanism of infrared sensitivity for TRPA1. To this end, we sequenced the infrared sensory gene TRPA1 in 24 snake species, representing nine snake families and multiple non-snake outgroups. We found that TRPA1 is under strong positive selection in the pit-bearing snakes studied, but not in other non-pit snakes and non-snake vertebrates. As a comparison, TRPV1, a gene closely related to TRPA1, was found to be under strong purifying selection in all the species studied, with no difference in the strength of selection between pit-bearing snakes and non-pit snakes. This finding demonstrates that the adaptive evolution of TRPA1 specifically occurred within the pit-bearing snakes and may be related to the functional modification for detecting infrared radiation. In addition, by comparing the TRPA1 protein sequences, we identified 11 amino acid sites that were diverged in pit-bearing snakes but conserved in non-pit snakes and other vertebrates, 21 sites that were diverged only within pit-vipers but conserved in the remaining snakes. These specific amino acid substitutions may be potentially functional important for infrared sensing. PMID:22163322

  7. Contrasting patterns in the evolution of the Rab GTPase family in Archaeplastida

    Directory of Open Access Journals (Sweden)

    Romana Petrželková

    2014-12-01

    Full Text Available Rab GTPases are a vast group of proteins serving a role of master regulators in membrane trafficking in eukaryotes. Previous studies delineated some 23 Rab and Rab-like paralogs ancestral for eukaryotes and mapped their current phylogenetic distribution, but the analyses relied on a limited sampling of the eukaryotic diversity. Taking advantage of the recent growth of genome and transcriptome resources for phylogenetically diverse plants and algae, we reanalyzed the evolution of the Rab family in eukaryotes with the primary plastid, collectively constituting the presumably monophyletic supergroup Archaeplastida. Our most important novel findings are as follows: (i the ancestral set of Rabs in Archaeplastida included not only the paralogs Rab1, Rab2, Rab5, Rab6, Rab7, Rab8, Rab11, Rab18, Rab23, Rab24, Rab28, IFT27, and RTW (=Rabl2, as suggested previously, but also Rab14 and Rab34, because Rab14 exists in glaucophytes and Rab34 is present in glaucophytes and some green algae; (ii except in embryophytes, Rab gene duplications have been rare in Archaeplastida. Most notable is the independent emergence of divergent, possibly functionally novel, in-paralogs of Rab1 and Rab11 in several archaeplastidial lineages; (iii recurrent gene losses have been a significant factor shaping Rab gene complements in archaeplastidial species; for example, the Rab21 paralog was lost at least six times independently within Archaeplastida, once in the lineage leading to the “core” eudicots; (iv while the glaucophyte Cyanophora paradoxa has retained the highest number of ancestral Rab paralogs among all archaeplastidial species studied so far, rhodophytes underwent an extreme reduction of the Rab gene set along their stem lineage, resulting in only six paralogs (Rab1, Rab2, Rab6, Rab7, Rab11, and Rab18 present in modern red algae. Especially notable is the absence of Rab5, a virtually universal paralog essential for the endocytic pathway, suggesting that endocytosis

  8. Evolution and expression analysis of the soybean glutamate ...

    Indian Academy of Sciences (India)

    Evolution and expression analysis of the soybean glutamate decarboxylase gene family. TAE KYUNG HYUN, SEUNG HEE EOM, XIAO HAN and JU-SUNG KIM http://www.ias.ac.in/jbiosci. J. Biosci. 39(5), December 2014, 899–907, © Indian Academy of Sciences. Supplementary material. Supplementary figure 1.

  9. Molecular evolution of a Y chromosome to autosome gene duplication in Drosophila.

    Science.gov (United States)

    Dyer, Kelly A; White, Brooke E; Bray, Michael J; Piqué, Daniel G; Betancourt, Andrea J

    2011-03-01

    In contrast to the rest of the genome, the Y chromosome is restricted to males and lacks recombination. As a result, Y chromosomes are unable to respond efficiently to selection, and newly formed Y chromosomes degenerate until few genes remain. The rapid loss of genes from newly formed Y chromosomes has been well studied, but gene loss from highly degenerate Y chromosomes has only recently received attention. Here, we identify and characterize a Y to autosome duplication of the male fertility gene kl-5 that occurred during the evolution of the testacea group species of Drosophila. The duplication was likely DNA based, as other Y-linked genes remain on the Y chromosome, the locations of introns are conserved, and expression analyses suggest that regulatory elements remain linked. Genetic mapping reveals that the autosomal copy of kl-5 resides on the dot chromosome, a tiny autosome with strongly suppressed recombination. Molecular evolutionary analyses show that autosomal copies of kl-5 have reduced polymorphism and little recombination. Importantly, the rate of protein evolution of kl-5 has increased significantly in lineages where it is on the dot versus Y linked. Further analyses suggest this pattern is a consequence of relaxed purifying selection, rather than adaptive evolution. Thus, although the initial fixation of the kl-5 duplication may have been advantageous, slightly deleterious mutations have accumulated in the dot-linked copies of kl-5 faster than in the Y-linked copies. Because the dot chromosome contains seven times more genes than the Y and is exposed to selection in both males and females, these results suggest that the dot suffers the deleterious effects of genetic linkage to more selective targets compared with the Y chromosome. Thus, a highly degenerate Y chromosome may not be the worst environment in the genome, as is generally thought, but may in fact be protected from the accumulation of deleterious mutations relative to other nonrecombining

  10. The SOD gene family in tomato: identification, phylogenetic relationships and expression patterns

    Directory of Open Access Journals (Sweden)

    kun feng

    2016-08-01

    Full Text Available Superoxide dismutases (SODs are critical antioxidant enzymes that protect organisms from reactive oxygen species (ROS caused by adverse conditions, and have been widely found in the cytoplasm, chloroplasts, and mitochondria of eukaryotic and prokaryotic cells. Tomato (Solanum lycopersicum L. is an important economic crop and is cultivated worldwide. However, abiotic and biotic stresses severely hinder growth and development of the plant, which affects the production and quality of the crop. To reveal the potential roles of SOD genes under various stresses, we performed a systematic analysis of the tomato SOD gene family and analyzed the expression patterns of SlSOD genes in response to abiotic stresses at the whole-genome level. The characteristics of the SlSOD gene family were determined by analyzing gene structure, conserved motifs, chromosomal distribution, phylogenetic relationships, and expression patterns. We determined that there are at least nine SOD genes in tomato, including four Cu/ZnSODs, three FeSODs, and one MnSOD, and they are unevenly distributed on 12 chromosomes. Phylogenetic analyses of SOD genes from tomato and other plant species were separated into two groups with a high bootstrap value, indicating that these SOD genes were present before the monocot-dicot split. Additionally, many cis-elements that respond to different stresses were found in the promoters of nine SlSOD genes. Gene expression analysis based on RNA-seq data showed that most genes were expressed in all tested tissues, with the exception of SlSOD6 and SlSOD8, which were only expressed in young fruits. Microarray data analysis showed that most members of the SlSOD gene family were altered under salt- and drought-stress conditions. This genome-wide analysis of SlSOD genes helps to clarify the function of SlSOD genes under different stress conditions and provides information to aid in further understanding the evolutionary relationships of SOD genes in plants.

  11. Bioinformatics Analysis of MAPKKK Family Genes in Medicago truncatula

    Directory of Open Access Journals (Sweden)

    Wei Li

    2016-04-01

    Full Text Available Mitogen‐activated protein kinase kinase kinase (MAPKKK is a component of the MAPK cascade pathway that plays an important role in plant growth, development, and response to abiotic stress, the functions of which have been well characterized in several plant species, such as Arabidopsis, rice, and maize. In this study, we performed genome‐wide and systemic bioinformatics analysis of MAPKKK family genes in Medicago truncatula. In total, there were 73 MAPKKK family members identified by search of homologs, and they were classified into three subfamilies, MEKK, ZIK, and RAF. Based on the genomic duplication function, 72 MtMAPKKK genes were located throughout all chromosomes, but they cluster in different chromosomes. Using microarray data and high‐throughput sequencing‐data, we assessed their expression profiles in growth and development processes; these results provided evidence for exploring their important functions in developmental regulation, especially in the nodulation process. Furthermore, we investigated their expression in abiotic stresses by RNA‐seq, which confirmed their critical roles in signal transduction and regulation processes under stress. In summary, our genome‐wide, systemic characterization and expressional analysis of MtMAPKKK genes will provide insights that will be useful for characterizing the molecular functions of these genes in M. truncatula.

  12. Dynamic Changes in Yeast Phosphatase Families Allow for Specialization in Phosphate and Thiamine Starvation.

    Science.gov (United States)

    Nahas, John V; Iosue, Christine L; Shaik, Noor F; Selhorst, Kathleen; He, Bin Z; Wykoff, Dennis D

    2018-05-10

    Convergent evolution is often due to selective pressures generating a similar phenotype. We observe relatively recent duplications in a spectrum of Saccharomycetaceae yeast species resulting in multiple phosphatases that are regulated by different nutrient conditions - thiamine and phosphate starvation. This specialization is both transcriptional and at the level of phosphatase substrate specificity. In Candida glabrata , loss of the ancestral phosphatase family was compensated by the co-option of a different histidine phosphatase family with three paralogs. Using RNA-seq and functional assays, we identify one of these paralogs, CgPMU3 , as a thiamine phosphatase. We further determine that the 81% identical paralog CgPMU2 does not encode thiamine phosphatase activity; however, both are capable of cleaving the phosphatase substrate, 1-napthyl-phosphate. We functionally demonstrate that members of this family evolved novel enzymatic functions for phosphate and thiamine starvation, and are regulated transcriptionally by either nutrient condition, and observe similar trends in other yeast species. This independent, parallel evolution involving two different families of histidine phosphatases suggests that there were likely similar selective pressures on multiple yeast species to recycle thiamine and phosphate. In this work, we focused on duplication and specialization, but there is also repeated loss of phosphatases, indicating that the expansion and contraction of the phosphatase family is dynamic in many Ascomycetes. The dynamic evolution of the phosphatase gene families is perhaps just one example of how gene duplication, co-option, and transcriptional and functional specialization together allow species to adapt to their environment with existing genetic resources. Copyright © 2018, G3: Genes, Genomes, Genetics.

  13. New genus and species of the extinct aphid family Szelegiewicziidae and their implications for aphid evolution.

    Science.gov (United States)

    Wegierek, Piotr; Żyła, Dagmara; Homan, Agnieszka; Cai, Chenyang; Huang, Diying

    2017-10-24

    Recently, we are witnessing an increased appreciation for the importance of the fossil record in phylogenetics and testing various evolutionary hypotheses. However, this approach brings many challenges, especially for such a complex group as aphids and requires a thorough morphological analysis of the extinct groups. The extinct aphid family Szelegiewicziidae is supposed to be one of the oviparous lineages in aphid evolution. New material from the rock fossil deposits of Shar Teg (Upper Jurassic of Mongolia), Baissa (Lower Cretaceous of Siberia-Russia), and Burmese amber (Upper Cretaceous of Myanmar) allowed us to undertake a more detailed examination of the morphological features and carry out an analysis of the taxonomical composition and evolution of the family. This led us to the conclusion that evolution of the body plan and wing structure was similar in different, often not closely related groups, probably as a result of convergence. Additionally, we present a description of a new genus and two species (Tinaphis mongolica Żyła &Wegierek, sp. nov., and Feroorbis burmensis Wegierek & Huang, gen. et sp. nov.) that belong to this family.

  14. New genus and species of the extinct aphid family Szelegiewicziidae and their implications for aphid evolution

    Science.gov (United States)

    Wegierek, Piotr; Żyła, Dagmara; Homan, Agnieszka; Cai, Chenyang; Huang, Diying

    2017-12-01

    Recently, we are witnessing an increased appreciation for the importance of the fossil record in phylogenetics and testing various evolutionary hypotheses. However, this approach brings many challenges, especially for such a complex group as aphids and requires a thorough morphological analysis of the extinct groups. The extinct aphid family Szelegiewicziidae is supposed to be one of the oviparous lineages in aphid evolution. New material from the rock fossil deposits of Shar Teg (Upper Jurassic of Mongolia), Baissa (Lower Cretaceous of Siberia-Russia), and Burmese amber (Upper Cretaceous of Myanmar) allowed us to undertake a more detailed examination of the morphological features and carry out an analysis of the taxonomical composition and evolution of the family. This led us to the conclusion that evolution of the body plan and wing structure was similar in different, often not closely related groups, probably as a result of convergence. Additionally, we present a description of a new genus and two species ( Tinaphis mongolica Żyła &Wegierek, sp. nov., and Feroorbis burmensis Wegierek & Huang, gen. et sp. nov.) that belong to this family.

  15. Genetic recombination as a major cause of mutagenesis in the human globin gene clusters.

    Science.gov (United States)

    Borg, Joseph; Georgitsi, Marianthi; Aleporou-Marinou, Vassiliki; Kollia, Panagoula; Patrinos, George P

    2009-12-01

    Homologous recombination is a frequent phenomenon in multigene families and as such it occurs several times in both the alpha- and beta-like globin gene families. In numerous occasions, genetic recombination has been previously implicated as a major mechanism that drives mutagenesis in the human globin gene clusters, either in the form of unequal crossover or gene conversion. Unequal crossover results in the increase or decrease of the human globin gene copies, accompanied in the majority of cases with minor phenotypic consequences, while gene conversion contributes either to maintaining sequence homogeneity or generating sequence diversity. The role of genetic recombination, particularly gene conversion in the evolution of the human globin gene families has been discussed elsewhere. Here, we summarize our current knowledge and review existing experimental evidence outlining the role of genetic recombination in the mutagenic process in the human globin gene families.

  16. Genomic evolution of 11 type strains within family Planctomycetaceae.

    Directory of Open Access Journals (Sweden)

    Min Guo

    Full Text Available The species in family Planctomycetaceae are ideal groups for investigating the origin of eukaryotes. Their cells are divided by a lipidic intracytoplasmic membrane and they share a number of eukaryote-like molecular characteristics. However, their genomic structures, potential abilities, and evolutionary status are still unknown. In this study, we searched for common protein families and a core genome/pan genome based on 11 sequenced species in family Planctomycetaceae. Then, we constructed phylogenetic tree based on their 832 common protein families. We also annotated the 11 genomes using the Clusters of Orthologous Groups database. Moreover, we predicted and reconstructed their core/pan metabolic pathways using the KEGG (Kyoto Encyclopedia of Genes and Genomes orthology system. Subsequently, we identified genomic islands (GIs and structural variations (SVs among the five complete genomes and we specifically investigated the integration of two Planctomycetaceae plasmids in all 11 genomes. The results indicate that Planctomycetaceae species share diverse genomic variations and unique genomic characteristics, as well as have huge potential for human applications.

  17. Chicken genome analysis reveals novel genes encoding biotin-binding proteins related to avidin family

    Directory of Open Access Journals (Sweden)

    Nordlund Henri R

    2005-03-01

    Full Text Available Abstract Background A chicken egg contains several biotin-binding proteins (BBPs, whose complete DNA and amino acid sequences are not known. In order to identify and characterise these genes and proteins we studied chicken cDNAs and genes available in the NCBI database and chicken genome database using the reported N-terminal amino acid sequences of chicken egg-yolk BBPs as search strings. Results Two separate hits showing significant homology for these N-terminal sequences were discovered. For one of these hits, the chromosomal location in the immediate proximity of the avidin gene family was found. Both of these hits encode proteins having high sequence similarity with avidin suggesting that chicken BBPs are paralogous to avidin family. In particular, almost all residues corresponding to biotin binding in avidin are conserved in these putative BBP proteins. One of the found DNA sequences, however, seems to encode a carboxy-terminal extension not present in avidin. Conclusion We describe here the predicted properties of the putative BBP genes and proteins. Our present observations link BBP genes together with avidin gene family and shed more light on the genetic arrangement and variability of this family. In addition, comparative modelling revealed the potential structural elements important for the functional and structural properties of the putative BBP proteins.

  18. [Mutation analysis of FGFR3 gene in a family featuring hereditary dwarfism].

    Science.gov (United States)

    Zhang, Qiong; Jiang, Hai-ou; Quan, Qing-li; Li, Jun; He, Ting; Huang, Xue-shuang

    2011-12-01

    To investigate the clinical symptoms and potential mutation in FGFR3 gene for a family featuring hereditary dwarfism in order to attain diagnosis and provide prenatal diagnosis. Five patients and two unaffected relatives from the family, in addition with 100 healthy controls, were recruited. Genome DNA was extracted. Exons 10 and 13 of the FGFR3 gene were amplified using polymerase chain reaction (PCR). PCR products were sequenced in both directions. All patients had similar features including short stature, short limbs, lumbar hyperlordosis but normal craniofacial features. A heterozygous mutation G1620T (N540K) was identified in the cDNA from all patients but not in the unaffected relatives and 100 control subjects. A heterozygous G380R mutation was excluded. The hereditary dwarfism featured by this family has been caused by hypochondroplasia (HCH) due to a N540K mutation in the FGFR3 gene.

  19. A study of the evolution of human microRNAs by their apparent repression effectiveness on target genes.

    Directory of Open Access Journals (Sweden)

    Yong Huang

    Full Text Available BACKGROUND: Even though the genomes of many model species have already been sequenced, our knowledge of gene regulation in evolution is still very limited. One big obstacle is that it is hard to predict the target genes of transcriptional factors accurately from sequences. In this respect, microRNAs (miRNAs are different from transcriptional factors, as target genes of miRNAs can be readily predicted from sequences. This feature of miRNAs offers an unprecedented vantage point for evolutionary analysis of gene regulation. METHODOLOGY/PRINCIPAL FINDINGS: In this study, we analyzed a particular aspect of miRNA evolution, the differences in the "apparent repression effectiveness (ARE" between human miRNAs of different conservational levels. ARE is a measure we designed to evaluate the repression effect of miRNAs on target genes based on publicly available gene expression data in normal tissues and miRNA targeting and expression data. We found that ARE values of more conserved miRNAs are significantly higher than those of less conserved miRNAs in general. We also found the gain in expression abundance and broadness of miRNAs in evolution contributed to the gain in ARE. CONCLUSIONS/SIGNIFICANCE: The ARE measure quantifies the repressive effects of miRNAs and enables us to study the influences of many factors on miRNA-mediated repression, such as conservational levels and expression levels of miRNAs. The gain in ARE can be explained by the existence of a trend of miRNAs in evolution to effectively control more target genes, which is beneficial to the miRNAs but not necessarily to the organism at all times. Our results from miRNAs gave us an insight of the complex interplay between regulators and target genes in evolution.

  20. Identification and description of three families with familial Alzheimer disease that segregate variants in the SORL1 gene.

    Science.gov (United States)

    Thonberg, Håkan; Chiang, Huei-Hsin; Lilius, Lena; Forsell, Charlotte; Lindström, Anna-Karin; Johansson, Charlotte; Björkström, Jenny; Thordardottir, Steinunn; Sleegers, Kristel; Van Broeckhoven, Christine; Rönnbäck, Annica; Graff, Caroline

    2017-06-09

    Alzheimer disease (AD) is a progressive neurodegenerative disorder and the most common form of dementia. The majority of AD cases are sporadic, while up to 5% are families with an early onset AD (EOAD). Mutations in one of the three genes: amyloid beta precursor protein (APP), presenilin 1 (PSEN1) or presenilin 2 (PSEN2) can be disease causing. However, most EOAD families do not carry mutations in any of these three genes, and candidate genes, such as the sortilin-related receptor 1 (SORL1), have been suggested to be potentially causative. To identify AD causative variants, we performed whole-exome sequencing on five individuals from a family with EOAD and a missense variant, p.Arg1303Cys (c.3907C > T) was identified in SORL1 which segregated with disease and was further characterized with immunohistochemistry on two post mortem autopsy cases from the same family. In a targeted re-sequencing effort on independent index patients from 35 EOAD-families, a second SORL1 variant, c.3050-2A > G, was found which segregated with the disease in 3 affected and was absent in one unaffected family member. The c.3050-2A > G variant is located two nucleotides upstream of exon 22 and was shown to cause exon 22 skipping, resulting in a deletion of amino acids Gly1017- Glu1074 of SORL1. Furthermore, a third SORL1 variant, c.5195G > C, recently identified in a Swedish case control cohort included in the European Early-Onset Dementia (EU EOD) consortium study, was detected in two affected siblings in a third family with familial EOAD. The finding of three SORL1-variants that segregate with disease in three separate families with EOAD supports the involvement of SORL1 in AD pathology. The cause of these rare monogenic forms of EOAD has proven difficult to find and the use of exome and genome sequencing may be a successful route to target them.