evolutionarily conserved sequences: Topics by WorldWideScience.org

Sample records for evolutionarily conserved sequences

Evolutionarily conserved regulation of TOR signalling.

Science.gov (United States)

Takahara, Terunao; Maeda, Tatsuya

2013-07-01

The target of rapamycin (TOR) is an evolutionarily conserved protein kinase that regulates cell growth in response to various environmental as well as intracellular cues through the formation of 2 distinct TOR complexes (TORC), TORC1 and TORC2. Dysregulation of TORC1 and TORC2 activity is closely associated with various diseases, including diabetes, cancer and neurodegenerative disorders. Over the past few years, new regulatory mechanisms of TORC1 and TORC2 activity have been elucidated. Furthermore, recent advances in the study of TOR inhibitors have revealed previously unrecognized cellular functions of TORC1. In this review, we briefly summarize the current understanding of the evolutionarily conserved TOR signalling from upstream regulators to downstream events.
Genomic Imprinting Was Evolutionarily Conserved during Wheat Polyploidization.

Science.gov (United States)

Yang, Guanghui; Liu, Zhenshan; Gao, Lulu; Yu, Kuohai; Feng, Man; Yao, Yingyin; Peng, Huiru; Hu, Zhaorong; Sun, Qixin; Ni, Zhongfu; Xin, Mingming

2018-01-01

Genomic imprinting is an epigenetic phenomenon that causes genes to be differentially expressed depending on their parent of origin. To evaluate the evolutionary conservation of genomic imprinting and the effects of ploidy on this process, we investigated parent-of-origin-specific gene expression patterns in the endosperm of diploid ( Aegilops spp), tetraploid, and hexaploid wheat ( Triticum spp) at various stages of development via high-throughput transcriptome sequencing. We identified 91, 135, and 146 maternally or paternally expressed genes (MEGs or PEGs, respectively) in diploid, tetraploid, and hexaploid wheat, respectively, 52.7% of which exhibited dynamic expression patterns at different developmental stages. Gene Ontology enrichment analysis suggested that MEGs and PEGs were involved in metabolic processes and DNA-dependent transcription, respectively. Nearly half of the imprinted genes exhibited conserved expression patterns during wheat hexaploidization. In addition, 40% of the homoeolog pairs originating from whole-genome duplication were consistently maternally or paternally biased in the different subgenomes of hexaploid wheat. Furthermore, imprinted expression was found for 41.2% and 50.0% of homolog pairs that evolved by tandem duplication after genome duplication in tetraploid and hexaploid wheat, respectively. These results suggest that genomic imprinting was evolutionarily conserved between closely related Triticum and Aegilops species and in the face of polyploid hybridization between species in these genera. © 2018 American Society of Plant Biologists. All rights reserved.
Molecular dissection of a contiguous gene syndrome: Frequent submicroscopic deletions, evolutionarily conserved sequences, and a hypomethylated island in the Miller-Dieker chromosome region

International Nuclear Information System (INIS)

Ledbetter, D.H.; Ledbetter, S.A.; vanTuinen, P.

1989-01-01

The Miller-Dieker syndrome (MDS), composed of characteristic facial abnormalities and a severe neuronal migration disorder affecting the cerebral cortex, is caused by visible or submicroscopic deletions of chromosome band 17p13. Twelve anonymous DNA markers were tested against a panel of somatic cell hybrids containing 17p deletions from seven MDS patients. All patients, including three with normal karyotypes, are deleted for a variable set of 5-12 markers. Two highly polymorphic VNTR (variable number of tandem repeats) probes, YNZ22 and YNH37, are codeleted in all patients tested and make molecular diagnosis for this disorder feasible. By pulsed-field gel electrophoresis, YNZ22 and YNH37 were shown to be within 30 kilobases (kb) of each other. Cosmid clones containing both VNTR sequences were identified, and restriction mapping showed them to be 100 kb were completely deleted in all patients, providing a minimum estimate of the size of the MDS critical region. A hypomethylated island and evolutionarily conserved sequences were identified within this 100-kb region, indications of the presence of one or more expressed sequences potentially involved in the pathophysiology of this disorder. The conserved sequences were mapped to mouse chromosome 11 by using mouse-rat somatic cell hybrids, extending the remarkable homology between human chromosome 17 and mouse chromosome 11 by 30 centimorgans, into the 17p telomere region
Violation of an evolutionarily conserved immunoglobulin diversity gene sequence preference promotes production of dsDNA-specific IgG antibodies.

Directory of Open Access Journals (Sweden)

Aaron Silva-Sanchez

Full Text Available Variability in the developing antibody repertoire is focused on the third complementarity determining region of the H chain (CDR-H3, which lies at the center of the antigen binding site where it often plays a decisive role in antigen binding. The power of VDJ recombination and N nucleotide addition has led to the common conception that the sequence of CDR-H3 is unrestricted in its variability and random in its composition. Under this view, the immune response is solely controlled by somatic positive and negative clonal selection mechanisms that act on individual B cells to promote production of protective antibodies and prevent the production of self-reactive antibodies. This concept of a repertoire of random antigen binding sites is inconsistent with the observation that diversity (DH gene segment sequence content by reading frame (RF is evolutionarily conserved, creating biases in the prevalence and distribution of individual amino acids in CDR-H3. For example, arginine, which is often found in the CDR-H3 of dsDNA binding autoantibodies, is under-represented in the commonly used DH RFs rearranged by deletion, but is a frequent component of rarely used inverted RF1 (iRF1, which is rearranged by inversion. To determine the effect of altering this germline bias in DH gene segment sequence on autoantibody production, we generated mice that by genetic manipulation are forced to utilize an iRF1 sequence encoding two arginines. Over a one year period we collected serial serum samples from these unimmunized, specific pathogen-free mice and found that more than one-fifth of them contained elevated levels of dsDNA-binding IgG, but not IgM; whereas mice with a wild type DH sequence did not. Thus, germline bias against the use of arginine enriched DH sequence helps to reduce the likelihood of producing self-reactive antibodies.
An evolutionarily conserved gene, FUWA, plays a role in determining panicle architecture, grain shape and grain weight in rice.

Science.gov (United States)

Chen, Jun; Gao, He; Zheng, Xiao-Ming; Jin, Mingna; Weng, Jian-Feng; Ma, Jin; Ren, Yulong; Zhou, Kunneng; Wang, Qi; Wang, Jie; Wang, Jiu-Lin; Zhang, Xin; Cheng, Zhijun; Wu, Chuanyin; Wang, Haiyang; Wan, Jian-Min

2015-08-01

Plant breeding relies on creation of novel allelic combinations for desired traits. Identification and utilization of beneficial alleles, rare alleles and evolutionarily conserved genes in the germplasm (referred to as 'hidden' genes) provide an effective approach to achieve this goal. Here we show that a chemically induced null mutation in an evolutionarily conserved gene, FUWA, alters multiple important agronomic traits in rice, including panicle architecture, grain shape and grain weight. FUWA encodes an NHL domain-containing protein, with preferential expression in the root meristem, shoot apical meristem and inflorescences, where it restricts excessive cell division. Sequence analysis revealed that FUWA has undergone a bottleneck effect, and become fixed in landraces and modern cultivars during domestication and breeding. We further confirm a highly conserved role of FUWA homologs in determining panicle architecture and grain development in rice, maize and sorghum through genetic transformation. Strikingly, knockdown of the FUWA transcription level by RNA interference results in an erect panicle and increased grain size in both indica and japonica genetic backgrounds. This study illustrates an approach to create new germplasm with improved agronomic traits for crop breeding by tapping into evolutionary conserved genes. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.
On the relationship between residue structural environment and sequence conservation in proteins.

Science.gov (United States)

Liu, Jen-Wei; Lin, Jau-Ji; Cheng, Chih-Wen; Lin, Yu-Feng; Hwang, Jenn-Kang; Huang, Tsun-Tsao

2017-09-01

Residues that are crucial to protein function or structure are usually evolutionarily conserved. To identify the important residues in protein, sequence conservation is estimated, and current methods rely upon the unbiased collection of homologous sequences. Surprisingly, our previous studies have shown that the sequence conservation is closely correlated with the weighted contact number (WCN), a measure of packing density for residue's structural environment, calculated only based on the C α positions of a protein structure. Moreover, studies have shown that sequence conservation is correlated with environment-related structural properties calculated based on different protein substructures, such as a protein's all atoms, backbone atoms, side-chain atoms, or side-chain centroid. To know whether the C α atomic positions are adequate to show the relationship between residue environment and sequence conservation or not, here we compared C α atoms with other substructures in their contributions to the sequence conservation. Our results show that C α positions are substantially equivalent to the other substructures in calculations of various measures of residue environment. As a result, the overlapping contributions between C α atoms and the other substructures are high, yielding similar structure-conservation relationship. Take the WCN as an example, the average overlapping contribution to sequence conservation is 87% between C α and all-atom substructures. These results indicate that only C α atoms of a protein structure could reflect sequence conservation at the residue level. © 2017 Wiley Periodicals, Inc.
Sequence of cDNAs for mammalian H2A. Z, an evolutionarily diverged but highly conserved basal histone H2A isoprotein species

Energy Technology Data Exchange (ETDEWEB)

Hatch, C L; Bonner, W M

1988-02-11

The nucleotide sequences of cDNAs for the evolutionarily diverged but highly conserved basal H2A isoprotein, H2A.Z, have been determined for the rat, cow, and human. As a basal histone, H2A.Z is synthesized throughout the cell cycle at a constant rate, unlinked to DNA replication, and at a much lower rate in quiescent cells. Each of the cDNA isolates encodes the entire H2A.Z polypeptide. The human isolate is about 1.0 kilobases long. It contains a coding region of 387 nucleotides flanked by 106 nucleotides of 5'UTR and 376 nucleotides of 3'UTR, which contains a polyadenylation signal followed by a poly A tail. The bovine and rat cDNAs have 97 and 94% nucleotide positional identity to the human cDNA in the coding region and 98% in the proximal 376 nucleotides of the 3'UTR which includes the polyadenylation signal. A potential stem-forming sequence imbedded in a direct repeat is found centered at 261 nucleotides into the 3'UTR. Each of the cDNA clones could be transcribed and translated in vitro to yield H2A.Z protein. The mammalian H2A.Z cDNA coding sequences are approximately 80% similar to those in chicken and 75% to those in sea urchin.
Aligning science and policy to achieve evolutionarily enlightened conservation.

Science.gov (United States)

Cook, Carly N; Sgrò, Carla M

2017-06-01

There is increasing recognition among conservation scientists that long-term conservation outcomes could be improved through better integration of evolutionary theory into management practices. Despite concerns that the importance of key concepts emerging from evolutionary theory (i.e., evolutionary principles and processes) are not being recognized by managers, there has been little effort to determine the level of integration of evolutionary theory into conservation policy and practice. We assessed conservation policy at 3 scales (international, national, and provincial) on 3 continents to quantify the degree to which key evolutionary concepts, such as genetic diversity and gene flow, are being incorporated into conservation practice. We also evaluated the availability of clear guidance within the applied evolutionary biology literature as to how managers can change their management practices to achieve better conservation outcomes. Despite widespread recognition of the importance of maintaining genetic diversity, conservation policies provide little guidance about how this can be achieved in practice and other relevant evolutionary concepts, such as inbreeding depression, are mentioned rarely. In some cases the poor integration of evolutionary concepts into management reflects a lack of decision-support tools in the literature. Where these tools are available, such as risk-assessment frameworks, they are not being adopted by conservation policy makers, suggesting that the availability of a strong evidence base is not the only barrier to evolutionarily enlightened management. We believe there is a clear need for more engagement by evolutionary biologists with policy makers to develop practical guidelines that will help managers make changes to conservation practice. There is also an urgent need for more research to better understand the barriers to and opportunities for incorporating evolutionary theory into conservation practice. © 2016 Society for Conservation
Identification of evolutionarily conserved non-AUG-initiated N-terminal extensions in human coding sequences.

LENUS (Irish Health Repository)

Ivanov, Ivaylo P

2011-05-01

In eukaryotes, it is generally assumed that translation initiation occurs at the AUG codon closest to the messenger RNA 5\\' cap. However, in certain cases, initiation can occur at codons differing from AUG by a single nucleotide, especially the codons CUG, UUG, GUG, ACG, AUA and AUU. While non-AUG initiation has been experimentally verified for a handful of human genes, the full extent to which this phenomenon is utilized--both for increased coding capacity and potentially also for novel regulatory mechanisms--remains unclear. To address this issue, and hence to improve the quality of existing coding sequence annotations, we developed a methodology based on phylogenetic analysis of predicted 5\\' untranslated regions from orthologous genes. We use evolutionary signatures of protein-coding sequences as an indicator of translation initiation upstream of annotated coding sequences. Our search identified novel conserved potential non-AUG-initiated N-terminal extensions in 42 human genes including VANGL2, FGFR1, KCNN4, TRPV6, HDGF, CITED2, EIF4G3 and NTF3, and also affirmed the conservation of known non-AUG-initiated extensions in 17 other genes. In several instances, we have been able to obtain independent experimental evidence of the expression of non-AUG-initiated products from the previously published literature and ribosome profiling data.
Linkage disequilibrium of evolutionarily conserved regions in the human genome

Directory of Open Access Journals (Sweden)

Johnson Todd A

2006-12-01

Full Text Available Abstract Background The strong linkage disequilibrium (LD recently found in genic or exonic regions of the human genome demonstrated that LD can be increased by evolutionary mechanisms that select for functionally important loci. This suggests that LD might be stronger in regions conserved among species than in non-conserved regions, since regions exposed to natural selection tend to be conserved. To assess this hypothesis, we used genome-wide polymorphism data from the HapMap project and investigated LD within DNA sequences conserved between the human and mouse genomes. Results Unexpectedly, we observed that LD was significantly weaker in conserved regions than in non-conserved regions. To investigate why, we examined sequence features that may distort the relationship between LD and conserved regions. We found that interspersed repeats, and not other sequence features, were associated with the weak LD tendency in conserved regions. To appropriately understand the relationship between LD and conserved regions, we removed the effect of repetitive elements and found that the high degree of sequence conservation was strongly associated with strong LD in coding regions but not with that in non-coding regions. Conclusion Our work demonstrates that the degree of sequence conservation does not simply increase LD as predicted by the hypothesis. Rather, it implies that purifying selection changes the polymorphic patterns of coding sequences but has little influence on the patterns of functional units such as regulatory elements present in non-coding regions, since the former are generally restricted by the constraint of maintaining a functional protein product across multiple exons while the latter may exist more as individually isolated units.
Evolutionary growth process of highly conserved sequences in vertebrate genomes.

Science.gov (United States)

Ishibashi, Minaka; Noda, Akiko Ogura; Sakate, Ryuichi; Imanishi, Tadashi

2012-08-01

Genome sequence comparison between evolutionarily distant species revealed ultraconserved elements (UCEs) among mammals under strong purifying selection. Most of them were also conserved among vertebrates. Because they tend to be located in the flanking regions of developmental genes, they would have fundamental roles in creating vertebrate body plans. However, the evolutionary origin and selection mechanism of these UCEs remain unclear. Here we report that UCEs arose in primitive vertebrates, and gradually grew in vertebrate evolution. We searched for UCEs in two teleost fishes, Tetraodon nigroviridis and Oryzias latipes, and found 554 UCEs with 100% identity over 100 bps. Comparison of teleost and mammalian UCEs revealed 43 pairs of common, jawed-vertebrate UCEs (jUCE) with high sequence identities, ranging from 83.1% to 99.2%. Ten of them retain lower similarities to the Petromyzon marinus genome, and the substitution rates of four non-exonic jUCEs were reduced after the teleost-mammal divergence, suggesting that robust conservation had been acquired in the jawed vertebrate lineage. Our results indicate that prototypical UCEs originated before the divergence of jawed and jawless vertebrates and have been frozen as perfect conserved sequences in the jawed vertebrate lineage. In addition, our comparative sequence analyses of UCEs and neighboring regions resulted in a discovery of lineage-specific conserved sequences. They were added progressively to prototypical UCEs, suggesting step-wise acquisition of novel regulatory roles. Our results indicate that conserved non-coding elements (CNEs) consist of blocks with distinct evolutionary history, each having been frozen since different evolutionary era along the vertebrate lineage. Copyright © 2012 Elsevier B.V. All rights reserved.
Identification of evolutionarily conserved exons as regulated targets for the splicing activator tra2β in development.

Directory of Open Access Journals (Sweden)

Sushma Grellscheid

2011-12-01

Full Text Available Alternative splicing amplifies the information content of the genome, creating multiple mRNA isoforms from single genes. The evolutionarily conserved splicing activator Tra2β (Sfrs10 is essential for mouse embryogenesis and implicated in spermatogenesis. Here we find that Tra2β is up-regulated as the mitotic stem cell containing population of male germ cells differentiate into meiotic and post-meiotic cells. Using CLIP coupled to deep sequencing, we found that Tra2β binds a high frequency of exons and identified specific G/A rich motifs as frequent targets. Significantly, for the first time we have analysed the splicing effect of Sfrs10 depletion in vivo by generating a conditional neuronal-specific Sfrs10 knock-out mouse (Sfrs10(fl/fl; Nestin-Cre(tg/+. This mouse has defects in brain development and allowed correlation of genuine physiologically Tra2β regulated exons. These belonged to a novel class which were longer than average size and importantly needed multiple cooperative Tra2β binding sites for efficient splicing activation, thus explaining the observed splicing defects in the knockout mice. Regulated exons included a cassette exon which produces a meiotic isoform of the Nasp histone chaperone that helps monitor DNA double-strand breaks. We also found a previously uncharacterised poison exon identifying a new pathway of feedback control between vertebrate Tra2 proteins. Both Nasp-T and the Tra2a poison exon are evolutionarily conserved, suggesting they might control fundamental developmental processes. Tra2β protein isoforms lacking the RRM were able to activate specific target exons indicating an additional functional role as a splicing co-activator. Significantly the N-terminal RS1 domain conserved between flies and humans was essential for the splicing activator function of Tra2β. Versions of Tra2β lacking this N-terminal RS1 domain potently repressed the same target exons activated by full-length Tra2β protein.
In Silico Analysis of Gene Expression Network Components Underlying Pigmentation Phenotypes in the Python Identified Evolutionarily Conserved Clusters of Transcription Factor Binding Sites

Directory of Open Access Journals (Sweden)

Kristopher J. L. Irizarry

2016-01-01

Full Text Available Color variation provides the opportunity to investigate the genetic basis of evolution and selection. Reptiles are less studied than mammals. Comparative genomics approaches allow for knowledge gained in one species to be leveraged for use in another species. We describe a comparative vertebrate analysis of conserved regulatory modules in pythons aimed at assessing bioinformatics evidence that transcription factors important in mammalian pigmentation phenotypes may also be important in python pigmentation phenotypes. We identified 23 python orthologs of mammalian genes associated with variation in coat color phenotypes for which we assessed the extent of pairwise protein sequence identity between pythons and mouse, dog, horse, cow, chicken, anole lizard, and garter snake. We next identified a set of melanocyte/pigment associated transcription factors (CREB, FOXD3, LEF-1, MITF, POU3F2, and USF-1 that exhibit relatively conserved sequence similarity within their DNA binding regions across species based on orthologous alignments across multiple species. Finally, we identified 27 evolutionarily conserved clusters of transcription factor binding sites within ~200-nucleotide intervals of the 1500-nucleotide upstream regions of AIM1, DCT, MC1R, MITF, MLANA, OA1, PMEL, RAB27A, and TYR from Python bivittatus. Our results provide insight into pigment phenotypes in pythons.
Earthworms and Humans in Vitro: Characterizing Evolutionarily Conserved Stress and Immune Responses to Silver Nanoparticles

DEFF Research Database (Denmark)

Hayashi, Yuya; Engelmann, Péter; Foldbjerg, Rasmus

2012-01-01

Little is known about the potential threats of silver nanoparticles (AgNPs) to ecosystem health, with no detailed report existing on the stress and immune responses of soil invertebrates. Here we use earthworm primary cells, cross-referencing to human cell cultures with a particular emphasis on t...... in the coelomocytes and THP-1 cells. Our findings provide mechanistic clues on cellular innate immunity toward AgNPs that is likely to be evolutionarily conserved across the animal kingdom....
Lariat sequencing in a unicellular yeast identifies regulated alternative splicing of exons that are evolutionarily conserved with humans.

Science.gov (United States)

Awan, Ali R; Manfredo, Amanda; Pleiss, Jeffrey A

2013-07-30

Alternative splicing is a potent regulator of gene expression that vastly increases proteomic diversity in multicellular eukaryotes and is associated with organismal complexity. Although alternative splicing is widespread in vertebrates, little is known about the evolutionary origins of this process, in part because of the absence of phylogenetically conserved events that cross major eukaryotic clades. Here we describe a lariat-sequencing approach, which offers high sensitivity for detecting splicing events, and its application to the unicellular fungus, Schizosaccharomyces pombe, an organism that shares many of the hallmarks of alternative splicing in mammalian systems but for which no previous examples of exon-skipping had been demonstrated. Over 200 previously unannotated splicing events were identified, including examples of regulated alternative splicing. Remarkably, an evolutionary analysis of four of the exons identified here as subject to skipping in S. pombe reveals high sequence conservation and perfect length conservation with their homologs in scores of plants, animals, and fungi. Moreover, alternative splicing of two of these exons have been documented in multiple vertebrate organisms, making these the first demonstrations of identical alternative-splicing patterns in species that are separated by over 1 billion y of evolution.
Ecological interactions are evolutionarily conserved across the entire tree of life.

Science.gov (United States)

Gómez, José M; Verdú, Miguel; Perfectti, Francisco

2010-06-17

Ecological interactions are crucial to understanding both the ecology and the evolution of organisms. Because the phenotypic traits regulating species interactions are largely a legacy of their ancestors, it is widely assumed that ecological interactions are phylogenetically conserved, with closely related species interacting with similar partners. However, the existing empirical evidence is inadequate to appropriately evaluate the hypothesis of phylogenetic conservatism in ecological interactions, because it is both ecologically and taxonomically biased. In fact, most studies on the evolution of ecological interactions have focused on specialized organisms, such as some parasites or insect herbivores, belonging to a limited subset of the overall tree of life. Here we study the evolution of host use in a large and diverse group of interactions comprising both specialist and generalist acellular, unicellular and multicellular organisms. We show that, as previously found for specialized interactions, generalized interactions can be evolutionarily conserved. Significant phylogenetic conservatism of interaction patterns was equally likely to occur in symbiotic and non-symbiotic interactions, as well as in mutualistic and antagonistic interactions. Host-use differentiation among species was higher in phylogenetically conserved clades, irrespective of their generalization degree and taxonomic position within the tree of life. Our findings strongly suggest a shared pattern in the organization of biological systems through evolutionary time, mediated by marked conservatism of ecological interactions among taxa.
Identification of two evolutionarily conserved 5' cis-elements involved in regulating spatiotemporal expression of Nolz-1 during mouse embryogenesis.

Directory of Open Access Journals (Sweden)

Sunny Li-Yun Chang

Full Text Available Proper development of vertebrate embryos depends not only on the crucial funtions of key evolutionarily conserved transcriptional regulators, but also on the precisely spatiotemporal expression of these transcriptional regulators. The mouse Nolz-1/Znf503/Zfp503 gene is a mammalian member of the conserved zinc-finger containing NET family. The expression pattern of Nolz-1 in mouse embryos is highly correlated with that of its homologues in different species. To study the spatiotemporal regulation of Nolz-1, we first identified two evolutionarily conserved cis-elements, UREA and UREB, in 5' upstream regions of mouse Nolz-1 locus. We then generated UREA-LacZ and UREB-LacZ transgenic reporter mice to characterize the putative enhancer activity of UREA and UREB. The results indicated that both UREA and UREB contained tissue-specific enhancer activity for directing LacZ expression in selective tissue organs during mouse embryogensis. UREA directed LacZ expression preferentially in selective regions of developing central nervous system, including the forebrain, hindbrain and spinal cord, whereas UREB directed LacZ expression mainly in other developing tissue organs such as the Nolz-1 expressing branchial arches and its derivatives, the apical ectodermal ridge of limb buds and the urogenital tissues. Both UREA and UREB directed strong LacZ expression in the lateral plate mesoderm where endogenous Nolz-1 was also expressed. Despite that the LacZ expression pattern did not full recapitulated the endogenous Nolz-1 expression and some mismatched expression patterns were observed, co-expression of LacZ and Nolz-1 did occur in many cells of selective tissue organs, such as in the ventrolateral cortex and ventral spinal cord of UREA-LacZ embryos, and the urogenital tubes of UREB-LacZ embryos. Taken together, our study suggests that UREA and UREB may function as evolutionarily conserved cis-regulatory elements that coordinate with other cis-elements to regulate
Evolutionarily conserved histone methylation dynamics during seed life-cycle transitions.

Directory of Open Access Journals (Sweden)

Kerstin Müller

Full Text Available Plants have a remarkable ability to react to seasonal changes by synchronizing life-cycle transitions with environmental conditions. We addressed the question of how transcriptional re-programming occurs in response to an environmental cue that triggers the major life cycle transition from seed dormancy to germination and seedling growth. We elucidated an important mechanistic aspect of this process by following the chromatin dynamics of key regulatory genes with a focus on the two antagonistic marks, H3K4me3 and H3K27me3. Histone methylation patterns of major dormancy regulators changed during the transition to germination and seedling growth. We observed a switch from H3K4me3 and high transcription levels to silencing by the repressive H3K27me3 mark when dormancy was broken through exposure to moist chilling, underscoring that a functional PRC2 complex is necessary for this transition. Moreover, this reciprocal regulation by H3K4me3 and H3K27me3 is evolutionarily conserved from gymnosperms to angiosperms.
Evolutionarily conserved substrate substructures for automated annotation of enzyme superfamilies.

Directory of Open Access Journals (Sweden)

Ranyee A Chiang

2008-08-01

Full Text Available The evolution of enzymes affects how well a species can adapt to new environmental conditions. During enzyme evolution, certain aspects of molecular function are conserved while other aspects can vary. Aspects of function that are more difficult to change or that need to be reused in multiple contexts are often conserved, while those that vary may indicate functions that are more easily changed or that are no longer required. In analogy to the study of conservation patterns in enzyme sequences and structures, we have examined the patterns of conservation and variation in enzyme function by analyzing graph isomorphisms among enzyme substrates of a large number of enzyme superfamilies. This systematic analysis of substrate substructures establishes the conservation patterns that typify individual superfamilies. Specifically, we determined the chemical substructures that are conserved among all known substrates of a superfamily and the substructures that are reacting in these substrates and then examined the relationship between the two. Across the 42 superfamilies that were analyzed, substantial variation was found in how much of the conserved substructure is reacting, suggesting that superfamilies may not be easily grouped into discrete and separable categories. Instead, our results suggest that many superfamilies may need to be treated individually for analyses of evolution, function prediction, and guiding enzyme engineering strategies. Annotating superfamilies with these conserved and reacting substructure patterns provides information that is orthogonal to information provided by studies of conservation in superfamily sequences and structures, thereby improving the precision with which we can predict the functions of enzymes of unknown function and direct studies in enzyme engineering. Because the method is automated, it is suitable for large-scale characterization and comparison of fundamental functional capabilities of both characterized
Evolutionarily conserved substrate substructures for automated annotation of enzyme superfamilies.

Science.gov (United States)

Chiang, Ranyee A; Sali, Andrej; Babbitt, Patricia C

2008-08-01

The evolution of enzymes affects how well a species can adapt to new environmental conditions. During enzyme evolution, certain aspects of molecular function are conserved while other aspects can vary. Aspects of function that are more difficult to change or that need to be reused in multiple contexts are often conserved, while those that vary may indicate functions that are more easily changed or that are no longer required. In analogy to the study of conservation patterns in enzyme sequences and structures, we have examined the patterns of conservation and variation in enzyme function by analyzing graph isomorphisms among enzyme substrates of a large number of enzyme superfamilies. This systematic analysis of substrate substructures establishes the conservation patterns that typify individual superfamilies. Specifically, we determined the chemical substructures that are conserved among all known substrates of a superfamily and the substructures that are reacting in these substrates and then examined the relationship between the two. Across the 42 superfamilies that were analyzed, substantial variation was found in how much of the conserved substructure is reacting, suggesting that superfamilies may not be easily grouped into discrete and separable categories. Instead, our results suggest that many superfamilies may need to be treated individually for analyses of evolution, function prediction, and guiding enzyme engineering strategies. Annotating superfamilies with these conserved and reacting substructure patterns provides information that is orthogonal to information provided by studies of conservation in superfamily sequences and structures, thereby improving the precision with which we can predict the functions of enzymes of unknown function and direct studies in enzyme engineering. Because the method is automated, it is suitable for large-scale characterization and comparison of fundamental functional capabilities of both characterized and uncharacterized

An evolutionarily conserved glycine-tyrosine motif forms a folding core in outer membrane proteins.

Directory of Open Access Journals (Sweden)

Marcin Michalik

Full Text Available An intimate interaction between a pair of amino acids, a tyrosine and glycine on neighboring β-strands, has been previously reported to be important for the structural stability of autotransporters. Here, we show that the conservation of this interacting pair extends to nearly all major families of outer membrane β-barrel proteins, which are thought to have originated through duplication events involving an ancestral ββ hairpin. We analyzed the function of this motif using the prototypical outer membrane protein OmpX. Stopped-flow fluorescence shows that two folding processes occur in the millisecond time regime, the rates of which are reduced in the tyrosine mutant. Folding assays further demonstrate a reduction in the yield of folded protein for the mutant compared to the wild-type, as well as a reduction in thermal stability. Taken together, our data support the idea of an evolutionarily conserved 'folding core' that affects the folding, membrane insertion, and thermal stability of outer membrane protein β-barrels.
Evidence for an evolutionarily conserved interaction between cell wall biosynthesis and flowering in maize and sorghum

Directory of Open Access Journals (Sweden)

Thompson Karen J

2002-01-01

Full Text Available Abstract Background Factors that affect flowering vary among different plant species, and in the grasses in particular the exact mechanism behind this transition is not fully understood. The brown midrib (bm mutants of maize (Zea mays L., which have altered cell wall composition, have different flowering dynamics compared to their wild-type counterparts. This is indicative of a link between cell wall biogenesis and flowering. In order to test whether this relationship also exists in other grasses, the flowering dynamics in sorghum (Sorghum bicolor (L. Moench were investigated. Sorghum is evolutionarily closely related to maize, and a set of brown midrib (bmr mutants similar to the maize bm mutants is available, making sorghum a suitable choice for study in this context. Results We compared the flowering time (time to half-bloom of several different bmr sorghum lines and their wild-type counterparts. This revealed that the relationship between cell wall composition and flowering was conserved in sorghum. Specifically, the mutant bmr7 flowered significantly earlier than the corresponding wild-type control, whereas the mutants bmr2, bmr4, bmr6, bmr12, and bmr19 flowered later than their wild-type controls. Conclusion The change in flowering dynamics in several of the brown midrib sorghum lines provides evidence for an evolutionarily conserved mechanism that links cell wall biosynthesis to flowering dynamics. The availability of the sorghum bmr mutants expands the germplasm available to investigate this relationship in further detail.
An evolutionarily conserved sexual signature in the primate brain.

Directory of Open Access Journals (Sweden)

Björn Reinius

2008-06-01

Full Text Available The question of a potential biological sexual signature in the human brain is a heavily disputed subject. In order to provide further insight into this issue, we used an evolutionary approach to identify genes with sex differences in brain expression level among primates. We reasoned that expression patterns important to uphold key male and female characteristics may be conserved during evolution. We selected cortex for our studies because this specific brain region is responsible for many higher behavioral functions. We compared gene expression profiles in the occipital cortex of male and female humans (Homo sapiens, a great ape and cynomolgus macaques (Macaca fascicularis, an old world monkey, two catarrhine species that show abundant morphological sexual dimorphism, as well as in common marmosets (Callithrix Jacchus, a new world monkey which are relatively sexually monomorphic. We identified hundreds of genes with sex-biased expression patterns in humans and macaques, while fewer than ten were differentially expressed between the sexes in marmosets. In primates, a general rule is that many of the morphological and behavioral sexual dimorphisms seen in polygamous species, such as macaques, are typically less pronounced in monogamous species such as the marmosets. Our observations suggest that this correlation may also be reflected in the extent of sex-biased gene expression in the brain. We identified 85 genes with common sex-biased expression, in both human and macaque and 2 genes, X inactivation-specific transcript (XIST and Heat shock factor binding protein 1 (HSBP1, that were consistently sex-biased in the female direction in human, macaque, and marmoset. These observations imply a conserved signature of sexual gene expression dimorphism in cortex of primates. Further, we found that the coding region of female-biased genes is more evolutionarily constrained compared to the coding region of both male-biased and non sex-biased brain
Similarity-based gene detection: using COGs to find evolutionarily-conserved ORFs.

Science.gov (United States)

Powell, Bradford C; Hutchison, Clyde A

2006-01-19

Experimental verification of gene products has not kept pace with the rapid growth of microbial sequence information. However, existing annotations of gene locations contain sufficient information to screen for probable errors. Furthermore, comparisons among genomes become more informative as more genomes are examined. We studied all open reading frames (ORFs) of at least 30 codons from the genomes of 27 sequenced bacterial strains. We grouped the potential peptide sequences encoded from the ORFs by forming Clusters of Orthologous Groups (COGs). We used this grouping in order to find homologous relationships that would not be distinguishable from noise when using simple BLAST searches. Although COG analysis was initially developed to group annotated genes, we applied it to the task of grouping anonymous DNA sequences that may encode proteins. "Mixed COGs" of ORFs (clusters in which some sequences correspond to annotated genes and some do not) are attractive targets when seeking errors of gene prediction. Examination of mixed COGs reveals some situations in which genes appear to have been missed in current annotations and a smaller number of regions that appear to have been annotated as gene loci erroneously. This technique can also be used to detect potential pseudogenes or sequencing errors. Our method uses an adjustable parameter for degree of conservation among the studied genomes (stringency). We detail results for one level of stringency at which we found 83 potential genes which had not previously been identified, 60 potential pseudogenes, and 7 sequences with existing gene annotations that are probably incorrect. Systematic study of sequence conservation offers a way to improve existing annotations by identifying potentially homologous regions where the annotation of the presence or absence of a gene is inconsistent among genomes.
Similarity-based gene detection: using COGs to find evolutionarily-conserved ORFs

Directory of Open Access Journals (Sweden)

Hutchison Clyde A

2006-01-01

Full Text Available Abstract Background Experimental verification of gene products has not kept pace with the rapid growth of microbial sequence information. However, existing annotations of gene locations contain sufficient information to screen for probable errors. Furthermore, comparisons among genomes become more informative as more genomes are examined. We studied all open reading frames (ORFs of at least 30 codons from the genomes of 27 sequenced bacterial strains. We grouped the potential peptide sequences encoded from the ORFs by forming Clusters of Orthologous Groups (COGs. We used this grouping in order to find homologous relationships that would not be distinguishable from noise when using simple BLAST searches. Although COG analysis was initially developed to group annotated genes, we applied it to the task of grouping anonymous DNA sequences that may encode proteins. Results "Mixed COGs" of ORFs (clusters in which some sequences correspond to annotated genes and some do not are attractive targets when seeking errors of gene predicion. Examination of mixed COGs reveals some situations in which genes appear to have been missed in current annotations and a smaller number of regions that appear to have been annotated as gene loci erroneously. This technique can also be used to detect potential pseudogenes or sequencing errors. Our method uses an adjustable parameter for degree of conservation among the studied genomes (stringency. We detail results for one level of stringency at which we found 83 potential genes which had not previously been identified, 60 potential pseudogenes, and 7 sequences with existing gene annotations that are probably incorrect. Conclusion Systematic study of sequence conservation offers a way to improve existing annotations by identifying potentially homologous regions where the annotation of the presence or absence of a gene is inconsistent among genomes.
Comparative transcriptome analysis within the Lolium/Festuca species complex reveals high sequence conservation

DEFF Research Database (Denmark)

Czaban, Adrian; Sharma, Sapna; Byrne, Stephen

2015-01-01

species from the Lolium-Festuca complex, ranging from 52,166 to 72,133 transcripts per assembly. We have also predicted a set of proteins and validated it with a high-confidence protein database from three closely related species (H. vulgare, B. distachyon and O. sativa). We have obtained gene family...... clusters for the four species using OrthoMCL and analyzed their inferred phylogenetic relationships. Our results indicate that VRN2 is a candidate gene for differentiating vernalization and non-vernalization types in the Lolium-Festuca complex. Grouping of the gene families based on their BLAST identity...... enabled us to divide ortholog groups into those that are very conserved and those that are more evolutionarily relaxed. The ratio of the non-synonumous to synonymous substitutions enabled us to pinpoint protein sequences evolving in response to positive selection. These proteins may explain some...
Evolutionarily conserved mechanisms for the selection and maintenance of behavioural activity.

Science.gov (United States)

Fiore, Vincenzo G; Dolan, Raymond J; Strausfeld, Nicholas J; Hirth, Frank

2015-12-19

Survival and reproduction entail the selection of adaptive behavioural repertoires. This selection manifests as phylogenetically acquired activities that depend on evolved nervous system circuitries. Lorenz and Tinbergen already postulated that heritable behaviours and their reliable performance are specified by genetically determined programs. Here we compare the functional anatomy of the insect central complex and vertebrate basal ganglia to illustrate their role in mediating selection and maintenance of adaptive behaviours. Comparative analyses reveal that central complex and basal ganglia circuitries share comparable lineage relationships within clusters of functionally integrated neurons. These clusters are specified by genetic mechanisms that link birth time and order to their neuronal identities and functions. Their subsequent connections and associated functions are characterized by similar mechanisms that implement dimensionality reduction and transition through attractor states, whereby spatially organized parallel-projecting loops integrate and convey sensorimotor representations that select and maintain behavioural activity. In both taxa, these neural systems are modulated by dopamine signalling that also mediates memory-like processes. The multiplicity of similarities between central complex and basal ganglia suggests evolutionarily conserved computational mechanisms for action selection. We speculate that these may have originated from ancestral ground pattern circuitries present in the brain of the last common ancestor of insects and vertebrates. © 2015 The Authors.
An evolutionarily conserved phosphatidate phosphatase maintains lipid droplet number and endoplasmic reticulum morphology but not nuclear morphology

Directory of Open Access Journals (Sweden)

Anoop Narayana Pillai

2017-11-01

Full Text Available Phosphatidic acid phosphatases are involved in the biosynthesis of phospholipids and triacylglycerol, and also act as transcriptional regulators. Studies to ascertain their role in lipid metabolism and membrane biogenesis are restricted to Opisthokonta and Archaeplastida. Here, we report the role of phosphatidate phosphatase (PAH in Tetrahymena thermophila, belonging to the Alveolata clade. We identified two PAH homologs in Tetrahymena, TtPAH1 and TtPAH2. Loss of function of TtPAH1 results in reduced lipid droplet number and an increase in endoplasmic reticulum (ER content. It also results in more ER sheet structure as compared to wild-type Tetrahymena. Surprisingly, we did not observe a visible defect in the nuclear morphology of the ΔTtpah1 mutant. TtPAH1 rescued all known defects in the yeast pah1Δ strain and is conserved functionally between Tetrahymena and yeast. The homologous gene derived from Trypanosoma also rescued the defects of the yeast pah1Δ strain. Our results indicate that PAH, previously known to be conserved among Opisthokonts, is also present in a set of distant lineages. Thus, a phosphatase cascade is evolutionarily conserved and is functionally interchangeable across eukaryotic lineages.
G-quadruplex DNA sequences are evolutionarily conserved and associated with distinct genomic features in Saccharomyces cerevisiae.

Directory of Open Access Journals (Sweden)

John A Capra

2010-07-01

Full Text Available G-quadruplex DNA is a four-stranded DNA structure formed by non-Watson-Crick base pairing between stacked sets of four guanines. Many possible functions have been proposed for this structure, but its in vivo role in the cell is still largely unresolved. We carried out a genome-wide survey of the evolutionary conservation of regions with the potential to form G-quadruplex DNA structures (G4 DNA motifs across seven yeast species. We found that G4 DNA motifs were significantly more conserved than expected by chance, and the nucleotide-level conservation patterns suggested that the motif conservation was the result of the formation of G4 DNA structures. We characterized the association of conserved and non-conserved G4 DNA motifs in Saccharomyces cerevisiae with more than 40 known genome features and gene classes. Our comprehensive, integrated evolutionary and functional analysis confirmed the previously observed associations of G4 DNA motifs with promoter regions and the rDNA, and it identified several previously unrecognized associations of G4 DNA motifs with genomic features, such as mitotic and meiotic double-strand break sites (DSBs. Conserved G4 DNA motifs maintained strong associations with promoters and the rDNA, but not with DSBs. We also performed the first analysis of G4 DNA motifs in the mitochondria, and surprisingly found a tenfold higher concentration of the motifs in the AT-rich yeast mitochondrial DNA than in nuclear DNA. The evolutionary conservation of the G4 DNA motif and its association with specific genome features supports the hypothesis that G4 DNA has in vivo functions that are under evolutionary constraint.
Inter-progenitor pool wiring: An evolutionarily conserved strategy that expands neural circuit diversity.

Science.gov (United States)

Suzuki, Takumi; Sato, Makoto

2017-11-15

Diversification of neuronal types is key to establishing functional variations in neural circuits. The first critical step to generate neuronal diversity is to organize the compartmental domains of developing brains into spatially distinct neural progenitor pools. Neural progenitors in each pool then generate a unique set of diverse neurons through specific spatiotemporal specification processes. In this review article, we focus on an additional mechanism, 'inter-progenitor pool wiring', that further expands the diversity of neural circuits. After diverse types of neurons are generated in one progenitor pool, a fraction of these neurons start migrating toward a remote brain region containing neurons that originate from another progenitor pool. Finally, neurons of different origins are intermingled and eventually form complex but precise neural circuits. The developing cerebral cortex of mammalian brains is one of the best examples of inter-progenitor pool wiring. However, Drosophila visual system development has revealed similar mechanisms in invertebrate brains, suggesting that inter-progenitor pool wiring is an evolutionarily conserved strategy that expands neural circuit diversity. Here, we will discuss how inter-progenitor pool wiring is accomplished in mammalian and fly brain systems. Copyright © 2017 Elsevier Inc. All rights reserved.
Conservation and variability of dengue virus proteins: implications for vaccine design.

Directory of Open Access Journals (Sweden)

Asif M Khan

2008-08-01

Full Text Available Genetic variation and rapid evolution are hallmarks of RNA viruses, the result of high mutation rates in RNA replication and selection of mutants that enhance viral adaptation, including the escape from host immune responses. Variability is uneven across the genome because mutations resulting in a deleterious effect on viral fitness are restricted. RNA viruses are thus marked by protein sites permissive to multiple mutations and sites critical to viral structure-function that are evolutionarily robust and highly conserved. Identification and characterization of the historical dynamics of the conserved sites have relevance to multiple applications, including potential targets for diagnosis, and prophylactic and therapeutic purposes.We describe a large-scale identification and analysis of evolutionarily highly conserved amino acid sequences of the entire dengue virus (DENV proteome, with a focus on sequences of 9 amino acids or more, and thus immune-relevant as potential T-cell determinants. DENV protein sequence data were collected from the NCBI Entrez protein database in 2005 (9,512 sequences and again in 2007 (12,404 sequences. Forty-four (44 sequences (pan-DENV sequences, mainly those of nonstructural proteins and representing approximately 15% of the DENV polyprotein length, were identical in 80% or more of all recorded DENV sequences. Of these 44 sequences, 34 ( approximately 77% were present in >or=95% of sequences of each DENV type, and 27 ( approximately 61% were conserved in other Flaviviruses. The frequencies of variants of the pan-DENV sequences were low (0 to approximately 5%, as compared to variant frequencies of approximately 60 to approximately 85% in the non pan-DENV sequence regions. We further showed that the majority of the conserved sequences were immunologically relevant: 34 contained numerous predicted human leukocyte antigen (HLA supertype-restricted peptide sequences, and 26 contained T-cell determinants identified by
Comparison of C. elegans and C. briggsae genome sequences reveals extensive conservation of chromosome organization and synteny.

Directory of Open Access Journals (Sweden)

LaDeana W Hillier

2007-07-01

Full Text Available To determine whether the distinctive features of Caenorhabditis elegans chromosomal organization are shared with the C. briggsae genome, we constructed a single nucleotide polymorphism-based genetic map to order and orient the whole genome shotgun assembly along the six C. briggsae chromosomes. Although these species are of the same genus, their most recent common ancestor existed 80-110 million years ago, and thus they are more evolutionarily distant than, for example, human and mouse. We found that, like C. elegans chromosomes, C. briggsae chromosomes exhibit high levels of recombination on the arms along with higher repeat density, a higher fraction of intronic sequence, and a lower fraction of exonic sequence compared with chromosome centers. Despite extensive intrachromosomal rearrangements, 1:1 orthologs tend to remain in the same region of the chromosome, and colinear blocks of orthologs tend to be longer in chromosome centers compared with arms. More strikingly, the two species show an almost complete conservation of synteny, with 1:1 orthologs present on a single chromosome in one species also found on a single chromosome in the other. The conservation of both chromosomal organization and synteny between these two distantly related species suggests roles for chromosome organization in the fitness of an organism that are only poorly understood presently.
IAA-Ala Resistant3, an evolutionarily conserved target of miR167, mediates Arabidopsis root architecture changes during high osmotic stress

KAUST Repository

Kinoshita, Natsuko

2012-09-01

The functions of microRNAs and their target mRNAs in Arabidopsis thaliana development have been widely documented; however, roles of stress-responsive microRNAs and their targets are not as well understood. Using small RNA deep sequencing and ATH1 microarrays to profile mRNAs, we identified IAA-Ala Resistant3 (IAR3) as a new target of miR167a. As expected, IAR3 mRNA was cleaved at the miR167a complementary site and under high osmotic stress miR167a levels decreased, whereas IAR3 mRNA levels increased. IAR3 hydrolyzes an inactive form of auxin (indole-3-acetic acid [IAA]-alanine) and releases bioactive auxin (IAA), a central phytohormone for root development. In contrast with the wild type, iar3 mutants accumulated reduced IAA levels and did not display high osmotic stress-induced root architecture changes. Transgenic plants expressing a cleavage-resistant form of IAR3 mRNA accumulated high levels of IAR3 mRNAs and showed increased lateral root development compared with transgenic plants expressing wild-type IAR3. Expression of an inducible noncoding RNA to sequester miR167a by target mimicry led to an increase in IAR3 mRNA levels, further confirming the inverse relationship between the two partners. Sequence comparison revealed the miR167 target site on IAR3 mRNA is conserved in evolutionarily distant plant species. Finally, we showed that IAR3 is required for drought tolerance. © 2012 American Society of Plant Biologists. All rights reserved.
IAA-Ala Resistant3, an evolutionarily conserved target of miR167, mediates Arabidopsis root architecture changes during high osmotic stress

KAUST Repository

Kinoshita, Natsuko; Wang, Huan; Kasahara, Hiroyuki; Liu, Jun; MacPherson, Cameron R.; Machida, Yasunori; Kamiya, Yuji; Hannah, Matthew A.; Chuaa, Nam Hai

2012-01-01

The functions of microRNAs and their target mRNAs in Arabidopsis thaliana development have been widely documented; however, roles of stress-responsive microRNAs and their targets are not as well understood. Using small RNA deep sequencing and ATH1 microarrays to profile mRNAs, we identified IAA-Ala Resistant3 (IAR3) as a new target of miR167a. As expected, IAR3 mRNA was cleaved at the miR167a complementary site and under high osmotic stress miR167a levels decreased, whereas IAR3 mRNA levels increased. IAR3 hydrolyzes an inactive form of auxin (indole-3-acetic acid [IAA]-alanine) and releases bioactive auxin (IAA), a central phytohormone for root development. In contrast with the wild type, iar3 mutants accumulated reduced IAA levels and did not display high osmotic stress-induced root architecture changes. Transgenic plants expressing a cleavage-resistant form of IAR3 mRNA accumulated high levels of IAR3 mRNAs and showed increased lateral root development compared with transgenic plants expressing wild-type IAR3. Expression of an inducible noncoding RNA to sequester miR167a by target mimicry led to an increase in IAR3 mRNA levels, further confirming the inverse relationship between the two partners. Sequence comparison revealed the miR167 target site on IAR3 mRNA is conserved in evolutionarily distant plant species. Finally, we showed that IAR3 is required for drought tolerance. © 2012 American Society of Plant Biologists. All rights reserved.
Conserved hypothetical protein Rv1977 in Mycobacterium tuberculosis strains contains sequence polymorphisms and might be involved in ongoing immune evasion.

Science.gov (United States)

Jiang, Yi; Liu, Haican; Wang, Xuezhi; Li, Guilian; Qiu, Yan; Dou, Xiangfeng; Wan, Kanglin

2015-01-01

Host immune pressure and associated parasite immune evasion are key features of host-pathogen co-evolution. A previous study showed that human T cell epitopes of Mycobacterium tuberculosis are evolutionarily hyperconserved and thus it was deduced that M. tuberculosis lacks antigenic variation and immune evasion. Here, we selected 151 clinical Mycobacterium tuberculosis isolates from China, amplified gene encoding Rv1977 and compared the sequences. The results showed that Rv1977, a conserved hypothetical protein, is not conserved in M. tuberculosis strains and there are polymorphisms existed in the protein. Some mutations, especially one frameshift mutation, occurred in the antigen Rv1977, which is uncommon in M.tb strains and may lead to the protein function altering. Mutations and deletion in the gene all affect one of three T cell epitopes and the changed T cell epitope contained more than one variable position, which may suggest ongoing immune evasion.
Highly conserved non-coding elements on either side of SOX9 associated with Pierre Robin sequence.

Science.gov (United States)

Benko, Sabina; Fantes, Judy A; Amiel, Jeanne; Kleinjan, Dirk-Jan; Thomas, Sophie; Ramsay, Jacqueline; Jamshidi, Negar; Essafi, Abdelkader; Heaney, Simon; Gordon, Christopher T; McBride, David; Golzio, Christelle; Fisher, Malcolm; Perry, Paul; Abadie, Véronique; Ayuso, Carmen; Holder-Espinasse, Muriel; Kilpatrick, Nicky; Lees, Melissa M; Picard, Arnaud; Temple, I Karen; Thomas, Paul; Vazquez, Marie-Paule; Vekemans, Michel; Roest Crollius, Hugues; Hastie, Nicholas D; Munnich, Arnold; Etchevers, Heather C; Pelet, Anna; Farlie, Peter G; Fitzpatrick, David R; Lyonnet, Stanislas

2009-03-01

Pierre Robin sequence (PRS) is an important subgroup of cleft palate. We report several lines of evidence for the existence of a 17q24 locus underlying PRS, including linkage analysis results, a clustering of translocation breakpoints 1.06-1.23 Mb upstream of SOX9, and microdeletions both approximately 1.5 Mb centromeric and approximately 1.5 Mb telomeric of SOX9. We have also identified a heterozygous point mutation in an evolutionarily conserved region of DNA with in vitro and in vivo features of a developmental enhancer. This enhancer is centromeric to the breakpoint cluster and maps within one of the microdeletion regions. The mutation abrogates the in vitro enhancer function and alters binding of the transcription factor MSX1 as compared to the wild-type sequence. In the developing mouse mandible, the 3-Mb region bounded by the microdeletions shows a regionally specific chromatin decompaction in cells expressing Sox9. Some cases of PRS may thus result from developmental misexpression of SOX9 due to disruption of very-long-range cis-regulatory elements.
Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters

Energy Technology Data Exchange (ETDEWEB)

Santini, Simona; Boore, Jeffrey L.; Meyer, Axel

2003-12-31

Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involved in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.
The relationship of protein conservation and sequence length

Directory of Open Access Journals (Sweden)

Panchenko Anna R

2002-11-01

Full Text Available Abstract Background In general, the length of a protein sequence is determined by its function and the wide variance in the lengths of an organism's proteins reflects the diversity of specific functional roles for these proteins. However, additional evolutionary forces that affect the length of a protein may be revealed by studying the length distributions of proteins evolving under weaker functional constraints. Results We performed sequence comparisons to distinguish highly conserved and poorly conserved proteins from the bacterium Escherichia coli, the archaeon Archaeoglobus fulgidus, and the eukaryotes Saccharomyces cerevisiae, Drosophila melanogaster, and Homo sapiens. For all organisms studied, the conserved and nonconserved proteins have strikingly different length distributions. The conserved proteins are, on average, longer than the poorly conserved ones, and the length distributions for the poorly conserved proteins have a relatively narrow peak, in contrast to the conserved proteins whose lengths spread over a wider range of values. For the two prokaryotes studied, the poorly conserved proteins approximate the minimal length distribution expected for a diverse range of structural folds. Conclusions There is a relationship between protein conservation and sequence length. For all the organisms studied, there seems to be a significant evolutionary trend favoring shorter proteins in the absence of other, more specific functional constraints.
The evolutionarily conserved E3 ubiquitin ligase AtCHIP contributes to plant immunity

Directory of Open Access Journals (Sweden)

Xin eLi

2016-03-01

Full Text Available Plants possess a sophisticated immune system to recognize and respond to microbial threats in their environment. The level of immune signaling must be tightly regulated so that immune responses can be quickly activated in the presence of pathogens, while avoiding autoimmunity. HSP90s, along with their diverse array of co-chaperones, forms chaperone complexes that have been shown to play both positive and negative roles in regulating the accumulation of immune receptors and regulators. In this study, we examined the role of AtCHIP, an evolutionarily conserved E3 ligase that was known to interact with chaperones including HSP90s in multicellular organisms including fruit fly, C. elegans, plants and human. Atchip knockout mutants display enhanced disease susceptibility to a virulent oomycete pathogen, and overexpression of AtCHIP causes enhanced disease resistance at low temperature. Although CHIP was reported to target HSP90 for ubiquitination and degradation, accumulation of HSP90.3 was not affected in Atchip plants. In addition, protein accumulation of nucleotide-binding, leucine-rich repeat domain immune receptor (NLR SNC1 is not altered in Atchip mutant. Thus, while AtCHIP plays a role in immunity, it does not seem to regulate the turnover of HSP90 or SNC1. Further investigation is needed in order to determine the exact mechanism behind AtCHIP’s role in regulating plant immune responses.
Evolutionarily conserved transcription factor Apontic controls the G1/S progression by inducing cyclin e during eye development

KAUST Repository

Liu, Qingxin

2014-06-16

During Drosophila eye development, differentiation initiates in the posterior region of the eye disk and progresses anteriorly as a wave marked by the morphogenetic furrow (MF), which demarcates the boundary between anterior undifferentiated cells and posterior differentiated photoreceptors. However, the mechanism underlying the regulation of gene expression immediately before the onset of differentiation remains unclear. Here, we show that Apontic (Apt), which is an evolutionarily conserved transcription factor, is expressed in the differentiating cells posterior to the MF. Moreover, it directly induces the expression of cyclin E and is also required for the G1-to-S phase transition, which is known to be essential for the initiation of cell differentiation at the MF. These observations identify a pathway crucial for eye development, governed by a mechanism in which Cyclin E promotes the G1-to-S phase transition when regulated by Apt.

Evolutionarily conserved transcription factor Apontic controls the G1/S progression by inducing cyclin e during eye development

KAUST Repository

Liu, Qingxin; Wang, Xianfeng; Ikeo, Kazuho; Hirose, Susumu; Gehring, Walter Jakob; Gojobori, Takashi

2014-01-01

During Drosophila eye development, differentiation initiates in the posterior region of the eye disk and progresses anteriorly as a wave marked by the morphogenetic furrow (MF), which demarcates the boundary between anterior undifferentiated cells and posterior differentiated photoreceptors. However, the mechanism underlying the regulation of gene expression immediately before the onset of differentiation remains unclear. Here, we show that Apontic (Apt), which is an evolutionarily conserved transcription factor, is expressed in the differentiating cells posterior to the MF. Moreover, it directly induces the expression of cyclin E and is also required for the G1-to-S phase transition, which is known to be essential for the initiation of cell differentiation at the MF. These observations identify a pathway crucial for eye development, governed by a mechanism in which Cyclin E promotes the G1-to-S phase transition when regulated by Apt.
Evolutionarily conserved regions of the human c-myc protein can be uncoupled from transforming activity

International Nuclear Information System (INIS)

Sarid, J.; Halazonetis, T.D.; Murphy, W.; Leder, P.

1987-01-01

The myc family of oncogenes contains coding sequences that have been preserved in different species for over 400 million years. This conservation (which implies functional selection) is broadly represented throughout the C-terminal portion of the human c-myc protein but is largely restricted to three cluster of amino acid sequences in the N-terminal region. The authors have examined the role that the latter three regions of the c-myc protein might play in the transforming function of the c-myc gene. Several mutations, deletions and frameshifts, were introduced into the c-myc gene, and these mutant genes were tested for their ability to collaborate with the EJ-ras oncogene to transform rat embryo fibroblasts. Complete elimination of the first two N-terminal conserved segments abolished transforming activity. In contrast, genes altered in a portion of the second or the entire third conserved segment retained their transforming activity. Thus, the latter two segments are not required for the transformation process, suggesting that they serve another function related only to the normal expression of the c-myc gene
Conservation and diversification of Msx protein in metazoan evolution.

Science.gov (United States)

Takahashi, Hirokazu; Kamiya, Akiko; Ishiguro, Akira; Suzuki, Atsushi C; Saitou, Naruya; Toyoda, Atsushi; Aruga, Jun

2008-01-01

Msx (/msh) family genes encode homeodomain (HD) proteins that control ontogeny in many animal species. We compared the structures of Msx genes from a wide range of Metazoa (Porifera, Cnidaria, Nematoda, Arthropoda, Tardigrada, Platyhelminthes, Mollusca, Brachiopoda, Annelida, Echiura, Echinodermata, Hemichordata, and Chordata) to gain an understanding of the role of these genes in phylogeny. Exon-intron boundary analysis suggested that the position of the intron located N-terminally to the HDs was widely conserved in all the genes examined, including those of cnidarians. Amino acid (aa) sequence comparison revealed 3 new evolutionarily conserved domains, as well as very strong conservation of the HDs. Two of the three domains were associated with Groucho-like protein binding in both a vertebrate and a cnidarian Msx homolog, suggesting that the interaction between Groucho-like proteins and Msx proteins was established in eumetazoan ancestors. Pairwise comparison among the collected HDs and their C-flanking aa sequences revealed that the degree of sequence conservation varied depending on the animal taxa from which the sequences were derived. Highly conserved Msx genes were identified in the Vertebrata, Cephalochordata, Hemichordata, Echinodermata, Mollusca, Brachiopoda, and Anthozoa. The wide distribution of the conserved sequences in the animal phylogenetic tree suggested that metazoan ancestors had already acquired a set of conserved domains of the current Msx family genes. Interestingly, although strongly conserved sequences were recovered from the Vertebrata, Cephalochordata, and Anthozoa, the sequences from the Urochordata and Hydrozoa showed weak conservation. Because the Vertebrata-Cephalochordata-Urochordata and Anthozoa-Hydrozoa represent sister groups in the Chordata and Cnidaria, respectively, Msx sequence diversification may have occurred differentially in the course of evolution. We speculate that selective loss of the conserved domains in Msx family
UET: a database of evolutionarily-predicted functional determinants of protein sequences that cluster as functional sites in protein structures.

Science.gov (United States)

Lua, Rhonald C; Wilson, Stephen J; Konecki, Daniel M; Wilkins, Angela D; Venner, Eric; Morgan, Daniel H; Lichtarge, Olivier

2016-01-04

The structure and function of proteins underlie most aspects of biology and their mutational perturbations often cause disease. To identify the molecular determinants of function as well as targets for drugs, it is central to characterize the important residues and how they cluster to form functional sites. The Evolutionary Trace (ET) achieves this by ranking the functional and structural importance of the protein sequence positions. ET uses evolutionary distances to estimate functional distances and correlates genotype variations with those in the fitness phenotype. Thus, ET ranks are worse for sequence positions that vary among evolutionarily closer homologs but better for positions that vary mostly among distant homologs. This approach identifies functional determinants, predicts function, guides the mutational redesign of functional and allosteric specificity, and interprets the action of coding sequence variations in proteins, people and populations. Now, the UET database offers pre-computed ET analyses for the protein structure databank, and on-the-fly analysis of any protein sequence. A web interface retrieves ET rankings of sequence positions and maps results to a structure to identify functionally important regions. This UET database integrates several ways of viewing the results on the protein sequence or structure and can be found at http://mammoth.bcm.tmc.edu/uet/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
DNA Sequence-Mediated, Evolutionarily Rapid Redistribution of Meiotic Recombination Hotspots

Science.gov (United States)

Wahls, Wayne P.; Davidson, Mari K.

2011-01-01

Hotspots regulate the position and frequency of Spo11 (Rec12)-initiated meiotic recombination, but paradoxically they are suicidal and are somehow resurrected elsewhere in the genome. After the DNA sequence-dependent activation of hotspots was discovered in fission yeast, nearly two decades elapsed before the key realizations that (A) DNA site-dependent regulation is broadly conserved and (B) individual eukaryotes have multiple different DNA sequence motifs that activate hotspots. From our perspective, such findings provide a conceptually straightforward solution to the hotspot paradox and can explain other, seemingly complex features of meiotic recombination. We describe how a small number of single-base-pair substitutions can generate hotspots de novo and dramatically alter their distribution in the genome. This model also shows how equilibrium rate kinetics could maintain the presence of hotspots over evolutionary timescales, without strong selective pressures invoked previously, and explains why hotspots localize preferentially to intergenic regions and introns. The model is robust enough to account for all hotspots of humans and chimpanzees repositioned since their divergence from the latest common ancestor. PMID:22084420
Sequencing Conservation Actions Through Threat Assessments in the Southeastern United States

Science.gov (United States)

Robert D. Sutter; Christopher C. Szell

2006-01-01

The identification of conservation priorities is one of the leading issues in conservation biology. We present a project of The Nature Conservancy, called Sequencing Conservation Actions, which prioritizes conservation areas and identifies foci for crosscutting strategies at various geographic scales. We use the term âSequencingâ to mean an ordering of actions over...
Widespread Shortening of 3' Untranslated Regions and Increased Exon Inclusion Are Evolutionarily Conserved Features of Innate Immune Responses to Infection.

Directory of Open Access Journals (Sweden)

Athma A Pai

2016-09-01

Full Text Available The contribution of pre-mRNA processing mechanisms to the regulation of immune responses remains poorly studied despite emerging examples of their role as regulators of immune defenses. We sought to investigate the role of mRNA processing in the cellular responses of human macrophages to live bacterial infections. Here, we used mRNA sequencing to quantify gene expression and isoform abundances in primary macrophages from 60 individuals, before and after infection with Listeria monocytogenes and Salmonella typhimurium. In response to both bacteria we identified thousands of genes that significantly change isoform usage in response to infection, characterized by an overall increase in isoform diversity after infection. In response to both bacteria, we found global shifts towards (i the inclusion of cassette exons and (ii shorter 3' UTRs, with near-universal shifts towards usage of more upstream polyadenylation sites. Using complementary data collected in non-human primates, we show that these features are evolutionarily conserved among primates. Following infection, we identify candidate RNA processing factors whose expression is associated with individual-specific variation in isoform abundance. Finally, by profiling microRNA levels, we show that 3' UTRs with reduced abundance after infection are significantly enriched for target sites for particular miRNAs. These results suggest that the pervasive usage of shorter 3' UTRs is a mechanism for particular genes to evade repression by immune-activated miRNAs. Collectively, our results suggest that dynamic changes in RNA processing may play key roles in the regulation of innate immune responses.
Asy2/Mer2: an evolutionarily conserved mediator of meiotic recombination, pairing, and global chromosome compaction.

Science.gov (United States)

Tessé, Sophie; Bourbon, Henri-Marc; Debuchy, Robert; Budin, Karine; Dubois, Emeline; Liangran, Zhang; Antoine, Romain; Piolot, Tristan; Kleckner, Nancy; Zickler, Denise; Espagne, Eric

2017-09-15

Meiosis is the cellular program by which a diploid cell gives rise to haploid gametes for sexual reproduction. Meiotic progression depends on tight physical and functional coupling of recombination steps at the DNA level with specific organizational features of meiotic-prophase chromosomes. The present study reveals that every step of this coupling is mediated by a single molecule: Asy2/Mer2. We show that Mer2, identified so far only in budding and fission yeasts, is in fact evolutionarily conserved from fungi (Mer2/Rec15/Asy2/Bad42) to plants (PRD3/PAIR1) and mammals (IHO1). In yeasts, Mer2 mediates assembly of recombination-initiation complexes and double-strand breaks (DSBs). This role is conserved in the fungus Sordaria However, functional analysis of 13 mer2 mutants and successive localization of Mer2 to axis, synaptonemal complex (SC), and chromatin revealed, in addition, three further important functions. First, after DSB formation, Mer2 is required for pairing by mediating homolog spatial juxtaposition, with implications for crossover (CO) patterning/interference. Second, Mer2 participates in the transfer/maintenance and release of recombination complexes to/from the SC central region. Third, after completion of recombination, potentially dependent on SUMOylation, Mer2 mediates global chromosome compaction and post-recombination chiasma development. Thus, beyond its role as a recombinosome-axis/SC linker molecule, Mer2 has important functions in relation to basic chromosome structure. © 2017 Tessé et al.; Published by Cold Spring Harbor Laboratory Press.
Close Sequence Comparisons are Sufficient to Identify Humancis-Regulatory Elements

Energy Technology Data Exchange (ETDEWEB)

Prabhakar, Shyam; Poulin, Francis; Shoukry, Malak; Afzal, Veena; Rubin, Edward M.; Couronne, Olivier; Pennacchio, Len A.

2005-12-01

Cross-species DNA sequence comparison is the primary method used to identify functional noncoding elements in human and other large genomes. However, little is known about the relative merits of evolutionarily close and distant sequence comparisons, due to the lack of a universal metric for sequence conservation, and also the paucity of empirically defined benchmark sets of cis-regulatory elements. To address this problem, we developed a general-purpose algorithm (Gumby) that detects slowly-evolving regions in primate, mammalian and more distant comparisons without requiring adjustment of parameters, and ranks conserved elements by P-value using Karlin-Altschul statistics. We benchmarked Gumby predictions against previously identified cis-regulatory elements at diverse genomic loci, and also tested numerous extremely conserved human-rodent sequences for transcriptional enhancer activity using reporter-gene assays in transgenic mice. Human regulatory elements were identified with acceptable sensitivity and specificity by comparison with 1-5 other eutherian mammals or 6 other simian primates. More distant comparisons (marsupial, avian, amphibian and fish) failed to identify many of the empirically defined functional noncoding elements. We derived an intuitive relationship between ancient and recent noncoding sequence conservation from whole genome comparative analysis, which explains some of these findings. Lastly, we determined that, in addition to strength of conservation, genomic location and/or density of surrounding conserved elements must also be considered in selecting candidate enhancers for testing at embryonic time points.
Highly conserved non-coding sequences are associated with vertebrate development.

Directory of Open Access Journals (Sweden)

Adam Woolfe

2005-01-01

Full Text Available In addition to protein coding sequence, the human genome contains a significant amount of regulatory DNA, the identification of which is proving somewhat recalcitrant to both in silico and functional methods. An approach that has been used with some success is comparative sequence analysis, whereby equivalent genomic regions from different organisms are compared in order to identify both similarities and differences. In general, similarities in sequence between highly divergent organisms imply functional constraint. We have used a whole-genome comparison between humans and the pufferfish, Fugu rubripes, to identify nearly 1,400 highly conserved non-coding sequences. Given the evolutionary divergence between these species, it is likely that these sequences are found in, and furthermore are essential to, all vertebrates. Most, and possibly all, of these sequences are located in and around genes that act as developmental regulators. Some of these sequences are over 90% identical across more than 500 bases, being more highly conserved than coding sequence between these two species. Despite this, we cannot find any similar sequences in invertebrate genomes. In order to begin to functionally test this set of sequences, we have used a rapid in vivo assay system using zebrafish embryos that allows tissue-specific enhancer activity to be identified. Functional data is presented for highly conserved non-coding sequences associated with four unrelated developmental regulators (SOX21, PAX6, HLXB9, and SHH, in order to demonstrate the suitability of this screen to a wide range of genes and expression patterns. Of 25 sequence elements tested around these four genes, 23 show significant enhancer activity in one or more tissues. We have identified a set of non-coding sequences that are highly conserved throughout vertebrates. They are found in clusters across the human genome, principally around genes that are implicated in the regulation of development
Identification of evolutionarily conserved Momordica charantia microRNAs using computational approach and its utility in phylogeny analysis.

Science.gov (United States)

Thirugnanasambantham, Krishnaraj; Saravanan, Subramanian; Karikalan, Kulandaivelu; Bharanidharan, Rajaraman; Lalitha, Perumal; Ilango, S; HairulIslam, Villianur Ibrahim

2015-10-01

Momordica charantia (bitter gourd, bitter melon) is a monoecious Cucurbitaceae with anti-oxidant, anti-microbial, anti-viral and anti-diabetic potential. Molecular studies on this economically valuable plant are very essential to understand its phylogeny and evolution. MicroRNAs (miRNAs) are conserved, small, non-coding RNA with ability to regulate gene expression by bind the 3' UTR region of target mRNA and are evolved at different rates in different plant species. In this study we have utilized homology based computational approach and identified 27 mature miRNAs for the first time from this bio-medically important plant. The phylogenetic tree developed from binary data derived from the data on presence/absence of the identified miRNAs were noticed to be uncertain and biased. Most of the identified miRNAs were highly conserved among the plant species and sequence based phylogeny analysis of miRNAs resolved the above difficulties in phylogeny approach using miRNA. Predicted gene targets of the identified miRNAs revealed their importance in regulation of plant developmental process. Reported miRNAs held sequence conservation in mature miRNAs and the detailed phylogeny analysis of pre-miRNA sequences revealed genus specific segregation of clusters. Copyright © 2015 Elsevier Ltd. All rights reserved.
Protection from UV light is an evolutionarily conserved feature of the haematopoietic niche

Science.gov (United States)

Kapp, Friedrich G.; Perlin, Julie R.; Hagedorn, Elliott J.; Gansner, John M.; Schwarz, Daniel E.; O'Connell, Lauren A.; Johnson, Nicholas; Amemiya, Chris; Fisher, David E.; Wolfle, Ute; Trompouki, Eirini; Niemeyer, Charlotte M.; Driever, Wolfgang; Zon, Leonard I.

2018-01-01

Haematopoietic stem and progenitor cells (HSPCs) require a specific microenvironment, the haematopoietic niche, which regulates HSPC behaviour. The location of this niche varies across species, but the evolutionary pressures that drive HSPCs to different microenvironments remain unknown. The niche is located in the bone marrow in adult mammals, whereas it is found in other locations in non-mammalian vertebrates, for example, in the kidney marrow in teleost fish. Here we show that a melanocyte umbrella above the kidney marrow protects HSPCs against ultraviolet light in zebrafish. Because mutants that lack melanocytes have normal steady-state haematopoiesis under standard laboratory conditions, we hypothesized that melanocytes above the stem cell niche protect HSPCs against ultraviolet-light-induced DNA damage. Indeed, after ultraviolet-light irradiation, unpigmented larvae show higher levels of DNA damage in HSPCs, as indicated by staining of cyclobutane pyrimidine dimers and have reduced numbers of HSPCs, as shown by cmyb (also known as myb) expression. The umbrella of melanocytes associated with the haematopoietic niche is highly evolutionarily conserved in aquatic animals, including the sea lamprey, a basal vertebrate. During the transition from an aquatic to a terrestrial environment, HSPCs relocated into the bone marrow, which is protected from ultraviolet light by the cortical bone around the marrow. Our studies reveal that melanocytes above the haematopoietic niche protect HSPCs from ultraviolet-light-induced DNA damage in aquatic vertebrates and suggest that during the transition to terrestrial life, ultraviolet light was an evolutionary pressure affecting the location of the haematopoietic niche.
Accelerated Evolution of Conserved Noncoding Sequences in theHuman Genome

Energy Technology Data Exchange (ETDEWEB)

Prambhakar, Shyam; Noonan, James P.; Paabo, Svante; Rubin, EdwardM.

2006-07-06

Genomic comparisons between human and distant, non-primatemammals are commonly used to identify cis-regulatory elements based onconstrained sequence evolution. However, these methods fail to detect"cryptic" functional elements, which are too weakly conserved amongmammals to distinguish from nonfunctional DNA. To address this problem,we explored the potential of deep intra-primate sequence comparisons. Wesequenced the orthologs of 558 kb of human genomic sequence, coveringmultiple loci involved in cholesterol homeostasis, in 6 nonhumanprimates. Our analysis identified 6 noncoding DNA elements displayingsignificant conservation among primates, but undetectable in more distantcomparisons. In vitro and in vivo tests revealed that at least three ofthese 6 elements have regulatory function. Notably, the mouse orthologsof these three functional human sequences had regulatory activity despitetheir lack of significant sequence conservation, indicating that they arecryptic ancestral cis-regulatory elements. These regulatory elementscould still be detected in a smaller set of three primate speciesincluding human, rhesus and marmoset. Since the human and rhesus genomesequences are already available, and the marmoset genome is activelybeing sequenced, the primate-specific conservation analysis describedhere can be applied in the near future on a whole-genome scale, tocomplement the annotation provided by more distant speciescomparisons.
Possible conservation units of the sun bear (Helarctos malayanus) in Sarawak based on variation of mtDNA control region.

Science.gov (United States)

Onuma, Manabu; Suzuki, Masatsugu; Ohtaishi, Noriyuki

2006-11-01

The mitochondrial DNA control region of the sun bear (Helarctos malayanus) was sequenced using 21 DNA samples collected from confiscated sun bears to identify conservation units, such as evolutionarily significant units and management units, in Sarawak, Borneo Island. A total of 10 haplotypes were observed, indicating the presence of at least two lineages in the sun bear population in Sarawak. Presumably, these two lineages could represent evolutionarily significant units. However, the geographical distributions of the two lineages remained unknown due to the lack of information regarding the exact capture locations of the confiscated sun bears. It is essential to elucidate the geographical distributions of these lineages in order to create a proper conservation plan for the sun bears in Sarawak. Therefore, further studies examining the haplotype distributions using DNA samples from known localities are essential.
High-Throughput Sequencing Reveals Diverse Sets of Conserved, Nonconserved, and Species-Specific miRNAs in Jute

Directory of Open Access Journals (Sweden)

Md. Tariqul Islam

2015-01-01

Full Text Available MicroRNAs play a pivotal role in regulating a broad range of biological processes, acting by cleaving mRNAs or by translational repression. A group of plant microRNAs are evolutionarily conserved; however, others are expressed in a species-specific manner. Jute is an agroeconomically important fibre crop; nonetheless, no practical information is available for microRNAs in jute to date. In this study, Illumina sequencing revealed a total of 227 known microRNAs and 17 potential novel microRNA candidates in jute, of which 164 belong to 23 conserved families and the remaining 63 belong to 58 nonconserved families. Among a total of 81 identified microRNA families, 116 potential target genes were predicted for 39 families and 11 targets were predicted for 4 among the 17 identified novel microRNAs. For understanding better the functions of microRNAs, target genes were analyzed by Gene Ontology and their pathways illustrated by KEGG pathway analyses. The presence of microRNAs identified in jute was validated by stem-loop RT-PCR followed by end point PCR and qPCR for randomly selected 20 known and novel microRNAs. This study exhaustively identifies microRNAs and their target genes in jute which will ultimately pave the way for understanding their role in this crop and other crops.
Whole-Genome de novo Sequencing Of Quail And Grey Partridge

DEFF Research Database (Denmark)

Holm, Lars-Erik; Panitz, Frank; Burt, Dave

2011-01-01

The development in sequencing methods has made it possible to perform whole genome de novo sequencing of species without large commercial interests. Within the EU-financed QUANTOMICS project (KBBE-2A-222664), we have performed de novo sequencing of quail (Coturnix coturnix) and grey partridge...... (Perdix perdix) on a Genome Analyzer GAII (Illumina) using paired-end sequencing. The amount of generated sequences amounts to 8 to 9 Gb for each species. The analysis and assembly of the generated sequences is ongoing. Access to the whole genome sequence from these two species will enable enhanced...... comparative studies towards the chicken genome and will aid in identifying evolutionarily conserved sequences within the Galliformes. The obtained sequences from quail and partridge represent a beginning of generating the whole genome sequence for these species. The continuation of establishing the genome...
A computational tool to predict the evolutionarily conserved protein-protein interaction hot-spot residues from the structure of the unbound protein.

Science.gov (United States)

Agrawal, Neeraj J; Helk, Bernhard; Trout, Bernhardt L

2014-01-21

Identifying hot-spot residues - residues that are critical to protein-protein binding - can help to elucidate a protein's function and assist in designing therapeutic molecules to target those residues. We present a novel computational tool, termed spatial-interaction-map (SIM), to predict the hot-spot residues of an evolutionarily conserved protein-protein interaction from the structure of an unbound protein alone. SIM can predict the protein hot-spot residues with an accuracy of 36-57%. Thus, the SIM tool can be used to predict the yet unknown hot-spot residues for many proteins for which the structure of the protein-protein complexes are not available, thereby providing a clue to their functions and an opportunity to design therapeutic molecules to target these proteins. Copyright © 2013 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Detection of Weakly Conserved Ancestral Mammalian RegulatorySequences by Primate Comparisons

Energy Technology Data Exchange (ETDEWEB)

Wang, Qian-fei; Prabhakar, Shyam; Chanan, Sumita; Cheng,Jan-Fang; Rubin, Edward M.; Boffelli, Dario

2006-06-01

Genomic comparisons between human and distant, non-primatemammals are commonly used to identify cis-regulatory elements based onconstrained sequence evolution. However, these methods fail to detectcryptic functional elements, which are too weakly conserved among mammalsto distinguish from nonfunctional DNA. To address this problem, weexplored the potential of deep intra-primate sequence comparisons. Wesequenced the orthologs of 558 kb of human genomic sequence, coveringmultiple loci involved in cholesterol homeostasis, in 6 nonhumanprimates. Our analysis identified 6 noncoding DNA elements displayingsignificant conservation among primates, but undetectable in more distantcomparisons. In vitro and in vivo tests revealed that at least three ofthese 6 elements have regulatory function. Notably, the mouse orthologsof these three functional human sequences had regulatory activity despitetheir lack of significant sequence conservation, indicating that they arecryptic ancestral cis-regulatory elements. These regulatory elementscould still be detected in a smaller set of three primate speciesincluding human, rhesus and marmoset. Since the human and rhesus genomesequences are already available, and the marmoset genome is activelybeing sequenced, the primate-specific conservation analysis describedhere can be applied in the near future on a whole-genome scale, tocomplement the annotation provided by more distant speciescomparisons.
Inverse statistical physics of protein sequences: a key issues review.

Science.gov (United States)

Cocco, Simona; Feinauer, Christoph; Figliuzzi, Matteo; Monasson, Rémi; Weigt, Martin

2018-03-01

In the course of evolution, proteins undergo important changes in their amino acid sequences, while their three-dimensional folded structure and their biological function remain remarkably conserved. Thanks to modern sequencing techniques, sequence data accumulate at unprecedented pace. This provides large sets of so-called homologous, i.e. evolutionarily related protein sequences, to which methods of inverse statistical physics can be applied. Using sequence data as the basis for the inference of Boltzmann distributions from samples of microscopic configurations or observables, it is possible to extract information about evolutionary constraints and thus protein function and structure. Here we give an overview over some biologically important questions, and how statistical-mechanics inspired modeling approaches can help to answer them. Finally, we discuss some open questions, which we expect to be addressed over the next years.
Evolutionarily conserved morphogenetic movements at the vertebrate head–trunk interface coordinate the transport and assembly of hypopharyngeal structures

Science.gov (United States)

Lours-Calet, Corinne; Alvares, Lucia E.; El-Hanfy, Amira S.; Gandesha, Saniel; Walters, Esther H.; Sobreira, Débora Rodrigues; Wotton, Karl R.; Jorge, Erika C.; Lawson, Jennifer A.; Kelsey Lewis, A.; Tada, Masazumi; Sharpe, Colin; Kardon, Gabrielle; Dietrich, Susanne

2014-01-01

The vertebrate head–trunk interface (occipital region) has been heavily remodelled during evolution, and its development is still poorly understood. In extant jawed vertebrates, this region provides muscle precursors for the throat and tongue (hypopharyngeal/hypobranchial/hypoglossal muscle precursors, HMP) that take a stereotype path rostrally along the pharynx and are thought to reach their target sites via active migration. Yet, this projection pattern emerged in jawless vertebrates before the evolution of migratory muscle precursors. This suggests that a so far elusive, more basic transport mechanism must have existed and may still be traceable today. Here we show for the first time that all occipital tissues participate in well-conserved cell movements. These cell movements are spearheaded by the occipital lateral mesoderm and ectoderm that split into two streams. The rostrally directed stream projects along the floor of the pharynx and reaches as far rostrally as the floor of the mandibular arch and outflow tract of the heart. Notably, this stream leads and engulfs the later emerging HMP, neural crest cells and hypoglossal nerve. When we (i) attempted to redirect hypobranchial/hypoglossal muscle precursors towards various attractants, (ii) placed non-migratory muscle precursors into the occipital environment or (iii) molecularly or (iv) genetically rendered muscle precursors non-migratory, they still followed the trajectory set by the occipital lateral mesoderm and ectoderm. Thus, we have discovered evolutionarily conserved morphogenetic movements, driven by the occipital lateral mesoderm and ectoderm, that ensure cell transport and organ assembly at the head–trunk interface. PMID:24662046

Fast discovery and visualization of conserved regions in DNA sequences using quasi-alignment.

Science.gov (United States)

Nagar, Anurag; Hahsler, Michael

2013-01-01

Next Generation Sequencing techniques are producing enormous amounts of biological sequence data and analysis becomes a major computational problem. Currently, most analysis, especially the identification of conserved regions, relies heavily on Multiple Sequence Alignment and its various heuristics such as progressive alignment, whose run time grows with the square of the number and the length of the aligned sequences and requires significant computational resources. In this work, we present a method to efficiently discover regions of high similarity across multiple sequences without performing expensive sequence alignment. The method is based on approximating edit distance between segments of sequences using p-mer frequency counts. Then, efficient high-throughput data stream clustering is used to group highly similar segments into so called quasi-alignments. Quasi-alignments have numerous applications such as identifying species and their taxonomic class from sequences, comparing sequences for similarities, and, as in this paper, discovering conserved regions across related sequences. In this paper, we show that quasi-alignments can be used to discover highly similar segments across multiple sequences from related or different genomes efficiently and accurately. Experiments on a large number of unaligned 16S rRNA sequences obtained from the Greengenes database show that the method is able to identify conserved regions which agree with known hypervariable regions in 16S rRNA. Furthermore, the experiments show that the proposed method scales well for large data sets with a run time that grows only linearly with the number and length of sequences, whereas for existing multiple sequence alignment heuristics the run time grows super-linearly. Quasi-alignment-based algorithms can detect highly similar regions and conserved areas across multiple sequences. Since the run time is linear and the sequences are converted into a compact clustering model, we are able to
An Evolutionarily Conserved Role of Presenilin in Neuronal Protection in the Aging Drosophila Brain.

Science.gov (United States)

Kang, Jongkyun; Shin, Sarah; Perrimon, Norbert; Shen, Jie

2017-07-01

Mutations in the Presenilin genes are the major genetic cause of Alzheimer's disease. Presenilin and Nicastrin are essential components of γ-secretase, a multi-subunit protease that cleaves Type I transmembrane proteins. Genetic studies in mice previously demonstrated that conditional inactivation of Presenilin or Nicastrin in excitatory neurons of the postnatal forebrain results in memory deficits, synaptic impairment, and age-dependent neurodegeneration. The roles of Drosophila Presenilin ( Psn ) and Nicastrin ( Nct ) in the adult fly brain, however, are unknown. To knockdown (KD) Psn or Nct selectively in neurons of the adult brain, we generated multiple shRNA lines. Using a ubiquitous driver, these shRNA lines resulted in 80-90% reduction of mRNA and pupal lethality-a phenotype that is shared with Psn and Nct mutants carrying nonsense mutations. Furthermore, expression of these shRNAs in the wing disc caused notching wing phenotypes, which are also shared with Psn and Nct mutants. Similar to Nct , neuron-specific Psn KD using two independent shRNA lines led to early mortality and rough eye phenotypes, which were rescued by a fly Psn transgene. Interestingly, conditional KD (cKD) of Psn or Nct in adult neurons using the elav-Gal4 and tubulin-Gal80 ts system caused shortened lifespan, climbing defects, increases in apoptosis, and age-dependent neurodegeneration. Together, these findings demonstrate that, similar to their mammalian counterparts, Drosophila Psn and Nct are required for neuronal survival during aging and normal lifespan, highlighting an evolutionarily conserved role of Presenilin in neuronal protection in the aging brain. Copyright © 2017 by the Genetics Society of America.
Two estrogen response element sequences near the PCNA gene are not responsible for its estrogen-enhanced expression in MCF7 cells.

Science.gov (United States)

Wang, Cheng; Yu, Jie; Kallen, Caleb B

2008-01-01

The proliferating cell nuclear antigen (PCNA) is an essential component of DNA replication, cell cycle regulation, and epigenetic inheritance. High expression of PCNA is associated with poor prognosis in patients with breast cancer. The 5'-region of the PCNA gene contains two computationally-detected estrogen response element (ERE) sequences, one of which is evolutionarily conserved. Both of these sequences are of undocumented cis-regulatory function. We recently demonstrated that estradiol (E2) enhances PCNA mRNA expression in MCF7 breast cancer cells. MCF7 cells proliferate in response to E2. Here, we demonstrate that E2 rapidly enhanced PCNA mRNA and protein expression in a process that requires ERalpha as well as de novo protein synthesis. One of the two upstream ERE sequences was specifically bound by ERalpha-containing protein complexes, in vitro, in gel shift analysis. Yet, each ERE sequence, when cloned as a single copy, or when engineered as two tandem copies of the ERE-containing sequence, was not capable of activating a luciferase reporter construct in response to E2. In MCF7 cells, neither ERE-containing genomic region demonstrated E2-dependent recruitment of ERalpha by sensitive ChIP-PCR assays. We conclude that E2 enhances PCNA gene expression by an indirect process and that computational detection of EREs, even when evolutionarily conserved and when near E2-responsive genes, requires biochemical validation.
Sequence conservation and combinatorial complexity of Drosophila neural precursor cell enhancers

Directory of Open Access Journals (Sweden)

Kuzin Alexander

2008-08-01

Full Text Available Abstract Background The presence of highly conserved sequences within cis-regulatory regions can serve as a valuable starting point for elucidating the basis of enhancer function. This study focuses on regulation of gene expression during the early events of Drosophila neural development. We describe the use of EvoPrinter and cis-Decoder, a suite of interrelated phylogenetic footprinting and alignment programs, to characterize highly conserved sequences that are shared among co-regulating enhancers. Results Analysis of in vivo characterized enhancers that drive neural precursor gene expression has revealed that they contain clusters of highly conserved sequence blocks (CSBs made up of shorter shared sequence elements which are present in different combinations and orientations within the different co-regulating enhancers; these elements contain either known consensus transcription factor binding sites or consist of novel sequences that have not been functionally characterized. The CSBs of co-regulated enhancers share a large number of sequence elements, suggesting that a diverse repertoire of transcription factors may interact in a highly combinatorial fashion to coordinately regulate gene expression. We have used information gained from our comparative analysis to discover an enhancer that directs expression of the nervy gene in neural precursor cells of the CNS and PNS. Conclusion The combined use EvoPrinter and cis-Decoder has yielded important insights into the combinatorial appearance of fundamental sequence elements required for neural enhancer function. Each of the 30 enhancers examined conformed to a pattern of highly conserved blocks of sequences containing shared constituent elements. These data establish a basis for further analysis and understanding of neural enhancer function.
Properties of Sequence Conservation in Upstream Regulatory and Protein Coding Sequences among Paralogs in Arabidopsis thaliana

Science.gov (United States)

Richardson, Dale N.; Wiehe, Thomas

Whole genome duplication (WGD) has catalyzed the formation of new species, genes with novel functions, altered expression patterns, complexified signaling pathways and has provided organisms a level of genetic robustness. We studied the long-term evolution and interrelationships of 5’ upstream regulatory sequences (URSs), protein coding sequences (CDSs) and expression correlations (EC) of duplicated gene pairs in Arabidopsis. Three distinct methods revealed significant evolutionary conservation between paralogous URSs and were highly correlated with microarray-based expression correlation of the respective gene pairs. Positional information on exact matches between sequences unveiled the contribution of micro-chromosomal rearrangements on expression divergence. A three-way rank analysis of URS similarity, CDS divergence and EC uncovered specific gene functional biases. Transcription factor activity was associated with gene pairs exhibiting conserved URSs and divergent CDSs, whereas a broad array of metabolic enzymes was found to be associated with gene pairs showing diverged URSs but conserved CDSs.
An Evolutionarily Young Polar Bear (Ursus maritimus) Endogenous Retrovirus Identified from Next Generation Sequence Data.

Science.gov (United States)

Tsangaras, Kyriakos; Mayer, Jens; Alquezar-Planas, David E; Greenwood, Alex D

2015-11-24

Transcriptome analysis of polar bear (Ursus maritimus) tissues identified sequences with similarity to Porcine Endogenous Retroviruses (PERV). Based on these sequences, four proviral copies and 15 solo long terminal repeats (LTRs) of a newly described endogenous retrovirus were characterized from the polar bear draft genome sequence. Closely related sequences were identified by PCR analysis of brown bear (Ursus arctos) and black bear (Ursus americanus) but were absent in non-Ursinae bear species. The virus was therefore designated UrsusERV. Two distinct groups of LTRs were observed including a recombinant ERV that contained one LTR belonging to each group indicating that genomic invasions by at least two UrsusERV variants have recently occurred. Age estimates based on proviral LTR divergence and conservation of integration sites among ursids suggest the viral group is only a few million years old. The youngest provirus was polar bear specific, had intact open reading frames (ORFs) and could potentially encode functional proteins. Phylogenetic analyses of UrsusERV consensus protein sequences suggest that it is part of a pig, gibbon and koala retrovirus clade. The young age estimates and lineage specificity of the virus suggests UrsusERV is a recent cross species transmission from an unknown reservoir and places the viral group among the youngest of ERVs identified in mammals.
An Evolutionarily Young Polar Bear (Ursus maritimus) Endogenous Retrovirus Identified from Next Generation Sequence Data

Science.gov (United States)

Tsangaras, Kyriakos; Mayer, Jens; Alquezar-Planas, David E.; Greenwood, Alex D.

2015-01-01

Transcriptome analysis of polar bear (Ursus maritimus) tissues identified sequences with similarity to Porcine Endogenous Retroviruses (PERV). Based on these sequences, four proviral copies and 15 solo long terminal repeats (LTRs) of a newly described endogenous retrovirus were characterized from the polar bear draft genome sequence. Closely related sequences were identified by PCR analysis of brown bear (Ursus arctos) and black bear (Ursus americanus) but were absent in non-Ursinae bear species. The virus was therefore designated UrsusERV. Two distinct groups of LTRs were observed including a recombinant ERV that contained one LTR belonging to each group indicating that genomic invasions by at least two UrsusERV variants have recently occurred. Age estimates based on proviral LTR divergence and conservation of integration sites among ursids suggest the viral group is only a few million years old. The youngest provirus was polar bear specific, had intact open reading frames (ORFs) and could potentially encode functional proteins. Phylogenetic analyses of UrsusERV consensus protein sequences suggest that it is part of a pig, gibbon and koala retrovirus clade. The young age estimates and lineage specificity of the virus suggests UrsusERV is a recent cross species transmission from an unknown reservoir and places the viral group among the youngest of ERVs identified in mammals. PMID:26610552
Two estrogen response element sequences near the PCNA gene are not responsible for its estrogen-enhanced expression in MCF7 cells.

Directory of Open Access Journals (Sweden)

Cheng Wang

Full Text Available The proliferating cell nuclear antigen (PCNA is an essential component of DNA replication, cell cycle regulation, and epigenetic inheritance. High expression of PCNA is associated with poor prognosis in patients with breast cancer. The 5'-region of the PCNA gene contains two computationally-detected estrogen response element (ERE sequences, one of which is evolutionarily conserved. Both of these sequences are of undocumented cis-regulatory function. We recently demonstrated that estradiol (E2 enhances PCNA mRNA expression in MCF7 breast cancer cells. MCF7 cells proliferate in response to E2.Here, we demonstrate that E2 rapidly enhanced PCNA mRNA and protein expression in a process that requires ERalpha as well as de novo protein synthesis. One of the two upstream ERE sequences was specifically bound by ERalpha-containing protein complexes, in vitro, in gel shift analysis. Yet, each ERE sequence, when cloned as a single copy, or when engineered as two tandem copies of the ERE-containing sequence, was not capable of activating a luciferase reporter construct in response to E2. In MCF7 cells, neither ERE-containing genomic region demonstrated E2-dependent recruitment of ERalpha by sensitive ChIP-PCR assays.We conclude that E2 enhances PCNA gene expression by an indirect process and that computational detection of EREs, even when evolutionarily conserved and when near E2-responsive genes, requires biochemical validation.
An Evolutionarily Young Polar Bear (Ursus maritimus Endogenous Retrovirus Identified from Next Generation Sequence Data

Directory of Open Access Journals (Sweden)

Kyriakos Tsangaras

2015-11-01

Full Text Available Transcriptome analysis of polar bear (Ursus maritimus tissues identified sequences with similarity to Porcine Endogenous Retroviruses (PERV. Based on these sequences, four proviral copies and 15 solo long terminal repeats (LTRs of a newly described endogenous retrovirus were characterized from the polar bear draft genome sequence. Closely related sequences were identified by PCR analysis of brown bear (Ursus arctos and black bear (Ursus americanus but were absent in non-Ursinae bear species. The virus was therefore designated UrsusERV. Two distinct groups of LTRs were observed including a recombinant ERV that contained one LTR belonging to each group indicating that genomic invasions by at least two UrsusERV variants have recently occurred. Age estimates based on proviral LTR divergence and conservation of integration sites among ursids suggest the viral group is only a few million years old. The youngest provirus was polar bear specific, had intact open reading frames (ORFs and could potentially encode functional proteins. Phylogenetic analyses of UrsusERV consensus protein sequences suggest that it is part of a pig, gibbon and koala retrovirus clade. The young age estimates and lineage specificity of the virus suggests UrsusERV is a recent cross species transmission from an unknown reservoir and places the viral group among the youngest of ERVs identified in mammals.
Genomic dissection of conserved transcriptional regulation in intestinal epithelial cells.

Directory of Open Access Journals (Sweden)

Colin R Lickwar

2017-08-01

Full Text Available The intestinal epithelium serves critical physiologic functions that are shared among all vertebrates. However, it is unknown how the transcriptional regulatory mechanisms underlying these functions have changed over the course of vertebrate evolution. We generated genome-wide mRNA and accessible chromatin data from adult intestinal epithelial cells (IECs in zebrafish, stickleback, mouse, and human species to determine if conserved IEC functions are achieved through common transcriptional regulation. We found evidence for substantial common regulation and conservation of gene expression regionally along the length of the intestine from fish to mammals and identified a core set of genes comprising a vertebrate IEC signature. We also identified transcriptional start sites and other putative regulatory regions that are differentially accessible in IECs in all 4 species. Although these sites rarely showed sequence conservation from fish to mammals, surprisingly, they drove highly conserved IEC expression in a zebrafish reporter assay. Common putative transcription factor binding sites (TFBS found at these sites in multiple species indicate that sequence conservation alone is insufficient to identify much of the functionally conserved IEC regulatory information. Among the rare, highly sequence-conserved, IEC-specific regulatory regions, we discovered an ancient enhancer upstream from her6/HES1 that is active in a distinct population of Notch-positive cells in the intestinal epithelium. Together, these results show how combining accessible chromatin and mRNA datasets with TFBS prediction and in vivo reporter assays can reveal tissue-specific regulatory information conserved across 420 million years of vertebrate evolution. We define an IEC transcriptional regulatory network that is shared between fish and mammals and establish an experimental platform for studying how evolutionarily distilled regulatory information commonly controls IEC development
Conservation patterns in different functional sequence categoriesof divergent Drosophila species

Energy Technology Data Exchange (ETDEWEB)

Papatsenko, Dmitri; Kislyuk, Andrey; Levine, Michael; Dubchak, Inna

2005-10-01

We have explored the distributions of fully conservedungapped blocks in genome-wide pairwise alignments of recently completedspecies of Drosophila: D.yakuba, D.ananassae, D.pseudoobscura, D.virilisand D.mojavensis. Based on these distributions we have found that nearlyevery functional sequence category possesses its own distinctiveconservation pattern, sometimes independent of the overall sequenceconservation level. In the coding and regulatory regions, the ungappedblocks were longer than in introns, UTRs and non-functional sequences. Atthe same time, the blocks in the coding regions carried 3N+2 signaturecharacteristic to synonymic substitutions in the 3rd codon positions.Larger block sizes in transcription regulatory regions can be explainedby the presence of conserved arrays of binding sites for transcriptionfactors. We also have shown that the longest ungapped blocks, or'ultraconserved' sequences, are associated with specific gene groups,including those encoding ion channels and components of the cytoskeleton.We discussed how restrained conservation patterns may help in mappingfunctional sequence categories and improving genomeannotation.
RNA expression in a cartilaginous fish cell line reveals ancient 3′ noncoding regions highly conserved in vertebrates

Science.gov (United States)

Forest, David; Nishikawa, Ryuhei; Kobayashi, Hiroshi; Parton, Angela; Bayne, Christopher J.; Barnes, David W.

2007-01-01

We have established a cartilaginous fish cell line [Squalus acanthias embryo cell line (SAE)], a mesenchymal stem cell line derived from the embryo of an elasmobranch, the spiny dogfish shark S. acanthias. Elasmobranchs (sharks and rays) first appeared >400 million years ago, and existing species provide useful models for comparative vertebrate cell biology, physiology, and genomics. Comparative vertebrate genomics among evolutionarily distant organisms can provide sequence conservation information that facilitates identification of critical coding and noncoding regions. Although these genomic analyses are informative, experimental verification of functions of genomic sequences depends heavily on cell culture approaches. Using ESTs defining mRNAs derived from the SAE cell line, we identified lengthy and highly conserved gene-specific nucleotide sequences in the noncoding 3′ UTRs of eight genes involved in the regulation of cell growth and proliferation. Conserved noncoding 3′ mRNA regions detected by using the shark nucleotide sequences as a starting point were found in a range of other vertebrate orders, including bony fish, birds, amphibians, and mammals. Nucleotide identity of shark and human in these regions was remarkably well conserved. Our results indicate that highly conserved gene sequences dating from the appearance of jawed vertebrates and representing potential cis-regulatory elements can be identified through the use of cartilaginous fish as a baseline. Because the expression of genes in the SAE cell line was prerequisite for their identification, this cartilaginous fish culture system also provides a physiologically valid tool to test functional hypotheses on the role of these ancient conserved sequences in comparative cell biology. PMID:17227856
Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes

DEFF Research Database (Denmark)

Siepel, Adam; Bejerano, Gill; Pedersen, Jakob Skou

2005-01-01

We have conducted a comprehensive search for conserved elements in vertebrate genomes, using genome-wide multiple alignments of five vertebrate species (human, mouse, rat, chicken, and Fugu rubripes). Parallel searches have been performed with multiple alignments of four insect species (three...... species of Drosophila and Anopheles gambiae), two species of Caenorhabditis, and seven species of Saccharomyces. Conserved elements were identified with a computer program called phastCons, which is based on a two-state phylogenetic hidden Markov model (phylo-HMM). PhastCons works by fitting a phylo......-HMM to the data by maximum likelihood, subject to constraints designed to calibrate the model across species groups, and then predicting conserved elements based on this model. The predicted elements cover roughly 3%-8% of the human genome (depending on the details of the calibration procedure) and substantially...
Peptomics, identification of novel cationic Arabidopsis peptides with conserved sequence motifs

DEFF Research Database (Denmark)

Olsen, Addie Nina; Mundy, John; Skriver, Karen

2002-01-01

Arabidopsis family of 34 genes. The predicted peptides are characterized by a conserved C-terminal sequence motif and additional primary structure conservation in a core region. The majority of these genes had not previously been annotated. A subset of the predicted peptides show high overall sequence...... similarity to Rapid Alkalinization Factor (RALF), a peptide isolated from tobacco. We therefore refer to this peptide family as RALFL for RALF-Like. RT-PCR analysis confirmed that several of the Arabidopsis genes are expressed and that their expression patterns vary. The identification of a large gene family...
Evolutionarily conserved 5'-3' exoribonuclease Xrn1 accumulates at plasma membrane-associated eisosomes in post-diauxic yeast.

Directory of Open Access Journals (Sweden)

Tomas Grousl

Full Text Available Regulation of gene expression on the level of translation and mRNA turnover is widely conserved evolutionarily. We have found that the main mRNA decay enzyme, exoribonuclease Xrn1, accumulates at the plasma membrane-associated eisosomes after glucose exhaustion in a culture of the yeast S. cerevisiae. Eisosomal localization of Xrn1 is not achieved in cells lacking the main component of eisosomes, Pil1, or Sur7, the protein accumulating at the membrane compartment of Can1 (MCC - the eisosome-organized plasma membrane microdomain. In contrast to the conditions of diauxic shift, when Xrn1 accumulates in processing bodies (P-bodies, or acute heat stress, in which these cytosolic accumulations of Xrn1 associate with eIF3a/Rpg1-containing stress granules, Xrn1 is not accompanied by other mRNA-decay machinery components when it accumulates at eisosomes in post-diauxic cells. It is important that Xrn1 is released from eisosomes after addition of fermentable substrate. We suggest that this spatial segregation of Xrn1 from the rest of the mRNA-decay machinery reflects a general regulatory mechanism, in which the key enzyme is kept separate from the rest of mRNA decay factors in resting cells but ready for immediate use when fermentable nutrients emerge and appropriate metabolism reprogramming is required. In particular, the localization of Xrn1 to the eisosome, together with previously published data, accents the relevance of this plasma membrane-associated compartment as a multipotent regulatory site.
WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences

Directory of Open Access Journals (Sweden)

Pesole Graziano

2007-02-01

Full Text Available Abstract Background This work addresses the problem of detecting conserved transcription factor binding sites and in general regulatory regions through the analysis of sequences from homologous genes, an approach that is becoming more and more widely used given the ever increasing amount of genomic data available. Results We present an algorithm that identifies conserved transcription factor binding sites in a given sequence by comparing it to one or more homologs, adapting a framework we previously introduced for the discovery of sites in sequences from co-regulated genes. Differently from the most commonly used methods, the approach we present does not need or compute an alignment of the sequences investigated, nor resorts to descriptors of the binding specificity of known transcription factors. The main novel idea we introduce is a relative measure of conservation, assuming that true functional elements should present a higher level of conservation with respect to the rest of the sequence surrounding them. We present tests where we applied the algorithm to the identification of conserved annotated sites in homologous promoters, as well as in distal regions like enhancers. Conclusion Results of the tests show how the algorithm can provide fast and reliable predictions of conserved transcription factor binding sites regulating the transcription of a gene, with better performances than other available methods for the same task. We also show examples on how the algorithm can be successfully employed when promoter annotations of the genes investigated are missing, or when regulatory sites and regions are located far away from the genes.
The PRC2-binding long non-coding RNAs in human and mouse genomes are associated with predictive sequence features

Science.gov (United States)

Tu, Shiqi; Yuan, Guo-Cheng; Shao, Zhen

2017-01-01

Recently, long non-coding RNAs (lncRNAs) have emerged as an important class of molecules involved in many cellular processes. One of their primary functions is to shape epigenetic landscape through interactions with chromatin modifying proteins. However, mechanisms contributing to the specificity of such interactions remain poorly understood. Here we took the human and mouse lncRNAs that were experimentally determined to have physical interactions with Polycomb repressive complex 2 (PRC2), and systematically investigated the sequence features of these lncRNAs by developing a new computational pipeline for sequences composition analysis, in which each sequence is considered as a series of transitions between adjacent nucleotides. Through that, PRC2-binding lncRNAs were found to be associated with a set of distinctive and evolutionarily conserved sequence features, which can be utilized to distinguish them from the others with considerable accuracy. We further identified fragments of PRC2-binding lncRNAs that are enriched with these sequence features, and found they show strong PRC2-binding signals and are more highly conserved across species than the other parts, implying their functional importance.
Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence

Science.gov (United States)

Gordon, Kacy L.; Arthur, Robert K.; Ruvinsky, Ilya

2015-01-01

Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements. PMID:26020930
Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence.

Directory of Open Access Journals (Sweden)

Kacy L Gordon

2015-05-01

Full Text Available Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2 from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements.
In Vivo Enhancer Analysis Chromosome 16 Conserved NoncodingSequences

Energy Technology Data Exchange (ETDEWEB)

Pennacchio, Len A.; Ahituv, Nadav; Moses, Alan M.; Nobrega,Marcelo; Prabhakar, Shyam; Shoukry, Malak; Minovitsky, Simon; Visel,Axel; Dubchak, Inna; Holt, Amy; Lewis, Keith D.; Plajzer-Frick, Ingrid; Akiyama, Jennifer; De Val, Sarah; Afzal, Veena; Black, Brian L.; Couronne, Olivier; Eisen, Michael B.; Rubin, Edward M.

2006-02-01

The identification of enhancers with predicted specificitiesin vertebrate genomes remains a significant challenge that is hampered bya lack of experimentally validated training sets. In this study, weleveraged extreme evolutionary sequence conservation as a filter toidentify putative gene regulatory elements and characterized the in vivoenhancer activity of human-fish conserved and ultraconserved1 noncodingelements on human chromosome 16 as well as such elements from elsewherein the genome. We initially tested 165 of these extremely conservedsequences in a transgenic mouse enhancer assay and observed that 48percent (79/165) functioned reproducibly as tissue-specific enhancers ofgene expression at embryonic day 11.5. While driving expression in abroad range of anatomical structures in the embryo, the majority of the79 enhancers drove expression in various regions of the developingnervous system. Studying a set of DNA elements that specifically droveforebrain expression, we identified DNA signatures specifically enrichedin these elements and used these parameters to rank all ~;3,400human-fugu conserved noncoding elements in the human genome. The testingof the top predictions in transgenic mice resulted in a three-foldenrichment for sequences with forebrain enhancer activity. These datadramatically expand the catalogue of in vivo-characterized human geneenhancers and illustrate the future utility of such training sets for avariety of iological applications including decoding the regulatoryvocabulary of the human genome.

Computational Analysis of an Evolutionarily Conserved VertebrateMuscle Alternative Splicing Program

Energy Technology Data Exchange (ETDEWEB)

Das, Debopriya; Clark, Tyson A.; Schweitzer, Anthony; Marr,Henry; Yamamoto, Miki L.; Parra, Marilyn K.; Arribere, Josh; Minovitsky,Simon; Dubchak, Inna; Blume, John E.; Conboy, John G.

2006-06-15

A novel exon microarray format that probes gene expression with single exon resolution was employed to elucidate critical features of a vertebrate muscle alternative splicing program. A dataset of 56 microarray-defined, muscle-enriched exons and their flanking introns were examined computationally in order to investigate coordination of the muscle splicing program. Candidate intron regulatory motifs were required to meet several stringent criteria: significant over-representation near muscle-enriched exons, correlation with muscle expression, and phylogenetic conservation among genomes of several vertebrate orders. Three classes of regulatory motifs were identified in the proximal downstream intron, within 200nt of the target exons: UGCAUG, a specific binding site for Fox-1 related splicing factors; ACUAAC, a novel branchpoint-like element; and UG-/UGC-rich elements characteristic of binding sites for CELF splicing factors. UGCAUG was remarkably enriched, being present in nearly one-half of all cases. These studies suggest that Fox and CELF splicing factors play a major role in enforcing the muscle-specific alternative splicing program, facilitating expression of a set of unique isoforms of cytoskeletal proteins that are critical to muscle cell differentiation. Supplementary materials: There are four supplementary tables and one supplementary figure. The tables provide additional detailed information concerning the muscle-enriched datasets, and about over-represented oligonucleotide sequences in the flanking introns. The supplementary figure shows RT-PCR data confirming the muscle-enriched expression of exons predicted from the microarray analysis.
AlignMiner: a Web-based tool for detection of divergent regions in multiple sequence alignments of conserved sequences

Directory of Open Access Journals (Sweden)

Claros M Gonzalo

2010-06-01

Full Text Available Abstract Background Multiple sequence alignments are used to study gene or protein function, phylogenetic relations, genome evolution hypotheses and even gene polymorphisms. Virtually without exception, all available tools focus on conserved segments or residues. Small divergent regions, however, are biologically important for specific quantitative polymerase chain reaction, genotyping, molecular markers and preparation of specific antibodies, and yet have received little attention. As a consequence, they must be selected empirically by the researcher. AlignMiner has been developed to fill this gap in bioinformatic analyses. Results AlignMiner is a Web-based application for detection of conserved and divergent regions in alignments of conserved sequences, focusing particularly on divergence. It accepts alignments (protein or nucleic acid obtained using any of a variety of algorithms, which does not appear to have a significant impact on the final results. AlignMiner uses different scoring methods for assessing conserved/divergent regions, Entropy being the method that provides the highest number of regions with the greatest length, and Weighted being the most restrictive. Conserved/divergent regions can be generated either with respect to the consensus sequence or to one master sequence. The resulting data are presented in a graphical interface developed in AJAX, which provides remarkable user interaction capabilities. Users do not need to wait until execution is complete and can.even inspect their results on a different computer. Data can be downloaded onto a user disk, in standard formats. In silico and experimental proof-of-concept cases have shown that AlignMiner can be successfully used to designing specific polymerase chain reaction primers as well as potential epitopes for antibodies. Primer design is assisted by a module that deploys several oligonucleotide parameters for designing primers "on the fly". Conclusions AlignMiner can be used
The BsaHI restriction-modification system: Cloning, sequencing and analysis of conserved motifs

Directory of Open Access Journals (Sweden)

Roberts Richard J

2008-05-01

Full Text Available Abstract Background Restriction and modification enzymes typically recognise short DNA sequences of between two and eight bases in length. Understanding the mechanism of this recognition represents a significant challenge that we begin to address for the BsaHI restriction-modification system, which recognises the six base sequence GRCGYC. Results The DNA sequences of the genes for the BsaHI methyltransferase, bsaHIM, and restriction endonuclease, bsaHIR, have been determined (GenBank accession #EU386360, cloned and expressed in E. coli. Both the restriction endonuclease and methyltransferase enzymes share significant similarity with a group of 6 other enzymes comprising the restriction-modification systems HgiDI and HgiGI and the putative HindVP, NlaCORFDP, NpuORFC228P and SplZORFNP restriction-modification systems. A sequence alignment of these homologues shows that their amino acid sequences are largely conserved and highlights several motifs of interest. We target one such conserved motif, reading SPERRFD, at the C-terminal end of the bsaHIR gene. A mutational analysis of these amino acids indicates that the motif is crucial for enzymatic activity. Sequence alignment of the methyltransferase gene reveals a short motif within the target recognition domain that is conserved among enzymes recognising the same sequences. Thus, this motif may be used as a diagnostic tool to define the recognition sequences of the cytosine C5 methyltransferases. Conclusion We have cloned and sequenced the BsaHI restriction and modification enzymes. We have identified a region of the R. BsaHI enzyme that is crucial for its activity. Analysis of the amino acid sequence of the BsaHI methyltransferase enzyme led us to propose two new motifs that can be used in the diagnosis of the recognition sequence of the cytosine C5-methyltransferases.
Whole-genome sequencing approaches for conservation biology: Advantages, limitations and practical recommendations.

Science.gov (United States)

Fuentes-Pardo, Angela P; Ruzzante, Daniel E

2017-10-01

Whole-genome resequencing (WGR) is a powerful method for addressing fundamental evolutionary biology questions that have not been fully resolved using traditional methods. WGR includes four approaches: the sequencing of individuals to a high depth of coverage with either unresolved or resolved haplotypes, the sequencing of population genomes to a high depth by mixing equimolar amounts of unlabelled-individual DNA (Pool-seq) and the sequencing of multiple individuals from a population to a low depth (lcWGR). These techniques require the availability of a reference genome. This, along with the still high cost of shotgun sequencing and the large demand for computing resources and storage, has limited their implementation in nonmodel species with scarce genomic resources and in fields such as conservation biology. Our goal here is to describe the various WGR methods, their pros and cons and potential applications in conservation biology. WGR offers an unprecedented marker density and surveys a wide diversity of genetic variations not limited to single nucleotide polymorphisms (e.g., structural variants and mutations in regulatory elements), increasing their power for the detection of signatures of selection and local adaptation as well as for the identification of the genetic basis of phenotypic traits and diseases. Currently, though, no single WGR approach fulfils all requirements of conservation genetics, and each method has its own limitations and sources of potential bias. We discuss proposed ways to minimize such biases. We envision a not distant future where the analysis of whole genomes becomes a routine task in many nonmodel species and fields including conservation biology. © 2017 John Wiley & Sons Ltd.
Interaction of MYC with host cell factor-1 is mediated by the evolutionarily conserved Myc box IV motif.

Science.gov (United States)

Thomas, L R; Foshage, A M; Weissmiller, A M; Popay, T M; Grieb, B C; Qualls, S J; Ng, V; Carboneau, B; Lorey, S; Eischen, C M; Tansey, W P

2016-07-07

The MYC family of oncogenes encodes a set of three related transcription factors that are overexpressed in many human tumors and contribute to the cancer-related deaths of more than 70,000 Americans every year. MYC proteins drive tumorigenesis by interacting with co-factors that enable them to regulate the expression of thousands of genes linked to cell growth, proliferation, metabolism and genome stability. One effective way to identify critical co-factors required for MYC function has been to focus on sequence motifs within MYC that are conserved throughout evolution, on the assumption that their conservation is driven by protein-protein interactions that are vital for MYC activity. In addition to their DNA-binding domains, MYC proteins carry five regions of high sequence conservation known as Myc boxes (Mb). To date, four of the Mb motifs (MbI, MbII, MbIIIa and MbIIIb) have had a molecular function assigned to them, but the precise role of the remaining Mb, MbIV, and the reason for its preservation in vertebrate Myc proteins, is unknown. Here, we show that MbIV is required for the association of MYC with the abundant transcriptional coregulator host cell factor-1 (HCF-1). We show that the invariant core of MbIV resembles the tetrapeptide HCF-binding motif (HBM) found in many HCF-interaction partners, and demonstrate that MYC interacts with HCF-1 in a manner indistinguishable from the prototypical HBM-containing protein VP16. Finally, we show that rationalized point mutations in MYC that disrupt interaction with HCF-1 attenuate the ability of MYC to drive tumorigenesis in mice. Together, these data expose a molecular function for MbIV and indicate that HCF-1 is an important co-factor for MYC.
Correlation between sequence conservation and structural thermodynamics of microRNA precursors from human, mouse, and chicken genomes

Directory of Open Access Journals (Sweden)

Wang Shengqi

2010-10-01

Full Text Available Abstract Background Previous studies have shown that microRNA precursors (pre-miRNAs have considerably more stable secondary structures than other native RNAs (tRNA, rRNA, and mRNA and artificial RNA sequences. However, pre-miRNAs with ultra stable secondary structures have not been investigated. It is not known if there is a tendency in pre-miRNA sequences towards or against ultra stable structures? Furthermore, the relationship between the structural thermodynamic stability of pre-miRNA and their evolution remains unclear. Results We investigated the correlation between pre-miRNA sequence conservation and structural stability as measured by adjusted minimum folding free energies in pre-miRNAs isolated from human, mouse, and chicken. The analysis revealed that conserved and non-conserved pre-miRNA sequences had structures with similar average stabilities. However, the relatively ultra stable and unstable pre-miRNAs were more likely to be non-conserved than pre-miRNAs with moderate stability. Non-conserved pre-miRNAs had more G+C than A+U nucleotides, while conserved pre-miRNAs contained more A+U nucleotides. Notably, the U content of conserved pre-miRNAs was especially higher than that of non-conserved pre-miRNAs. Further investigations showed that conserved and non-conserved pre-miRNAs exhibited different structural element features, even though they had comparable levels of stability. Conclusions We proposed that there is a correlation between structural thermodynamic stability and sequence conservation for pre-miRNAs from human, mouse, and chicken genomes. Our analyses suggested that pre-miRNAs with relatively ultra stable or unstable structures were less favoured by natural selection than those with moderately stable structures. Comparison of nucleotide compositions between non-conserved and conserved pre-miRNAs indicated the importance of U nucleotides in the pre-miRNA evolutionary process. Several characteristic structural elements were
Genome-wide identification of coding and non-coding conserved sequence tags in human and mouse genomes

Directory of Open Access Journals (Sweden)

Maggi Giorgio P

2008-06-01

Full Text Available Abstract Background The accurate detection of genes and the identification of functional regions is still an open issue in the annotation of genomic sequences. This problem affects new genomes but also those of very well studied organisms such as human and mouse where, despite the great efforts, the inventory of genes and regulatory regions is far from complete. Comparative genomics is an effective approach to address this problem. Unfortunately it is limited by the computational requirements needed to perform genome-wide comparisons and by the problem of discriminating between conserved coding and non-coding sequences. This discrimination is often based (thus dependent on the availability of annotated proteins. Results In this paper we present the results of a comprehensive comparison of human and mouse genomes performed with a new high throughput grid-based system which allows the rapid detection of conserved sequences and accurate assessment of their coding potential. By detecting clusters of coding conserved sequences the system is also suitable to accurately identify potential gene loci. Following this analysis we created a collection of human-mouse conserved sequence tags and carefully compared our results to reliable annotations in order to benchmark the reliability of our classifications. Strikingly we were able to detect several potential gene loci supported by EST sequences but not corresponding to as yet annotated genes. Conclusion Here we present a new system which allows comprehensive comparison of genomes to detect conserved coding and non-coding sequences and the identification of potential gene loci. Our system does not require the availability of any annotated sequence thus is suitable for the analysis of new or poorly annotated genomes.
Deep sequencing discovery of novel and conserved microRNAs in trifoliate orange (Citrus trifoliata

Directory of Open Access Journals (Sweden)

Yu Huaping

2010-07-01

Full Text Available Abstract Background MicroRNAs (miRNAs play a critical role in post-transcriptional gene regulation and have been shown to control many genes involved in various biological and metabolic processes. There have been extensive studies to discover miRNAs and analyze their functions in model plant species, such as Arabidopsis and rice. Deep sequencing technologies have facilitated identification of species-specific or lowly expressed as well as conserved or highly expressed miRNAs in plants. Results In this research, we used Solexa sequencing to discover new microRNAs in trifoliate orange (Citrus trifoliata which is an important rootstock of citrus. A total of 13,106,753 reads representing 4,876,395 distinct sequences were obtained from a short RNA library generated from small RNA extracted from C. trifoliata flower and fruit tissues. Based on sequence similarity and hairpin structure prediction, we found that 156,639 reads representing 63 sequences from 42 highly conserved miRNA families, have perfect matches to known miRNAs. We also identified 10 novel miRNA candidates whose precursors were all potentially generated from citrus ESTs. In addition, five miRNA* sequences were also sequenced. These sequences had not been earlier described in other plant species and accumulation of the 10 novel miRNAs were confirmed by qRT-PCR analysis. Potential target genes were predicted for most conserved and novel miRNAs. Moreover, four target genes including one encoding IRX12 copper ion binding/oxidoreductase and three genes encoding NB-LRR disease resistance protein have been experimentally verified by detection of the miRNA-mediated mRNA cleavage in C. trifoliata. Conclusion Deep sequencing of short RNAs from C. trifoliata flowers and fruits identified 10 new potential miRNAs and 42 highly conserved miRNA families, indicating that specific miRNAs exist in C. trifoliata. These results show that regulatory miRNAs exist in agronomically important trifoliate orange
Remarkable sequence conservation of the last intron in the PKD1 gene.

Science.gov (United States)

Rodova, Marianna; Islam, M Rafiq; Peterson, Kenneth R; Calvet, James P

2003-10-01

The last intron of the PKD1 gene (intron 45) was found to have exceptionally high sequence conservation across four mammalian species: human, mouse, rat, and dog. This conservation did not extend to the comparable intron in pufferfish. Pairwise comparisons for intron 45 showed 91% identity (human vs. dog) to 100% identity (mouse vs. rat) for an average for all four species of 94% identity. In contrast, introns 43 and 44 of the PKD1 gene had average pairwise identities of 57% and 54%, and exons 43, 44, and 45 and the coding region of exon 46 had average pairwise identities of 80%, 84%, 82%, and 80%. Intron 45 is 90 to 95 bp in length, with the major region of sequence divergence being in a central 4-bp to 9-bp variable region. RNA secondary structure analysis of intron 45 predicts a branching stem-loop structure in which the central variable region lies in one loop and the putative branch point sequence lies in another loop, suggesting that the intron adopts a specific stem-loop structure that may be important for its removal. Although intron 45 appears to conform to the class of small, G-triplet-containing introns that are spliced by a mechanism utilizing intron definition, its high sequence conservation may be a reflection of constraints imposed by a unique mechanism that coordinates splicing of this last PKD1 intron with polyadenylation.
Extreme sequence divergence but conserved ligand-binding specificity in Streptococcus pyogenes M protein.

Directory of Open Access Journals (Sweden)

2006-05-01

Full Text Available Many pathogenic microorganisms evade host immunity through extensive sequence variability in a protein region targeted by protective antibodies. In spite of the sequence variability, a variable region commonly retains an important ligand-binding function, reflected in the presence of a highly conserved sequence motif. Here, we analyze the limits of sequence divergence in a ligand-binding region by characterizing the hypervariable region (HVR of Streptococcus pyogenes M protein. Our studies were focused on HVRs that bind the human complement regulator C4b-binding protein (C4BP, a ligand that confers phagocytosis resistance. A previous comparison of C4BP-binding HVRs identified residue identities that could be part of a binding motif, but the extended analysis reported here shows that no residue identities remain when additional C4BP-binding HVRs are included. Characterization of the HVR in the M22 protein indicated that two relatively conserved Leu residues are essential for C4BP binding, but these residues are probably core residues in a coiled-coil, implying that they do not directly contribute to binding. In contrast, substitution of either of two relatively conserved Glu residues, predicted to be solvent-exposed, had no effect on C4BP binding, although each of these changes had a major effect on the antigenic properties of the HVR. Together, these findings show that HVRs of M proteins have an extraordinary capacity for sequence divergence and antigenic variability while retaining a specific ligand-binding function.
Evolutionary Conservation of the Components in the TOR Signaling Pathways.

Science.gov (United States)

Tatebe, Hisashi; Shiozaki, Kazuhiro

2017-11-01

Target of rapamycin (TOR) is an evolutionarily conserved protein kinase that controls multiple cellular processes upon various intracellular and extracellular stimuli. Since its first discovery, extensive studies have been conducted both in yeast and animal species including humans. Those studies have revealed that TOR forms two structurally and physiologically distinct protein complexes; TOR complex 1 (TORC1) is ubiquitous among eukaryotes including animals, yeast, protozoa, and plants, while TOR complex 2 (TORC2) is conserved in diverse eukaryotic species other than plants. The studies have also identified two crucial regulators of mammalian TORC1 (mTORC1), Ras homolog enriched in brain (RHEB) and RAG GTPases. Of these, RAG regulates TORC1 in yeast as well and is conserved among eukaryotes with the green algae and land plants as apparent exceptions. RHEB is present in various eukaryotes but sporadically missing in multiple taxa. RHEB, in the budding yeast Saccharomyces cerevisiae , appears to be extremely divergent with concomitant loss of its function as a TORC1 regulator. In this review, we summarize the evolutionarily conserved functions of the key regulatory subunits of TORC1 and TORC2, namely RAPTOR, RICTOR, and SIN1. We also delve into the evolutionary conservation of RHEB and RAG and discuss the conserved roles of these GTPases in regulating TORC1.
Comparative analyses of six solanaceous transcriptomes reveal a high degree of sequence conservation and species-specific transcripts

Directory of Open Access Journals (Sweden)

Ouyang Shu

2005-09-01

Full Text Available Abstract Background The Solanaceae is a family of closely related species with diverse phenotypes that have been exploited for agronomic purposes. Previous studies involving a small number of genes suggested sequence conservation across the Solanaceae. The availability of large collections of Expressed Sequence Tags (ESTs for the Solanaceae now provides the opportunity to assess sequence conservation and divergence on a genomic scale. Results All available ESTs and Expressed Transcripts (ETs, 449,224 sequences for six Solanaceae species (potato, tomato, pepper, petunia, tobacco and Nicotiana benthamiana, were clustered and assembled into gene indices. Examination of gene ontologies revealed that the transcripts within the gene indices encode a similar suite of biological processes. Although the ESTs and ETs were derived from a variety of tissues, 55–81% of the sequences had significant similarity at the nucleotide level with sequences among the six species. Putative orthologs could be identified for 28–58% of the sequences. This high degree of sequence conservation was supported by expression profiling using heterologous hybridizations to potato cDNA arrays that showed similar expression patterns in mature leaves for all six solanaceous species. 16–19% of the transcripts within the six Solanaceae gene indices did not have matches among Solanaceae, Arabidopsis, rice or 21 other plant gene indices. Conclusion Results from this genome scale analysis confirmed a high level of sequence conservation at the nucleotide level of the coding sequence among Solanaceae. Additionally, the results indicated that part of the Solanaceae transcriptome is likely to be unique for each species.
Identification of microRNAs and their targets in Finger millet by high throughput sequencing.

Science.gov (United States)

Usha, S; Jyothi, M N; Sharadamma, N; Dixit, Rekha; Devaraj, V R; Nagesh Babu, R

2015-12-15

MicroRNAs are short non-coding RNAs which play an important role in regulating gene expression by mRNA cleavage or by translational repression. The majority of identified miRNAs were evolutionarily conserved; however, others expressed in a species-specific manner. Finger millet is an important cereal crop; nonetheless, no practical information is available on microRNAs to date. In this study, we have identified 95 conserved microRNAs belonging to 39 families and 3 novel microRNAs by high throughput sequencing. For the identified conserved and novel miRNAs a total of 507 targets were predicted. 11 miRNAs were validated and tissue specificity was determined by stem loop RT-qPCR, Northern blot. GO analyses revealed targets of miRNA were involved in wide range of regulatory functions. This study implies large number of known and novel miRNAs found in Finger millet which may play important role in growth and development. Copyright © 2015 Elsevier B.V. All rights reserved.
A CACGTG motif of the Antirrhinum majus chalcone synthase promoter is recognized by an evolutionarily conserved nuclear protein

International Nuclear Information System (INIS)

Staiger, D.; Kaulen, H.; Schell, J.

1989-01-01

In the chalcone synthase gene of Antirrhinum majus (snapdragon), 150 base pairs of the 5' flanking region contain cis-acting signals for UV light-induced expression. A nuclear factor, designated CG-1, specifically recognizes a hexameric motif with internal dyad symmetry, CACGTG, located within this light-responsive sequence. Binding of CG-1 is influenced by C-methylation of the CpG dinucleotide in the recognition sequence. CG-1 is a factor found in a variety of dicotyledonous plant species including Nicotiana tabacum, A. majus, Petunia hybrida, Arabidopsis thaliana, and Glycine max. CACGTG motifs contained within trans-acting factor recognition sites in various other plant promoters can interact with CG-1. In addition, the binding site of the human adenovirus major late transcription factor USF can compete for CG-1 binding to the chalcone synthase promoter. This suggests an evolutionary conservation of trans-acting factor recognition sites involved in divergent mechanisms of gene control. (author)
Conservation of AtTZF1, AtTZF2 and AtTZF3 homolog gene regulation by salt stress in evolutionarily distant plant species

Directory of Open Access Journals (Sweden)

Fabio eD'Orso

2015-06-01

Full Text Available Arginine-rich tandem zinc-finger proteins (RR-TZF participate in a wide range of plant developmental processes and adaptive responses to abiotic stress, such as cold, salt and drought. This study investigates the conservation of the genes AtTZF1-5 at the level of their sequences and expression across plant species. The genomic sequences of the two RR-TZF genes TdTZF1-A and TdTZF1-B were isolated in durum wheat and assigned to chromosomes 3A and 3B, respectively. Sequence comparisons revealed that they encode proteins that are highly homologous to AtTZF1, AtTZF2 and AtTZF3. The expression profiles of these RR-TZF durum wheat and Arabidopsis proteins support a common function in the regulation of seed germination and responses to abiotic stress. In particular, analysis of plants with attenuated and overexpressed AtTZF3 indicate that AtTZF3 is a negative regulator of seed germination under conditions of salt stress. Finally, comparative sequence analyses establish that the RR-TZF genes are encoded by lower plants, including the bryophyte Physcomitrella patens and the alga Chlamydomonas reinhardtii. The regulation of the Physcomitrella AtTZF1-2-3-like genes by salt stress strongly suggests that a subgroup of the RR-TZF proteins has a function that has been conserved throughout evolution.
The HMMER Web Server for Protein Sequence Similarity Search.

Science.gov (United States)

Prakash, Ananth; Jeffryes, Matt; Bateman, Alex; Finn, Robert D

2017-12-08

Protein sequence similarity search is one of the most commonly used bioinformatics methods for identifying evolutionarily related proteins. In general, sequences that are evolutionarily related share some degree of similarity, and sequence-search algorithms use this principle to identify homologs. The requirement for a fast and sensitive sequence search method led to the development of the HMMER software, which in the latest version (v3.1) uses a combination of sophisticated acceleration heuristics and mathematical and computational optimizations to enable the use of profile hidden Markov models (HMMs) for sequence analysis. The HMMER Web server provides a common platform by linking the HMMER algorithms to databases, thereby enabling the search for homologs, as well as providing sequence and functional annotation by linking external databases. This unit describes three basic protocols and two alternate protocols that explain how to use the HMMER Web server using various input formats and user defined parameters. © 2017 by John Wiley & Sons, Inc. Copyright © 2017 John Wiley & Sons, Inc.
A Simple Predictive Enhancer Syntax for Hindbrain Patterning Is Conserved in Vertebrate Genomes.

Directory of Open Access Journals (Sweden)

Joseph Grice

Full Text Available Determining the function of regulatory elements is fundamental for our understanding of development, disease and evolution. However, the sequence features that mediate these functions are often unclear and the prediction of tissue-specific expression patterns from sequence alone is non-trivial. Previous functional studies have demonstrated a link between PBX-HOX and MEIS/PREP binding interactions and hindbrain enhancer activity, but the defining grammar of these sites, if any exists, has remained elusive.Here, we identify a shared sequence signature (syntax within a heterogeneous set of conserved vertebrate hindbrain enhancers composed of spatially co-occurring PBX-HOX and MEIS/PREP transcription factor binding motifs. We use this syntax to accurately predict hindbrain enhancers in 89% of cases (67/75 predicted elements from a set of conserved non-coding elements (CNEs. Furthermore, mutagenesis of the sites abolishes activity or generates ectopic expression, demonstrating their requirement for segmentally restricted enhancer activity in the hindbrain. We refine and use our syntax to predict over 3,000 hindbrain enhancers across the human genome. These sequences tend to be located near developmental transcription factors and are enriched in known hindbrain activating elements, demonstrating the predictive power of this simple model.Our findings support the theory that hundreds of CNEs, and perhaps thousands of regions across the human genome, function to coordinate gene expression in the developing hindbrain. We speculate that deeply conserved sequences of this kind contributed to the co-option of new genes into the hindbrain gene regulatory network during early vertebrate evolution by linking patterns of hox expression to downstream genes involved in segmentation and patterning, and evolutionarily newer instances may have continued to contribute to lineage-specific elaboration of the hindbrain.
Next generation sequencing and analysis of a conserved transcriptome of New Zealand's kiwi.

Science.gov (United States)

Subramanian, Sankar; Huynen, Leon; Millar, Craig D; Lambert, David M

2010-12-15

Kiwi is a highly distinctive, flightless and endangered ratite bird endemic to New Zealand. To understand the patterns of molecular evolution of the nuclear protein-coding genes in brown kiwi (Apteryx australis mantelli) and to determine the timescale of avian history we sequenced a transcriptome obtained from a kiwi embryo using next generation sequencing methods. We then assembled the conserved protein-coding regions using the chicken proteome as a scaffold. Using 1,543 conserved protein coding genes we estimated the neutral evolutionary divergence between the kiwi and chicken to be ~45%, which is approximately equal to the divergence computed for the human-mouse pair using the same set of genes. A large fraction of genes was found to be under high selective constraint, as most of the expressed genes appeared to be involved in developmental gene regulation. Our study suggests a significant relationship between gene expression levels and protein evolution. Using sequences from over 700 nuclear genes we estimated the divergence between the two basal avian groups, Palaeognathae and Neognathae to be 132 million years, which is consistent with previous studies using mitochondrial genes. The results of this investigation revealed patterns of mutation and purifying selection in conserved protein coding regions in birds. Furthermore this study suggests a relatively cost-effective way of obtaining a glimpse into the fundamental molecular evolutionary attributes of a genome, particularly when no closely related genomic sequence is available.
Next generation sequencing and analysis of a conserved transcriptome of New Zealand's kiwi

Directory of Open Access Journals (Sweden)

Huynen Leon

2010-12-01

Full Text Available Abstract Background Kiwi is a highly distinctive, flightless and endangered ratite bird endemic to New Zealand. To understand the patterns of molecular evolution of the nuclear protein-coding genes in brown kiwi (Apteryx australis mantelli and to determine the timescale of avian history we sequenced a transcriptome obtained from a kiwi embryo using next generation sequencing methods. We then assembled the conserved protein-coding regions using the chicken proteome as a scaffold. Results Using 1,543 conserved protein coding genes we estimated the neutral evolutionary divergence between the kiwi and chicken to be ~45%, which is approximately equal to the divergence computed for the human-mouse pair using the same set of genes. A large fraction of genes was found to be under high selective constraint, as most of the expressed genes appeared to be involved in developmental gene regulation. Our study suggests a significant relationship between gene expression levels and protein evolution. Using sequences from over 700 nuclear genes we estimated the divergence between the two basal avian groups, Palaeognathae and Neognathae to be 132 million years, which is consistent with previous studies using mitochondrial genes. Conclusions The results of this investigation revealed patterns of mutation and purifying selection in conserved protein coding regions in birds. Furthermore this study suggests a relatively cost-effective way of obtaining a glimpse into the fundamental molecular evolutionary attributes of a genome, particularly when no closely related genomic sequence is available.
Metazoan Remaining Genes for Essential Amino Acid Biosynthesis: Sequence Conservation and Evolutionary Analyses

Directory of Open Access Journals (Sweden)

Igor R. Costa

2014-12-01

Full Text Available Essential amino acids (EAA consist of a group of nine amino acids that animals are unable to synthesize via de novo pathways. Recently, it has been found that most metazoans lack the same set of enzymes responsible for the de novo EAA biosynthesis. Here we investigate the sequence conservation and evolution of all the metazoan remaining genes for EAA pathways. Initially, the set of all 49 enzymes responsible for the EAA de novo biosynthesis in yeast was retrieved. These enzymes were used as BLAST queries to search for similar sequences in a database containing 10 complete metazoan genomes. Eight enzymes typically attributed to EAA pathways were found to be ubiquitous in metazoan genomes, suggesting a conserved functional role. In this study, we address the question of how these genes evolved after losing their pathway partners. To do this, we compared metazoan genes with their fungal and plant orthologs. Using phylogenetic analysis with maximum likelihood, we found that acetolactate synthase (ALS and betaine-homocysteine S-methyltransferase (BHMT diverged from the expected Tree of Life (ToL relationships. High sequence conservation in the paraphyletic group Plant-Fungi was identified for these two genes using a newly developed Python algorithm. Selective pressure analysis of ALS and BHMT protein sequences showed higher non-synonymous mutation ratios in comparisons between metazoans/fungi and metazoans/plants, supporting the hypothesis that these two genes have undergone non-ToL evolution in animals.

Genes with stable DNA methylation levels show higher evolutionary conservation than genes with fluctuant DNA methylation levels.

Science.gov (United States)

Zhang, Ruijie; Lv, Wenhua; Luan, Meiwei; Zheng, Jiajia; Shi, Miao; Zhu, Hongjie; Li, Jin; Lv, Hongchao; Zhang, Mingming; Shang, Zhenwei; Duan, Lian; Jiang, Yongshuai

2015-11-24

Different human genes often exhibit different degrees of stability in their DNA methylation levels between tissues, samples or cell types. This may be related to the evolution of human genome. Thus, we compared the evolutionary conservation between two types of genes: genes with stable DNA methylation levels (SM genes) and genes with fluctuant DNA methylation levels (FM genes). For long-term evolutionary characteristics between species, we compared the percentage of the orthologous genes, evolutionary rate dn/ds and protein sequence identity. We found that the SM genes had greater percentages of the orthologous genes, lower dn/ds, and higher protein sequence identities in all the 21 species. These results indicated that the SM genes were more evolutionarily conserved than the FM genes. For short-term evolutionary characteristics among human populations, we compared the single nucleotide polymorphism (SNP) density, and the linkage disequilibrium (LD) degree in HapMap populations and 1000 genomes project populations. We observed that the SM genes had lower SNP densities, and higher degrees of LD in all the 11 HapMap populations and 13 1000 genomes project populations. These results mean that the SM genes had more stable chromosome genetic structures, and were more conserved than the FM genes.
Phyloscan: locating transcription-regulating binding sites in mixed aligned and unaligned sequence data.

Science.gov (United States)

Palumbo, Michael J; Newberg, Lee A

2010-07-01

The transcription of a gene from its DNA template into an mRNA molecule is the first, and most heavily regulated, step in gene expression. Especially in bacteria, regulation is typically achieved via the binding of a transcription factor (protein) or small RNA molecule to the chromosomal region upstream of a regulated gene. The protein or RNA molecule recognizes a short, approximately conserved sequence within a gene's promoter region and, by binding to it, either enhances or represses expression of the nearby gene. Since the sought-for motif (pattern) is short and accommodating to variation, computational approaches that scan for binding sites have trouble distinguishing functional sites from look-alikes. Many computational approaches are unable to find the majority of experimentally verified binding sites without also finding many false positives. Phyloscan overcomes this difficulty by exploiting two key features of functional binding sites: (i) these sites are typically more conserved evolutionarily than are non-functional DNA sequences; and (ii) these sites often occur two or more times in the promoter region of a regulated gene. The website is free and open to all users, and there is no login requirement. Address: (http://bayesweb.wadsworth.org/phyloscan/).
Stem loop sequences specific to transposable element IS605 are found linked to lipoprotein genes in Borrelia plasmids.

Directory of Open Access Journals (Sweden)

Nicholas Delihas

Full Text Available BACKGROUND: Plasmids of Borrelia species are dynamic structures that contain a large number of repetitive genes, gene fragments, and gene fusions. In addition, the transposable element IS605/200 family, as well as degenerate forms of this IS element, are prevalent. In Helicobacter pylori, flanking regions of the IS605 transposase gene contain sequences that fold into identical small stem loops. These function in transposition at the single-stranded DNA level. METHODOLOGY/PRINCIPAL FINDINGS: In work reported here, bioinformatics techniques were used to scan Borrelia plasmid genomes for IS605 transposable element specific stem loop sequences. Two variant stem loop motifs are found in the left and right flanking regions of the transposase gene. Both motifs appear to have dispersed in plasmid genomes and are found "free-standing" and phylogenetically conserved without the associated IS605 transposase gene or the adjacent flanking sequence. Importantly, IS605 specific stem loop sequences are also found at the 3' ends of lipoprotein genes (PFam12 and PFam60, however the left and right sequences appear to develop their own evolutionary patterns. The lipoprotein gene-linked left stem loop sequences maintain the IS605 stem loop motif in orthologs but only at the RNA level. These show mutations whereby variants fold into phylogenetically conserved RNA-type stem loops that contain the wobble non-Watson-Crick G-U base-pairing. The right flanking sequence is associated with the family lipoprotein-1 genes. A comparison of homologs shows that the IS605 stem loop motif rapidly dissipates, but a more elaborate secondary structure appears to develop in its place. CONCLUSIONS/SIGNIFICANCE: Stem loop sequences specific to the transposable element IS605 are present in plasmid regions devoid of a transposase gene and significantly, are found linked to lipoprotein genes in Borrelia plasmids. These sequences are evolutionarily conserved and/or structurally developed in
HMMerThread: detecting remote, functional conserved domains in entire genomes by combining relaxed sequence-database searches with fold recognition.

Directory of Open Access Journals (Sweden)

Charles Richard Bradshaw

Full Text Available Conserved domains in proteins are one of the major sources of functional information for experimental design and genome-level annotation. Though search tools for conserved domain databases such as Hidden Markov Models (HMMs are sensitive in detecting conserved domains in proteins when they share sufficient sequence similarity, they tend to miss more divergent family members, as they lack a reliable statistical framework for the detection of low sequence similarity. We have developed a greatly improved HMMerThread algorithm that can detect remotely conserved domains in highly divergent sequences. HMMerThread combines relaxed conserved domain searches with fold recognition to eliminate false positive, sequence-based identifications. With an accuracy of 90%, our software is able to automatically predict highly divergent members of conserved domain families with an associated 3-dimensional structure. We give additional confidence to our predictions by validation across species. We have run HMMerThread searches on eight proteomes including human and present a rich resource of remotely conserved domains, which adds significantly to the functional annotation of entire proteomes. We find ∼4500 cross-species validated, remotely conserved domain predictions in the human proteome alone. As an example, we find a DNA-binding domain in the C-terminal part of the A-kinase anchor protein 10 (AKAP10, a PKA adaptor that has been implicated in cardiac arrhythmias and premature cardiac death, which upon stress likely translocates from mitochondria to the nucleus/nucleolus. Based on our prediction, we propose that with this HLH-domain, AKAP10 is involved in the transcriptional control of stress response. Further remotely conserved domains we discuss are examples from areas such as sporulation, chromosome segregation and signalling during immune response. The HMMerThread algorithm is able to automatically detect the presence of remotely conserved domains in
Conserved antigenic sites between MERS-CoV and Bat-coronavirus are revealed through sequence analysis.

Science.gov (United States)

Sharmin, Refat; Islam, Abul B M M K

2016-01-01

MERS-CoV is a newly emerged human coronavirus reported closely related with HKU4 and HKU5 Bat coronaviruses. Bat and MERS corona-viruses are structurally related. Therefore, it is of interest to estimate the degree of conserved antigenic sites among them. It is of importance to elucidate the shared antigenic-sites and extent of conservation between them to understand the evolutionary dynamics of MERS-CoV. Multiple sequence alignment of the spike (S), membrane (M), enveloped (E) and nucleocapsid (N) proteins was employed to identify the sequence conservation among MERS and Bat (HKU4, HKU5) coronaviruses. We used various in silico tools to predict the conserved antigenic sites. We found that MERS-CoV shared 30 % of its S protein antigenic sites with HKU4 and 70 % with HKU5 bat-CoV. Whereas 100 % of its E, M and N protein's antigenic sites are found to be conserved with those in HKU4 and HKU5. This sharing suggests that in case of pathogenicity MERS-CoV is more closely related to HKU5 bat-CoV than HKU4 bat-CoV. The conserved epitopes indicates their evolutionary relationship and ancestry of pathogenicity.
Sequence recombination and conservation of Varroa destructor virus-1 and deformed wing virus in field collected honey bees (Apis mellifera.

Directory of Open Access Journals (Sweden)

Hui Wang

Full Text Available We sequenced small (s RNAs from field collected honeybees (Apis mellifera and bumblebees (Bombuspascuorum using the Illumina technology. The sRNA reads were assembled and resulting contigs were used to search for virus homologues in GenBank. Matches with Varroadestructor virus-1 (VDV1 and Deformed wing virus (DWV genomic sequences were obtained for A. mellifera but not B. pascuorum. Further analyses suggested that the prevalent virus population was composed of VDV-1 and a chimera of 5'-DWV-VDV1-DWV-3'. The recombination junctions in the chimera genomes were confirmed by using RT-PCR, cDNA cloning and Sanger sequencing. We then focused on conserved short fragments (CSF, size > 25 nt in the virus genomes by using GenBank sequences and the deep sequencing data obtained in this study. The majority of CSF sites confirmed conservation at both between-species (GenBank sequences and within-population (dataset of this study levels. However, conserved nucleotide positions in the GenBank sequences might be variable at the within-population level. High mutation rates (Pi>10% were observed at a number of sites using the deep sequencing data, suggesting that sequence conservation might not always be maintained at the population level. Virus-host interactions and strategies for developing RNAi treatments against VDV1/DWV infections are discussed.
Relationships between residue Voronoi volume and sequence conservation in proteins.

Science.gov (United States)

Liu, Jen-Wei; Cheng, Chih-Wen; Lin, Yu-Feng; Chen, Shao-Yu; Hwang, Jenn-Kang; Yen, Shih-Chung

2018-02-01

Functional and biophysical constraints can cause different levels of sequence conservation in proteins. Previously, structural properties, e.g., relative solvent accessibility (RSA) and packing density of the weighted contact number (WCN), have been found to be related to protein sequence conservation (CS). The Voronoi volume has recently been recognized as a new structural property of the local protein structural environment reflecting CS. However, for surface residues, it is sensitive to water molecules surrounding the protein structure. Herein, we present a simple structural determinant termed the relative space of Voronoi volume (RSV); it uses the Voronoi volume and the van der Waals volume of particular residues to quantify the local structural environment. RSV (range, 0-1) is defined as (Voronoi volume-van der Waals volume)/Voronoi volume of the target residue. The concept of RSV describes the extent of available space for every protein residue. RSV and Voronoi profiles with and without water molecules (RSVw, RSV, VOw, and VO) were compared for 554 non-homologous proteins. RSV (without water) showed better Pearson's correlations with CS than did RSVw, VO, or VOw values. The mean correlation coefficient between RSV and CS was 0.51, which is comparable to the correlation between RSA and CS (0.49) and that between WCN and CS (0.56). RSV is a robust structural descriptor with and without water molecules and can quantitatively reflect evolutionary information in a single protein structure. Therefore, it may represent a practical structural determinant to study protein sequence, structure, and function relationships. Copyright © 2017 Elsevier B.V. All rights reserved.
Sequence and phylogenetic analysis of chicken anaemia virus obtained from backyard and commercial chickens in Nigeria.

Science.gov (United States)

Oluwayelu, D O; Todd, D; Olaleye, O D

2008-12-01

This work reports the first molecular analysis study of chicken anaemia virus (CAV) in backyard chickens in Africa using molecular cloning and sequence analysis to characterize CAV strains obtained from commercial chickens and Nigerian backyard chickens. Partial VP1 gene sequences were determined for three CAVs from commercial chickens and for six CAV variants present in samples from a backyard chicken. Multiple alignment analysis revealed that the 6% and 4% nucleotide diversity obtained respectively for the commercial and backyard chicken strains translated to only 2% amino acid diversity for each breed. Overall, the amino acid composition of Nigerian CAVs was found to be highly conserved. Since the partial VP1 gene sequence of two backyard chicken cloned CAV strains (NGR/CI-8 and NGR/CI-9) were almost identical and evolutionarily closely related to the commercial chicken strains NGR-1, and NGR-4 and NGR-5, respectively, we concluded that CAV infections had crossed the farm boundary.
A Global Trend towards the Loss of Evolutionarily Unique Species in Mangrove Ecosystems.

Directory of Open Access Journals (Sweden)

Barnabas H Daru

Full Text Available The mangrove biome stands out as a distinct forest type at the interface between terrestrial, estuarine, and near-shore marine ecosystems. However, mangrove species are increasingly threatened and experiencing range contraction across the globe that requires urgent conservation action. Here, we assess the spatial distribution of mangrove species richness and evolutionary diversity, and evaluate potential predictors of global declines and risk of extinction. We found that human pressure, measured as the number of different uses associated with mangroves, correlated strongly, but negatively, with extinction probability, whereas species ages were the best predictor of global decline, explaining 15% of variation in extinction risk. Although the majority of mangrove species are categorised by the IUCN as Least Concern, our finding that the more threatened species also tend to be those that are more evolutionarily unique is of concern because their extinction would result in a greater loss of phylogenetic diversity. Finally, we identified biogeographic regions that are relatively species-poor but rich in evolutionary history, and suggest these regions deserve greater conservation priority. Our study provides phylogenetic information that is important for developing a unified management plan for mangrove ecosystems worldwide.
A Global Trend towards the Loss of Evolutionarily Unique Species in Mangrove Ecosystems.

Science.gov (United States)

Daru, Barnabas H; Yessoufou, Kowiyou; Mankga, Ledile T; Davies, T Jonathan

2013-01-01

The mangrove biome stands out as a distinct forest type at the interface between terrestrial, estuarine, and near-shore marine ecosystems. However, mangrove species are increasingly threatened and experiencing range contraction across the globe that requires urgent conservation action. Here, we assess the spatial distribution of mangrove species richness and evolutionary diversity, and evaluate potential predictors of global declines and risk of extinction. We found that human pressure, measured as the number of different uses associated with mangroves, correlated strongly, but negatively, with extinction probability, whereas species ages were the best predictor of global decline, explaining 15% of variation in extinction risk. Although the majority of mangrove species are categorised by the IUCN as Least Concern, our finding that the more threatened species also tend to be those that are more evolutionarily unique is of concern because their extinction would result in a greater loss of phylogenetic diversity. Finally, we identified biogeographic regions that are relatively species-poor but rich in evolutionary history, and suggest these regions deserve greater conservation priority. Our study provides phylogenetic information that is important for developing a unified management plan for mangrove ecosystems worldwide.
Discovery and profiling of novel and conserved microRNAs during flower development in Carya cathayensis via deep sequencing.

Science.gov (United States)

Wang, Zheng Jia; Huang, Jian Qin; Huang, You Jun; Li, Zheng; Zheng, Bing Song

2012-08-01

Hickory (Carya cathayensis Sarg.) is an economically important woody plant in China, but its long juvenile phase delays yield. MicroRNAs (miRNAs) are critical regulators of genes and important for normal plant development and physiology, including flower development. We used Solexa technology to sequence two small RNA libraries from two floral differentiation stages in hickory to identify miRNAs related to flower development. We identified 39 conserved miRNA sequences from 114 loci belonging to 23 families as well as two novel and ten potential novel miRNAs belonging to nine families. Moreover, 35 conserved miRNA*s and two novel miRNA*s were detected. Twenty miRNA sequences from 49 loci belonging to 11 families were differentially expressed; all were up-regulated at the later stage of flower development in hickory. Quantitative real-time PCR of 12 conserved miRNA sequences, five novel miRNA families, and two novel miRNA*s validated that all were expressed during hickory flower development, and the expression patterns were similar to those detected with Solexa sequencing. Finally, a total of 146 targets of the novel and conserved miRNAs were predicted. This study identified a diverse set of miRNAs that were closely related to hickory flower development and that could help in plant floral induction.
B and T Cell Epitope-Based Peptides Predicted from Evolutionarily Conserved and Whole Protein Sequences of Ebola Virus as Vaccine Targets.

Science.gov (United States)

Yasmin, T; Nabi, A H M Nurun

2016-05-01

Ebola virus (EBV) has become a serious threat to public health. Different approaches were applied to predict continuous and discontinuous B cell epitopes as well as T cell epitopes from the sequence-based and available three-dimensional structural analyses of each protein of EBV. Peptides '(79) VPSATKRWGFRSGVPP(94) ' from GP1 and '(515) LHYWTTQDEGAAIGLA(530) ' from GP2 of Ebola were found to be the consensus peptidic sequences predicted as linear B cell epitope of which the latter contains a region (519) TTQDEG(524) that fulfilled all the criteria of accessibility, hydrophilicity, flexibility and beta turn region for becoming an ideal B cell epitope. Different nonamers as T cell epitopes were obtained that interacted with different numbers of MHC class I and class II alleles with a binding affinity of <100 nm. Interestingly, these alleles also bound to the MHC class I alleles mostly prevalent in African and South Asian regions. Of these, 'LANETTQAL' and 'FLYDRLAST' nonamers were predicted to be the most potent T cell epitopes and they, respectively, interacted with eight and twelve class I alleles that covered 63.79% and 54.16% of world population, respectively. These nonamers were found to be the core sequences of 15mer peptides that interacted with the most common class II allele, HLA-DRB1*01:01. They were further validated for their binding to specific class I alleles using docking technique. Thus, these predicted epitopes may be used as vaccine targets against EBV and can be validated in model hosts to verify their efficacy as vaccine. © 2016 The Foundation for the Scandinavian Journal of Immunology.
Sequence analysis and molecular characterization of Wnt4 gene in metacestodes of Taenia solium.

Science.gov (United States)

Hou, Junling; Luo, Xuenong; Wang, Shuai; Yin, Cai; Zhang, Shaohua; Zhu, Xueliang; Dou, Yongxi; Cai, Xuepeng

2014-04-01

Wnt proteins are a family of secreted glycoproteins that are evolutionarily conserved and considered to be involved in extensive developmental processes in metazoan organisms. The characterization of wnt genes may improve understanding the parasite's development. In the present study, a wnt4 gene encoding 491amino acids was amplified from cDNA of metacestodes of Taenia solium using reverse transcription PCR (RT-PCR). Bioinformatics tools were used for sequence analysis. The conserved domain of the wnt gene family was predicted. The expression profile of Wnt4 was investigated using real-time PCR. Wnt4 expression was found to be dramatically increased in scolex evaginated cysticerci when compared to invaginated cysticerci. In situ hybridization showed that wnt4 gene was distributed in the posterior end of the worm along the primary body axis in evaginated cysticerci. These findings indicated that wnt4 may take part in the process of cysticerci evagination and play a role in scolex/bladder development of cysticerci of T. solium.
Evolutionarily Conserved, Growth Plate Zone-Specific Regulation of the Matrilin-1 Promoter: L-Sox5/Sox6 and Nfi Factors Bound near TATA Finely Tune Activation by Sox9 ▿

Science.gov (United States)

Nagy, Andrea; Kénesi, Erzsébet; Rentsendorj, Otgonchimeg; Molnár, Annamária; Szénási, Tibor; Sinkó, Ildikó; Zvara, Ágnes; Thottathil Oommen, Sajit; Barta, Endre; Puskás, László G.; Lefebvre, Veronique; Kiss, Ibolya

2011-01-01

To help uncover the mechanisms underlying the staggered expression of cartilage-specific genes in the growth plate, we dissected the transcriptional mechanisms driving expression of the matrilin-1 gene (Matn1). We show that a unique assembly of evolutionarily conserved cis-acting elements in the Matn1 proximal promoter restricts expression to the proliferative and prehypertrophic zones of the growth plate. These elements functionally interact with distal elements and likewise are capable of restricting the domain of activity of a pancartilaginous Col2a1 enhancer. The proximal elements include a Pe1 element binding the chondrogenic L-Sox5, Sox6, and Sox9 proteins, a SI element binding Nfi proteins, and an initiator Ine element binding the Sox trio and other factors. Sox9 binding to Pe1 is indispensable for functional interaction with the distal promoter. Binding of L-Sox5/Sox6 to Ine and Nfib to SI modulates Sox9 transactivation in a protein dose-dependent manner, possibly to enhance Sox9 activity in early stages of chondrogenesis and repress it at later stages. Hence, our data suggest a novel model whereby Sox and Nfi proteins bind to conserved Matn1 proximal elements and functionally interact with each other to finely tune gene expression in specific zones of the cartilage growth plate. PMID:21173167
Computational evidence for hundreds of non-conserved plant microRNAs

DEFF Research Database (Denmark)

Lindow, Morten; Krogh, Anders Stærmose

2005-01-01

Background MicroRNAs (miRNA) are small (20-25 nt) non-coding RNA molecules that regulate gene expression through interaction with mRNA in plants and metazoans. A few hundred miRNAs are known or predicted, and most of those are evolutionarily conserved. In general plant miRNA are different from...
A ChIP-Seq benchmark shows that sequence conservation mainly improves detection of strong transcription factor binding sites.

Directory of Open Access Journals (Sweden)

Tony Håndstad

Full Text Available BACKGROUND: Transcription factors are important controllers of gene expression and mapping transcription factor binding sites (TFBS is key to inferring transcription factor regulatory networks. Several methods for predicting TFBS exist, but there are no standard genome-wide datasets on which to assess the performance of these prediction methods. Also, it is believed that information about sequence conservation across different genomes can generally improve accuracy of motif-based predictors, but it is not clear under what circumstances use of conservation is most beneficial. RESULTS: Here we use published ChIP-seq data and an improved peak detection method to create comprehensive benchmark datasets for prediction methods which use known descriptors or binding motifs to detect TFBS in genomic sequences. We use this benchmark to assess the performance of five different prediction methods and find that the methods that use information about sequence conservation generally perform better than simpler motif-scanning methods. The difference is greater on high-affinity peaks and when using short and information-poor motifs. However, if the motifs are specific and information-rich, we find that simple motif-scanning methods can perform better than conservation-based methods. CONCLUSIONS: Our benchmark provides a comprehensive test that can be used to rank the relative performance of transcription factor binding site prediction methods. Moreover, our results show that, contrary to previous reports, sequence conservation is better suited for predicting strong than weak transcription factor binding sites.
Sequence and phylogenetic analysis of chicken anaemia virus obtained from backyard and commercial chickens in Nigeria : research communication

Directory of Open Access Journals (Sweden)

D.O. Oluwayelu

2008-09-01

Full Text Available This work reports the first molecular analysis study of chicken anaemia virus (CAV in backyard chickens in Africa using molecular cloning and sequence analysis to characterize CAV strains obtained from commercial chickens and Nigerian backyard chickens. Partial VP1 gene sequences were determined for three CAVs from commercial chickens and for six CAV variants present in samples from a backyard chicken. Multiple alignment analysis revealed that the 6 % and 4 % nucleotide diversity obtained respectively for the commercial and backyard chicken strains translated to only 2 % amino acid diversity for each breed. Overall, the amino acid composition of Nigerian CAVs was found to be highly conserved. Since the partial VP1 gene sequence of two backyard chicken cloned CAV strains (NGR/Cl-8 and NGR/Cl-9 were almost identical and evolutionarily closely related to the commercial chicken strains NGR-1, and NGR-4 and NGR-5, respectively, we concluded that CAV infections had crossed the farm boundary.
Sequence analysis of the L protein of the Ebola 2014 outbreak: Insight into conserved regions and mutations.

Science.gov (United States)

Ayub, Gohar; Waheed, Yasir

2016-06-01

The 2014 Ebola outbreak was one of the largest that have occurred; it started in Guinea and spread to Nigeria, Liberia and Sierra Leone. Phylogenetic analysis of the current virus species indicated that this outbreak is the result of a divergent lineage of the Zaire ebolavirus. The L protein of Ebola virus (EBOV) is the catalytic subunit of the RNA‑dependent RNA polymerase complex, which, with VP35, is key for the replication and transcription of viral RNA. Earlier sequence analysis demonstrated that the L protein of all non‑segmented negative‑sense (NNS) RNA viruses consists of six domains containing conserved functional motifs. The aim of the present study was to analyze the presence of these motifs in 2014 EBOV isolates, highlight their function and how they may contribute to the overall pathogenicity of the isolates. For this purpose, 81 2014 EBOV L protein sequences were aligned with 475 other NNS RNA viruses, including Paramyxoviridae and Rhabdoviridae viruses. Phylogenetic analysis of all EBOV outbreak L protein sequences was also performed. Analysis of the amino acid substitutions in the 2014 EBOV outbreak was conducted using sequence analysis. The alignment demonstrated the presence of previously conserved motifs in the 2014 EBOV isolates and novel residues. Notably, all the mutations identified in the 2014 EBOV isolates were tolerant, they were pathogenic with certain examples occurring within previously determined functional conserved motifs, possibly altering viral pathogenicity, replication and virulence. The phylogenetic analysis demonstrated that all sequences with the exception of the 2014 EBOV sequences were clustered together. The 2014 EBOV outbreak has acquired a great number of mutations, which may explain the reasons behind this unprecedented outbreak. Certain residues critical to the function of the polymerase remain conserved and may be targets for the development of antiviral therapeutic agents.
Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

Science.gov (United States)

Fauteux, François; Strömvik, Martina V

2009-01-01

Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP) gene promoters from three plant families, namely Brassicaceae (mustards), Fabaceae (legumes) and Poaceae (grasses) using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L.) Heynh.), soybean (Glycine max (L.) Merr.) and rice (Oryza sativa L.) respectively. We have identified three conserved motifs (two RY-like and one ACGT-like) in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination of conserved motifs
Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

Directory of Open Access Journals (Sweden)

Fauteux François

2009-10-01

Full Text Available Abstract Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP gene promoters from three plant families, namely Brassicaceae (mustards, Fabaceae (legumes and Poaceae (grasses using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L. Heynh., soybean (Glycine max (L. Merr. and rice (Oryza sativa L. respectively. We have identified three conserved motifs (two RY-like and one ACGT-like in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination

Sequence and structural analysis of the chitinase insertion domain reveals two conserved motifs involved in chitin-binding.

Directory of Open Access Journals (Sweden)

Hai Li

2010-01-01

Full Text Available Chitinases are prevalent in life and are found in species including archaea, bacteria, fungi, plants, and animals. They break down chitin, which is the second most abundant carbohydrate in nature after cellulose. Hence, they are important for maintaining a balance between carbon and nitrogen trapped as insoluble chitin in biomass. Chitinases are classified into two families, 18 and 19 glycoside hydrolases. In addition to a catalytic domain, which is a triosephosphate isomerase barrel, many family 18 chitinases contain another module, i.e., chitinase insertion domain. While numerous studies focus on the biological role of the catalytic domain in chitinase activity, the function of the chitinase insertion domain is not completely understood. Bioinformatics offers an important avenue in which to facilitate understanding the role of residues within the chitinase insertion domain in chitinase function.Twenty-seven chitinase insertion domain sequences, which include four experimentally determined structures and span five kingdoms, were aligned and analyzed using a modified sequence entropy parameter. Thirty-two positions with conserved residues were identified. The role of these conserved residues was explored by conducting a structural analysis of a number of holo-enzymes. Hydrogen bonding and van der Waals calculations revealed a distinct subset of four conserved residues constituting two sequence motifs that interact with oligosaccharides. The other conserved residues may be key to the structure, folding, and stability of this domain.Sequence and structural studies of the chitinase insertion domains conducted within the framework of evolution identified four conserved residues which clearly interact with the substrates. Furthermore, evolutionary studies propose a link between the appearance of the chitinase insertion domain and the function of family 18 chitinases in the subfamily A.
Evolutionarily significant units of the critically endangered leaf frog Pithecopus ayeaye (Anura, Phyllomedusidae) are not effectively preserved by the Brazilian protected areas network.

Science.gov (United States)

de Magalhães, Rafael Félix; Lemes, Priscila; Camargo, Arley; Oliveira, Ubirajara; Brandão, Reuber Albuquerque; Thomassen, Hans; Garcia, Paulo Christiano de Anchietta; Leite, Felipe Sá Fortes; Santos, Fabrício Rodrigues

2017-11-01

Protected areas (PAs) are essential for biodiversity conservation, but their coverage is considered inefficient for the preservation of all species. Many species are subdivided into evolutionarily significant units (ESUs) and the effectiveness of PAs in protecting them needs to be investigated. We evaluated the usefulness of the Brazilian PAs network in protecting ESUs of the critically endangered Pithecopus ayeaye through ongoing climate change. This species occurs in a threatened mountaintop ecosystem known as campos rupestres . We used multilocus DNA sequences to delimit geographic clusters, which were further validated as ESUs with a coalescent approach. Ecological niche modeling was used to estimate spatial changes in ESUs' potential distributions, and a gap analysis was carried out to evaluate the effectiveness of the Brazilian PAs network to protect P. ayeaye in the face of climate changes. We tested the niche overlap between ESUs to gain insights for potential management alternatives for the species. Pithecopus ayeaye contains at least three ESUs isolated in distinct mountain regions, and one of them is not protected by any PA. There are no climatic niche differences between the units, and only 4% of the suitable potential area of the species is protected in present and future projections. The current PAs are not effective in preserving the intraspecific diversity of P. ayeaye in its present and future range distributions. The genetic structure of P. ayeaye could represent a typical pattern in campos rupestres endemics, which should be considered for evaluating its conservation status.
The evolutionarily conserved transcription factor PRDM12 controls sensory neuron development and pain perception.

Science.gov (United States)

Nagy, Vanja; Cole, Tiffany; Van Campenhout, Claude; Khoung, Thang M; Leung, Calvin; Vermeiren, Simon; Novatchkova, Maria; Wenzel, Daniel; Cikes, Domagoj; Polyansky, Anton A; Kozieradzki, Ivona; Meixner, Arabella; Bellefroid, Eric J; Neely, G Gregory; Penninger, Josef M

2015-01-01

PR homology domain-containing member 12 (PRDM12) belongs to a family of conserved transcription factors implicated in cell fate decisions. Here we show that PRDM12 is a key regulator of sensory neuronal specification in Xenopus. Modeling of human PRDM12 mutations that cause hereditary sensory and autonomic neuropathy (HSAN) revealed remarkable conservation of the mutated residues in evolution. Expression of wild-type human PRDM12 in Xenopus induced the expression of sensory neuronal markers, which was reduced using various human PRDM12 mutants. In Drosophila, we identified Hamlet as the functional PRDM12 homolog that controls nociceptive behavior in sensory neurons. Furthermore, expression analysis of human patient fibroblasts with PRDM12 mutations uncovered possible downstream target genes. Knockdown of several of these target genes including thyrotropin-releasing hormone degrading enzyme (TRHDE) in Drosophila sensory neurons resulted in altered cellular morphology and impaired nociception. These data show that PRDM12 and its functional fly homolog Hamlet are evolutionary conserved master regulators of sensory neuronal specification and play a critical role in pain perception. Our data also uncover novel pathways in multiple species that regulate evolutionary conserved nociception.
Conservation, diversification and expansion of C2H2 zinc finger proteins in the Arabidopsis thaliana genome

Directory of Open Access Journals (Sweden)

Böhm Siegfried

2004-07-01

Full Text Available Background The classical C2H2 zinc finger domain is involved in a wide range of functions and can bind to DNA, RNA and proteins. The comparison of zinc finger proteins in several eukaryotes has shown that there is a lot of lineage specific diversification and expansion. Although the number of characterized plant proteins that carry the classical C2H2 zinc finger motifs is growing, a systematic classification and analysis of a plant genome zinc finger gene set is lacking. Results We found through in silico analysis 176 zinc finger proteins in Arabidopsis thaliana that hence constitute the most abundant family of putative transcriptional regulators in this plant. Only a minority of 33 A. thaliana zinc finger proteins are conserved in other eukaryotes. In contrast, the majority of these proteins (81% are plant specific. They are derived from extensive duplication events and form expanded families. We assigned the proteins to different subgroups and families and focused specifically on the two largest and evolutionarily youngest families (A1 and C1 that are suggested to be primarily involved in transcriptional regulation. The newly defined family A1 (24 members comprises proteins with tandemly arranged zinc finger domains. Family C1 (64 members, earlier described as the EPF-family in Petunia, comprises proteins with one isolated or two to five dispersed fingers and a mostly invariant QALGGH motif in the zinc finger helices. Based on the amino acid pattern in these helices we could describe five different signature sequences prevalent in C1 zinc finger domains. We also found a number of non-finger domains that are conserved in these families. Conclusions Our analysis of the few evolutionarily conserved zinc finger proteins of A. thaliana suggests that most of them could be involved in ancient biological processes like RNA metabolism and chromatin-remodeling. In contrast, the majority of the unique A. thaliana zinc finger proteins are known or
A unique genomic sequence in the Wolf-Hirschhorn syndrome [WHS] region of humans is conserved in the great apes.

Science.gov (United States)

Tarzami, S T; Kringstein, A M; Conte, R A; Verma, R S

1996-10-01

The Wolf-Hirschhorn syndrome (WHS) is caused by a partial deletion in the short arm of chromosome 4 band 16.3 (4p 16.3). A unique-sequence human DNA probe (39 kb) localized within this region has been used to search for sequence homology in the apes' equivalent chromosome 3 by FISH-technique. The WHS loci are conserved in higher primates at the expected position. Nevertheless, a control probe, which detects alphoid sequences of the pericentromeric region of humans, is diverged in chimpanzee, gorilla, and orangutan. The conservation of WHS loci and divergence of DNA alphoid sequences have further added to the controversy concerning human descent.
Sample sequencing of vascular plants demonstrates widespread conservation and divergence of microRNAs.

Science.gov (United States)

Chávez Montes, Ricardo A; de Fátima Rosas-Cárdenas, Flor; De Paoli, Emanuele; Accerbi, Monica; Rymarquis, Linda A; Mahalingam, Gayathri; Marsch-Martínez, Nayelli; Meyers, Blake C; Green, Pamela J; de Folter, Stefan

2014-04-23

Small RNAs are pivotal regulators of gene expression that guide transcriptional and post-transcriptional silencing mechanisms in eukaryotes, including plants. Here we report a comprehensive atlas of sRNA and miRNA from 3 species of algae and 31 representative species across vascular plants, including non-model plants. We sequence and quantify sRNAs from 99 different tissues or treatments across species, resulting in a data set of over 132 million distinct sequences. Using miRBase mature sequences as a reference, we identify the miRNA sequences present in these libraries. We apply diverse profiling methods to examine critical sRNA and miRNA features, such as size distribution, tissue-specific regulation and sequence conservation between species, as well as to predict putative new miRNA sequences. We also develop database resources, computational analysis tools and a dedicated website, http://smallrna.udel.edu/. This study provides new insights on plant sRNAs and miRNAs, and a foundation for future studies.
[Three regions of Rpb10 mini-subunit of nuclear RNA polymerases are strictly conserved in all eukaryotes].

Science.gov (United States)

Shpakovskiĭ, G V; Lebedenko, E N

1996-12-01

The rpb10+ cDNA from the fission yeast Schizosaccharomyces pombe was cloned using two independent approaches (PCR and genetic suppression). The cloned cDNA encoded the Rpb10 subunit common for all three RNA polymerases. Comparison of the deduced amino acid sequence of the Sz. pombe Rbp10 subunit (71 amino acid residues) with those of the homologous subunits of RNA polymerases I, II, and III from Saccharomyces cerevisiae and Home sapiens revealed that heptapeptides RCFT/SCGK (residues 6-12), RYCCRRM (residues 43-49), and HVDLIEK (residues 53-59) were evolutionarily the most conserved structural motifs of these subunits. It is shown that the Rbp10 subunit from Sz. pombe can substitute its homolog (ABC10 beta) in the baker's yeast S. cerevisiae.
A belief-based evolutionarily stable strategy

OpenAIRE

Deng, Xinyang; Wang, Zhen; Liu, Qi; Deng, Yong; Mahadevan, Sankaran

2014-01-01

As an equilibrium refinement of the Nash equilibrium, evolutionarily stable strategy (ESS) is a key concept in evolutionary game theory and has attracted growing interest. An ESS can be either a pure strategy or a mixed strategy. Even though the randomness is allowed in mixed strategy, the selection probability of pure strategy in a mixed strategy may fluctuate due to the impact of many factors. The fluctuation can lead to more uncertainty. In this paper, such uncertainty involved in mixed st...
Hydra meiosis reveals unexpected conservation of structural synaptonemal complex proteins across metazoans.

Science.gov (United States)

Fraune, Johanna; Alsheimer, Manfred; Volff, Jean-Nicolas; Busch, Karoline; Fraune, Sebastian; Bosch, Thomas C G; Benavente, Ricardo

2012-10-09

The synaptonemal complex (SC) is a key structure of meiosis, mediating the stable pairing (synapsis) of homologous chromosomes during prophase I. Its remarkable tripartite structure is evolutionarily well conserved and can be found in almost all sexually reproducing organisms. However, comparison of the different SC protein components in the common meiosis model organisms Saccharomyces cerevisiae, Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster, and Mus musculus revealed no sequence homology. This discrepancy challenged the hypothesis that the SC arose only once in evolution. To pursue this matter we focused on the evolution of SYCP1 and SYCP3, the two major structural SC proteins of mammals. Remarkably, our comparative bioinformatic and expression studies revealed that SYCP1 and SYCP3 are also components of the SC in the basal metazoan Hydra. In contrast to previous assumptions, we therefore conclude that SYCP1 and SYCP3 form monophyletic groups of orthologous proteins across metazoans.
Towards rationally redesigning bacterial signaling systems using information encoded in abundant sequence data

Science.gov (United States)

Cheng, Ryan; Morcos, Faruck; Levine, Herbert; Onuchic, Jose

2014-03-01

An important challenge in biology is to distinguish the subset of residues that allow bacterial two-component signaling (TCS) proteins to preferentially interact with their correct TCS partner such that they can bind and transfer signal. Detailed knowledge of this information would allow one to search sequence-space for mutations that can systematically tune the signal transmission between TCS partners as well as re-encode a TCS protein to preferentially transfer signals to a non-partner. Motivated by the notion that this detailed information is found in sequence data, we explore the mutual sequence co-evolution between signaling partners to infer how mutations can positively or negatively alter their interaction. Using Direct Coupling Analysis (DCA) for determining evolutionarily conserved interprotein interactions, we apply a DCA-based metric to quantify mutational changes in the interaction between TCS proteins and demonstrate that it accurately correlates with experimental mutagenesis studies probing the mutational change in the in vitro phosphotransfer. Our methodology serves as a potential framework for the rational design of TCS systems as well as a framework for the system-level study of protein-protein interactions in sequence-rich systems. This research has been supported by the NSF INSPIRE award MCB-1241332 and by the CTBP sponsored by the NSF (Grant PHY-1308264).
An evolutionarily conserved gene family encodes proton-selective ion channels.

Science.gov (United States)

Tu, Yu-Hsiang; Cooper, Alexander J; Teng, Bochuan; Chang, Rui B; Artiga, Daniel J; Turner, Heather N; Mulhall, Eric M; Ye, Wenlei; Smith, Andrew D; Liman, Emily R

2018-03-02

Ion channels form the basis for cellular electrical signaling. Despite the scores of genetically identified ion channels selective for other monatomic ions, only one type of proton-selective ion channel has been found in eukaryotic cells. By comparative transcriptome analysis of mouse taste receptor cells, we identified Otopetrin1 (OTOP1), a protein required for development of gravity-sensing otoconia in the vestibular system, as forming a proton-selective ion channel. We found that murine OTOP1 is enriched in acid-detecting taste receptor cells and is required for their zinc-sensitive proton conductance. Two related murine genes, Otop2 and Otop3 , and a Drosophila ortholog also encode proton channels. Evolutionary conservation of the gene family and its widespread tissue distribution suggest a broad role for proton channels in physiology and pathophysiology. Copyright © 2018 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.
Selaginella moellendoffii telomeres: conserved and unique features in an ancient land plant lineage

Directory of Open Access Journals (Sweden)

Eugene V Shakirov

2012-07-01

Full Text Available Telomeres, the essential terminal regions of linear eukaryotic chromosomes, consist of G-rich DNA repeats bound by a plethora of associated proteins. While the general pathways of telomere maintenance are evolutionarily conserved, individual telomere complex components show remarkable variation between eukaryotic lineages and even within closely related species. The recent genome sequencing of the lycophyte Selaginella moellendoffii and the availability of an ever-increasing number of flowering plant genomes provides a unique opportunity to evaluate the molecular and functional evolution of telomere components from the early evolving non-seed plants to the more developmentally advanced angiosperms. Here we analyzed telomere sequence in S. moellendorffii and found it to consist of TTTAGGG repeats, typical of most plants. Telomere tracts in S. moellendorffii range from 1-5.5 kb, closely resembling Arabidopsis thaliana. We identified several S. moellendorffii genes encoding sequence homologues of proteins involved in telomere maintenance in other organisms, including CST complex components and the telomere-binding proteins POT1 and TRFL. Notable sequence similarities and differences were uncovered among the telomere-related genes in some of the plant lineages. Taken together, the data indicate that comparative analysis of the telomere complex in early diverging land plants such as S. moellendorffii and green algae will yield important insights into the evolution of telomeres and their protein constituents.
Evolutionarily stable learning schedules and cumulative culture in discrete generation models.

Science.gov (United States)

Aoki, Kenichi; Wakano, Joe Yuichiro; Lehmann, Laurent

2012-06-01

Individual learning (e.g., trial-and-error) and social learning (e.g., imitation) are alternative ways of acquiring and expressing the appropriate phenotype in an environment. The optimal choice between using individual learning and/or social learning may be dictated by the life-stage or age of an organism. Of special interest is a learning schedule in which social learning precedes individual learning, because such a schedule is apparently a necessary condition for cumulative culture. Assuming two obligatory learning stages per discrete generation, we obtain the evolutionarily stable learning schedules for the three situations where the environment is constant, fluctuates between generations, or fluctuates within generations. During each learning stage, we assume that an organism may target the optimal phenotype in the current environment by individual learning, and/or the mature phenotype of the previous generation by oblique social learning. In the absence of exogenous costs to learning, the evolutionarily stable learning schedules are predicted to be either pure social learning followed by pure individual learning ("bang-bang" control) or pure individual learning at both stages ("flat" control). Moreover, we find for each situation that the evolutionarily stable learning schedule is also the one that optimizes the learned phenotype at equilibrium. Copyright © 2012 Elsevier Inc. All rights reserved.
High-throughput sequencing, characterization and detection of new and conserved cucumber miRNAs.

Directory of Open Access Journals (Sweden)

Germán Martínez

Full Text Available Micro RNAS (miRNAs are a class of endogenous small non coding RNAs involved in the post-transcriptional regulation of gene expression. In plants, a great number of conserved and specific miRNAs, mainly arising from model species, have been identified to date. However less is known about the diversity of these regulatory RNAs in vegetal species with agricultural and/or horticultural importance. Here we report a combined approach of bioinformatics prediction, high-throughput sequencing data and molecular methods to analyze miRNAs populations in cucumber (Cucumis sativus plants. A set of 19 conserved and 6 known but non-conserved miRNA families were found in our cucumber small RNA dataset. We also identified 7 (3 with their miRNA* strand not previously described miRNAs, candidates to be cucumber-specific. To validate their description these new C. sativus miRNAs were detected by northern blot hybridization. Additionally, potential targets for most conserved and new miRNAs were identified in cucumber genome.In summary, in this study we have identified, by first time, conserved, known non-conserved and new miRNAs arising from an agronomically important species such as C. sativus. The detection of this complex population of regulatory small RNAs suggests that similarly to that observe in other plant species, cucumber miRNAs may possibly play an important role in diverse biological and metabolic processes.
Structure-Based Sequence Alignment of the Transmembrane Domains of All Human GPCRs: Phylogenetic, Structural and Functional Implications

Science.gov (United States)

Cvicek, Vaclav; Goddard, William A.; Abrol, Ravinder

2016-01-01

The understanding of G-protein coupled receptors (GPCRs) is undergoing a revolution due to increased information about their signaling and the experimental determination of structures for more than 25 receptors. The availability of at least one receptor structure for each of the GPCR classes, well separated in sequence space, enables an integrated superfamily-wide analysis to identify signatures involving the role of conserved residues, conserved contacts, and downstream signaling in the context of receptor structures. In this study, we align the transmembrane (TM) domains of all experimental GPCR structures to maximize the conserved inter-helical contacts. The resulting superfamily-wide GpcR Sequence-Structure (GRoSS) alignment of the TM domains for all human GPCR sequences is sufficient to generate a phylogenetic tree that correctly distinguishes all different GPCR classes, suggesting that the class-level differences in the GPCR superfamily are encoded at least partly in the TM domains. The inter-helical contacts conserved across all GPCR classes describe the evolutionarily conserved GPCR structural fold. The corresponding structural alignment of the inactive and active conformations, available for a few GPCRs, identifies activation hot-spot residues in the TM domains that get rewired upon activation. Many GPCR mutations, known to alter receptor signaling and cause disease, are located at these conserved contact and activation hot-spot residue positions. The GRoSS alignment places the chemosensory receptor subfamilies for bitter taste (TAS2R) and pheromones (Vomeronasal, VN1R) in the rhodopsin family, known to contain the chemosensory olfactory receptor subfamily. The GRoSS alignment also enables the quantification of the structural variability in the TM regions of experimental structures, useful for homology modeling and structure prediction of receptors. Furthermore, this alignment identifies structurally and functionally important residues in all human GPCRs
H2B ubiquitination: Conserved molecular mechanism, diverse physiologic functions of the E3 ligase during meiosis.

Science.gov (United States)

Wang, Liying; Cao, Chunwei; Wang, Fang; Zhao, Jianguo; Li, Wei

2017-09-03

RNF20/Bre1 mediated H2B ubiquitination (H2Bub) has various physiologic functions. Recently, we found that H2Bub participates in meiotic recombination by promoting chromatin relaxation during meiosis. We then analyzed the phylogenetic relationships among the E3 ligase for H2Bub, its E2 Rad6 and their partner WW domain-containing adaptor with a coiled-coil (WAC) or Lge1, and found that the molecular mechanism underlying H2Bub is evolutionarily conserved from yeast to mammals. However, RNF20 has diverse physiologic functions in different organisms, which might be caused by the evolutionary divergency of their domain/motif architectures. In the current extra view, we not only elucidate the evolutionarily conserved molecular mechanism underlying H2Bub, but also discuss the diverse physiologic functions of RNF20 during meiosis.
Synaptotagmin gene content of the sequenced genomes

Directory of Open Access Journals (Sweden)

Craxton Molly

2004-07-01

Full Text Available Abstract Background Synaptotagmins exist as a large gene family in mammals. There is much interest in the function of certain family members which act crucially in the regulated synaptic vesicle exocytosis required for efficient neurotransmission. Knowledge of the functions of other family members is relatively poor and the presence of Synaptotagmin genes in plants indicates a role for the family as a whole which is wider than neurotransmission. Identification of the Synaptotagmin genes within completely sequenced genomes can provide the entire Synaptotagmin gene complement of each sequenced organism. Defining the detailed structures of all the Synaptotagmin genes and their encoded products can provide a useful resource for functional studies and a deeper understanding of the evolution of the gene family. The current rapid increase in the number of sequenced genomes from different branches of the tree of life, together with the public deposition of evolutionarily diverse transcript sequences make such studies worthwhile. Results I have compiled a detailed list of the Synaptotagmin genes of Caenorhabditis, Anopheles, Drosophila, Ciona, Danio, Fugu, Mus, Homo, Arabidopsis and Oryza by examining genomic and transcript sequences from public sequence databases together with some transcript sequences obtained by cDNA library screening and RT-PCR. I have compared all of the genes and investigated the relationship between plant Synaptotagmins and their non-Synaptotagmin counterparts. Conclusions I have identified and compared 98 Synaptotagmin genes from 10 sequenced genomes. Detailed comparison of transcript sequences reveals abundant and complex variation in Synaptotagmin gene expression and indicates the presence of Synaptotagmin genes in all animals and land plants. Amino acid sequence comparisons indicate patterns of conservation and diversity in function. Phylogenetic analysis shows the origin of Synaptotagmins in multicellular eukaryotes and their
Genome-wide discovery and differential regulation of conserved and novel microRNAs in chickpea via deep sequencing.

Science.gov (United States)

Jain, Mukesh; Chevala, V V S Narayana; Garg, Rohini

2014-11-01

MicroRNAs (miRNAs) are essential components of complex gene regulatory networks that orchestrate plant development. Although several genomic resources have been developed for the legume crop chickpea, miRNAs have not been discovered until now. For genome-wide discovery of miRNAs in chickpea (Cicer arietinum), we sequenced the small RNA content from seven major tissues/organs employing Illumina technology. About 154 million reads were generated, which represented more than 20 million distinct small RNA sequences. We identified a total of 440 conserved miRNAs in chickpea based on sequence similarity with known miRNAs in other plants. In addition, 178 novel miRNAs were identified using a miRDeep pipeline with plant-specific scoring. Some of the conserved and novel miRNAs with significant sequence similarity were grouped into families. The chickpea miRNAs targeted a wide range of mRNAs involved in diverse cellular processes, including transcriptional regulation (transcription factors), protein modification and turnover, signal transduction, and metabolism. Our analysis revealed several miRNAs with differential spatial expression. Many of the chickpea miRNAs were expressed in a tissue-specific manner. The conserved and differential expression of members of the same miRNA family in different tissues was also observed. Some of the same family members were predicted to target different chickpea mRNAs, which suggested the specificity and complexity of miRNA-mediated developmental regulation. This study, for the first time, reveals a comprehensive set of conserved and novel miRNAs along with their expression patterns and putative targets in chickpea, and provides a framework for understanding regulation of developmental processes in legumes. © The Author 2014. Published by Oxford University Press on behalf of the Society for Experimental Biology.
JDet: interactive calculation and visualization of function-related conservation patterns in multiple sequence alignments and structures.

Science.gov (United States)

Muth, Thilo; García-Martín, Juan A; Rausell, Antonio; Juan, David; Valencia, Alfonso; Pazos, Florencio

2012-02-15

We have implemented in a single package all the features required for extracting, visualizing and manipulating fully conserved positions as well as those with a family-dependent conservation pattern in multiple sequence alignments. The program allows, among other things, to run different methods for extracting these positions, combine the results and visualize them in protein 3D structures and sequence spaces. JDet is a multiplatform application written in Java. It is freely available, including the source code, at http://csbg.cnb.csic.es/JDet. The package includes two of our recently developed programs for detecting functional positions in protein alignments (Xdet and S3Det), and support for other methods can be added as plug-ins. A help file and a guided tutorial for JDet are also available.
How conserved are the conserved 16S-rRNA regions?

Directory of Open Access Journals (Sweden)

Marcel Martinez-Porchas

2017-02-01

Full Text Available The 16S rRNA gene has been used as master key for studying prokaryotic diversity in almost every environment. Despite the claim of several researchers to have the best universal primers, the reality is that no primer has been demonstrated to be truly universal. This suggests that conserved regions of the gene may not be as conserved as expected. The aim of this study was to evaluate the conservation degree of the so-called conserved regions flanking the hypervariable regions of the 16S rRNA gene. Data contained in SILVA database (release 123 were used for the study. Primers reported as matches of each conserved region were assembled to form contigs; sequences sizing 12 nucleotides (12-mers were extracted from these contigs and searched into the entire set of SILVA sequences. Frequency analysis shown that extreme regions, 1 and 10, registered the lowest frequencies. 12-mer frequencies revealed segments of contigs that were not as conserved as expected (≤90%. Fragments corresponding to the primer contigs 3, 4, 5b and 6a were recovered from all sequences in SILVA database. Nucleotide frequency analysis in each consensus demonstrated that only a small fraction of these so-called conserved regions is truly conserved in non-redundant sequences. It could be concluded that conserved regions of the 16S rRNA gene exhibit considerable variation that has to be considered when using this gene as biomarker.

Phylogenetically-informed priorities for amphibian conservation.

Science.gov (United States)

Isaac, Nick J B; Redding, David W; Meredith, Helen M; Safi, Kamran

2012-01-01

The amphibian decline and extinction crisis demands urgent action to prevent further large numbers of species extinctions. Lists of priority species for conservation, based on a combination of species' threat status and unique contribution to phylogenetic diversity, are one tool for the direction and catalyzation of conservation action. We describe the construction of a near-complete species-level phylogeny of 5713 amphibian species, which we use to create a list of evolutionarily distinct and globally endangered species (EDGE list) for the entire class Amphibia. We present sensitivity analyses to test the robustness of our priority list to uncertainty in species' phylogenetic position and threat status. We find that both sources of uncertainty have only minor impacts on our 'top 100' list of priority species, indicating the robustness of the approach. By contrast, our analyses suggest that a large number of Data Deficient species are likely to be high priorities for conservation action from the perspective of their contribution to the evolutionary history.
Phylogenetically-informed priorities for amphibian conservation.

Directory of Open Access Journals (Sweden)

Nick J B Isaac

Full Text Available The amphibian decline and extinction crisis demands urgent action to prevent further large numbers of species extinctions. Lists of priority species for conservation, based on a combination of species' threat status and unique contribution to phylogenetic diversity, are one tool for the direction and catalyzation of conservation action. We describe the construction of a near-complete species-level phylogeny of 5713 amphibian species, which we use to create a list of evolutionarily distinct and globally endangered species (EDGE list for the entire class Amphibia. We present sensitivity analyses to test the robustness of our priority list to uncertainty in species' phylogenetic position and threat status. We find that both sources of uncertainty have only minor impacts on our 'top 100' list of priority species, indicating the robustness of the approach. By contrast, our analyses suggest that a large number of Data Deficient species are likely to be high priorities for conservation action from the perspective of their contribution to the evolutionary history.
T-cell recognition is shaped by epitope sequence conservation in the host proteome and microbiome

DEFF Research Database (Denmark)

Bresciani, Anne Gøther; Paul, Sinu; Schommer, Nina

2016-01-01

or allergen with the conservation of its sequence in the human proteome or the healthy human microbiome. Indeed, performing such comparisons on large sets of validated T-cell epitopes, we found that epitopes that are similar with self-antigens above a certain threshold showed lower immunogenicity, presumably...... as a result of negative selection of T cells capable of recognizing such peptides. Moreover, we also found a reduced level of immune recognition for epitopes conserved in the commensal microbiome, presumably as a result of peripheral tolerance. These findings indicate that the existence (and potentially...
A conserved Oct4/POUV-dependent network links adhesion and migration to progenitor maintenance

DEFF Research Database (Denmark)

Livigni, Alessandra; Peradziryi, Hanna; Sharov, Alexei A

2013-01-01

BACKGROUND: The class V POU domain transcription factor Oct4 (Pou5f1) is a pivotal regulator of embryonic stem cell (ESC) self-renewal and reprogramming of somatic cells to induced pluripotent stem (iPS) cells. Oct4 is also an important evolutionarily conserved regulator of progenitor cell differ...
Sequence, structure and function relationships in flaviviruses as assessed by evolutive aspects of its conserved non-structural protein domains.

Science.gov (United States)

da Fonseca, Néli José; Lima Afonso, Marcelo Querino; Pedersolli, Natan Gonçalves; de Oliveira, Lucas Carrijo; Andrade, Dhiego Souto; Bleicher, Lucas

2017-10-28

Flaviviruses are responsible for serious diseases such as dengue, yellow fever, and zika fever. Their genomes encode a polyprotein which, after cleavage, results in three structural and seven non-structural proteins. Homologous proteins can be studied by conservation and coevolution analysis as detected in multiple sequence alignments, usually reporting positions which are strictly necessary for the structure and/or function of all members in a protein family or which are involved in a specific sub-class feature requiring the coevolution of residue sets. This study provides a complete conservation and coevolution analysis on all flaviviruses non-structural proteins, with results mapped on all well-annotated available sequences. A literature review on the residues found in the analysis enabled us to compile available information on their roles and distribution among different flaviviruses. Also, we provide the mapping of conserved and coevolved residues for all sequences currently in SwissProt as a supplementary material, so that particularities in different viruses can be easily analyzed. Copyright © 2017 Elsevier Inc. All rights reserved.
Relaxed selection against accidental binding of transcription factors with conserved chromatin contexts.

Science.gov (United States)

Babbitt, G A

2010-10-15

The spurious (or nonfunctional) binding of transcription factors (TF) to the wrong locations on DNA presents a formidable challenge to genomes given the relatively low ceiling for sequence complexity within the short lengths of most binding motifs. The high potential for the occurrence of random motifs and subsequent nonfunctional binding of many transcription factors should theoretically lead to natural selection against the occurrence of spurious motif throughout the genome. However, because of the active role that chromatin can influence over eukaryotic gene regulation, it may also be expected that many supposed spurious binding sites could escape purifying selection if (A) they simply occur in regions of high nucleosome occupancy or (B) their surrounding chromatin was dynamically involved in their identity and function. We compared nucleosome occupancy and the presence/absence of functionally conserved chromatin context to the strength of selection against spurious binding of various TF binding motifs in Saccharomyces yeast. While we find no direct relationship with nucleosome occupancy, we find strong evidence that transcription factors spatially associated with evolutionarily conserved chromatin states are under relaxed selection against accidental binding. Transcription factors (with/without) a conserved chromatin context were found to occur on average, (87.7%/49.3%) of their expected frequencies. Functional binding motifs with conserved chromatin contexts were also significantly shorter in length and more often clustered. These results indicate a role of chromatin context dependency in relaxing selection against spurious binding in nearly half of all TF binding motifs throughout the yeast genome. 2010 Elsevier B.V. All rights reserved.
Assessment of genetic diversity in the critically endangered Australian corroboree frogs, Pseudophryne corroboree and Pseudophryne pengilleyi, identifies four evolutionarily significant units for conservation.

Science.gov (United States)

Morgan, Matthew J; Hunter, David; Pietsch, Rod; Osborne, William; Keogh, J Scott

2008-08-01

The iconic and brightly coloured Australian northern corroboree frog, Pseudophryne pengilleyi, and the southern corroboree frog, Pseudophryne corroboree are critically endangered and may be extinct in the wild within 3 years. We have assembled samples that cover the current range of both species and applied hypervariable microsatellite markers and mitochondrial DNA sequences to assess the levels and patterns of genetic variation. The four loci used in the study were highly variable, the total number of alleles observed ranged from 13 to 30 and the average number of alleles per locus was 19. Expected heterozygosity of the four microsatellite loci across all populations was high and varied between 0.830 and 0.935. Bayesian clustering analyses in STRUCTURE strongly supported four genetically distinct populations, which correspond exactly to the four main allopatric geographical regions in which the frogs are currently found. Individual analyses performed on the separate regions showed that breeding sites within these four regions could not be separated into distinct populations. Twelve mtND2 haplotypes were identified from 66 individuals from throughout the four geographical regions. A statistical parsimony network of mtDNA haplotypes shows two distinct groups, which correspond to the two species of corroboree frog, but with most of the haplotype diversity distributed in P. pengilleyi. These results demonstrate an unexpectedly high level of genetic diversity in both species. Our data have important implications for how the genetic diversity is managed in the future. The four evolutionarily significant units must be protected and maintained in captive breeding programmes for as long as it is possible to do.
Asymmetrical distribution of non-conserved regulatory sequences at PHOX2B is reflected at the ENCODE loci and illuminates a possible genome-wide trend

Directory of Open Access Journals (Sweden)

McCallion Andrew S

2009-01-01

Full Text Available Abstract Background Transcriptional regulatory elements are central to development and interspecific phenotypic variation. Current regulatory element prediction tools rely heavily upon conservation for prediction of putative elements. Recent in vitro observations from the ENCODE project combined with in vivo analyses at the zebrafish phox2b locus suggests that a significant fraction of regulatory elements may fall below commonly applied metrics of conservation. We propose to explore these observations in vivo at the human PHOX2B locus, and also evaluate the potential evidence for genome-wide applicability of these observations through a novel analysis of extant data. Results Transposon-based transgenic analysis utilizing a tiling path proximal to human PHOX2B in zebrafish recapitulates the observations at the zebrafish phox2b locus of both conserved and non-conserved regulatory elements. Analysis of human sequences conserved with previously identified zebrafish phox2b regulatory elements demonstrates that the orthologous sequences exhibit overlapping regulatory control. Additionally, analysis of non-conserved sequences scattered over 135 kb 5' to PHOX2B, provides evidence of non-conserved regulatory elements positively biased with close proximity to the gene. Furthermore, we provide a novel analysis of data from the ENCODE project, finding a non-uniform distribution of regulatory elements consistent with our in vivo observations at PHOX2B. These observations remain largely unchanged when one accounts for the sequence repeat content of the assayed intervals, when the intervals are sub-classified by biological role (developmental versus non-developmental, or by gene density (gene desert versus non-gene desert. Conclusion While regulatory elements frequently display evidence of evolutionary conservation, a fraction appears to be undetected by current metrics of conservation. In vivo observations at the PHOX2B locus, supported by our analyses of in
Towards the Development of an Evolutionarily Valid Domain-Specific Risk-Taking Scale

Directory of Open Access Journals (Sweden)

Daniel J. Kruger

2007-07-01

Full Text Available From an evolutionary perspective, human risk-taking behaviors should be viewed in relation to evolutionarily recurrent survival and reproductive problems. In response to recent calls for domain-specific measures of risk-taking, we emphasize the need of evolutionarily valid domains. We report on two studies designed to validate a scale of risky behaviors in domains selected from research and theory in evolutionary psychology and biology, corresponding to reoccurring challenges in the ancestral environment. Behaviors were framed in situations which people would have some chance of encountering in modern times. We identify five domains of risk-taking: between-group competition, within-group competition, mating and resource allocation for mate attraction, environmental risks, and fertility risks.
A Potential Tool for Swift Fox (Vulpes velox) Conservation: Individuality of Long-Range Barking Sequences

DEFF Research Database (Denmark)

Darden, Safi-Kirstine Klem; Dabelsteen, Torben; Pedersen, Simon Boel

2003-01-01

Vocal individuality has been found in a number canid species. This natural variation can have applications in several aspects of species conservation, from behavioral studies to estimating population density or abundance. The swift fox (Vulpes velox) is a North American canid listed as endangered...... in Canada and extirpated, endangered, or threatened in parts of the United States. The barking sequence is a long-range vocalization in the species' vocal repertoire. It consists of a series of barks and is most common during the mating season. We analyzed barking sequences recorded in a standardized...
The Number, Organization, and Size of Polymorphic Membrane Protein Coding Sequences as well as the Most Conserved Pmp Protein Differ within and across Chlamydia Species.

Science.gov (United States)

Van Lent, Sarah; Creasy, Heather Huot; Myers, Garry S A; Vanrompay, Daisy

2016-01-01

Variation is a central trait of the polymorphic membrane protein (Pmp) family. The number of pmp coding sequences differs between Chlamydia species, but it is unknown whether the number of pmp coding sequences is constant within a Chlamydia species. The level of conservation of the Pmp proteins has previously only been determined for Chlamydia trachomatis. As different Pmp proteins might be indispensible for the pathogenesis of different Chlamydia species, this study investigated the conservation of Pmp proteins both within and across C. trachomatis,C. pneumoniae,C. abortus, and C. psittaci. The pmp coding sequences were annotated in 16 C. trachomatis, 6 C. pneumoniae, 2 C. abortus, and 16 C. psittaci genomes. The number and organization of polymorphic membrane coding sequences differed within and across the analyzed Chlamydia species. The length of coding sequences of pmpA,pmpB, and pmpH was conserved among all analyzed genomes, while the length of pmpE/F and pmpG, and remarkably also of the subtype pmpD, differed among the analyzed genomes. PmpD, PmpA, PmpH, and PmpA were the most conserved Pmp in C. trachomatis,C. pneumoniae,C. abortus, and C. psittaci, respectively. PmpB was the most conserved Pmp across the 4 analyzed Chlamydia species. © 2016 S. Karger AG, Basel.
The disequilibrium of nucleosomes distribution along chromosomes plays a functional and evolutionarily role in regulating gene expression

KAUST Repository

Cui, Peng

2011-08-19

To further understand the relationship between nucleosome-space occupancy (NO) and global transcriptional activity in mammals, we acquired a set of genome-wide nucleosome distribution and transcriptome data from the mouse cerebrum and testis based on ChIP (H3)-seq and RNA-seq, respectively. We identified a nearly consistent NO patterns among three mouse tissues-cerebrum, testis, and ESCs-and found, through clustering analysis for transcriptional activation, that the NO variations among chromosomes are closely associated with distinct expression levels between house-keeping (HK) genes and tissue-specific (TS) genes. Both TS and HK genes form clusters albeit the obvious majority. This feature implies that NO patterns, i.e. nucleosome binding and clustering, are coupled with gene clustering that may be functionally and evolutionarily conserved in regulating gene expression among different cell types. © 2011 Cui et al.
The disequilibrium of nucleosomes distribution along chromosomes plays a functional and evolutionarily role in regulating gene expression.

Directory of Open Access Journals (Sweden)

Peng Cui

Full Text Available To further understand the relationship between nucleosome-space occupancy (NO and global transcriptional activity in mammals, we acquired a set of genome-wide nucleosome distribution and transcriptome data from the mouse cerebrum and testis based on ChIP (H3-seq and RNA-seq, respectively. We identified a nearly consistent NO patterns among three mouse tissues--cerebrum, testis, and ESCs--and found, through clustering analysis for transcriptional activation, that the NO variations among chromosomes are closely associated with distinct expression levels between house-keeping (HK genes and tissue-specific (TS genes. Both TS and HK genes form clusters albeit the obvious majority. This feature implies that NO patterns, i.e. nucleosome binding and clustering, are coupled with gene clustering that may be functionally and evolutionarily conserved in regulating gene expression among different cell types.
Development of a dedicated peptide tandem mass spectral library for conservation science.

Science.gov (United States)

Fremout, Wim; Dhaenens, Maarten; Saverwyns, Steven; Sanyova, Jana; Vandenabeele, Peter; Deforce, Dieter; Moens, Luc

2012-05-30

In recent years, the use of liquid chromatography tandem mass spectrometry (LC-MS/MS) on tryptic digests of cultural heritage objects has attracted much attention. It allows for unambiguous identification of peptides and proteins, and even in complex mixtures species-specific identification becomes feasible with minimal sample consumption. Determination of the peptides is commonly based on theoretical cleavage of known protein sequences and on comparison of the expected peptide fragments with those found in the MS/MS spectra. In this approach, complex computer programs, such as Mascot, perform well identifying known proteins, but fail when protein sequences are unknown or incomplete. Often, when trying to distinguish evolutionarily well preserved collagens of different species, Mascot lacks the required specificity. Complementary and often more accurate information on the proteins can be obtained using a reference library of MS/MS spectra of species-specific peptides. Therefore, a library dedicated to various sources of proteins in works of art was set up, with an initial focus on collagen rich materials. This paper discusses the construction and the advantages of this spectral library for conservation science, and its application on a number of samples from historical works of art. Copyright © 2012 Elsevier B.V. All rights reserved.
Some AFLP amplicons are highly conserved DNA sequences mapping to the same linkage groups in two F2 populations of carrot

Directory of Open Access Journals (Sweden)

Santos Carlos A.F.

2002-01-01

Full Text Available Amplified fragment length polymorphism (AFLP is a fast and reliable tool to generate a large number of DNA markers. In two unrelated F2 populations of carrot (Daucus carota L., Brasilia x HCM and B493 x QAL (wild carrot, it was hypothesized that DNA 1 digested with the same restriction endonuclease enzymes and amplified with the same primer combination and 2 sharing the same position in polyacrylamide gels should be conserved sequences. To test this hypothesis AFLP fragments from polyacrylamide gels were eluted, reamplified, separated in agarose gels, purified, cloned and sequenced. Among thirty-one paired fragments from each F2 population, twenty-six had identity greater than 91% and five presented identity of 24% to 44%. Among the twenty-six conserved AFLPs only one mapped to different linkage groups in the two populations while four of the five less-conserved bands mapped to different linkage groups. Of eight SCAR (sequence characterized amplified regions primers tested, one conserved AFLP resulted in co-dominant markers in both populations. Screening among 14 carrot inbreds or cultivars with three AFLP-SCAR primers revealed clear and polymorphic PCR products, with similar molecular sizes on agarose gels. The development of co-dominant markers based on conserved AFLP fragments will be useful to detect seed mixtures among hybrids, to improve and to merge linkage maps and to study diversity and phylogenetic relationships.
Hydra meiosis reveals unexpected conservation of structural synaptonemal complex proteins across metazoans

OpenAIRE

Fraune, Johanna; Alsheimer, Manfred; Volff, Jean-Nicolas; Busch, Karoline; Fraune, Sebastian; Bosch, Thomas C. G.; Benavente, Ricardo

2012-01-01

The synaptonemal complex (SC) is a key structure of meiosis, mediating the stable pairing (synapsis) of homologous chromosomes during prophase I. Its remarkable tripartite structure is evolutionarily well conserved and can be found in almost all sexually reproducing organisms. However, comparison of the different SC protein components in the common meiosis model organisms Saccharomyces cerevisiae, Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster, and Mus musculus revealed...
Effects of temperature and mass conservation on the typical chemical sequences of hydrogen oxidation

Science.gov (United States)

Nicholson, Schuyler B.; Alaghemandi, Mohammad; Green, Jason R.

2018-01-01

Macroscopic properties of reacting mixtures are necessary to design synthetic strategies, determine yield, and improve the energy and atom efficiency of many chemical processes. The set of time-ordered sequences of chemical species are one representation of the evolution from reactants to products. However, only a fraction of the possible sequences is typical, having the majority of the joint probability and characterizing the succession of chemical nonequilibrium states. Here, we extend a variational measure of typicality and apply it to atomistic simulations of a model for hydrogen oxidation over a range of temperatures. We demonstrate an information-theoretic methodology to identify typical sequences under the constraints of mass conservation. Including these constraints leads to an improved ability to learn the chemical sequence mechanism from experimentally accessible data. From these typical sequences, we show that two quantities defining the variational typical set of sequences—the joint entropy rate and the topological entropy rate—increase linearly with temperature. These results suggest that, away from explosion limits, data over a narrow range of thermodynamic parameters could be sufficient to extrapolate these typical features of combustion chemistry to other conditions.
PDL1 Signals through Conserved Sequence Motifs to Overcome Interferon-Mediated Cytotoxicity

Directory of Open Access Journals (Sweden)

Maria Gato-Cañas

2017-08-01

Full Text Available PDL1 blockade produces remarkable clinical responses, thought to occur by T cell reactivation through prevention of PDL1-PD1 T cell inhibitory interactions. Here, we find that PDL1 cell-intrinsic signaling protects cancer cells from interferon (IFN cytotoxicity and accelerates tumor progression. PDL1 inhibited IFN signal transduction through a conserved class of sequence motifs that mediate crosstalk with IFN signaling. Abrogation of PDL1 expression or antibody-mediated PDL1 blockade strongly sensitized cancer cells to IFN cytotoxicity through a STAT3/caspase-7-dependent pathway. Moreover, somatic mutations found in human carcinomas within these PDL1 sequence motifs disrupted motif regulation, resulting in PDL1 molecules with enhanced protective activities from type I and type II IFN cytotoxicity. Overall, our results reveal a mode of action of PDL1 in cancer cells as a first line of defense against IFN cytotoxicity.
MicroRNA genes preferentially expressed in dendritic cells contain sites for conserved transcription factor binding motifs in their promoters

Directory of Open Access Journals (Sweden)

Huynen Martijn A

2011-06-01

Full Text Available Abstract Background MicroRNAs (miRNAs play a fundamental role in the regulation of gene expression by translational repression or target mRNA degradation. Regulatory elements in miRNA promoters are less well studied, but may reveal a link between their expression and a specific cell type. Results To explore this link in myeloid cells, miRNA expression profiles were generated from monocytes and dendritic cells (DCs. Differences in miRNA expression among monocytes, DCs and their stimulated progeny were observed. Furthermore, putative promoter regions of miRNAs that are significantly up-regulated in DCs were screened for Transcription Factor Binding Sites (TFBSs based on TFBS motif matching score, the degree to which those TFBSs are over-represented in the promoters of the up-regulated miRNAs, and the extent of conservation of the TFBSs in mammals. Conclusions Analysis of evolutionarily conserved TFBSs in DC promoters revealed preferential clustering of sites within 500 bp upstream of the precursor miRNAs and that many mRNAs of cognate TFs of the conserved TFBSs were indeed expressed in the DCs. Taken together, our data provide evidence that selected miRNAs expressed in DCs have evolutionarily conserved TFBSs relevant to DC biology in their promoters.
An Evolutionarily Conserved Mechanism for Intrinsic and Transferable Polymyxin Resistance

Directory of Open Access Journals (Sweden)

Yongchang Xu

2018-04-01

Full Text Available Polymyxins, a family of cationic antimicrobial cyclic peptides, act as a last line of defense against severe infections by Gram-negative pathogens with carbapenem resistance. In addition to the intrinsic resistance to polymyxin E (colistin conferred by Neisseria eptA, the plasmid-borne mobilized colistin resistance gene mcr-1 has been disseminated globally since the first discovery in Southern China, in late 2015. However, the molecular mechanisms for both intrinsic and transferable resistance to colistin remain largely unknown. Here, we aim to address this gap in the knowledge of these proteins. Structural and functional analyses of EptA and MCR-1 and -2 have defined a conserved 12-residue cavity that is required for the entry of the lipid substrate, phosphatidylethanolamine (PE. The in vitro and in vivo data together have allowed us to visualize the similarities in catalytic activity shared by EptA and MCR-1 and -2. The expression of either EptA or MCR-1 or -2 is shown to remodel the surface of enteric bacteria (e.g., Escherichia coli, Salmonella enterica, Klebsiella pneumoniae, etc., rendering them resistant to colistin. The parallels in the PE substrate-binding cavities among EptA, MCR-1, and MCR-2 provide a comprehensive understanding of both intrinsic and transferable colistin resistance. Domain swapping between EptA and MCR-1 and -2 reveals that the two domains (transmembrane [TM] region and phosphoethanolamine [PEA] transferase are not functionally exchangeable. Taken together, the results represent a common mechanism for intrinsic and transferable PEA resistance to polymyxin, a last-resort antibiotic against multidrug-resistant pathogens.

Rapid detection, classification and accurate alignment of up to a million or more related protein sequences.

Science.gov (United States)

Neuwald, Andrew F

2009-08-01

The patterns of sequence similarity and divergence present within functionally diverse, evolutionarily related proteins contain implicit information about corresponding biochemical similarities and differences. A first step toward accessing such information is to statistically analyze these patterns, which, in turn, requires that one first identify and accurately align a very large set of protein sequences. Ideally, the set should include many distantly related, functionally divergent subgroups. Because it is extremely difficult, if not impossible for fully automated methods to align such sequences correctly, researchers often resort to manual curation based on detailed structural and biochemical information. However, multiply-aligning vast numbers of sequences in this way is clearly impractical. This problem is addressed using Multiply-Aligned Profiles for Global Alignment of Protein Sequences (MAPGAPS). The MAPGAPS program uses a set of multiply-aligned profiles both as a query to detect and classify related sequences and as a template to multiply-align the sequences. It relies on Karlin-Altschul statistics for sensitivity and on PSI-BLAST (and other) heuristics for speed. Using as input a carefully curated multiple-profile alignment for P-loop GTPases, MAPGAPS correctly aligned weakly conserved sequence motifs within 33 distantly related GTPases of known structure. By comparison, the sequence- and structurally based alignment methods hmmalign and PROMALS3D misaligned at least 11 and 23 of these regions, respectively. When applied to a dataset of 65 million protein sequences, MAPGAPS identified, classified and aligned (with comparable accuracy) nearly half a million putative P-loop GTPase sequences. A C++ implementation of MAPGAPS is available at http://mapgaps.igs.umaryland.edu. Supplementary data are available at Bioinformatics online.
Comparative analysis of evolutionarily conserved motifs of epidermal growth factor receptor 2 (HER2) predicts novel potential therapeutic epitopes

DEFF Research Database (Denmark)

Deng, Xiaohong; Zheng, Xuxu; Yang, Huanming

2014-01-01

druggable epitopes/targets. We employed the PROSITE Scan to detect structurally conserved motifs and PRINTS to search for linearly conserved motifs of ECD HER2. We found that the epitopes recognized by trastuzumab and pertuzumab are located in the predicted conserved motifs of ECD HER2, supporting our...
Optimal packaging of FIV genomic RNA depends upon a conserved long-range interaction and a palindromic sequence within gag.

Science.gov (United States)

Rizvi, Tahir A; Kenyon, Julia C; Ali, Jahabar; Aktar, Suriya J; Phillip, Pretty S; Ghazawi, Akela; Mustafa, Farah; Lever, Andrew M L

2010-10-15

The feline immunodeficiency virus (FIV) is a lentivirus that is related to human immunodeficiency virus (HIV), causing a similar pathology in cats. It is a potential small animal model for AIDS and the FIV-based vectors are also being pursued for human gene therapy. Previous studies have mapped the FIV packaging signal (ψ) to two or more discontinuous regions within the 5' 511 nt of the genomic RNA and structural analyses have determined its secondary structure. The 5' and 3' sequences within ψ region interact through extensive long-range interactions (LRIs), including a conserved heptanucleotide interaction between R/U5 and gag. Other secondary structural elements identified include a conserved 150 nt stem-loop (SL2) and a small palindromic stem-loop within gag open reading frame that might act as a viral dimerization initiation site. We have performed extensive mutational analysis of these sequences and structures and ascertained their importance in FIV packaging using a trans-complementation assay. Disrupting the conserved heptanucleotide LRI to prevent base pairing between R/U5 and gag reduced packaging by 2.8-5.5 fold. Restoration of pairing using an alternative, non-wild type (wt) LRI sequence restored RNA packaging and propagation to wt levels, suggesting that it is the structure of the LRI, rather than its sequence, that is important for FIV packaging. Disrupting the palindrome within gag reduced packaging by 1.5-3-fold, but substitution with a different palindromic sequence did not restore packaging completely, suggesting that the sequence of this region as well as its palindromic nature is important. Mutation of individual regions of SL2 did not have a pronounced effect on FIV packaging, suggesting that either it is the structure of SL2 as a whole that is necessary for optimal packaging, or that there is redundancy within this structure. The mutational analysis presented here has further validated the previously predicted RNA secondary structure of FIV
Inferring Pongo conservation units: a perspective based on microsatellite and mitochondrial DNA analyses.

Science.gov (United States)

Kanthaswamy, Sreetharan; Kurushima, Jennifer D; Smith, David Glenn

2006-10-01

In order to define evolutionarily significant and management units (ESUs and MUs) among subpopulations of Sumatran (Pongo pygmaeus abelii) and Bornean (P. p. pygmaeus) orangutans we determined their genetic relationships. We analyzed partial sequences of four mitochondrial genes and nine autosomal microsatellite loci of 70 orangutans to test two hypotheses regarding the population structure within Borneo and the genetic distinction between Bornean and Sumatran orangutans. Our data show Bornean orangutans consist of two genetic clusters-the western and eastern clades. Each taxon exhibits relatively distinct mtDNA and nuclear genetic distributions that are likely attributable to genetic drift. These groups, however, do not warrant designations as separate conservation MUs because they demonstrate no demographic independence and only moderate genetic differentiation. Our findings also indicate relatively high levels of overall genetic diversity within Borneo, suggesting that observed habitat fragmentation and erosion during the last three decades had limited influence on genetic variability. Because the mtDNA of Bornean and Sumatran orangutans are not strictly reciprocally monophyletic, we recommend treating these populations as separate MUs and discontinuing inter-island translocation of animals unless absolutely necessary.
Detecting remote sequence homology in disordered proteins: discovery of conserved motifs in the N-termini of Mononegavirales phosphoproteins.

Directory of Open Access Journals (Sweden)

David Karlin

Full Text Available Paramyxovirinae are a large group of viruses that includes measles virus and parainfluenza viruses. The viral Phosphoprotein (P plays a central role in viral replication. It is composed of a highly variable, disordered N-terminus and a conserved C-terminus. A second viral protein alternatively expressed, the V protein, also contains the N-terminus of P, fused to a zinc finger. We suspected that, despite their high variability, the N-termini of P/V might all be homologous; however, using standard approaches, we could previously identify sequence conservation only in some Paramyxovirinae. We now compared the N-termini using sensitive sequence similarity search programs, able to detect residual similarities unnoticeable by conventional approaches. We discovered that all Paramyxovirinae share a short sequence motif in their first 40 amino acids, which we called soyuz1. Despite its short length (11-16aa, several arguments allow us to conclude that soyuz1 probably evolved by homologous descent, unlike linear motifs. Conservation across such evolutionary distances suggests that soyuz1 plays a crucial role and experimental data suggest that it binds the viral nucleoprotein to prevent its illegitimate self-assembly. In some Paramyxovirinae, the N-terminus of P/V contains a second motif, soyuz2, which might play a role in blocking interferon signaling. Finally, we discovered that the P of related Mononegavirales contain similarly overlooked motifs in their N-termini, and that their C-termini share a previously unnoticed structural similarity suggesting a common origin. Our results suggest several testable hypotheses regarding the replication of Mononegavirales and suggest that disordered regions with little overall sequence similarity, common in viral and eukaryotic proteins, might contain currently overlooked motifs (intermediate in length between linear motifs and disordered domains that could be detected simply by comparing orthologous proteins.
An artificial intelligence approach fit for tRNA gene studies in the era of big sequence data.

Science.gov (United States)

Iwasaki, Yuki; Abe, Takashi; Wada, Kennosuke; Wada, Yoshiko; Ikemura, Toshimichi

2017-09-12

Unsupervised data mining capable of extracting a wide range of knowledge from big data without prior knowledge or particular models is a timely application in the era of big sequence data accumulation in genome research. By handling oligonucleotide compositions as high-dimensional data, we have previously modified the conventional self-organizing map (SOM) for genome informatics and established BLSOM, which can analyze more than ten million sequences simultaneously. Here, we develop BLSOM specialized for tRNA genes (tDNAs) that can cluster (self-organize) more than one million microbial tDNAs according to their cognate amino acid solely depending on tetra- and pentanucleotide compositions. This unsupervised clustering can reveal combinatorial oligonucleotide motifs that are responsible for the amino acid-dependent clustering, as well as other functionally and structurally important consensus motifs, which have been evolutionarily conserved. BLSOM is also useful for identifying tDNAs as phylogenetic markers for special phylotypes. When we constructed BLSOM with 'species-unknown' tDNAs from metagenomic sequences plus 'species-known' microbial tDNAs, a large portion of metagenomic tDNAs self-organized with species-known tDNAs, yielding information on microbial communities in environmental samples. BLSOM can also enhance accuracy in the tDNA database obtained from big sequence data. This unsupervised data mining should become important for studying numerous functionally unclear RNAs obtained from a wide range of organisms.
Genomic diversity guides conservation strategies among rare terrestrial orchid species when taxonomy remains uncertain.

Science.gov (United States)

Ahrens, Collin W; Supple, Megan A; Aitken, Nicola C; Cantrill, David J; Borevitz, Justin O; James, Elizabeth A

2017-06-01

Species are often used as the unit for conservation, but may not be suitable for species complexes where taxa are difficult to distinguish. Under such circumstances, it may be more appropriate to consider species groups or populations as evolutionarily significant units (ESUs). A population genomic approach was employed to investigate the diversity within and among closely related species to create a more robust, lineage-specific conservation strategy for a nationally endangered terrestrial orchid and its relatives from south-eastern Australia. Four putative species were sampled from a total of 16 populations in the Victorian Volcanic Plain (VVP) bioregion and one population of a sub-alpine outgroup in south-eastern Australia. Morphological measurements were taken in situ along with leaf material for genotyping by sequencing (GBS) and microsatellite analyses. Species could not be differentiated using morphological measurements. Microsatellite and GBS markers confirmed the outgroup as distinct, but only GBS markers provided resolution of population genetic structure. The nationally endangered Diuris basaltica was indistinguishable from two related species ( D. chryseopsis and D. behrii ), while the state-protected D. gregaria showed genomic differentiation. Genomic diversity identified among the four Diuris species suggests that conservation of this taxonomically complex group will be best served by considering them as one ESU rather than separately aligned with species as currently recognized. This approach will maximize evolutionary potential among all species during increased isolation and environmental change. The methods used here can be applied generally to conserve evolutionary processes for groups where taxonomic uncertainty hinders the use of species as conservation units. © The Author 2017. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Sequence imputation of HPV16 genomes for genetic association studies.

Directory of Open Access Journals (Sweden)

Benjamin Smith

Full Text Available Human Papillomavirus type 16 (HPV16 causes over half of all cervical cancer and some HPV16 variants are more oncogenic than others. The genetic basis for the extraordinary oncogenic properties of HPV16 compared to other HPVs is unknown. In addition, we neither know which nucleotides vary across and within HPV types and lineages, nor which of the single nucleotide polymorphisms (SNPs determine oncogenicity.A reference set of 62 HPV16 complete genome sequences was established and used to examine patterns of evolutionary relatedness amongst variants using a pairwise identity heatmap and HPV16 phylogeny. A BLAST-based algorithm was developed to impute complete genome data from partial sequence information using the reference database. To interrogate the oncogenic risk of determined and imputed HPV16 SNPs, odds-ratios for each SNP were calculated in a case-control viral genome-wide association study (VWAS using biopsy confirmed high-grade cervix neoplasia and self-limited HPV16 infections from Guanacaste, Costa Rica.HPV16 variants display evolutionarily stable lineages that contain conserved diagnostic SNPs. The imputation algorithm indicated that an average of 97.5±1.03% of SNPs could be accurately imputed. The VWAS revealed specific HPV16 viral SNPs associated with variant lineages and elevated odds ratios; however, individual causal SNPs could not be distinguished with certainty due to the nature of HPV evolution.Conserved and lineage-specific SNPs can be imputed with a high degree of accuracy from limited viral polymorphic data due to the lack of recombination and the stochastic mechanism of variation accumulation in the HPV genome. However, to determine the role of novel variants or non-lineage-specific SNPs by VWAS will require direct sequence analysis. The investigation of patterns of genetic variation and the identification of diagnostic SNPs for lineages of HPV16 variants provides a valuable resource for future studies of HPV16
Structure-Related Roles for the Conservation of the HIV-1 Fusion Peptide Sequence Revealed by Nuclear Magnetic Resonance.

Science.gov (United States)

Serrano, Soraya; Huarte, Nerea; Rujas, Edurne; Andreu, David; Nieva, José L; Jiménez, María Angeles

2017-10-17

Despite extensive characterization of the human immunodeficiency virus type 1 (HIV-1) hydrophobic fusion peptide (FP), the structure-function relationships underlying its extraordinary degree of conservation remain poorly understood. Specifically, the fact that the tandem repeat of the FLGFLG tripeptide is absolutely conserved suggests that high hydrophobicity may not suffice to unleash FP function. Here, we have compared the nuclear magnetic resonance (NMR) structures adopted in nonpolar media by two FP surrogates, wtFP-tag and scrFP-tag, which had equal hydrophobicity but contained wild-type and scrambled core sequences LFLGFLG and FGLLGFL, respectively. In addition, these peptides were tagged at their C-termini with an epitope sequence that folded independently, thereby allowing Western blot detection without interfering with FP structure. We observed similar α-helical FP conformations for both specimens dissolved in the low-polarity medium 25% (v/v) 1,1,1,3,3,3-hexafluoro-2-propanol (HFIP), but important differences in contact with micelles of the membrane mimetic dodecylphosphocholine (DPC). Thus, whereas wtFP-tag preserved a helix displaying a Gly-rich ridge, the scrambled sequence lost in great part the helical structure upon being solubilized in DPC. Western blot analyses further revealed the capacity of wtFP-tag to assemble trimers in membranes, whereas membrane oligomers were not observed in the case of the scrFP-tag sequence. We conclude that, beyond hydrophobicity, preserving sequence order is an important feature for defining the secondary structures and oligomeric states adopted by the HIV FP in membranes.
Evolutionary conservation of vertebrate notochord genes in the ascidian Ciona intestinalis.

Science.gov (United States)

Kugler, Jamie E; Passamaneck, Yale J; Feldman, Taya G; Beh, Jeni; Regnier, Todd W; Di Gregorio, Anna

2008-11-01

To reconstruct a minimum complement of notochord genes evolutionarily conserved across chordates, we scanned the Ciona intestinalis genome using the sequences of 182 genes reported to be expressed in the notochord of different vertebrates and identified 139 candidate notochord genes. For 66 of these Ciona genes expression data were already available, hence we analyzed the expression of the remaining 73 genes and found notochord expression for 20. The predicted products of the newly identified notochord genes range from the transcription factors Ci-XBPa and Ci-miER1 to extracellular matrix proteins. We examined the expression of the newly identified notochord genes in embryos ectopically expressing Ciona Brachyury (Ci-Bra) and in embryos expressing a repressor form of this transcription factor in the notochord, and we found that while a subset of the genes examined are clearly responsive to Ci-Bra, other genes are not affected by alterations in its levels. We provide a first description of notochord genes that are not evidently influenced by the ectopic expression of Ci-Bra and we propose alternative regulatory mechanisms that might control their transcription. Copyright 2008 Wiley-Liss, Inc.
Multi-species sequence comparison reveals conservation of ghrelin gene-derived splice variants encoding a truncated ghrelin peptide.

Science.gov (United States)

Seim, Inge; Jeffery, Penny L; Thomas, Patrick B; Walpole, Carina M; Maugham, Michelle; Fung, Jenny N T; Yap, Pei-Yi; O'Keeffe, Angela J; Lai, John; Whiteside, Eliza J; Herington, Adrian C; Chopin, Lisa K

2016-06-01

The peptide hormone ghrelin is a potent orexigen produced predominantly in the stomach. It has a number of other biological actions, including roles in appetite stimulation, energy balance, the stimulation of growth hormone release and the regulation of cell proliferation. Recently, several ghrelin gene splice variants have been described. Here, we attempted to identify conserved alternative splicing of the ghrelin gene by cross-species sequence comparisons. We identified a novel human exon 2-deleted variant and provide preliminary evidence that this splice variant and in1-ghrelin encode a C-terminally truncated form of the ghrelin peptide, termed minighrelin. These variants are expressed in humans and mice, demonstrating conservation of alternative splicing spanning 90 million years. Minighrelin appears to have similar actions to full-length ghrelin, as treatment with exogenous minighrelin peptide stimulates appetite and feeding in mice. Forced expression of the exon 2-deleted preproghrelin variant mirrors the effect of the canonical preproghrelin, stimulating cell proliferation and migration in the PC3 prostate cancer cell line. This is the first study to characterise an exon 2-deleted preproghrelin variant and to demonstrate sequence conservation of ghrelin gene-derived splice variants that encode a truncated ghrelin peptide. This adds further impetus for studies into the alternative splicing of the ghrelin gene and the function of novel ghrelin peptides in vertebrates.
NOA36 Protein Contains a Highly Conserved Nucleolar Localization Signal Capable of Directing Functional Proteins to the Nucleolus, in Mammalian Cells

Science.gov (United States)

de Melo, Ivan S.; Jimenez-Nuñez, Maria D.; Iglesias, Concepción; Campos-Caro, Antonio; Moreno-Sanchez, David; Ruiz, Felix A.; Bolívar, Jorge

2013-01-01

NOA36/ZNF330 is an evolutionarily well-preserved protein present in the nucleolus and mitochondria of mammalian cells. We have previously reported that the pro-apoptotic activity of this protein is mediated by a characteristic cysteine-rich domain. We now demonstrate that the nucleolar localization of NOA36 is due to a highly-conserved nucleolar localization signal (NoLS) present in residues 1–33. This NoLS is a sequence containing three clusters of two or three basic amino acids. We fused the amino terminal of NOA36 to eGFP in order to characterize this putative NoLS. We show that a cluster of three lysine residues at positions 3 to 5 within this sequence is critical for the nucleolar localization. We also demonstrate that the sequence as found in human is capable of directing eGFP to the nucleolus in several mammal, fish and insect cells. Moreover, this NoLS is capable of specifically directing the cytosolic yeast enzyme polyphosphatase to the target of the nucleolus of HeLa cells, wherein its enzymatic activity was detected. This NoLS could therefore serve as a very useful tool as a nucleolar marker and for directing particular proteins to the nucleolus in distant animal species. PMID:23516598
Analysis of 90 Mb of the potato genome reveals conservation of gene structures and order with tomato but divergence in repetitive sequence composition

Directory of Open Access Journals (Sweden)

O'Brien Kimberly

2008-06-01

Full Text Available Abstract Background The Solanaceae family contains a number of important crop species including potato (Solanum tuberosum which is grown for its underground storage organ known as a tuber. Albeit the 4th most important food crop in the world, other than a collection of ~220,000 Expressed Sequence Tags, limited genomic sequence information is currently available for potato and advances in potato yield and nutrition content would be greatly assisted through access to a complete genome sequence. While morphologically diverse, Solanaceae species such as potato, tomato, pepper, and eggplant share not only genes but also gene order thereby permitting highly informative comparative genomic analyses. Results In this study, we report on analysis 89.9 Mb of potato genomic sequence representing 10.2% of the genome generated through end sequencing of a potato bacterial artificial chromosome (BAC clone library (87 Mb and sequencing of 22 potato BAC clones (2.9 Mb. The GC content of potato is very similar to Solanum lycopersicon (tomato and other dicotyledonous species yet distinct from the monocotyledonous grass species, Oryza sativa. Parallel analyses of repetitive sequences in potato and tomato revealed substantial differences in their abundance, 34.2% in potato versus 46.3% in tomato, which is consistent with the increased genome size per haploid genome of these two Solanum species. Specific classes and types of repetitive sequences were also differentially represented between these two species including a telomeric-related repetitive sequence, ribosomal DNA, and a number of unclassified repetitive sequences. Comparative analyses between tomato and potato at the gene level revealed a high level of conservation of gene content, genic feature, and gene order although discordances in synteny were observed. Conclusion Genomic level analyses of potato and tomato confirm that gene sequence and gene order are conserved between these solanaceous species and that
An evolutionarily conserved intronic region controls the spatiotemporal expression of the transcription factor Sox10

Directory of Open Access Journals (Sweden)

Pavan William J

2008-10-01

Full Text Available Abstract Background A major challenge lies in understanding the complexities of gene regulation. Mutation of the transcription factor SOX10 is associated with several human diseases. The disease phenotypes reflect the function of SOX10 in diverse tissues including the neural crest, central nervous system and otic vesicle. As expected, the SOX10 expression pattern is complex and highly dynamic, but little is known of the underlying mechanisms regulating its spatiotemporal pattern. SOX10 expression is highly conserved between all vertebrates characterised. Results We have combined in vivo testing of DNA fragments in zebrafish and computational comparative genomics to identify the first regulatory regions of the zebrafish sox10 gene. Both approaches converged on the 3' end of the conserved 1st intron as being critical for spatial patterning of sox10 in the embryo. Importantly, we have defined a minimal region crucial for this function. We show that this region contains numerous binding sites for transcription factors known to be essential in early neural crest induction, including Tcf/Lef, Sox and FoxD3. We show that the identity and relative position of these binding sites are conserved between zebrafish and mammals. A further region, partially required for oligodendrocyte expression, lies in the 5' region of the same intron and contains a putative CSL binding site, consistent with a role for Notch signalling in sox10 regulation. Furthermore, we show that β-catenin, Notch signalling and Sox9 can induce ectopic sox10 expression in early embryos, consistent with regulatory roles predicted from our transgenic and computational results. Conclusion We have thus identified two major sites of sox10 regulation in vertebrates and provided evidence supporting a role for at least three factors in driving sox10 expression in neural crest, otic epithelium and oligodendrocyte domains.
Expression and genomic analysis of midasin, a novel and highly conserved AAA protein distantly related to dynein

Directory of Open Access Journals (Sweden)

Gibbons I R

2002-07-01

Full Text Available Abstract Background The largest open reading frame in the Saccharomyces genome encodes midasin (MDN1p, YLR106p, an AAA ATPase of 560 kDa that is essential for cell viability. Orthologs of midasin have been identified in the genome projects for Drosophila, Arabidopsis, and Schizosaccharomyces pombe. Results Midasin is present as a single-copy gene encoding a well-conserved protein of ~600 kDa in all eukaryotes for which data are available. In humans, the gene maps to 6q15 and encodes a predicted protein of 5596 residues (632 kDa. Sequence alignments of midasin from humans, yeast, Giardia and Encephalitozoon indicate that its domain structure comprises an N-terminal domain (35 kDa, followed by an AAA domain containing six tandem AAA protomers (~30 kDa each, a linker domain (260 kDa, an acidic domain (~70 kDa containing 35–40% aspartate and glutamate, and a carboxy-terminal M-domain (30 kDa that possesses MIDAS sequence motifs and is homologous to the I-domain of integrins. Expression of hemagglutamin-tagged midasin in yeast demonstrates a polypeptide of the anticipated size that is localized principally in the nucleus. Conclusions The highly conserved structure of midasin in eukaryotes, taken in conjunction with its nuclear localization in yeast, suggests that midasin may function as a nuclear chaperone and be involved in the assembly/disassembly of macromolecular complexes in the nucleus. The AAA domain of midasin is evolutionarily related to that of dynein, but it appears to lack a microtubule-binding site.
Analysis of C. elegans NR2E nuclear receptors defines three conserved clades and ligand-independent functions

Directory of Open Access Journals (Sweden)

Weber Katherine P

2012-06-01

Full Text Available Abstract Background The nuclear receptors (NRs are an important class of transcription factors that are conserved across animal phyla. Canonical NRs consist of a DNA-binding domain (DBD and ligand-binding domain (LBD. While most animals have 20–40 NRs, nematodes of the genus Caenorhabditis have experienced a spectacular proliferation and divergence of NR genes. The LBDs of evolutionarily-conserved Caenorhabditis NRs have diverged sharply from their Drosophila and vertebrate orthologs, while the DBDs have been strongly conserved. The NR2E family of NRs play critical roles in development, especially in the nervous system. In this study, we explore the phylogenetics and function of the NR2E family of Caenorhabditis elegans, using an in vivo assay to test LBD function. Results Phylogenetic analysis reveals that the NR2E family of NRs consists of three broadly-conserved clades of orthologous NRs. In C. elegans, these clades are defined by nhr-67, fax-1 and nhr-239. The vertebrate orthologs of nhr-67 and fax-1 are Tlx and PNR, respectively. While the nhr-239 clade includes orthologs in insects (Hr83, an echinoderm, and a hemichordate, the gene appears to have been lost from vertebrate lineages. The C. elegans and C. briggsae nhr-239 genes have an apparently-truncated and highly-diverged LBD region. An additional C. elegans NR2E gene, nhr-111, appears to be a recently-evolved paralog of fax-1; it is present in C. elegans, but not C. briggsae or other animals with completely-sequenced genomes. Analysis of the relatively unstudied nhr-111 and nhr-239 genes demonstrates that they are both expressed—nhr-111 very broadly and nhr-239 in a small subset of neurons. Analysis of the FAX-1 LBD in an in vivo assay revealed that it is not required for at least some developmental functions. Conclusions Our analysis supports three conserved clades of NR2E receptors, only two of which are represented in vertebrates, indicating three ancestral NR2E genes in the
FAM20: an evolutionarily conserved family of secreted proteins expressed in hematopoietic cells

Directory of Open Access Journals (Sweden)

Cobos Everardo

2005-01-01

Full Text Available Abstract Background Hematopoiesis is a complex developmental process controlled by a large number of factors that regulate stem cell renewal, lineage commitment and differentiation. Secreted proteins, including the hematopoietic growth factors, play critical roles in these processes and have important biological and clinical significance. We have employed representational difference analysis to identify genes that are differentially expressed during experimentally induced myeloid differentiation in the murine EML hematopoietic stem cell line. Results One identified clone encoded a previously unidentified protein of 541 amino acids that contains an amino terminal signal sequence but no other characterized domains. This protein is a member of family of related proteins that has been named family with sequence similarity 20 (FAM20 with three members (FAM20A, FAM20B and FAM20C in mammals. Evolutionary comparisons revealed the existence of a single FAM20 gene in the simple vertebrate Ciona intestinalis and the invertebrate worm Caenorhabditis elegans and two genes in two insect species, Drosophila melanogaster and Anopheles gambiae. Six FAM20 family members were identified in the genome of the pufferfish, Fugu rubripes and five members in the zebrafish, Danio rerio. The mouse Fam20a protein was ectopically expressed in a mammalian cell line and found to be a bona fide secreted protein and efficient secretion was dependent on the integrity of the signal sequence. Expression analysis revealed that the Fam20a gene was indeed differentially expressed during hematopoietic differentiation and that the other two family members (Fam20b and Fam20c were also expressed during hematcpoiesis but that their mRNA levels did not vary significantly. Likewise FAM20A was expressed in more limited set of human tissues than the other two family members. Conclusions The FAM20 family represents a new family of secreted proteins with potential functions in regulating
A belief-based evolutionarily stable strategy.

Science.gov (United States)

Deng, Xinyang; Wang, Zhen; Liu, Qi; Deng, Yong; Mahadevan, Sankaran

2014-11-21

As an equilibrium refinement of the Nash equilibrium, evolutionarily stable strategy (ESS) is a key concept in evolutionary game theory and has attracted growing interest. An ESS can be either a pure strategy or a mixed strategy. Even though the randomness is allowed in mixed strategy, the selection probability of pure strategy in a mixed strategy may fluctuate due to the impact of many factors. The fluctuation can lead to more uncertainty. In this paper, such uncertainty involved in mixed strategy has been further taken into consideration: a belief strategy is proposed in terms of Dempster-Shafer evidence theory. Furthermore, based on the proposed belief strategy, a belief-based ESS has been developed. The belief strategy and belief-based ESS can reduce to the mixed strategy and mixed ESS, which provide more realistic and powerful tools to describe interactions among agents. Copyright © 2014 Elsevier Ltd. All rights reserved.
Polymorphic human (CTAT)n microsatellite provides a conserved linkage marker for mouse mutants causing cleft palate, vestibular defects, obesity and ataxia

Energy Technology Data Exchange (ETDEWEB)

Griffith, A.J.; Burgess, D.L.; Kohrman, D. [Univ. of MIchigan, Ann Arbor, MI (United States)] [and others

1994-09-01

The Twirler mutation (Tw) causing cleft palate {plus_minus} cleft lip, vestibular defects and obesity is located within 0.5 cM of an ataxia locus (ax) on mouse chromosome 18. We identified a transgene-induced insertional mutation with vestibular and craniofacial defects that appears to be a new allele of Twirler. Mouse DNA flanking the transgene insertion site was isolated from a cosmid library. An evolutionarily conserved, zoo blot positive cosmid subclone was used to probe a human {lambda} genomic library. From the sequence of a highly homologous human {lambda} clone, we designed STS primers and screened a human P1 library. DNA from two positive P1 clones was hybridized with simple sequence probes, and a (CTAT){sub 12} repeat was detected. Analysis of 62 CEPH parents with primers flanking the repeat identified six alleles containing 9 to 14 copies of the repeat, at frequencies of 0.17, 0.17, 0.17, 0.27, 0.15 and 0.07, respectively. The observed heterozygosity was 49/62 with a calculated PIC value of 0.76. This polymorphic microsatellite marker, designated Umi3, was mapped to the predicted conserved human linkage group by analysis of somatic cell hybrid panels. The anticipated short distance between Umi3 and the disease genes will facilitate detection of linkage in small families. We would like to type appropriate human pedigrees with Umi3 in order to identify patients with inherited disorders homologous to the mouse mutations Twirler and ataxia.
Functional region prediction with a set of appropriate homologous sequences-an index for sequence selection by integrating structure and sequence information with spatial statistics

Science.gov (United States)

2012-01-01

Background The detection of conserved residue clusters on a protein structure is one of the effective strategies for the prediction of functional protein regions. Various methods, such as Evolutionary Trace, have been developed based on this strategy. In such approaches, the conserved residues are identified through comparisons of homologous amino acid sequences. Therefore, the selection of homologous sequences is a critical step. It is empirically known that a certain degree of sequence divergence in the set of homologous sequences is required for the identification of conserved residues. However, the development of a method to select homologous sequences appropriate for the identification of conserved residues has not been sufficiently addressed. An objective and general method to select appropriate homologous sequences is desired for the efficient prediction of functional regions. Results We have developed a novel index to select the sequences appropriate for the identification of conserved residues, and implemented the index within our method to predict the functional regions of a protein. The implementation of the index improved the performance of the functional region prediction. The index represents the degree of conserved residue clustering on the tertiary structure of the protein. For this purpose, the structure and sequence information were integrated within the index by the application of spatial statistics. Spatial statistics is a field of statistics in which not only the attributes but also the geometrical coordinates of the data are considered simultaneously. Higher degrees of clustering generate larger index scores. We adopted the set of homologous sequences with the highest index score, under the assumption that the best prediction accuracy is obtained when the degree of clustering is the maximum. The set of sequences selected by the index led to higher functional region prediction performance than the sets of sequences selected by other sequence

Modeling coding-sequence evolution within the context of residue solvent accessibility.

Science.gov (United States)

Scherrer, Michael P; Meyer, Austin G; Wilke, Claus O

2012-09-12

Protein structure mediates site-specific patterns of sequence divergence. In particular, residues in the core of a protein (solvent-inaccessible residues) tend to be more evolutionarily conserved than residues on the surface (solvent-accessible residues). Here, we present a model of sequence evolution that explicitly accounts for the relative solvent accessibility of each residue in a protein. Our model is a variant of the Goldman-Yang 1994 (GY94) model in which all model parameters can be functions of the relative solvent accessibility (RSA) of a residue. We apply this model to a data set comprised of nearly 600 yeast genes, and find that an evolutionary-rate ratio ω that varies linearly with RSA provides a better model fit than an RSA-independent ω or an ω that is estimated separately in individual RSA bins. We further show that the branch length t and the transition-transverion ratio κ also vary with RSA. The RSA-dependent GY94 model performs better than an RSA-dependent Muse-Gaut 1994 (MG94) model in which the synonymous and non-synonymous rates individually are linear functions of RSA. Finally, protein core size affects the slope of the linear relationship between ω and RSA, and gene expression level affects both the intercept and the slope. Structure-aware models of sequence evolution provide a significantly better fit than traditional models that neglect structure. The linear relationship between ω and RSA implies that genes are better characterized by their ω slope and intercept than by just their mean ω.
Modeling coding-sequence evolution within the context of residue solvent accessibility

Directory of Open Access Journals (Sweden)

Scherrer Michael P

2012-09-01

Full Text Available Abstract Background Protein structure mediates site-specific patterns of sequence divergence. In particular, residues in the core of a protein (solvent-inaccessible residues tend to be more evolutionarily conserved than residues on the surface (solvent-accessible residues. Results Here, we present a model of sequence evolution that explicitly accounts for the relative solvent accessibility of each residue in a protein. Our model is a variant of the Goldman-Yang 1994 (GY94 model in which all model parameters can be functions of the relative solvent accessibility (RSA of a residue. We apply this model to a data set comprised of nearly 600 yeast genes, and find that an evolutionary-rate ratio ω that varies linearly with RSA provides a better model fit than an RSA-independent ω or an ω that is estimated separately in individual RSA bins. We further show that the branch length t and the transition-transverion ratio κ also vary with RSA. The RSA-dependent GY94 model performs better than an RSA-dependent Muse-Gaut 1994 (MG94 model in which the synonymous and non-synonymous rates individually are linear functions of RSA. Finally, protein core size affects the slope of the linear relationship between ω and RSA, and gene expression level affects both the intercept and the slope. Conclusions Structure-aware models of sequence evolution provide a significantly better fit than traditional models that neglect structure. The linear relationship between ω and RSA implies that genes are better characterized by their ω slope and intercept than by just their mean ω.
An Evolutionarily Conserved Mechanism for Intrinsic and Transferable Polymyxin Resistance.

Science.gov (United States)

Xu, Yongchang; Wei, Wenhui; Lei, Sheng; Lin, Jingxia; Srinivas, Swaminath; Feng, Youjun

2018-04-10

Polymyxins, a family of cationic antimicrobial cyclic peptides, act as a last line of defense against severe infections by Gram-negative pathogens with carbapenem resistance. In addition to the intrinsic resistance to polymyxin E (colistin) conferred by Neisseria eptA , the plasmid-borne mobilized colistin resistance gene mcr-1 has been disseminated globally since the first discovery in Southern China, in late 2015. However, the molecular mechanisms for both intrinsic and transferable resistance to colistin remain largely unknown. Here, we aim to address this gap in the knowledge of these proteins. Structural and functional analyses of EptA and MCR-1 and -2 have defined a conserved 12-residue cavity that is required for the entry of the lipid substrate, phosphatidylethanolamine (PE). The in vitro and in vivo data together have allowed us to visualize the similarities in catalytic activity shared by EptA and MCR-1 and -2. The expression of either EptA or MCR-1 or -2 is shown to remodel the surface of enteric bacteria (e.g., Escherichia coli , Salmonella enterica , Klebsiella pneumoniae , etc.), rendering them resistant to colistin. The parallels in the PE substrate-binding cavities among EptA, MCR-1, and MCR-2 provide a comprehensive understanding of both intrinsic and transferable colistin resistance. Domain swapping between EptA and MCR-1 and -2 reveals that the two domains (transmembrane [TM] region and p hospho e thanol a mine [PEA] transferase) are not functionally exchangeable. Taken together, the results represent a common mechanism for intrinsic and transferable PEA resistance to polymyxin, a last-resort antibiotic against multidrug-resistant pathogens. IMPORTANCE EptA and MCR-1 and -2 remodel the outer membrane, rendering bacteria resistant to colistin, a final resort against carbapenem-resistant pathogens. Structural and functional analyses of EptA and MCR-1 and -2 reveal parallel PE lipid substrate-recognizing cavities, which explains intrinsic and
Packaging signals in two single-stranded RNA viruses imply a conserved assembly mechanism and geometry of the packaged genome.

Science.gov (United States)

Dykeman, Eric C; Stockley, Peter G; Twarock, Reidun

2013-09-09

The current paradigm for assembly of single-stranded RNA viruses is based on a mechanism involving non-sequence-specific packaging of genomic RNA driven by electrostatic interactions. Recent experiments, however, provide compelling evidence for sequence specificity in this process both in vitro and in vivo. The existence of multiple RNA packaging signals (PSs) within viral genomes has been proposed, which facilitates assembly by binding coat proteins in such a way that they promote the protein-protein contacts needed to build the capsid. The binding energy from these interactions enables the confinement or compaction of the genomic RNAs. Identifying the nature of such PSs is crucial for a full understanding of assembly, which is an as yet untapped potential drug target for this important class of pathogens. Here, for two related bacterial viruses, we determine the sequences and locations of their PSs using Hamiltonian paths, a concept from graph theory, in combination with bioinformatics and structural studies. Their PSs have a common secondary structure motif but distinct consensus sequences and positions within the respective genomes. Despite these differences, the distributions of PSs in both viruses imply defined conformations for the packaged RNA genomes in contact with the protein shell in the capsid, consistent with a recent asymmetric structure determination of the MS2 virion. The PS distributions identified moreover imply a preferred, evolutionarily conserved assembly pathway with respect to the RNA sequence with potentially profound implications for other single-stranded RNA viruses known to have RNA PSs, including many animal and human pathogens. Copyright © 2013 Elsevier Ltd. All rights reserved.
Seasonal Sex Ratio Trend in the European Kestrel : An Evolutionarily Stable Strategy Analysis

NARCIS (Netherlands)

Pen, I.R.; Weissing, F.J.; Daan, S.

We present an evolutionarily stable strategy (ESS) model to analyze selection on seasonal variation in the brood sex ratio, as observed in several species of raptorial birds. The model is specifically tailored to the life history of the European kestrel, and it reflects the maturation time
Intermediary metabolism in protists: a sequence-based view of facultative anaerobic metabolism in evolutionarily diverse eukaryotes.

Science.gov (United States)

Ginger, Michael L; Fritz-Laylin, Lillian K; Fulton, Chandler; Cande, W Zacheus; Dawson, Scott C

2010-12-01

Protists account for the bulk of eukaryotic diversity. Through studies of gene and especially genome sequences the molecular basis for this diversity can be determined. Evident from genome sequencing are examples of versatile metabolism that go far beyond the canonical pathways described for eukaryotes in textbooks. In the last 2-3 years, genome sequencing and transcript profiling has unveiled several examples of heterotrophic and phototrophic protists that are unexpectedly well-equipped for ATP production using a facultative anaerobic metabolism, including some protists that can (Chlamydomonas reinhardtii) or are predicted (Naegleria gruberi, Acanthamoeba castellanii, Amoebidium parasiticum) to produce H(2) in their metabolism. It is possible that some enzymes of anaerobic metabolism were acquired and distributed among eukaryotes by lateral transfer, but it is also likely that the common ancestor of eukaryotes already had far more metabolic versatility than was widely thought a few years ago. The discussion of core energy metabolism in unicellular eukaryotes is the subject of this review. Since genomic sequencing has so far only touched the surface of protist diversity, it is anticipated that sequences of additional protists may reveal an even wider range of metabolic capabilities, while simultaneously enriching our understanding of the early evolution of eukaryotes. Copyright © 2010 Elsevier GmbH. All rights reserved.
Evolutionary conservation of nuclear and nucleolar targeting sequences in yeast ribosomal protein S6A

International Nuclear Information System (INIS)

Lipsius, Edgar; Walter, Korden; Leicher, Torsten; Phlippen, Wolfgang; Bisotti, Marc-Angelo; Kruppa, Joachim

2005-01-01

Over 1 billion years ago, the animal kingdom diverged from the fungi. Nevertheless, a high sequence homology of 62% exists between human ribosomal protein S6 and S6A of Saccharomyces cerevisiae. To investigate whether this similarity in primary structure is mirrored in corresponding functional protein domains, the nuclear and nucleolar targeting signals were delineated in yeast S6A and compared to the known human S6 signals. The complete sequence of S6A and cDNA fragments was fused to the 5'-end of the LacZ gene, the constructs were transiently expressed in COS cells, and the subcellular localization of the fusion proteins was detected by indirect immunofluorescence. One bipartite and two monopartite nuclear localization signals as well as two nucleolar binding domains were identified in yeast S6A, which are located at homologous regions in human S6 protein. Remarkably, the number, nature, and position of these targeting signals have been conserved, albeit their amino acid sequences have presumably undergone a process of co-evolution with their corresponding rRNAs
Optimizing multiple sequence alignments using a genetic algorithm based on three objectives: structural information, non-gaps percentage and totally conserved columns.

Science.gov (United States)

Ortuño, Francisco M; Valenzuela, Olga; Rojas, Fernando; Pomares, Hector; Florido, Javier P; Urquiza, Jose M; Rojas, Ignacio

2013-09-01

Multiple sequence alignments (MSAs) are widely used approaches in bioinformatics to carry out other tasks such as structure predictions, biological function analyses or phylogenetic modeling. However, current tools usually provide partially optimal alignments, as each one is focused on specific biological features. Thus, the same set of sequences can produce different alignments, above all when sequences are less similar. Consequently, researchers and biologists do not agree about which is the most suitable way to evaluate MSAs. Recent evaluations tend to use more complex scores including further biological features. Among them, 3D structures are increasingly being used to evaluate alignments. Because structures are more conserved in proteins than sequences, scores with structural information are better suited to evaluate more distant relationships between sequences. The proposed multiobjective algorithm, based on the non-dominated sorting genetic algorithm, aims to jointly optimize three objectives: STRIKE score, non-gaps percentage and totally conserved columns. It was significantly assessed on the BAliBASE benchmark according to the Kruskal-Wallis test (P algorithm also outperforms other aligners, such as ClustalW, Multiple Sequence Alignment Genetic Algorithm (MSA-GA), PRRP, DIALIGN, Hidden Markov Model Training (HMMT), Pattern-Induced Multi-sequence Alignment (PIMA), MULTIALIGN, Sequence Alignment Genetic Algorithm (SAGA), PILEUP, Rubber Band Technique Genetic Algorithm (RBT-GA) and Vertical Decomposition Genetic Algorithm (VDGA), according to the Wilcoxon signed-rank test (P 0.05) with the advantage of being able to use less structures. Structural information is included within the objective function to evaluate more accurately the obtained alignments. The source code is available at http://www.ugr.es/~fortuno/MOSAStrE/MO-SAStrE.zip.
In Vivo Characterization of a Vertebrate Ultra-conserved Enhancer

Energy Technology Data Exchange (ETDEWEB)

Poulin, Francis; Nobrega, Marcelo A.; Plajzer-Frick, Ingrid; Holt, Amy; Afzal, Veena; Rubin, Edward M.; Pennacchio, Len

2004-10-01

Genomic sequence comparisons between human, mouse and pufferfish (Takifugu rubripes (Fugu))have revealed a set of extremely conserved noncoding sequences. While this high degree of sequence conservation suggests severe evolutionary constraint and predicts a lack of tolerance to change in order to retain in vivo functionality, such elements have been minimally explored experimentally. In this study, we describe the in-depth characterization of an ancient conserved enhancer, Dc2 located near the dachshund gene, which displays a human-Fugu identity of 84 percent over 424 basepairs (bp). In addition to this large overall conservation, we find that Dc2 is characterized by the presence of a large block of sequence (144 bp) that is completely identical between human, mouse, chicken, zebrafish and Fugu. Through the testing of reporter vector constructs in transgenic mice, we observed that the 424 bp Dc2 conserved element is necessary and sufficient for brain tissue enhancer activity. In vivo analyses also revealed that the 144 bp 100 percent conserved sequence is necessary, but not sufficient, to replicate Dc2 enhancer function. However, the introduction of two separate 16 bp insertions into the highly conserved enhancer core did not cause any detectable modification of its in vivo activity. Our observations indicate that the 144 bp 100 percent conserved element is tolerant of change at least at the resolution of this transgenic mouse assay and suggest that purifying selection on Dc2 sequence might not be as strong as we predicted or that some unknown property also constrains this highly conserved enhancer sequence.
Base Composition Characteristics of Mammalian miRNAs

Directory of Open Access Journals (Sweden)

Bin Wang

2013-01-01

Full Text Available MicroRNAs (miRNAs are short RNA sequences that repress protein synthesis by either inhibiting the translation of messenger RNA (mRNA or increasing mRNA degradation. Endogenous miRNAs have been found in various organisms, including animals, plants, and viruses. Mammalian miRNAs are evolutionarily conserved, are scattered throughout chromosomes, and play an important role in the immune response and the onset of cancer. For this study, the author explored the base composition characteristics of miRNA genes from the six mammalian species that contain the largest number of known miRNAs. It was found that mammalian miRNAs are evolutionarily conserved and GU-rich. Interestingly, in the miRNA sequences investigated, A residues are clearly the most frequent occupants of positions 2 and 3 of the 5′ end of miRNAs. Unlike G and U residues that may pair with C/U and A/G, respectively, A residues can only pair with U residues of target mRNAs, which may augment the recognition specificity of the 5′ seed region.
Sequence and conformational preferences at termini of α-helices in membrane proteins: role of the helix environment.

Science.gov (United States)

Shelar, Ashish; Bansal, Manju

2014-12-01

α-Helices are amongst the most common secondary structural elements seen in membrane proteins and are packed in the form of helix bundles. These α-helices encounter varying external environments (hydrophobic, hydrophilic) that may influence the sequence preferences at their N and C-termini. The role of the external environment in stabilization of the helix termini in membrane proteins is still unknown. Here we analyze α-helices in a high-resolution dataset of integral α-helical membrane proteins and establish that their sequence and conformational preferences differ from those in globular proteins. We specifically examine these preferences at the N and C-termini in helices initiating/terminating inside the membrane core as well as in linkers connecting these transmembrane helices. We find that the sequence preferences and structural motifs at capping (Ncap and Ccap) and near-helical (N' and C') positions are influenced by a combination of features including the membrane environment and the innate helix initiation and termination property of residues forming structural motifs. We also find that a large number of helix termini which do not form any particular capping motif are stabilized by formation of hydrogen bonds and hydrophobic interactions contributed from the neighboring helices in the membrane protein. We further validate the sequence preferences obtained from our analysis with data from an ultradeep sequencing study that identifies evolutionarily conserved amino acids in the rat neurotensin receptor. The results from our analysis provide insights for the secondary structure prediction, modeling and design of membrane proteins. © 2014 Wiley Periodicals, Inc.
NFAT5 regulates HIV-1 in primary monocytes via a highly conserved long terminal repeat site.

Directory of Open Access Journals (Sweden)

Shahin Ranjbar

2006-12-01

Full Text Available To replicate, HIV-1 capitalizes on endogenous cellular activation pathways resulting in recruitment of key host transcription factors to its viral enhancer. RNA interference has been a powerful tool for blocking key checkpoints in HIV-1 entry into cells. Here we apply RNA interference to HIV-1 transcription in primary macrophages, a major reservoir of the virus, and specifically target the transcription factor NFAT5 (nuclear factor of activated T cells 5, which is the most evolutionarily divergent NFAT protein. By molecularly cloning and sequencing isolates from multiple viral subtypes, and performing DNase I footprinting, electrophoretic mobility shift, and promoter mutagenesis transfection assays, we demonstrate that NFAT5 functionally interacts with a specific enhancer binding site conserved in HIV-1, HIV-2, and multiple simian immunodeficiency viruses. Using small interfering RNA to ablate expression of endogenous NFAT5 protein, we show that the replication of three major HIV-1 viral subtypes (B, C, and E is dependent upon NFAT5 in human primary differentiated macrophages. Our results define a novel host factor-viral enhancer interaction that reveals a new regulatory role for NFAT5 and defines a functional DNA motif conserved across HIV-1 subtypes and representative simian immunodeficiency viruses. Inhibition of the NFAT5-LTR interaction may thus present a novel therapeutic target to suppress HIV-1 replication and progression of AIDS.
Transcriptional activation signals found in the Epstein-Barr virus (EBV) latency C promoter are conserved in the latency C promoter sequences from baboon and Rhesus monkey EBV-like lymphocryptoviruses (cercopithicine herpesviruses 12 and 15).

Science.gov (United States)

Fuentes-Pananá, E M; Swaminathan, S; Ling, P D

1999-01-01

The Epstein-Barr virus (EBV) EBNA2 protein is a transcriptional activator that controls viral latent gene expression and is essential for EBV-driven B-cell immortalization. EBNA2 is expressed from the viral C promoter (Cp) and regulates its own expression by activating Cp through interaction with the cellular DNA binding protein CBF1. Through regulation of Cp and EBNA2 expression, EBV controls the pattern of latent protein expression and the type of latency established. To gain further insight into the important regulatory elements that modulate Cp usage, we isolated and sequenced the Cp regions corresponding to nucleotides 10251 to 11479 of the EBV genome (-1079 to +144 relative to the transcription initiation site) from the EBV-like lymphocryptoviruses found in baboons (herpesvirus papio; HVP) and Rhesus macaques (RhEBV). Sequence comparison of the approximately 1,230-bp Cp regions from these primate viruses revealed that EBV and HVP Cp sequences are 64% conserved, EBV and RhEBV Cp sequences are 66% conserved, and HVP and RhEBV Cp sequences are 65% conserved relative to each other. Approximately 50% of the residues are conserved among all three sequences, yet all three viruses have retained response elements for glucocorticoids, two positionally conserved CCAAT boxes, and positionally conserved TATA boxes. The putative EBNA2 100-bp enhancers within these promoters contain 54 conserved residues, and the binding sites for CBF1 and CBF2 are well conserved. Cp usage in the HVP- and RhEBV-transformed cell lines was detected by S1 nuclease protection analysis. Transient-transfection analysis showed that promoters of both HVP and RhEBV are responsive to EBNA2 and that they bind CBF1 and CBF2 in gel mobility shift assays. These results suggest that similar mechanisms for regulation of latent gene expression are conserved among the EBV-related lymphocryptoviruses found in nonhuman primates.
SNPs in Multi-Species Conserved Sequences (MCS as useful markers in association studies: a practical approach

Directory of Open Access Journals (Sweden)

Pericak-Vance Margaret A

2007-08-01

Full Text Available Abstract Background Although genes play a key role in many complex diseases, the specific genes involved in most complex diseases remain largely unidentified. Their discovery will hinge on the identification of key sequence variants that are conclusively associated with disease. While much attention has been focused on variants in protein-coding DNA, variants in noncoding regions may also play many important roles in complex disease by altering gene regulation. Since the vast majority of noncoding genomic sequence is of unknown function, this increases the challenge of identifying "functional" variants that cause disease. However, evolutionary conservation can be used as a guide to indicate regions of noncoding or coding DNA that are likely to have biological function, and thus may be more likely to harbor SNP variants with functional consequences. To help bias marker selection in favor of such variants, we devised a process that prioritizes annotated SNPs for genotyping studies based on their location within Multi-species Conserved Sequences (MCSs and used this process to select SNPs in a region of linkage to a complex disease. This allowed us to evaluate the utility of the chosen SNPs for further association studies. Previously, a region of chromosome 1q43 was linked to Multiple Sclerosis (MS in a genome-wide screen. We chose annotated SNPs in the region based on location within MCSs (termed MCS-SNPs. We then obtained genotypes for 478 MCS-SNPs in 989 individuals from MS families. Results Analysis of our MCS-SNP genotypes from the 1q43 region and comparison to HapMap data confirmed that annotated SNPs in MCS regions are frequently polymorphic and show subtle signatures of selective pressure, consistent with previous reports of genome-wide variation in conserved regions. We also present an online tool that allows MCS data to be directly exported to the UCSC genome browser so that MCS-SNPs can be easily identified within genomic regions of
Conservation Below The Species Level: Suitable Evolutionarily Significant Units Among Mountain Vipers (The Montivipera Raddei Complex) in Iran.

Science.gov (United States)

Behrooz, Roozbeh; Kaboli, Mohammad; Arnal, Véronique; Nazarizadeh, Masoud; Asadi, Atefeh; Salmanian, Amin; Ahmadi, Mohsen; Montgelard, Claudine

2018-02-01

Northern and western mountains of Iran are among the most important biodiversity and endemism hot spots for reptiles in the Middle East. Among herpetofauna, the montivipers represent an emblematic and fragmented endemic group for which estimating their level of genetic differentiation and defining conservation priorities is urgently needed. Here, we present the most comprehensive phylogenetic study on the Montivipera raddei species group comprising all five known taxa, among which three are endemic to Iran. Based on two mitochondrial genes, phylogenetic and phylogeographic analyses revealed three major lineages each presenting very contrasting distribution area. The Iranian montivipers are highly structured in clades showing low genetic diversity and corresponding to high altitude summits. Molecular dating revealed the role of Quaternary paleo-climatic oscillations and altitudinal movements of montivipers in shaping genetic diversity and differentiation of these sky-island taxa. In addition, the best scenario of historical biogeography allowed identifying three possible refugial areas in Iran most likely arising by vicariance. Based on our mitochondrial results and pending additional data, we recognize three candidate species among the Montivipera raddei complex: M. raddei, M.latifii and M. kuhrangica that are coherent with their geographical distribution. We propose that the most appropriate Evolutionary Significant Units for conservation of the montivipers are represented by thirteen units among which six are recognized as high priority. Finally, we suggest some recommendations to the IUCN as well as to the Iranian conservation policies with respect to conservation prioritization. © The American Genetic Association 2018. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Mouse Nkrp1-Clr gene cluster sequence and expression analyses reveal conservation of tissue-specific MHC-independent immunosurveillance.

Directory of Open Access Journals (Sweden)

Qiang Zhang

Full Text Available The Nkrp1 (Klrb1-Clr (Clec2 genes encode a receptor-ligand system utilized by NK cells as an MHC-independent immunosurveillance strategy for innate immune responses. The related Ly49 family of MHC-I receptors displays extreme allelic polymorphism and haplotype plasticity. In contrast, previous BAC-mapping and aCGH studies in the mouse suggest the neighboring and related Nkrp1-Clr cluster is evolutionarily stable. To definitively compare the relative evolutionary rate of Nkrp1-Clr vs. Ly49 gene clusters, the Nkrp1-Clr gene clusters from two Ly49 haplotype-disparate inbred mouse strains, BALB/c and 129S6, were sequenced. Both Nkrp1-Clr gene cluster sequences are highly similar to the C57BL/6 reference sequence, displaying the same gene numbers and order, complete pseudogenes, and gene fragments. The Nkrp1-Clr clusters contain a strikingly dissimilar proportion of repetitive elements compared to the Ly49 clusters, suggesting that certain elements may be partly responsible for the highly disparate Ly49 vs. Nkrp1 evolutionary rate. Focused allelic polymorphisms were found within the Nkrp1b/d (Klrb1b, Nkrp1c (Klrb1c, and Clr-c (Clec2f genes, suggestive of possible immune selection. Cell-type specific transcription of Nkrp1-Clr genes in a large panel of tissues/organs was determined. Clr-b (Clec2d and Clr-g (Clec2i showed wide expression, while other Clr genes showed more tissue-specific expression patterns. In situ hybridization revealed specific expression of various members of the Clr family in leukocytes/hematopoietic cells of immune organs, various tissue-restricted epithelial cells (including intestinal, kidney tubular, lung, and corneal progenitor epithelial cells, as well as myocytes. In summary, the Nkrp1-Clr gene cluster appears to evolve more slowly relative to the related Ly49 cluster, and likely regulates innate immunosurveillance in a tissue-specific manner.
Conserved PCR primer set designing for closely-related species to complete mitochondrial genome sequencing using a sliding window-based PSO algorithm.

Directory of Open Access Journals (Sweden)

Cheng-Hong Yang

Full Text Available BACKGROUND: Complete mitochondrial (mt genome sequencing is becoming increasingly common for phylogenetic reconstruction and as a model for genome evolution. For long template sequencing, i.e., like the entire mtDNA, it is essential to design primers for Polymerase Chain Reaction (PCR amplicons which are partly overlapping each other. The presented chromosome walking strategy provides the overlapping design to solve the problem for unreliable sequencing data at the 5' end and provides the effective sequencing. However, current algorithms and tools are mostly focused on the primer design for a local region in the genomic sequence. Accordingly, it is still challenging to provide the primer sets for the entire mtDNA. METHODOLOGY/PRINCIPAL FINDINGS: The purpose of this study is to develop an integrated primer design algorithm for entire mt genome in general, and for the common primer sets for closely-related species in particular. We introduce ClustalW to generate the multiple sequence alignment needed to find the conserved sequences in closely-related species. These conserved sequences are suitable for designing the common primers for the entire mtDNA. Using a heuristic algorithm particle swarm optimization (PSO, all the designed primers were computationally validated to fit the common primer design constraints, such as the melting temperature, primer length and GC content, PCR product length, secondary structure, specificity, and terminal limitation. The overlap requirement for PCR amplicons in the entire mtDNA is satisfied by defining the overlapping region with the sliding window technology. Finally, primer sets were designed within the overlapping region. The primer sets for the entire mtDNA sequences were successfully demonstrated in the example of two closely-related fish species. The pseudo code for the primer design algorithm is provided. CONCLUSIONS/SIGNIFICANCE: In conclusion, it can be said that our proposed sliding window-based PSO
Identifying all moiety conservation laws in genome-scale metabolic networks.

Science.gov (United States)

De Martino, Andrea; De Martino, Daniele; Mulet, Roberto; Pagnani, Andrea

2014-01-01

The stoichiometry of a metabolic network gives rise to a set of conservation laws for the aggregate level of specific pools of metabolites, which, on one hand, pose dynamical constraints that cross-link the variations of metabolite concentrations and, on the other, provide key insight into a cell's metabolic production capabilities. When the conserved quantity identifies with a chemical moiety, extracting all such conservation laws from the stoichiometry amounts to finding all non-negative integer solutions of a linear system, a programming problem known to be NP-hard. We present an efficient strategy to compute the complete set of integer conservation laws of a genome-scale stoichiometric matrix, also providing a certificate for correctness and maximality of the solution. Our method is deployed for the analysis of moiety conservation relationships in two large-scale reconstructions of the metabolism of the bacterium E. coli, in six tissue-specific human metabolic networks, and, finally, in the human reactome as a whole, revealing that bacterial metabolism could be evolutionarily designed to cover broader production spectra than human metabolism. Convergence to the full set of moiety conservation laws in each case is achieved in extremely reduced computing times. In addition, we uncover a scaling relation that links the size of the independent pool basis to the number of metabolites, for which we present an analytical explanation.
Identifying all moiety conservation laws in genome-scale metabolic networks.

Directory of Open Access Journals (Sweden)

Andrea De Martino

Full Text Available The stoichiometry of a metabolic network gives rise to a set of conservation laws for the aggregate level of specific pools of metabolites, which, on one hand, pose dynamical constraints that cross-link the variations of metabolite concentrations and, on the other, provide key insight into a cell's metabolic production capabilities. When the conserved quantity identifies with a chemical moiety, extracting all such conservation laws from the stoichiometry amounts to finding all non-negative integer solutions of a linear system, a programming problem known to be NP-hard. We present an efficient strategy to compute the complete set of integer conservation laws of a genome-scale stoichiometric matrix, also providing a certificate for correctness and maximality of the solution. Our method is deployed for the analysis of moiety conservation relationships in two large-scale reconstructions of the metabolism of the bacterium E. coli, in six tissue-specific human metabolic networks, and, finally, in the human reactome as a whole, revealing that bacterial metabolism could be evolutionarily designed to cover broader production spectra than human metabolism. Convergence to the full set of moiety conservation laws in each case is achieved in extremely reduced computing times. In addition, we uncover a scaling relation that links the size of the independent pool basis to the number of metabolites, for which we present an analytical explanation.
Conserved regulatory modules in the Sox9 testis-specific enhancer predict roles for SOX, TCF/LEF, Forkhead, DMRT, and GATA proteins in vertebrate sex determination.

Science.gov (United States)

Bagheri-Fam, Stefan; Sinclair, Andrew H; Koopman, Peter; Harley, Vincent R

2010-03-01

While the primary sex determining switch varies between vertebrate species, a key downstream event in testicular development, namely the male-specific up-regulation of Sox9, is conserved. To date, only two sex determining switch genes have been identified, Sry in mammals and the Dmrt1-related gene Dmy (Dmrt1bY) in the medaka fish Oryzias latipes. In mice, Sox9 expression is evidently up-regulated by SRY and maintained by SOX9 both of which directly activate the core 1.3 kb testis-specific enhancer of Sox9 (TESCO). How Sox9 expression is up-regulated and maintained in species without Sry (i.e. non-mammalian species) is not understood. In this study, we have undertaken an in-depth comparative genomics approach and show that TESCO contains an evolutionarily conserved region (ECR) of 180 bp which is present in marsupials, monotremes, birds, reptiles and amphibians. The ECR contains highly conserved modules that predict regulatory roles for SOX, TCF/LEF, Forkhead, DMRT, and GATA proteins in vertebrate sex determination/differentiation. Our data suggest that tetrapods share common aspects of Sox9 regulation in the testis, despite having different sex determining switch mechanisms. They also suggest that Sox9 autoregulation is an ancient mechanism shared by all tetrapods, raising the possibility that in mammals, SRY evolved by mimicking this regulation. The validation of ECR regulatory sequences conserved from human to frogs will provide new insights into vertebrate sex determination. Copyright 2009 Elsevier Ltd. All rights reserved.

The complete nucleotide sequence, genome organization, and origin of human adenovirus type 11

International Nuclear Information System (INIS)

Stone, Daniel; Furthmann, Anne; Sandig, Volker; Lieber, Andre

2003-01-01

The complete DNA sequence and transcription map of human adenovirus type 11 are reported here. This is the first published sequence for a subgenera B human adenovirus and demonstrates a genome organization highly similar to those of other human adenoviruses. All of the genes from the early, intermediate, and late regions are present in the expected locations of the genome for a human adenovirus. The genome size is 34,794 bp in length and has a GC content of 48.9%. Sequence alignment with genomes of groups A (Ad12), C (Ad5), D (Ad17), E (Simian adenovirus 25), and F (Ad40) revealed homologies of 64, 54, 68, 75, and 52%, respectively. Detailed genomic analysis demonstrated that Ads 11 and 35 are highly conserved in all areas except the hexon hypervariable regions and fiber. Similarly, comparison of Ad11 with subgroup E SAV25 revealed poor homology between fibers but high homology in proteins encoded by all other areas of the genome. We propose an evolutionary model in which functional viruses can be reconstituted following fiber substitution from one serotype to another. According to this model either the Ad11 genome is a derivative of Ad35, from which the fiber was substituted with Ad7, or the Ad35 genome is the product of a fiber substitution from Ad21 into the Ad11 genome. This model also provides a possible explanation for the origin of group E Ads, which are evolutionarily derived from a group C fiber substitution into a group B genome
Determination of 5 '-leader sequences from radically disparate strains of porcine reproductive and respiratory syndrome virus reveals the presence of highly conserved sequence motifs

DEFF Research Database (Denmark)

Oleksiewicz, M.B.; Bøtner, Anette; Nielsen, Jens

1999-01-01

We determined the untranslated 5'-leader sequence for three different isolates of porcine reproductive and respiratory syndrome virus (PRRSV): pathogenic European- and American-types, as well as an American-type vaccine strain. 5'-leader from European- and American-type PRRSV differed in length...... (220 and 190 nt, respectively), and exhibited only approximately 50% nucleotide homology. Nevertheless, highly conserved areas were identified in the leader of all 3 PRRSV isolates, which constitute candidate motifs for binding of protein(s) involved in viral replication. These comparative data provide...
The interplay of sequence conservation and T cell immune recognition

DEFF Research Database (Denmark)

Bresciani, Anne Gøther; Sette, Alessandro; Greenbaum, Jason

2014-01-01

examined the hypothesis that conservation of a peptide in bacteria that are part of the healthy human microbiome leads to a reduced level of immunogenicity due to tolerization of T cells to the commensal bacteria. This was done by comparing experimentally characterized T cell epitope recognition data from...... the Immune Epitope Database with their conservation in the human microbiome. Indeed, we did see a lower immunogenicity for conserved peptides conserved. While many aspects how this conservation comparison is done require further optimization, this is a first step towards a better understanding T cell...... recognition of peptides in bacterial pathogens is influenced by their conservation in commensal bacteria. If the further work proves that this approach is successful, the degree of overlap of a peptide with the human proteome or microbiome could be added to the arsenal of tools available to assess peptide...
Domain architecture conservation in orthologs

Science.gov (United States)

2011-01-01

Background As orthologous proteins are expected to retain function more often than other homologs, they are often used for functional annotation transfer between species. However, ortholog identification methods do not take into account changes in domain architecture, which are likely to modify a protein's function. By domain architecture we refer to the sequential arrangement of domains along a protein sequence. To assess the level of domain architecture conservation among orthologs, we carried out a large-scale study of such events between human and 40 other species spanning the entire evolutionary range. We designed a score to measure domain architecture similarity and used it to analyze differences in domain architecture conservation between orthologs and paralogs relative to the conservation of primary sequence. We also statistically characterized the extents of different types of domain swapping events across pairs of orthologs and paralogs. Results The analysis shows that orthologs exhibit greater domain architecture conservation than paralogous homologs, even when differences in average sequence divergence are compensated for, for homologs that have diverged beyond a certain threshold. We interpret this as an indication of a stronger selective pressure on orthologs than paralogs to retain the domain architecture required for the proteins to perform a specific function. In general, orthologs as well as the closest paralogous homologs have very similar domain architectures, even at large evolutionary separation. The most common domain architecture changes observed in both ortholog and paralog pairs involved insertion/deletion of new domains, while domain shuffling and segment duplication/deletion were very infrequent. Conclusions On the whole, our results support the hypothesis that function conservation between orthologs demands higher domain architecture conservation than other types of homologs, relative to primary sequence conservation. This supports the
Position-specific prediction of methylation sites from sequence conservation based on information theory.

Science.gov (United States)

Shi, Yinan; Guo, Yanzhi; Hu, Yayun; Li, Menglong

2015-07-23

Protein methylation plays vital roles in many biological processes and has been implicated in various human diseases. To fully understand the mechanisms underlying methylation for use in drug design and work in methylation-related diseases, an initial but crucial step is to identify methylation sites. The use of high-throughput bioinformatics methods has become imperative to predict methylation sites. In this study, we developed a novel method that is based only on sequence conservation to predict protein methylation sites. Conservation difference profiles between methylated and non-methylated peptides were constructed by the information entropy (IE) in a wider neighbor interval around the methylation sites that fully incorporated all of the environmental information. Then, the distinctive neighbor residues were identified by the importance scores of information gain (IG). The most representative model was constructed by support vector machine (SVM) for Arginine and Lysine methylation, respectively. This model yielded a promising result on both the benchmark dataset and independent test set. The model was used to screen the entire human proteome, and many unknown substrates were identified. These results indicate that our method can serve as a useful supplement to elucidate the mechanism of protein methylation and facilitate hypothesis-driven experimental design and validation.
Impaired mitotic progression and preimplantation lethality in mice lacking OMCG1, a new evolutionarily conserved nuclear protein

DEFF Research Database (Denmark)

Artus, Jérôme; Vandormael-Pournin, Sandrine; Frödin, Morten

2005-01-01

While highly conserved through evolution, the cell cycle has been extensively modified to adapt to new developmental programs. Recently, analyses of mouse mutants revealed that several important cell cycle regulators are either dispensable for development or have a tissue- or cell-type-specific f...
The influence of DNA sequence on epigenome-induced pathologies

Directory of Open Access Journals (Sweden)

Meagher Richard B

2012-07-01

Full Text Available Abstract Clear cause-and-effect relationships are commonly established between genotype and the inherited risk of acquiring human and plant diseases and aberrant phenotypes. By contrast, few such cause-and-effect relationships are established linking a chromatin structure (that is, the epitype with the transgenerational risk of acquiring a disease or abnormal phenotype. It is not entirely clear how epitypes are inherited from parent to offspring as populations evolve, even though epigenetics is proposed to be fundamental to evolution and the likelihood of acquiring many diseases. This article explores the hypothesis that, for transgenerationally inherited chromatin structures, “genotype predisposes epitype”, and that epitype functions as a modifier of gene expression within the classical central dogma of molecular biology. Evidence for the causal contribution of genotype to inherited epitypes and epigenetic risk comes primarily from two different kinds of studies discussed herein. The first and direct method of research proceeds by the examination of the transgenerational inheritance of epitype and the penetrance of phenotype among genetically related individuals. The second approach identifies epitypes that are duplicated (as DNA sequences are duplicated and evolutionarily conserved among repeated patterns in the DNA sequence. The body of this article summarizes particularly robust examples of these studies from humans, mice, Arabidopsis, and other organisms. The bulk of the data from both areas of research support the hypothesis that genotypes predispose the likelihood of displaying various epitypes, but for only a few classes of epitype. This analysis suggests that renewed efforts are needed in identifying polymorphic DNA sequences that determine variable nucleosome positioning and DNA methylation as the primary cause of inherited epigenome-induced pathologies. By contrast, there is very little evidence that DNA sequence directly
CORECLUST: identification of the conserved CRM grammar together with prediction of gene regulation.

Science.gov (United States)

Nikulova, Anna A; Favorov, Alexander V; Sutormin, Roman A; Makeev, Vsevolod J; Mironov, Andrey A

2012-07-01

Identification of transcriptional regulatory regions and tracing their internal organization are important for understanding the eukaryotic cell machinery. Cis-regulatory modules (CRMs) of higher eukaryotes are believed to possess a regulatory 'grammar', or preferred arrangement of binding sites, that is crucial for proper regulation and thus tends to be evolutionarily conserved. Here, we present a method CORECLUST (COnservative REgulatory CLUster STructure) that predicts CRMs based on a set of positional weight matrices. Given regulatory regions of orthologous and/or co-regulated genes, CORECLUST constructs a CRM model by revealing the conserved rules that describe the relative location of binding sites. The constructed model may be consequently used for the genome-wide prediction of similar CRMs, and thus detection of co-regulated genes, and for the investigation of the regulatory grammar of the system. Compared with related methods, CORECLUST shows better performance at identification of CRMs conferring muscle-specific gene expression in vertebrates and early-developmental CRMs in Drosophila.
Positioning of centrioles is a conserved readout of Frizzled planar cell polarity signalling.

Science.gov (United States)

Carvajal-Gonzalez, Jose Maria; Roman, Angel-Carlos; Mlodzik, Marek

2016-03-29

Planar cell polarity (PCP) signalling is a well-conserved developmental pathway regulating cellular orientation during development. An evolutionarily conserved pathway readout is not established and, moreover, it is thought that PCP mediated cellular responses are tissue-specific. A key PCP function in vertebrates is to regulate coordinated centriole/cilia positioning, a function that has not been associated with PCP in Drosophila. Here we report instructive input of Frizzled-PCP (Fz/PCP) signalling into polarized centriole positioning in Drosophila wings. We show that centrioles are polarized in pupal wing cells as a readout of PCP signalling, with both gain and loss-of-function Fz/PCP signalling affecting centriole polarization. Importantly, loss or gain of centrioles does not affect Fz/PCP establishment, implicating centriolar positioning as a conserved PCP-readout, likely downstream of PCP-regulated actin polymerization. Together with vertebrate data, these results suggest a unifying model of centriole/cilia positioning as a common downstream effect of PCP signalling from flies to mammals.
Human T-cell recognition of synthetic peptides representing conserved and variant sequences from the merozoite surface protein 2 of Plasmodium falciparum

DEFF Research Database (Denmark)

Theander, T G; Hviid, L; Dodoo, D

1997-01-01

Merozoite surface protein 2 (MSP2) is a malaria vaccine candidate currently undergoing clinical trials. We analyzed the peripheral blood mononuclear cell (PBMC) response to synthetic peptides corresponding to conserved and variant regions of the FCQ-27 allelic form of MSP2 in Ghanaian individuals....... The findings are encouraging for the development of a vaccine based on these T-epitope containing regions of MSP2, as the peptides were broadly recognized suggesting that they can bind to diverse HLA alleles and also because they include conserved MSP2 sequences. Immunisation with a vaccine construct...
Mapping the transcription start points of the Staphylococcus aureus eap, emp, and vwb promoters reveals a conserved octanucleotide sequence that is essential for expression of these genes.

Science.gov (United States)

Harraghy, Niamh; Homerova, Dagmar; Herrmann, Mathias; Kormanec, Jan

2008-01-01

Mapping the transcription start points of the eap, emp, and vwb promoters revealed a conserved octanucleotide sequence (COS). Deleting this sequence abolished the expression of eap, emp, and vwb. However, electrophoretic mobility shift assays gave no evidence that this sequence was a binding site for SarA or SaeR, known regulators of eap and emp.
The Poxvirus C7L Host Range Factor Superfamily

OpenAIRE

Liu, Jia; Rothenburg, Stefan; McFadden, Grant

2012-01-01

Host range factors, expressed by the poxvirus family, determine the host tropism of species, tissue, and cell specificity. C7L family members exist in the genomes of most sequenced mammalian poxviruses, suggesting an evolutionarily conserved effort adapting to the hosts. In general, C7L orthologs influence the host tropism in mammalian cell culture, and for some poxviruses it is essential for the complete viral life cycle in vitro and in vivo. The C7L family members lack obvious sequence homo...
Spatially conserved regulatory elements identified within human and mouse Cd247 gene using high-throughput sequencing data from the ENCODE project

DEFF Research Database (Denmark)

Pundhir, Sachin; Hannibal, Tine Dahlbæk; Bang-Berthelsen, Claus Heiner

2014-01-01

. In this study, we have utilized the wealth of high-throughput sequencing data produced during the Encyclopedia of DNA Elements (ENCODE) project to identify spatially conserved regulatory elements within the Cd247 gene from human and mouse. We show the presence of two transcription factor binding sites...
Fenced and Fragmented: Conservation Value of Managed Metapopulations

Science.gov (United States)

Miller, Susan M.; Harper, Cindy K.; Bloomer, Paulette; Hofmeyr, Jennifer; Funston, Paul J.

2015-01-01

Population fragmentation is threatening biodiversity worldwide. Species that once roamed vast areas are increasingly being conserved in small, isolated areas. Modern management approaches must adapt to ensure the continued survival and conservation value of these populations. In South Africa, a managed metapopulation approach has been adopted for several large carnivore species, all protected in isolated, relatively small, reserves that are fenced. As far as possible these approaches are based on natural metapopulation structures. In this network, over the past 25 years, African lions (Panthera leo) were reintroduced into 44 fenced reserves with little attention given to maintaining genetic diversity. To examine the situation, we investigated the current genetic provenance and diversity of these lions. We found that overall genetic diversity was similar to that in a large national park, and included a mixture of four different southern African evolutionarily significant units (ESUs). This mixing of ESUs, while not ideal, provides a unique opportunity to study the impact of mixing ESUs over the long term. We propose a strategic managed metapopulation plan to ensure the maintenance of genetic diversity and improve the long-term conservation value of these lions. This managed metapopulation approach could be applied to other species under similar ecological constraints around the globe. PMID:26699333
Sequence diversity and evolution of antimicrobial peptides in invertebrates.

Science.gov (United States)

Tassanakajon, Anchalee; Somboonwiwat, Kunlaya; Amparyup, Piti

2015-02-01

Antimicrobial peptides (AMPs) are evolutionarily ancient molecules that act as the key components in the invertebrate innate immunity against invading pathogens. Several AMPs have been identified and characterized in invertebrates, and found to display considerable diversity in their amino acid sequence, structure and biological activity. AMP genes appear to have rapidly evolved, which might have arisen from the co-evolutionary arms race between host and pathogens, and enabled organisms to survive in different microbial environments. Here, the sequence diversity of invertebrate AMPs (defensins, cecropins, crustins and anti-lipopolysaccharide factors) are presented to provide a better understanding of the evolution pattern of these peptides that play a major role in host defense mechanisms. Copyright © 2014 Elsevier Ltd. All rights reserved.
The evolutionarily conserved mediator subunit MDT-15/MED15 links protective innate immune responses and xenobiotic detoxification.

Directory of Open Access Journals (Sweden)

Read Pukkila-Worley

2014-05-01

Full Text Available Metazoans protect themselves from environmental toxins and virulent pathogens through detoxification and immune responses. We previously identified a small molecule xenobiotic toxin that extends survival of Caenorhabditis elegans infected with human bacterial pathogens by activating the conserved p38 MAP kinase PMK-1 host defense pathway. Here we investigate the cellular mechanisms that couple activation of a detoxification response to innate immunity. From an RNAi screen of 1,420 genes expressed in the C. elegans intestine, we identified the conserved Mediator subunit MDT-15/MED15 and 28 other gene inactivations that abrogate the induction of PMK-1-dependent immune effectors by this small molecule. We demonstrate that MDT-15/MED15 is required for the xenobiotic-induced expression of p38 MAP kinase PMK-1-dependent immune genes and protection from Pseudomonas aeruginosa infection. We also show that MDT-15 controls the induction of detoxification genes and functions to protect the host from bacteria-derived phenazine toxins. These data define a central role for MDT-15/MED15 in the coordination of xenobiotic detoxification and innate immune responses.
The evolutionarily conserved mediator subunit MDT-15/MED15 links protective innate immune responses and xenobiotic detoxification.

Science.gov (United States)

Pukkila-Worley, Read; Feinbaum, Rhonda L; McEwan, Deborah L; Conery, Annie L; Ausubel, Frederick M

2014-05-01

Metazoans protect themselves from environmental toxins and virulent pathogens through detoxification and immune responses. We previously identified a small molecule xenobiotic toxin that extends survival of Caenorhabditis elegans infected with human bacterial pathogens by activating the conserved p38 MAP kinase PMK-1 host defense pathway. Here we investigate the cellular mechanisms that couple activation of a detoxification response to innate immunity. From an RNAi screen of 1,420 genes expressed in the C. elegans intestine, we identified the conserved Mediator subunit MDT-15/MED15 and 28 other gene inactivations that abrogate the induction of PMK-1-dependent immune effectors by this small molecule. We demonstrate that MDT-15/MED15 is required for the xenobiotic-induced expression of p38 MAP kinase PMK-1-dependent immune genes and protection from Pseudomonas aeruginosa infection. We also show that MDT-15 controls the induction of detoxification genes and functions to protect the host from bacteria-derived phenazine toxins. These data define a central role for MDT-15/MED15 in the coordination of xenobiotic detoxification and innate immune responses.
Cytoplasmic protein binding to highly conserved sequences in the 3' untranslated region of mouse protamine 2 mRNA, a translationally regulated transcript of male germ cells

International Nuclear Information System (INIS)

Kwon, Y.K.; Hecht, N.B.

1991-01-01

The expression of the protamines, the predominant nuclear proteins of mammalian spermatozoa, is regulated translationally during male germ-cell development. The 3' untranslated region (UTR) of protamine 1 mRNA has been reported to control its time of translation. To understand the mechanisms controlling translation of the protamine mRNAs, we have sought to identify cis elements of the 3' UTR of protamine 2 mRNA that are recognized by cytoplasmic factors. From gel retardation assays, two sequence elements are shown to form specific RNA-protein complexes. Protein binding sites of the two complexes were determined by RNase T1 mapping, by blocking the putative binding sites with antisense oligonucleotides, and by competition assays. The sequences of these elements, located between nucleotides + 537 and + 572 in protamine 2 mRNA, are highly conserved among postmeiotic translationally regulated nuclear proteins of the mammalian testis. Two closely linked protein binding sites were detected. UV-crosslinking studies revealed that a protein of about 18 kDa binds to one of the conserved sequences. These data demonstrate specific protein binding to a highly conserved 3' UTR of translationally regulated testicular mRNA
BlockLogo: Visualization of peptide and sequence motif conservation

DEFF Research Database (Denmark)

Olsen, Lars Rønn; Kudahl, Ulrich Johan; Simon, Christian

2013-01-01

BlockLogo is a web-server application for the visualization of protein and nucleotide fragments, continuous protein sequence motifs, and discontinuous sequence motifs using calculation of block entropy from multiple sequence alignments. The user input consists of a multiple sequence alignment, se...
The drug target genes show higher evolutionary conservation than non-target genes.

Science.gov (United States)

Lv, Wenhua; Xu, Yongdeng; Guo, Yiying; Yu, Ziqi; Feng, Guanglong; Liu, Panpan; Luan, Meiwei; Zhu, Hongjie; Liu, Guiyou; Zhang, Mingming; Lv, Hongchao; Duan, Lian; Shang, Zhenwei; Li, Jin; Jiang, Yongshuai; Zhang, Ruijie

2016-01-26

Although evidence indicates that drug target genes share some common evolutionary features, there have been few studies analyzing evolutionary features of drug targets from an overall level. Therefore, we conducted an analysis which aimed to investigate the evolutionary characteristics of drug target genes. We compared the evolutionary conservation between human drug target genes and non-target genes by combining both the evolutionary features and network topological properties in human protein-protein interaction network. The evolution rate, conservation score and the percentage of orthologous genes of 21 species were included in our study. Meanwhile, four topological features including the average shortest path length, betweenness centrality, clustering coefficient and degree were considered for comparison analysis. Then we got four results as following: compared with non-drug target genes, 1) drug target genes had lower evolutionary rates; 2) drug target genes had higher conservation scores; 3) drug target genes had higher percentages of orthologous genes and 4) drug target genes had a tighter network structure including higher degrees, betweenness centrality, clustering coefficients and lower average shortest path lengths. These results demonstrate that drug target genes are more evolutionarily conserved than non-drug target genes. We hope that our study will provide valuable information for other researchers who are interested in evolutionary conservation of drug targets.

On Nash Equilibrium and Evolutionarily Stable States That Are Not Characterised by the Folk Theorem.

Directory of Open Access Journals (Sweden)

Jiawei Li

Full Text Available In evolutionary game theory, evolutionarily stable states are characterised by the folk theorem because exact solutions to the replicator equation are difficult to obtain. It is generally assumed that the folk theorem, which is the fundamental theory for non-cooperative games, defines all Nash equilibria in infinitely repeated games. Here, we prove that Nash equilibria that are not characterised by the folk theorem do exist. By adopting specific reactive strategies, a group of players can be better off by coordinating their actions in repeated games. We call it a type-k equilibrium when a group of k players coordinate their actions and they have no incentive to deviate from their strategies simultaneously. The existence and stability of the type-k equilibrium in general games is discussed. This study shows that the sets of Nash equilibria and evolutionarily stable states have greater cardinality than classic game theory has predicted in many repeated games.
On Nash Equilibrium and Evolutionarily Stable States That Are Not Characterised by the Folk Theorem

Science.gov (United States)

Li, Jiawei; Kendall, Graham

2015-01-01

In evolutionary game theory, evolutionarily stable states are characterised by the folk theorem because exact solutions to the replicator equation are difficult to obtain. It is generally assumed that the folk theorem, which is the fundamental theory for non-cooperative games, defines all Nash equilibria in infinitely repeated games. Here, we prove that Nash equilibria that are not characterised by the folk theorem do exist. By adopting specific reactive strategies, a group of players can be better off by coordinating their actions in repeated games. We call it a type-k equilibrium when a group of k players coordinate their actions and they have no incentive to deviate from their strategies simultaneously. The existence and stability of the type-k equilibrium in general games is discussed. This study shows that the sets of Nash equilibria and evolutionarily stable states have greater cardinality than classic game theory has predicted in many repeated games. PMID:26288088
The effects of sequence and type of chemotherapy and radiation therapy on cosmesis and complications after breast conservation therapy

International Nuclear Information System (INIS)

Markiewicz, Deborah A.; Schultz, Delray J.; Haas, Jonathan A.; Harris, Eleanor E. R.; Fox, Kevin R.; Glick, John H.; Solin, Lawrence J.

1996-01-01

Purpose: Chemotherapy plays an increasingly important role in the treatment of both node-negative and node-positive breast cancer patients, but the optimal sequencing of chemotherapy and radiation therapy is not well established. The purpose of this study is to evaluate the interaction of sequence and type of chemotherapy and hormonal therapy given with radiation therapy on the cosmetic outcome and the incidence of complications of Stage I and II breast cancer patients treated with breast-conserving therapy. Methods and Materials: The records of 1053 Stage I and II breast cancer patients treated with curative intent with breast-conserving surgery, axillary dissection, and radiation therapy between 1977-1991 were reviewed. Median follow-up after treatment was 6.7 years. Two hundred fourteen patients received chemotherapy alone, 141 patients received hormonal therapy alone, 86 patients received both, and 612 patients received no adjuvant therapy. Patients who received chemotherapy ± hormonal therapy were grouped according to sequence of chemotherapy: (a) concurrent = concurrent chemotherapy with radiation therapy followed by chemotherapy; (b) sequential = radiation followed by chemotherapy or chemotherapy followed by radiation; and (c) sandwich = chemotherapy followed by concurrent chemotherapy and radiation followed by chemotherapy. Compared to node negative patients, node-positive patients more commonly received chemotherapy (77 vs. 9%, p < 0.0001) and/or hormonal therapy (40 vs. 14%, p < 0.0001). Among patients who received chemotherapy, the majority (243 patients) received concurrent chemotherapy and radiation therapy with two cycles of cytoxan and 5-fluorouracil (5-FU) administered during radiation followed by six cycles of chemotherapy with cytoxan, 5-fluorouracil and either methotrexate(CMF) or doxorubicin(CAF). For analysis of cosmesis, patients included were relapse free with 3 years minimum follow-up. Results: The use of chemotherapy had an adverse effect
Sequence conservation between porcine and human LRRK2

DEFF Research Database (Denmark)

Larsen, Knud; Madsen, Lone Bruhn

2009-01-01

Leucine-rich repeat kinase 2 (LRRK2) is a member of the ROCO protein superfamily (Ras of complex proteins (Roc) with a C-terminal Roc domain). Mutations in the LRRK2 gene lead to autosomal dominant Parkinsonism. We have cloned the porcine LRRK2 cDNA in an attempt to characterize conserved...... and expression patterns are conserved across species. The porcine LRRK2 gene was mapped to chromosome 5q25. The results obtained suggest that the LRRK2 gene might be of particular interest in our attempt to generate a transgenic porcine model for Parkinson's disease...
MASTR: multiple alignment and structure prediction of non-coding RNAs using simulated annealing

DEFF Research Database (Denmark)

Lindgreen, Stinus; Gardner, Paul P; Krogh, Anders

2007-01-01

function that considers sequence conservation, covariation and basepairing probabilities. The results show that the method is very competitive to similar programs available today, both in terms of accuracy and computational efficiency. AVAILABILITY: Source code available from http://mastr.binf.ku.dk/......MOTIVATION: As more non-coding RNAs are discovered, the importance of methods for RNA analysis increases. Since the structure of ncRNA is intimately tied to the function of the molecule, programs for RNA structure prediction are necessary tools in this growing field of research. Furthermore......, it is known that RNA structure is often evolutionarily more conserved than sequence. However, few existing methods are capable of simultaneously considering multiple sequence alignment and structure prediction. RESULT: We present a novel solution to the problem of simultaneous structure prediction...
A novel fragile X syndrome mutation reveals a conserved role for the carboxy-terminus in FMRP localization and function.

Science.gov (United States)

Okray, Zeynep; de Esch, Celine E F; Van Esch, Hilde; Devriendt, Koen; Claeys, Annelies; Yan, Jiekun; Verbeeck, Jelle; Froyen, Guy; Willemsen, Rob; de Vrij, Femke M S; Hassan, Bassem A

2015-04-01

Loss of function of the FMR1 gene leads to fragile X syndrome (FXS), the most common form of intellectual disability. The loss of FMR1 function is usually caused by epigenetic silencing of the FMR1 promoter leading to expansion and subsequent methylation of a CGG repeat in the 5' untranslated region. Very few coding sequence variations have been experimentally characterized and shown to be causal to the disease. Here, we describe a novel FMR1 mutation and reveal an unexpected nuclear export function for the C-terminus of FMRP. We screened a cohort of patients with typical FXS symptoms who tested negative for CGG repeat expansion in the FMR1 locus. In one patient, we identified a guanine insertion in FMR1 exon 15. This mutation alters the open reading frame creating a short novel C-terminal sequence, followed by a stop codon. We find that this novel peptide encodes a functional nuclear localization signal (NLS) targeting the patient FMRP to the nucleolus in human cells. We also reveal an evolutionarily conserved nuclear export function associated with the endogenous C-terminus of FMRP. In vivo analyses in Drosophila demonstrate that a patient-mimetic mutation alters the localization and function of Dfmrp in neurons, leading to neomorphic neuronal phenotypes. © 2015 The Authors. Published under the terms of the CC BY 4.0 license.
Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity.

Science.gov (United States)

King, Brian R; Aburdene, Maurice; Thompson, Alex; Warres, Zach

2014-01-01

Digital signal processing (DSP) techniques for biological sequence analysis continue to grow in popularity due to the inherent digital nature of these sequences. DSP methods have demonstrated early success for detection of coding regions in a gene. Recently, these methods are being used to establish DNA gene similarity. We present the inter-coefficient difference (ICD) transformation, a novel extension of the discrete Fourier transformation, which can be applied to any DNA sequence. The ICD method is a mathematical, alignment-free DNA comparison method that generates a genetic signature for any DNA sequence that is used to generate relative measures of similarity among DNA sequences. We demonstrate our method on a set of insulin genes obtained from an evolutionarily wide range of species, and on a set of avian influenza viral sequences, which represents a set of highly similar sequences. We compare phylogenetic trees generated using our technique against trees generated using traditional alignment techniques for similarity and demonstrate that the ICD method produces a highly accurate tree without requiring an alignment prior to establishing sequence similarity.
TRANSAT-- method for detecting the conserved helices of functional RNA structures, including transient, pseudo-knotted and alternative structures.

Science.gov (United States)

Wiebe, Nicholas J P; Meyer, Irmtraud M

2010-06-24

The prediction of functional RNA structures has attracted increased interest, as it allows us to study the potential functional roles of many genes. RNA structure prediction methods, however, assume that there is a unique functional RNA structure and also do not predict functional features required for in vivo folding. In order to understand how functional RNA structures form in vivo, we require sophisticated experiments or reliable prediction methods. So far, there exist only a few, experimentally validated transient RNA structures. On the computational side, there exist several computer programs which aim to predict the co-transcriptional folding pathway in vivo, but these make a range of simplifying assumptions and do not capture all features known to influence RNA folding in vivo. We want to investigate if evolutionarily related RNA genes fold in a similar way in vivo. To this end, we have developed a new computational method, Transat, which detects conserved helices of high statistical significance. We introduce the method, present a comprehensive performance evaluation and show that Transat is able to predict the structural features of known reference structures including pseudo-knotted ones as well as those of known alternative structural configurations. Transat can also identify unstructured sub-sequences bound by other molecules and provides evidence for new helices which may define folding pathways, supporting the notion that homologous RNA sequence not only assume a similar reference RNA structure, but also fold similarly. Finally, we show that the structural features predicted by Transat differ from those assuming thermodynamic equilibrium. Unlike the existing methods for predicting folding pathways, our method works in a comparative way. This has the disadvantage of not being able to predict features as function of time, but has the considerable advantage of highlighting conserved features and of not requiring a detailed knowledge of the cellular
Mammals on the EDGE: conservation priorities based on threat and phylogeny.

Directory of Open Access Journals (Sweden)

Nick J B Isaac

2007-03-01

Full Text Available Conservation priority setting based on phylogenetic diversity has frequently been proposed but rarely implemented. Here, we define a simple index that measures the contribution made by different species to phylogenetic diversity and show how the index might contribute towards species-based conservation priorities. We describe procedures to control for missing species, incomplete phylogenetic resolution and uncertainty in node ages that make it possible to apply the method in poorly known clades. We also show that the index is independent of clade size in phylogenies of more than 100 species, indicating that scores from unrelated taxonomic groups are likely to be comparable. Similar scores are returned under two different species concepts, suggesting that the index is robust to taxonomic changes. The approach is applied to a near-complete species-level phylogeny of the Mammalia to generate a global priority list incorporating both phylogenetic diversity and extinction risk. The 100 highest-ranking species represent a high proportion of total mammalian diversity and include many species not usually recognised as conservation priorities. Many species that are both evolutionarily distinct and globally endangered (EDGE species do not benefit from existing conservation projects or protected areas. The results suggest that global conservation priorities may have to be reassessed in order to prevent a disproportionately large amount of mammalian evolutionary history becoming extinct in the near future.
An effective approach for annotation of protein families with low sequence similarity and conserved motifs: identifying GDSL hydrolases across the plant kingdom.

Science.gov (United States)

Vujaklija, Ivan; Bielen, Ana; Paradžik, Tina; Biđin, Siniša; Goldstein, Pavle; Vujaklija, Dušica

2016-02-18

The massive accumulation of protein sequences arising from the rapid development of high-throughput sequencing, coupled with automatic annotation, results in high levels of incorrect annotations. In this study, we describe an approach to decrease annotation errors of protein families characterized by low overall sequence similarity. The GDSL lipolytic family comprises proteins with multifunctional properties and high potential for pharmaceutical and industrial applications. The number of proteins assigned to this family has increased rapidly over the last few years. In particular, the natural abundance of GDSL enzymes reported recently in plants indicates that they could be a good source of novel GDSL enzymes. We noticed that a significant proportion of annotated sequences lack specific GDSL motif(s) or catalytic residue(s). Here, we applied motif-based sequence analyses to identify enzymes possessing conserved GDSL motifs in selected proteomes across the plant kingdom. Motif-based HMM scanning (Viterbi decoding-VD and posterior decoding-PD) and the here described PD/VD protocol were successfully applied on 12 selected plant proteomes to identify sequences with GDSL motifs. A significant number of identified GDSL sequences were novel. Moreover, our scanning approach successfully detected protein sequences lacking at least one of the essential motifs (171/820) annotated by Pfam profile search (PfamA) as GDSL. Based on these analyses we provide a curated list of GDSL enzymes from the selected plants. CLANS clustering and phylogenetic analysis helped us to gain a better insight into the evolutionary relationship of all identified GDSL sequences. Three novel GDSL subfamilies as well as unreported variations in GDSL motifs were discovered in this study. In addition, analyses of selected proteomes showed a remarkable expansion of GDSL enzymes in the lycophyte, Selaginella moellendorffii. Finally, we provide a general motif-HMM scanner which is easily accessible through
Genome Analysis of Conserved Dehydrin Motifs in Vascular Plants

Directory of Open Access Journals (Sweden)

Ahmad A. Malik

2017-05-01

Full Text Available Dehydrins, a large family of abiotic stress proteins, are defined by the presence of a mostly conserved motif known as the K-segment, and may also contain two other conserved motifs known as the Y-segment and S-segment. Using the dehydrin literature, we developed a sequence motif definition of the K-segment, which we used to create a large dataset of dehydrin sequences by searching the Pfam00257 dehydrin dataset and the Phytozome 10 sequences of vascular plants. A comprehensive analysis of these sequences reveals that lysine residues are highly conserved in the K-segment, while the amino acid type is often conserved at other positions. Despite the Y-segment name, the central tyrosine is somewhat conserved, but can be substituted with two other small aromatic amino acids (phenylalanine or histidine. The S-segment contains a series of serine residues, but in some proteins is also preceded by a conserved LHR sequence. In many dehydrins containing all three of these motifs the S-segment is linked to the K-segment by a GXGGRRKK motif (where X can be any amino acid, suggesting a functional linkage between these two motifs. An analysis of the sequences shows that the dehydrin architecture and several biochemical properties (isoelectric point, molecular mass, and hydrophobicity score are dependent on each other, and that some dehydrin architectures are overexpressed during certain abiotic stress, suggesting that they may be optimized for a specific abiotic stress while others are involved in all forms of dehydration stress (drought, cold, and salinity.
Identification of an evolutionarily conserved extracellular threonine residue critical for surface expression and its potential coupling of adjacent voltage-sensing and gating domains in voltage-gated potassium channels.

Science.gov (United States)

Mckeown, Lynn; Burnham, Matthew P; Hodson, Charlotte; Jones, Owen T

2008-10-31

The dynamic expression of voltage-gated potassium channels (Kvs) at the cell surface is a fundamental factor controlling membrane excitability. In exploring possible mechanisms controlling Kv surface expression, we identified a region in the extracellular linker between the first and second of the six (S1-S6) transmembrane-spanning domains of the Kv1.4 channel, which we hypothesized to be critical for its biogenesis. Using immunofluorescence microscopy, flow cytometry, patch clamp electrophysiology, and mutagenesis, we identified a single threonine residue at position 330 within the Kv1.4 S1-S2 linker that is absolutely required for cell surface expression. Mutation of Thr-330 to an alanine, aspartate, or lysine prevented surface expression. However, surface expression occurred upon co-expression of mutant and wild type Kv1.4 subunits or mutation of Thr-330 to a serine. Mutation of the corresponding residue (Thr-211) in Kv3.1 to alanine also caused intracellular retention, suggesting that the conserved threonine plays a generalized role in surface expression. In support of this idea, sequence comparisons showed conservation of the critical threonine in all Kv families and in organisms across the evolutionary spectrum. Based upon the Kv1.2 crystal structure, further mutagenesis, and the partial restoration of surface expression in an electrostatic T330K bridging mutant, we suggest that Thr-330 hydrogen bonds to equally conserved outer pore residues, which may include a glutamate at position 502 that is also critical for surface expression. We propose that Thr-330 serves to interlock the voltage-sensing and gating domains of adjacent monomers, thereby yielding a structure competent for the surface expression of functional tetramers.
Human developmental enhancers conserved between deuterostomes and protostomes.

Directory of Open Access Journals (Sweden)

Shoa L Clarke

Full Text Available The identification of homologies, whether morphological, molecular, or genetic, is fundamental to our understanding of common biological principles. Homologies bridging the great divide between deuterostomes and protostomes have served as the basis for current models of animal evolution and development. It is now appreciated that these two clades share a common developmental toolkit consisting of conserved transcription factors and signaling pathways. These patterning genes sometimes show common expression patterns and genetic interactions, suggesting the existence of similar or even conserved regulatory apparatus. However, previous studies have found no regulatory sequence conserved between deuterostomes and protostomes. Here we describe the first such enhancers, which we call bilaterian conserved regulatory elements (Bicores. Bicores show conservation of sequence and gene synteny. Sequence conservation of Bicores reflects conserved patterns of transcription factor binding sites. We predict that Bicores act as response elements to signaling pathways, and we show that Bicores are developmental enhancers that drive expression of transcriptional repressors in the vertebrate central nervous system. Although the small number of identified Bicores suggests extensive rewiring of cis-regulation between the protostome and deuterostome clades, additional Bicores may be revealed as our understanding of cis-regulatory logic and sample of bilaterian genomes continue to grow.
Identification of four evolutionarily related G protein-coupled receptors from the malaria mosquito Anopheles gambiae

DEFF Research Database (Denmark)

Belmont, Martin; Cazzamali, Giuseppe; Williamson, Michael

2006-01-01

The mosquito Anopheles gambiae is an important vector for malaria, which is one of the most serious human parasitic diseases in the world, causing up to 2.7 million deaths yearly. To contribute to our understanding of A. gambiae and to the transmission of malaria, we have now cloned four evolutio......The mosquito Anopheles gambiae is an important vector for malaria, which is one of the most serious human parasitic diseases in the world, causing up to 2.7 million deaths yearly. To contribute to our understanding of A. gambiae and to the transmission of malaria, we have now cloned four...... evolutionarily related G protein-coupled receptors (GPCRs) from this mosquito and expressed them in Chinese hamster ovary cells. After screening of a library of thirty-three insect or other invertebrate neuropeptides and eight biogenic amines, we could identify (de-orphanize) three of these GPCRs as...... relationship to the A. gambiae and other insect AKH receptors suggested that it is a receptor for an AKH-like peptide. This is the first published report on evolutionarily related AKH, corazonin, and CCAP receptors in mosquitoes....
Conservation implications of anthropogenic impacts on visual communication and camouflage.

Science.gov (United States)

Delhey, Kaspar; Peters, Anne

2017-02-01

Anthropogenic environmental impacts can disrupt the sensory environment of animals and affect important processes from mate choice to predator avoidance. Currently, these effects are best understood for auditory and chemosensory modalities, and recent reviews highlight their importance for conservation. We examined how anthropogenic changes to the visual environment (ambient light, transmission, and backgrounds) affect visual communication and camouflage and considered the implications of these effects for conservation. Human changes to the visual environment can increase predation risk by affecting camouflage effectiveness, lead to maladaptive patterns of mate choice, and disrupt mutualistic interactions between pollinators and plants. Implications for conservation are particularly evident for disrupted camouflage due to its tight links with survival. The conservation importance of impaired visual communication is less documented. The effects of anthropogenic changes on visual communication and camouflage may be severe when they affect critical processes such as pollination or species recognition. However, when impaired mate choice does not lead to hybridization, the conservation consequences are less clear. We suggest that the demographic effects of human impacts on visual communication and camouflage will be particularly strong when human-induced modifications to the visual environment are evolutionarily novel (i.e., very different from natural variation); affected species and populations have low levels of intraspecific (genotypic and phenotypic) variation and behavioral, sensory, or physiological plasticity; and the processes affected are directly related to survival (camouflage), species recognition, or number of offspring produced, rather than offspring quality or attractiveness. Our findings suggest that anthropogenic effects on the visual environment may be of similar importance relative to conservation as anthropogenic effects on other sensory modalities
Further results on universal properties in conservative dynamical systems

Energy Technology Data Exchange (ETDEWEB)

Benettin, G [Padua Univ. (Italy). Ist. di Fisica; Galgani, L; Giorgilli, A [Milan Univ. (Italy). Ist. di Fisica; Milan Univ. (Italy). Ist. di Matematica)

1980-10-11

In conservative dynamical systems depending on a parameter, sequences of period-doubling bifurcations can be observed by varying the parameter, starting from a stable fixed point. These sequences are analogous to those already known for dissipative systems. The paper shows some new results obtained for two-dimensional conservative mappings.
Positive selection in the SLC11A1 gene in the family Equidae

DEFF Research Database (Denmark)

Bayerova, Zuzana; Janova, Eva; Matiasovic, Jan

2016-01-01

Immunity-related genes are a suitable model for studying effects of selection at the genomic level. Some of them are highly conserved due to functional constraints and purifying selection, while others are variable and change quickly to cope with the variation of pathogens. The SLC11A1 gene encodes...... a transporter protein mediating antimicrobial activity of macrophages. Little is known about the patterns of selection shaping this gene during evolution. Although it is a typical evolutionarily conserved gene, functionally important polymorphisms associated with various diseases were identified in humans...... and other species. We analyzed the genomic organization, genetic variation, and evolution of the SLC11A1 gene in the family Equidae to identify patterns of selection within this important gene. Nucleotide SLC11A1 sequences were shown to be highly conserved in ten equid species, with more than 97 % sequence...
The First Myriapod Genome Sequence Reveals Conservative Arthropod Gene Content and Genome Organisation in the Centipede Strigamia maritima

Science.gov (United States)

Chipman, Ariel D.; Ferrier, David E. K.; Brena, Carlo; Qu, Jiaxin; Hughes, Daniel S. T.; Schröder, Reinhard; Torres-Oliva, Montserrat; Znassi, Nadia; Jiang, Huaiyang; Almeida, Francisca C.; Alonso, Claudio R.; Apostolou, Zivkos; Aqrawi, Peshtewani; Arthur, Wallace; Barna, Jennifer C. J.; Blankenburg, Kerstin P.; Brites, Daniela; Capella-Gutiérrez, Salvador; Coyle, Marcus; Dearden, Peter K.; Du Pasquier, Louis; Duncan, Elizabeth J.; Ebert, Dieter; Eibner, Cornelius; Erikson, Galina; Evans, Peter D.; Extavour, Cassandra G.; Francisco, Liezl; Gabaldón, Toni; Gillis, William J.; Goodwin-Horn, Elizabeth A.; Green, Jack E.; Griffiths-Jones, Sam; Grimmelikhuijzen, Cornelis J. P.; Gubbala, Sai; Guigó, Roderic; Han, Yi; Hauser, Frank; Havlak, Paul; Hayden, Luke; Helbing, Sophie; Holder, Michael; Hui, Jerome H. L.; Hunn, Julia P.; Hunnekuhl, Vera S.; Jackson, LaRonda; Javaid, Mehwish; Jhangiani, Shalini N.; Jiggins, Francis M.; Jones, Tamsin E.; Kaiser, Tobias S.; Kalra, Divya; Kenny, Nathan J.; Korchina, Viktoriya; Kovar, Christie L.; Kraus, F. Bernhard; Lapraz, François; Lee, Sandra L.; Lv, Jie; Mandapat, Christigale; Manning, Gerard; Mariotti, Marco; Mata, Robert; Mathew, Tittu; Neumann, Tobias; Newsham, Irene; Ngo, Dinh N.; Ninova, Maria; Okwuonu, Geoffrey; Ongeri, Fiona; Palmer, William J.; Patil, Shobha; Patraquim, Pedro; Pham, Christopher; Pu, Ling-Ling; Putman, Nicholas H.; Rabouille, Catherine; Ramos, Olivia Mendivil; Rhodes, Adelaide C.; Robertson, Helen E.; Robertson, Hugh M.; Ronshaugen, Matthew; Rozas, Julio; Saada, Nehad; Sánchez-Gracia, Alejandro; Scherer, Steven E.; Schurko, Andrew M.; Siggens, Kenneth W.; Simmons, DeNard; Stief, Anna; Stolle, Eckart; Telford, Maximilian J.; Tessmar-Raible, Kristin; Thornton, Rebecca; van der Zee, Maurijn; von Haeseler, Arndt; Williams, James M.; Willis, Judith H.; Wu, Yuanqing; Zou, Xiaoyan; Lawson, Daniel; Muzny, Donna M.; Worley, Kim C.; Gibbs, Richard A.; Akam, Michael; Richards, Stephen

2014-01-01

Myriapods (e.g., centipedes and millipedes) display a simple homonomous body plan relative to other arthropods. All members of the class are terrestrial, but they attained terrestriality independently of insects. Myriapoda is the only arthropod class not represented by a sequenced genome. We present an analysis of the genome of the centipede Strigamia maritima. It retains a compact genome that has undergone less gene loss and shuffling than previously sequenced arthropods, and many orthologues of genes conserved from the bilaterian ancestor that have been lost in insects. Our analysis locates many genes in conserved macro-synteny contexts, and many small-scale examples of gene clustering. We describe several examples where S. maritima shows different solutions from insects to similar problems. The insect olfactory receptor gene family is absent from S. maritima, and olfaction in air is likely effected by expansion of other receptor gene families. For some genes S. maritima has evolved paralogues to generate coding sequence diversity, where insects use alternate splicing. This is most striking for the Dscam gene, which in Drosophila generates more than 100,000 alternate splice forms, but in S. maritima is encoded by over 100 paralogues. We see an intriguing linkage between the absence of any known photosensory proteins in a blind organism and the additional absence of canonical circadian clock genes. The phylogenetic position of myriapods allows us to identify where in arthropod phylogeny several particular molecular mechanisms and traits emerged. For example, we conclude that juvenile hormone signalling evolved with the emergence of the exoskeleton in the arthropods and that RR-1 containing cuticle proteins evolved in the lineage leading to Mandibulata. We also identify when various gene expansions and losses occurred. The genome of S. maritima offers us a unique glimpse into the ancestral arthropod genome, while also displaying many adaptations to its specific
Strong minor groove base conservation in sequence logos implies DNA distortion or base flipping during replication and transcription initiation | Center for Cancer Research

Science.gov (United States)

Dubbed "Tom's T" by Dhruba Chattoraj, the unusually conserved thymine at position +7 in bacteriophage P1 plasmid RepA DNA binding sites rises above repressor and acceptor sequence logos. The T appears to represent base flipping prior to helix opening in this DNA replication initation protein.
Short sequence motifs, overrepresented in mammalian conservednon-coding sequences

Energy Technology Data Exchange (ETDEWEB)

Minovitsky, Simon; Stegmaier, Philip; Kel, Alexander; Kondrashov,Alexey S.; Dubchak, Inna

2007-02-21

Background: A substantial fraction of non-coding DNAsequences of multicellular eukaryotes is under selective constraint. Inparticular, ~;5 percent of the human genome consists of conservednon-coding sequences (CNSs). CNSs differ from other genomic sequences intheir nucleotide composition and must play important functional roles,which mostly remain obscure.Results: We investigated relative abundancesof short sequence motifs in all human CNSs present in the human/mousewhole-genome alignments vs. three background sets of sequences: (i)weakly conserved or unconserved non-coding sequences (non-CNSs); (ii)near-promoter sequences (located between nucleotides -500 and -1500,relative to a start of transcription); and (iii) random sequences withthe same nucleotide composition as that of CNSs. When compared tonon-CNSs and near-promoter sequences, CNSs possess an excess of AT-richmotifs, often containing runs of identical nucleotides. In contrast, whencompared to random sequences, CNSs contain an excess of GC-rich motifswhich, however, lack CpG dinucleotides. Thus, abundance of short sequencemotifs in human CNSs, taken as a whole, is mostly determined by theiroverall compositional properties and not by overrepresentation of anyspecific short motifs. These properties are: (i) high AT-content of CNSs,(ii) a tendency, probably due to context-dependent mutation, of A's andT's to clump, (iii) presence of short GC-rich regions, and (iv) avoidanceof CpG contexts, due to their hypermutability. Only a small number ofshort motifs, overrepresented in all human CNSs are similar to bindingsites of transcription factors from the FOX family.Conclusion: Human CNSsas a whole appear to be too broad a class of sequences to possess strongfootprints of any short sequence-specific functions. Such footprintsshould be studied at the level of functional subclasses of CNSs, such asthose which flank genes with a particular pattern of expression. Overallproperties of CNSs are affected by

The evolutionarily conserved leprecan gene: its regulation by Brachyury and its role in the developing Ciona notochord.

Science.gov (United States)

Dunn, Matthew P; Di Gregorio, Anna

2009-04-15

In Ciona intestinalis, leprecan was identified as a target of the notochord-specific transcription factor Ciona Brachyury (Ci-Bra) (Takahashi, H., Hotta, K., Erives, A., Di Gregorio, A., Zeller, R.W., Levine, M., Satoh, N., 1999. Brachyury downstream notochord differentiation in the ascidian embryo. Genes Dev. 13, 1519-1523). By screening approximately 14 kb of the Ci-leprecan locus for cis-regulatory activity, we have identified a 581-bp minimal notochord-specific cis-regulatory module (CRM) whose activity depends upon T-box binding sites located at the 3'-end of its sequence. These sites are specifically bound in vitro by a GST-Ci-Bra fusion protein, and mutations that abolish binding in vitro result in loss or decrease of regulatory activity in vivo. Serial deletions of the 581-bp notochord CRM revealed that this sequence is also able to direct expression in muscle cells through the same T-box sites that are utilized by Ci-Bra in the notochord, which are also bound in vitro by the muscle-specific T-box activators Ci-Tbx6b and Ci-Tbx6c. Additionally, we created plasmids aimed to interfere with the function of Ci-leprecan and categorized the resulting phenotypes, which consist of variable dislocations of notochord cells along the anterior-posterior axis. Together, these observations provide mechanistic insights generally applicable to T-box transcription factors and their target sequences, as well as a first set of clues on the function of Leprecan in early chordate development.
Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium.

Science.gov (United States)

Catania, Francesco; Lynch, Michael

2010-05-04

In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa) remains a virtually unexplored issue. By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Our observations 1) shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2) are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3) reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.
CodonLogo: a sequence logo-based viewer for codon patterns.

Science.gov (United States)

Sharma, Virag; Murphy, David P; Provan, Gregory; Baranov, Pavel V

2012-07-15

Conserved patterns across a multiple sequence alignment can be visualized by generating sequence logos. Sequence logos show each column in the alignment as stacks of symbol(s) where the height of a stack is proportional to its informational content, whereas the height of each symbol within the stack is proportional to its frequency in the column. Sequence logos use symbols of either nucleotide or amino acid alphabets. However, certain regulatory signals in messenger RNA (mRNA) act as combinations of codons. Yet no tool is available for visualization of conserved codon patterns. We present the first application which allows visualization of conserved regions in a multiple sequence alignment in the context of codons. CodonLogo is based on WebLogo3 and uses the same heuristics but treats codons as inseparable units of a 64-letter alphabet. CodonLogo can discriminate patterns of codon conservation from patterns of nucleotide conservation that appear indistinguishable in standard sequence logos. The CodonLogo source code and its implementation (in a local version of the Galaxy Browser) are available at http://recode.ucc.ie/CodonLogo and through the Galaxy Tool Shed at http://toolshed.g2.bx.psu.edu/.
Structure-aided prediction of mammalian transcription factor complexes in conserved non-coding elements

KAUST Repository

Guturu, H.

2013-11-11

Mapping the DNA-binding preferences of transcription factor (TF) complexes is critical for deciphering the functions of cis-regulatory elements. Here, we developed a computational method that compares co-occurring motif spacings in conserved versus unconserved regions of the human genome to detect evolutionarily constrained binding sites of rigid TF complexes. Structural data were used to estimate TF complex physical plausibility, explore overlapping motif arrangements seldom tackled by non-structure-aware methods, and generate and analyse three-dimensional models of the predicted complexes bound to DNA. Using this approach, we predicted 422 physically realistic TF complex motifs at 18% false discovery rate, the majority of which (326, 77%) contain some sequence overlap between binding sites. The set of mostly novel complexes is enriched in known composite motifs, predictive of binding site configurations in TF-TF-DNA crystal structures, and supported by ChIP-seq datasets. Structural modelling revealed three cooperativity mechanisms: direct protein-protein interactions, potentially indirect interactions and \\'through-DNA\\' interactions. Indeed, 38% of the predicted complexes were found to contain four or more bases in which TF pairs appear to synergize through overlapping binding to the same DNA base pairs in opposite grooves or strands. Our TF complex and associated binding site predictions are available as a web resource at http://bejerano.stanford.edu/complex.
Structure-aided prediction of mammalian transcription factor complexes in conserved non-coding elements

KAUST Repository

Guturu, H.; Doxey, A. C.; Wenger, A. M.; Bejerano, G.

2013-01-01

Mapping the DNA-binding preferences of transcription factor (TF) complexes is critical for deciphering the functions of cis-regulatory elements. Here, we developed a computational method that compares co-occurring motif spacings in conserved versus unconserved regions of the human genome to detect evolutionarily constrained binding sites of rigid TF complexes. Structural data were used to estimate TF complex physical plausibility, explore overlapping motif arrangements seldom tackled by non-structure-aware methods, and generate and analyse three-dimensional models of the predicted complexes bound to DNA. Using this approach, we predicted 422 physically realistic TF complex motifs at 18% false discovery rate, the majority of which (326, 77%) contain some sequence overlap between binding sites. The set of mostly novel complexes is enriched in known composite motifs, predictive of binding site configurations in TF-TF-DNA crystal structures, and supported by ChIP-seq datasets. Structural modelling revealed three cooperativity mechanisms: direct protein-protein interactions, potentially indirect interactions and 'through-DNA' interactions. Indeed, 38% of the predicted complexes were found to contain four or more bases in which TF pairs appear to synergize through overlapping binding to the same DNA base pairs in opposite grooves or strands. Our TF complex and associated binding site predictions are available as a web resource at http://bejerano.stanford.edu/complex.
Conservation of HIV-1 T cell epitopes across time and clades

DEFF Research Database (Denmark)

Levitz, Lauren; Koita, Ousmane A; Sangare, Kotou

2012-01-01

HIV genomic sequence variability has complicated efforts to generate an effective globally relevant vaccine. Regions of the viral genome conserved in sequence and across time may represent the "Achilles' heel" of HIV. In this study, highly conserved T-cell epitopes were selected using immunoinfor...
CONREAL web server: identification and visualization of conserved transcription factor binding sites

NARCIS (Netherlands)

Berezikov, E.; Guryev, V.; Cuppen, E.

2005-01-01

The use of orthologous sequences and phylogenetic footprinting approaches have become popular for the recognition of conserved and potentially functional sequences. Several algorithms have been developed for the identification of conserved transcription factor binding sites (TFBSs), which are
cDNA cloning and sequencing of human fibrillarin, a conserved nucleolar protein recognized by autoimmune antisera

International Nuclear Information System (INIS)

Aris, J.P.; Blobel, G.

1991-01-01

The authors have isolated a 1.1-kilobase cDNA clone that encodes human fibrillarin by screening a hepatoma library in parallel with DNA probes derived from the fibrillarin genes of Saccharomyces cerevisiae (NOP1) and Xenopus laevis. RNA blot analysis indicates that the corresponding mRNA is ∼1,300 nucleotides in length. Human fibrillarin expressed in vitro migrates on SDS gels as a 36-kDa protein that is specifically immunoprecipitated by antisera from humans with scleroderma autoimmune disease. Human fibrillarin contains an amino-terminal repetitive domain ∼75-80 amino acids in length that is rich in glycine and arginine residues and is similar to amino-terminal domains in the yeast and Xenopus fibrillarins. The occurrence of a putative RNA-binding domain and an RNP consensus sequence within the protein is consistent with the association of fibrillarin with small nucleolar RNAs. Protein sequence alignments show that 67% of amino acids from human fibrillarin are identical to those in yeast fibrillarin and that 81% are identical to those in Xenopus fibrillarin. This identity suggests the evolutionary conservation of an important function early in the pathway for ribosome biosynthesis
Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium

Directory of Open Access Journals (Sweden)

Lynch Michael

2010-05-01

Full Text Available Abstract Background In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa remains a virtually unexplored issue. Results By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Conclusions Our observations 1 shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2 are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3 reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.
Data from "Crossing to safety: Dispersal, colonization and mate choice in evolutionarily distinct populations of Steller sea lions, Eumetopias jubatus."

Data.gov (United States)

National Oceanic and Atmospheric Administration, Department of Commerce — Data sets used to support analysis published by O'Corry-Crowe et al (2014) Crossing to safety: Dispersal, colonization and mate choice in evolutionarily distinct...
A general pipeline for the development of anchor markers for comparative genomics in plants

Directory of Open Access Journals (Sweden)

Stougaard Jens

2006-08-01

Full Text Available Abstract Background Complete or near-complete genomic sequence information is presently only available for a few plant species representing a large phylogenetic diversity among plants. In order to effectively transfer this information to species lacking sequence information, comparative genomic tools need to be developed. Molecular markers permitting cross-species mapping along co-linear genomic regions are central to comparative genomics. These "anchor" markers, defining unique loci in genetic linkage maps of multiple species, are gene-based and possess a number of features that make them relatively sparse. To identify potential anchor marker sequences more efficiently, we have established an automated bioinformatic pipeline that combines multi-species Expressed Sequence Tags (EST and genome sequence data. Results Taking advantage of sequence data from related species, the pipeline identifies evolutionarily conserved sequences that are likely to define unique orthologous loci in most species of the same phylogenetic clade. The key features are the identification of evolutionarily conserved sequences followed by automated design of intron-flanking Polymerase Chain Reaction (PCR primer pairs. Polymorphisms can subsequently be identified by size- or sequence variation of PCR products, amplified from mapping parents or populations. We illustrate our procedure in legumes and grasses and exemplify its application in legumes, where model plant studies and the genome- and EST-sequence data available have a potential impact on the breeding of crop species and on our understanding of the evolution of this large and diverse family. Conclusion We provide a database of 459 candidate anchor loci which have the potential to serve as map anchors in more than 18,000 legume species, a number of which are of agricultural importance. For grasses, the database contains 1335 candidate anchor loci. Based on this database, we have evaluated 76 candidate anchor loci
BLAST and FASTA similarity searching for multiple sequence alignment.

Science.gov (United States)

Pearson, William R

2014-01-01

BLAST, FASTA, and other similarity searching programs seek to identify homologous proteins and DNA sequences based on excess sequence similarity. If two sequences share much more similarity than expected by chance, the simplest explanation for the excess similarity is common ancestry-homology. The most effective similarity searches compare protein sequences, rather than DNA sequences, for sequences that encode proteins, and use expectation values, rather than percent identity, to infer homology. The BLAST and FASTA packages of sequence comparison programs provide programs for comparing protein and DNA sequences to protein databases (the most sensitive searches). Protein and translated-DNA comparisons to protein databases routinely allow evolutionary look back times from 1 to 2 billion years; DNA:DNA searches are 5-10-fold less sensitive. BLAST and FASTA can be run on popular web sites, but can also be downloaded and installed on local computers. With local installation, target databases can be customized for the sequence data being characterized. With today's very large protein databases, search sensitivity can also be improved by searching smaller comprehensive databases, for example, a complete protein set from an evolutionarily neighboring model organism. By default, BLAST and FASTA use scoring strategies target for distant evolutionary relationships; for comparisons involving short domains or queries, or searches that seek relatively close homologs (e.g. mouse-human), shallower scoring matrices will be more effective. Both BLAST and FASTA provide very accurate statistical estimates, which can be used to reliably identify protein sequences that diverged more than 2 billion years ago.
Comparative biology of the pentraxin protein family: evolutionarily conserved component of innate immune system.

Science.gov (United States)

Armstrong, Peter B

2015-01-01

The immune system is based on the actions of the collection of specialized immune defense cells and their secreted proteins and peptides that defend the host against infection by parasites. Parasites are organisms that live part or all of their lives in close physical association with the host and extract nutrients from the host and, by releasing toxins and virulence factors, cause disease with the potential for injury and premature death of that host. Parasites of the metazoa can be viruses, eubacteria, fungi, protozoans, and other metazoans. The immune system operates to kill or eliminate parasites and eliminate or detoxify their toxins and virulence factors. Although some of the elements of immune systems are specific to a particular phylum of metazoans, others show extensive evolutionary conservation, being present in several or all major phyla of the metazoa. The pentraxins display this latter character in their roles in immune defense. Pentraxins have been documented in vertebrates, nonvertebrate chordates, arthropods, and mollusks and may be present in other taxa of metazoans. Presumably the pentraxins appeared early in the evolution of metazoa, prior to their evolutionary divergence in the Precambrian epoch into many phyla present today, and have been preserved for the 542 million years since that explosive evolutionary radiation. The fidelity with which these phyla have preserved the pentraxins suggests that the functions of these proteins are important for survival of the members of these diverse taxa of animals. Copyright © 2015 Elsevier Inc. All rights reserved.
Evolutionarily costly courtship displays in a wolf spider: a test of viability indicator theory

OpenAIRE

Chad D. Hoefler; Matthew H. Persons; Ann L. Rypstra

2008-01-01

The costs of secondary sexual traits are crucial to our understanding of sexual selection. Although it is broadly accepted that sexual traits are indirectly or directly costly to express, few studies have quantified such costs. Thus, it is unclear if costs are evolutionarily meaningful and to what degree. Costs play a key role in viability indicator models, which assume that 1) the expression of sexual traits reduces the fitness of the trait bearer, 2) sexual trait expression is dependent on ...
Phylogeny as a proxy for ecology in seagrass amphipods: which traits are most conserved?

Directory of Open Access Journals (Sweden)

Rebecca J Best

Full Text Available Increasingly, studies of community assembly and ecosystem function combine trait data and phylogenetic relationships to gain novel insight into the ecological and evolutionary constraints on community dynamics. However, the key to interpreting these two types of information is an understanding of the extent to which traits are phylogenetically conserved. In this study, we develop the necessary framework for community phylogenetics approaches in a system of marine crustacean herbivores that play an important role in the ecosystem functioning of seagrass systems worldwide. For 16 species of amphipods and isopods, we (1 reconstructed phylogenetic relationships using COI, 16S, and 18S sequences and Bayesian analyses, (2 measured traits that are potentially important for assembling species between and within habitats, and (3 compared the degree to which each of these traits are evolutionarily conserved. Despite poor phylogenetic resolution for the order Amphipoda as a whole, we resolved almost all of the topology for the species in our system, and used a sampling of ultrametric trees from the posterior distribution to account for remaining uncertainty in topology and branch lengths. We found that traits varied widely in their degree of phylogenetic signal. Body mass, fecundity, and tube building showed very strong phylogenetic signal, and temperature tolerance and feeding traits showed much less. As such, the degree of signal was not predictable based on whether the trait is related to environmental filtering or to resource partitioning. Further, we found that even with strong phylogenetic signal in body size, (which may have large impacts on ecosystem function, the predictive relationship between phylogenetic diversity and ecosystem function is not straightforward. We show that patterns of phylogenetic diversity in communities of seagrass mesograzers could lead to a variety of interpretations and predictions, and that detailed study of trait
Evolutionarily conserved sites in yeast tropomyosin function in cell polarity, transport and contractile ring formation

Directory of Open Access Journals (Sweden)

Susanne Cranz-Mileva

2015-08-01

Full Text Available Tropomyosin is a coiled-coil protein that binds and regulates actin filaments. The tropomyosin gene in Schizosaccharomyces pombe, cdc8, is required for formation of actin cables, contractile rings, and polar localization of actin patches. The roles of conserved residues were investigated in gene replacement mutants. The work validates an evolution-based approach to identify tropomyosin functions in living cells and sites of potential interactions with other proteins. A cdc8 mutant with near-normal actin affinity affects patch polarization and vacuole fusion, possibly by affecting Myo52p, a class V myosin, function. The presence of labile residual cell attachments suggests a delay in completion of cell division and redistribution of cell patches following cytokinesis. Another mutant with a mild phenotype is synthetic negative with GFP-fimbrin, inferring involvement of the mutated tropomyosin sites in interaction between the two proteins. Proteins that assemble in the contractile ring region before actin do so in a mutant cdc8 strain that cannot assemble condensed actin rings, yet some cells can divide. Of general significance, LifeAct-GFP negatively affects the actin cytoskeleton, indicating caution in its use as a biomarker for actin filaments.
Evaluating, Comparing, and Interpreting Protein Domain Hierarchies

Science.gov (United States)

2014-01-01

Abstract Arranging protein domain sequences hierarchically into evolutionarily divergent subgroups is important for investigating evolutionary history, for speeding up web-based similarity searches, for identifying sequence determinants of protein function, and for genome annotation. However, whether or not a particular hierarchy is optimal is often unclear, and independently constructed hierarchies for the same domain can often differ significantly. This article describes methods for statistically evaluating specific aspects of a hierarchy, for probing the criteria underlying its construction and for direct comparisons between hierarchies. Information theoretical notions are used to quantify the contributions of specific hierarchical features to the underlying statistical model. Such features include subhierarchies, sequence subgroups, individual sequences, and subgroup-associated signature patterns. Underlying properties are graphically displayed in plots of each specific feature's contributions, in heat maps of pattern residue conservation, in “contrast alignments,” and through cross-mapping of subgroups between hierarchies. Together, these approaches provide a deeper understanding of protein domain functional divergence, reveal uncertainties caused by inconsistent patterns of sequence conservation, and help resolve conflicts between competing hierarchies. PMID:24559108
Evolutionary conservation and changes in insect TRP channels.

Science.gov (United States)

Matsuura, Hironori; Sokabe, Takaaki; Kohno, Keigo; Tominaga, Makoto; Kadowaki, Tatsuhiko

2009-09-10

TRP (Transient Receptor Potential) channels respond to diverse stimuli and thus function as the primary integrators of varied sensory information. They are also activated by various compounds and secondary messengers to mediate cell-cell interactions as well as to detect changes in the local environment. Their physiological roles have been primarily characterized only in mice and fruit flies, and evolutionary studies are limited. To understand the evolution of insect TRP channels and the mechanisms of integrating sensory inputs in insects, we have identified and compared TRP channel genes in Drosophila melanogaster, Bombyx mori, Tribolium castaneum, Apis mellifera, Nasonia vitripennis, and Pediculus humanus genomes as part of genome sequencing efforts. All the insects examined have 2 TRPV, 1 TRPN, 1 TRPM, 3 TRPC, and 1 TRPML subfamily members, demonstrating that these channels have the ancient origins in insects. The common pattern also suggests that the mechanisms for detecting mechanical and visual stimuli and maintaining lysosomal functions may be evolutionarily well conserved in insects. However, a TRPP channel, the most ancient TRP channel, is missing in B. mori, A. mellifera, and N. vitripennis. Although P. humanus and D. melanogaster contain 4 TRPA subfamily members, the other insects have 5 TRPA subfamily members. T. castaneum, A. mellifera, and N. vitripennis contain TRPA5 channels, which have been specifically retained or gained in Coleoptera and Hymenoptera. Furthermore, TRPA1, which functions for thermotaxis in Drosophila, is missing in A. mellifera and N. vitripennis; however, they have other Hymenoptera-specific TRPA channels (AmHsTRPA and NvHsTRPA). NvHsTRPA expressed in HEK293 cells is activated by temperature increase, demonstrating that HsTRPAs function as novel thermal sensors in Hymenoptera. The total number of insect TRP family members is 13-14, approximately half that of mammalian TRP family members. As shown for mammalian TRP channels, this
Evolutionary conservation and changes in insect TRP channels

Directory of Open Access Journals (Sweden)

Tominaga Makoto

2009-09-01

Full Text Available Abstract Background TRP (Transient Receptor Potential channels respond to diverse stimuli and thus function as the primary integrators of varied sensory information. They are also activated by various compounds and secondary messengers to mediate cell-cell interactions as well as to detect changes in the local environment. Their physiological roles have been primarily characterized only in mice and fruit flies, and evolutionary studies are limited. To understand the evolution of insect TRP channels and the mechanisms of integrating sensory inputs in insects, we have identified and compared TRP channel genes in Drosophila melanogaster, Bombyx mori, Tribolium castaneum, Apis mellifera, Nasonia vitripennis, and Pediculus humanus genomes as part of genome sequencing efforts. Results All the insects examined have 2 TRPV, 1 TRPN, 1 TRPM, 3 TRPC, and 1 TRPML subfamily members, demonstrating that these channels have the ancient origins in insects. The common pattern also suggests that the mechanisms for detecting mechanical and visual stimuli and maintaining lysosomal functions may be evolutionarily well conserved in insects. However, a TRPP channel, the most ancient TRP channel, is missing in B. mori, A. mellifera, and N. vitripennis. Although P. humanus and D. melanogaster contain 4 TRPA subfamily members, the other insects have 5 TRPA subfamily members. T. castaneum, A. mellifera, and N. vitripennis contain TRPA5 channels, which have been specifically retained or gained in Coleoptera and Hymenoptera. Furthermore, TRPA1, which functions for thermotaxis in Drosophila, is missing in A. mellifera and N. vitripennis; however, they have other Hymenoptera-specific TRPA channels (AmHsTRPA and NvHsTRPA. NvHsTRPA expressed in HEK293 cells is activated by temperature increase, demonstrating that HsTRPAs function as novel thermal sensors in Hymenoptera. Conclusion The total number of insect TRP family members is 13-14, approximately half that of mammalian TRP
Lanthanum-Based Metal-Organic Frameworks for Specific Detection of Sudan Virus RNA Conservative Sequences down to Single-Base Mismatch.

Science.gov (United States)

Yang, Shui-Ping; Zhao, Wei; Hu, Pei-Pei; Wu, Ke-Yang; Jiang, Zhi-Hong; Bai, Li-Ping; Li, Min-Min; Chen, Jin-Xiang

2017-12-18

Reactions of La(NO 3 ) 3 ·6H 2 O with the polar, tritopic quaternized carboxylate ligands N-carboxymethyl-3,5-dicarboxylpyridinium bromide (H 3 CmdcpBr) and N-(4-carboxybenzyl)-3,5-dicarboxylpyridinium bromide (H 3 CbdcpBr) afford two water-stable metal-organic frameworks (MOFs) of {[La 4 (Cmdcp) 6 (H 2 O) 9 ]} n (1, 3D) and {[La 2 (Cbdcp) 3 (H 2 O) 10 ]} n (2, 2D). MOFs 1 and 2 absorb the carboxyfluorescein (FAM)-tagged probe DNA (P-DNA) and quench the fluorescence of FAM via a photoinduced electron transfer (PET) process. The nonemissive P-DNA@MOF hybrids thus formed in turn function as sensing platforms to distinguish conservative linear, single-stranded RNA sequences of Sudan virus with high selectivity and low detection limits of 112 and 67 pM, respectively (at a signal-to-noise ratio of 3). These hybrids also exhibit high specificity and discriminate down to single-base mismatch RNA sequences.

Comparative anatomy of the human APRT gene and enzyme: nucleotide sequence divergence and conservation of a nonrandom CpG dinucleotide arrangement

International Nuclear Information System (INIS)

Broderick, T.P.; Schaff, D.A.; Bertino, A.M.; Dush, M.K.; Tischfield, J.A.; Stambrook, P.J.

1987-01-01

The functional human adenine phosphoribosyltransferase (APRT) gene is <2.6 kilobases in length and contains five exons. The amino acid sequences of APRTs have been highly conserved throughout evolution. The human enzyme is 82%, 90%, and 40% identical to the mouse, hamster, and Escherichia coli enzymes, respectively. The promoter region of the human APRT gene, like that of several other housekeeping genes, lacks TATA and CCAAT boxes but contains five GC boxes that are potential binding sites for the Sp1 transcription factor. The distal three, however, are dispensable for gene expression. Comparison between human and mouse APRT gene nucleotide sequences reveals a high degree of homology within protein coding regions but an absence of significant homology in 5' flanking, 3' untranslated, and intron sequences, except for similarly positioned GC boxes in the promoter region and a 26-base-pair region in intron 3. This 26-base-pair sequence is 92% identical with a similarly positioned sequence in the mouse gene and is also found in intron 3 of the hamster gene, suggesting that its retention may be a consequence of stringent selection. The positions of all introns have been precisely retained in the human and both rodent genes. Retention of an elevated CpG dinucleotide content, despite loss of sequence homology, suggests that there may be selection for CpG dinucleotides in these regions and that their maintenance may be important for APRT gene function
The highly conserved codon following the slippery sequence supports -1 frameshift efficiency at the HIV-1 frameshift site.

Directory of Open Access Journals (Sweden)

Suneeth F Mathew

Full Text Available HIV-1 utilises -1 programmed ribosomal frameshifting to translate structural and enzymatic domains in a defined proportion required for replication. A slippery sequence, U UUU UUA, and a stem-loop are well-defined RNA features modulating -1 frameshifting in HIV-1. The GGG glycine codon immediately following the slippery sequence (the 'intercodon' contributes structurally to the start of the stem-loop but has no defined role in current models of the frameshift mechanism, as slippage is inferred to occur before the intercodon has reached the ribosomal decoding site. This GGG codon is highly conserved in natural isolates of HIV. When the natural intercodon was replaced with a stop codon two different decoding molecules-eRF1 protein or a cognate suppressor tRNA-were able to access and decode the intercodon prior to -1 frameshifting. This implies significant slippage occurs when the intercodon is in the (perhaps distorted ribosomal A site. We accommodate the influence of the intercodon in a model of frame maintenance versus frameshifting in HIV-1.
Origin and spread of photosynthesis based upon conserved sequence features in key bacteriochlorophyll biosynthesis proteins.

Science.gov (United States)

Gupta, Radhey S

2012-11-01

The origin of photosynthesis and how this capability has spread to other bacterial phyla remain important unresolved questions. I describe here a number of conserved signature indels (CSIs) in key proteins involved in bacteriochlorophyll (Bchl) biosynthesis that provide important insights in these regards. The proteins BchL and BchX, which are essential for Bchl biosynthesis, are derived by gene duplication in a common ancestor of all phototrophs. More ancient gene duplication gave rise to the BchX-BchL proteins and the NifH protein of the nitrogenase complex. The sequence alignment of NifH-BchX-BchL proteins contain two CSIs that are uniquely shared by all NifH and BchX homologs, but not by any BchL homologs. These CSIs and phylogenetic analysis of NifH-BchX-BchL protein sequences strongly suggest that the BchX homologs are ancestral to BchL and that the Bchl-based anoxygenic photosynthesis originated prior to the chlorophyll (Chl)-based photosynthesis in cyanobacteria. Another CSI in the BchX-BchL sequence alignment that is uniquely shared by all BchX homologs and the BchL sequences from Heliobacteriaceae, but absent in all other BchL homologs, suggests that the BchL homologs from Heliobacteriaceae are primitive in comparison to all other photosynthetic lineages. Several other identified CSIs in the BchN homologs are commonly shared by all proteobacterial homologs and a clade consisting of the marine unicellular Cyanobacteria (Clade C). These CSIs in conjunction with the results of phylogenetic analyses and pair-wise sequence similarity on the BchL, BchN, and BchB proteins, where the homologs from Clade C Cyanobacteria and Proteobacteria exhibited close relationship, provide strong evidence that these two groups have incurred lateral gene transfers. Additionally, phylogenetic analyses and several CSIs in the BchL-N-B proteins that are uniquely shared by all Chlorobi and Chloroflexi homologs provide evidence that the genes for these proteins have also been
Conserved Transcriptional Regulatory Programs Underlying Rice and Barley Germination

Science.gov (United States)

Lin, Li; Tian, Shulan; Kaeppler, Shawn; Liu, Zongrang; An, Yong-Qiang (Charles)

2014-01-01

Germination is a biological process important to plant development and agricultural production. Barley and rice diverged 50 million years ago, but share a similar germination process. To gain insight into the conservation of their underlying gene regulatory programs, we compared transcriptomes of barley and rice at start, middle and end points of germination, and revealed that germination regulated barley and rice genes (BRs) diverged significantly in expression patterns and/or protein sequences. However, BRs with higher protein sequence similarity tended to have more conserved expression patterns. We identified and characterized 316 sets of conserved barley and rice genes (cBRs) with high similarity in both protein sequences and expression patterns, and provided a comprehensive depiction of the transcriptional regulatory program conserved in barley and rice germination at gene, pathway and systems levels. The cBRs encoded proteins involved in a variety of biological pathways and had a wide range of expression patterns. The cBRs encoding key regulatory components in signaling pathways often had diverse expression patterns. Early germination up-regulation of cell wall metabolic pathway and peroxidases, and late germination up-regulation of chromatin structure and remodeling pathways were conserved in both barley and rice. Protein sequence and expression pattern of a gene change quickly if it is not subjected to a functional constraint. Preserving germination-regulated expression patterns and protein sequences of those cBRs for 50 million years strongly suggests that the cBRs are functionally significant and equivalent in germination, and contribute to the ancient characteristics of germination preserved in barley and rice. The functional significance and equivalence of the cBR genes predicted here can serve as a foundation to further characterize their biological functions and facilitate bridging rice and barley germination research with greater confidence. PMID
Solexa sequencing identification of conserved and novel microRNAs in backfat of Large White and Chinese Meishan pigs.

Directory of Open Access Journals (Sweden)

Chen Chen

Full Text Available The domestic pig (Sus scrofa, an important species in animal production industry, is a right model for studying adipogenesis and fat deposition. In order to expand the repertoire of porcine miRNAs and further explore potential regulatory miRNAs which have influence on adipogenesis, high-throughput Solexa sequencing approach was adopted to identify miRNAs in backfat of Large White (lean type pig and Meishan pigs (Chinese indigenous fatty pig. We identified 215 unique miRNAs comprising 75 known pre-miRNAs, of which 49 miRNA*s were first identified in our study, 73 miRNAs were overlapped in both libraries, and 140 were novelly predicted miRNAs, and 215 unique miRNAs were collectively corresponding to 235 independent genomic loci. Furthermore, we analyzed the sequence variations, seed edits and phylogenetic development of the miRNAs. 17 miRNAs were widely conserved from vertebrates to invertebrates, suggesting that these miRNAs may serve as potential evolutional biomarkers. 9 conserved miRNAs with significantly differential expressions were determined. The expression of miR-215, miR-135, miR-224 and miR-146b was higher in Large White pigs, opposite to the patterns shown by miR-1a, miR-133a, miR-122, miR-204 and miR-183. Almost all novel miRNAs could be considered pig-specific except ssc-miR-1343, miR-2320, miR-2326, miR-2411 and miR-2483 which had homologs in Bos taurus, among which ssc-miR-1343, miR-2320, miR-2411 and miR-2483 were validated in backfat tissue by stem-loop qPCR. Our results displayed a high level of concordance between the qPCR and Solexa sequencing method in 9 of 10 miRNAs comparisons except for miR-1a. Moreover, we found 2 miRNAs, miR-135 and miR-183, may exert impacts on porcine backfat development through WNT signaling pathway. In conclusion, our research develops porcine miRNAs and should be beneficial to study the adipogenesis and fat deposition of different pig breeds based on miRNAs.
Structural Conservation Despite Huge Sequence Diversity Allows EPCR Binding by the PfEMP1 Family Implicated in Severe Childhood Malaria

DEFF Research Database (Denmark)

Lau, Clinton K.Y.; Turner, Louise; Jespersen, Jakob S.

2015-01-01

with severe childhood malaria. We combine crystal structures of CIDRa1:EPCR complexes with analysis of 885 CIDRa1 sequences, showing that the EPCR-binding surfaces of CIDRa1 domains are conserved in shape and bonding potential, despite dramatic sequence diversity. Additionally, these domains mimic features...... of the natural EPCR ligand and can block this ligand interaction. Using peptides corresponding to the EPCR-binding region, antibodies can be purified from individuals in malaria-endemic regions that block EPCR binding of diverse CIDRa1 variants. This highlights the extent to which such a surface protein family......The PfEMP1 family of surface proteins is central for Plasmodium falciparum virulence and must retain the ability to bind to host receptors while also diversifying to aid immune evasion. The interaction between CIDRa1 domains of PfEMP1 and endothelial protein C receptor (EPCR) is associated...
Description and physical localization of the bovine survival of motor neuron gene (SMN).

Science.gov (United States)

Pietrowski, D; Goldammer, T; Meinert, S; Schwerin, M; Förster, M

1998-01-01

Proximal spinal muscular atrophy (SMA) is an autosomal recessive disease in humans and other mammals, characterized by degeneration of anterior horn cells of the spinal cord. In humans, the survival of motor neuron gene (SMN) has been recognized as the SMA-determining gene and has been mapped to 5q13. In cattle, SMA is a recurrent, inherited disease that plays an important economic role in breeding programs of Brown Swiss stock. Now we have identified the full- length cDNA sequence of the bovine SMN gene. Molecular analysis and characterization of the sequence documents 85% identity to its human counterpart and three evolutionarily conserved domains in different species. Physical mapping data reveals that bovine SMN is localized to chromosome region 20q12-->q13, supporting the conserved synteny of this chromosomal region between humans and cattle.
Relative Stabilities of Conserved and Non-Conserved Structures in the OB-Fold Superfamily

Directory of Open Access Journals (Sweden)

Andrei T. Alexandrescu

2009-05-01

Full Text Available The OB-fold is a diverse structure superfamily based on a β-barrel motif that is often supplemented with additional non-conserved secondary structures. Previous deletion mutagenesis and NMR hydrogen exchange studies of three OB-fold proteins showed that the structural stabilities of sites within the conserved β-barrels were larger than sites in non-conserved segments. In this work we examined a database of 80 representative domain structures currently classified as OB-folds, to establish the basis of this effect. Residue-specific values were obtained for the number of Cα-Cα distance contacts, sequence hydrophobicities, crystallographic B-factors, and theoretical B-factors calculated from a Gaussian Network Model. All four parameters point to a larger average flexibility for the non-conserved structures compared to the conserved β-barrels. The theoretical B-factors and contact densities show the highest sensitivity.Our results suggest a model of protein structure evolution in which novel structural features develop at the periphery of conserved motifs. Core residues are more resistant to structural changes during evolution since their substitution would disrupt a larger number of interactions. Similar factors are likely to account for the differences in stability to unfolding between conserved and non-conserved structures.
Inhibition of Hepatitis C Virus in Mice by a Small Interfering RNA Targeting a Highly Conserved Sequence in Viral IRES Pseudoknot.

Directory of Open Access Journals (Sweden)

Jae-Su Moon

Full Text Available The hepatitis C virus (HCV internal ribosome entry site (IRES that directs cap-independent viral translation is a primary target for small interfering RNA (siRNA-based HCV antiviral therapy. However, identification of potent siRNAs against HCV IRES by bioinformatics-based siRNA design is a challenging task given the complexity of HCV IRES secondary and tertiary structures and association with multiple proteins, which can also dynamically change the structure of this cis-acting RNA element. In this work, we utilized siRNA tiling approach whereby siRNAs were tiled with overlapping sequences that were shifted by one or two nucleotides over the HCV IRES stem-loop structures III and IV spanning nucleotides (nts 277-343. Based on their antiviral activity, we mapped a druggable region (nts 313-343 where the targets of potent siRNAs were enriched. siIE22, which showed the greatest anti-HCV potency, targeted a highly conserved sequence across diverse HCV genotypes, locating within the IRES subdomain IIIf involved in pseudoknot formation. Stepwise target shifting toward the 5' or 3' direction by 1 or 2 nucleotides reduced the antiviral potency of siIE22, demonstrating the importance of siRNA accessibility to this highly structured and sequence-conserved region of HCV IRES for RNA interference. Nanoparticle-mediated systemic delivery of the stability-improved siIE22 derivative gs_PS1 siIE22, which contains a single phosphorothioate linkage on the guide strand, reduced the serum HCV genome titer by more than 4 log10 in a xenograft mouse model for HCV replication without generation of resistant variants. Our results provide a strategy for identifying potent siRNA species against a highly structured RNA target and offer a potential pan-HCV genotypic siRNA therapy that might be beneficial for patients resistant to current treatment regimens.
Comparative analysis of function and interaction of transcription factors in nematodes: Extensive conservation of orthology coupled to rapid sequence evolution

Directory of Open Access Journals (Sweden)

Singh Rama S

2008-08-01

Full Text Available Abstract Background Much of the morphological diversity in eukaryotes results from differential regulation of gene expression in which transcription factors (TFs play a central role. The nematode Caenorhabditis elegans is an established model organism for the study of the roles of TFs in controlling the spatiotemporal pattern of gene expression. Using the fully sequenced genomes of three Caenorhabditid nematode species as well as genome information from additional more distantly related organisms (fruit fly, mouse, and human we sought to identify orthologous TFs and characterized their patterns of evolution. Results We identified 988 TF genes in C. elegans, and inferred corresponding sets in C. briggsae and C. remanei, containing 995 and 1093 TF genes, respectively. Analysis of the three gene sets revealed 652 3-way reciprocal 'best hit' orthologs (nematode TF set, approximately half of which are zinc finger (ZF-C2H2 and ZF-C4/NHR types and HOX family members. Examination of the TF genes in C. elegans and C. briggsae identified the presence of significant tandem clustering on chromosome V, the majority of which belong to ZF-C4/NHR family. We also found evidence for lineage-specific duplications and rapid evolution of many of the TF genes in the two species. A search of the TFs conserved among nematodes in Drosophila melanogaster, Mus musculus and Homo sapiens revealed 150 reciprocal orthologs, many of which are associated with important biological processes and human diseases. Finally, a comparison of the sequence, gene interactions and function indicates that nematode TFs conserved across phyla exhibit significantly more interactions and are enriched in genes with annotated mutant phenotypes compared to those that lack orthologs in other species. Conclusion Our study represents the first comprehensive genome-wide analysis of TFs across three nematode species and other organisms. The findings indicate substantial conservation of transcription
Conserved properties of Drosophila Insomniac link sleep regulation and synaptic function.

Science.gov (United States)

Li, Qiuling; Kellner, David A; Hatch, Hayden A M; Yumita, Tomohiro; Sanchez, Sandrine; Machold, Robert P; Frank, C Andrew; Stavropoulos, Nicholas

2017-05-01

Sleep is an ancient animal behavior that is regulated similarly in species ranging from flies to humans. Various genes that regulate sleep have been identified in invertebrates, but whether the functions of these genes are conserved in mammals remains poorly explored. Drosophila insomniac (inc) mutants exhibit severely shortened and fragmented sleep. Inc protein physically associates with the Cullin-3 (Cul3) ubiquitin ligase, and neuronal depletion of Inc or Cul3 strongly curtails sleep, suggesting that Inc is a Cul3 adaptor that directs the ubiquitination of neuronal substrates that impact sleep. Three proteins similar to Inc exist in vertebrates-KCTD2, KCTD5, and KCTD17-but are uncharacterized within the nervous system and their functional conservation with Inc has not been addressed. Here we show that Inc and its mouse orthologs exhibit striking biochemical and functional interchangeability within Cul3 complexes. Remarkably, KCTD2 and KCTD5 restore sleep to inc mutants, indicating that they can substitute for Inc in vivo and engage its neuronal targets relevant to sleep. Inc and its orthologs localize similarly within fly and mammalian neurons and can traffic to synapses, suggesting that their substrates may include synaptic proteins. Consistent with such a mechanism, inc mutants exhibit defects in synaptic structure and physiology, indicating that Inc is essential for both sleep and synaptic function. Our findings reveal that molecular functions of Inc are conserved through ~600 million years of evolution and support the hypothesis that Inc and its orthologs participate in an evolutionarily conserved ubiquitination pathway that links synaptic function and sleep regulation.
Simple connection between conservation laws in the Korteweg--de Vriesand sine-Gordon systems

International Nuclear Information System (INIS)

Chodos, A.

1980-01-01

An infinite sequence of conserved quantities follows from the Lax representation in both the Korteweg--de Vries and sine-Gordon systems. We show that these two sequences are related by a simple substitution. In an appendix, two different methods of deriving conservation laws from the Lax representation are presented
Allelic barley MLA immune receptors recognize sequence-unrelated avirulence effectors of the powdery mildew pathogen.

Science.gov (United States)

Lu, Xunli; Kracher, Barbara; Saur, Isabel M L; Bauer, Saskia; Ellwood, Simon R; Wise, Roger; Yaeno, Takashi; Maekawa, Takaki; Schulze-Lefert, Paul

2016-10-18

Disease-resistance genes encoding intracellular nucleotide-binding domain and leucine-rich repeat proteins (NLRs) are key components of the plant innate immune system and typically detect the presence of isolate-specific avirulence (AVR) effectors from pathogens. NLR genes define the fastest-evolving gene family of flowering plants and are often arranged in gene clusters containing multiple paralogs, contributing to copy number and allele-specific NLR variation within a host species. Barley mildew resistance locus a (Mla) has been subject to extensive functional diversification, resulting in allelic resistance specificities each recognizing a cognate, but largely unidentified, AVR a gene of the powdery mildew fungus, Blumeria graminis f. sp. hordei (Bgh). We applied a transcriptome-wide association study among 17 Bgh isolates containing different AVR a genes and identified AVR a1 and AVR a13 , encoding candidate-secreted effectors recognized by Mla1 and Mla13 alleles, respectively. Transient expression of the effector genes in barley leaves or protoplasts was sufficient to trigger Mla1 or Mla13 allele-specific cell death, a hallmark of NLR receptor-mediated immunity. AVR a1 and AVR a13 are phylogenetically unrelated, demonstrating that certain allelic MLA receptors evolved to recognize sequence-unrelated effectors. They are ancient effectors because corresponding loci are present in wheat powdery mildew. AVR A1 recognition by barley MLA1 is retained in transgenic Arabidopsis, indicating that AVR A1 directly binds MLA1 or that its recognition involves an evolutionarily conserved host target of AVR A1 Furthermore, analysis of transcriptome-wide sequence variation among the Bgh isolates provides evidence for Bgh population structure that is partially linked to geographic isolation.
Identification and characterization of putative conserved IAM ...

African Journals Online (AJOL)

Available putative AMI sequences from a wide array of monocot and dicot plants were identified and the phylogenetic tree was constructed and analyzed. We identified in this tree, a clade that contained sequences from species across the plant kingdom suggesting that AMI is conserved and may have a primary role in plant ...
Towards high-throughput phenotyping of complex patterned behaviors in rodents: focus on mouse self-grooming and its sequencing.

Science.gov (United States)

Kyzar, Evan; Gaikwad, Siddharth; Roth, Andrew; Green, Jeremy; Pham, Mimi; Stewart, Adam; Liang, Yiqing; Kobla, Vikrant; Kalueff, Allan V

2011-12-01

Increasingly recognized in biological psychiatry, rodent self-grooming is a complex patterned behavior with evolutionarily conserved cephalo-caudal progression. While grooming is traditionally assessed by the latency, frequency and duration, its sequencing represents another important domain sensitive to various experimental manipulations. Such behavioral complexity requires novel objective approaches to quantify rodent grooming, in addition to time-consuming and highly variable manual observation. The present study combined modern behavior-recognition video-tracking technologies (CleverSys, Inc.) with manual observation to characterize in-depth spontaneous (novelty-induced) and artificial (water-induced) self-grooming in adult male C57BL/6J mice. We specifically focused on individual episodes of grooming (paw licking, head washing, body/leg washing, and tail/genital grooming), their duration and transitions between episodes. Overall, the frequency, duration and transitions detected using the automated approach significantly correlated with manual observations (R=0.51-0.7, pgrooming, also indicating that behavior-recognition tools can be applied to characterize both the amount and sequential organization (patterning) of rodent grooming. Together with further refinement and methodological advancement, this approach will foster high-throughput neurophenotyping of grooming, with multiple applications in drug screening and testing of genetically modified animals. Copyright © 2011 Elsevier B.V. All rights reserved.
The putative Leishmania telomerase RNA (LeishTER undergoes trans-splicing and contains a conserved template sequence.

Directory of Open Access Journals (Sweden)

Elton J R Vasconcelos

Full Text Available Telomerase RNAs (TERs are highly divergent between species, varying in size and sequence composition. Here, we identify a candidate for the telomerase RNA component of Leishmania genus, which includes species that cause leishmaniasis, a neglected tropical disease. Merging a thorough computational screening combined with RNA-seq evidence, we mapped a non-coding RNA gene localized in a syntenic locus on chromosome 25 of five Leishmania species that shares partial synteny with both Trypanosoma brucei TER locus and a putative TER candidate-containing locus of Crithidia fasciculata. Using target-driven molecular biology approaches, we detected a ∼2,100 nt transcript (LeishTER that contains a 5' spliced leader (SL cap, a putative 3' polyA tail and a predicted C/D box snoRNA domain. LeishTER is expressed at similar levels in the logarithmic and stationary growth phases of promastigote forms. A 5'SL capped LeishTER co-immunoprecipitated and co-localized with the telomerase protein component (TERT in a cell cycle-dependent manner. Prediction of its secondary structure strongly suggests the existence of a bona fide single-stranded template sequence and a conserved C[U/C]GUCA motif-containing helix II, representing the template boundary element. This study paves the way for further investigations on the biogenesis of parasite TERT ribonucleoproteins (RNPs and its role in parasite telomere biology.
Structural and functional analysis of mouse Msx1 gene promoter: sequence conservation with human MSX1 promoter points at potential regulatory elements.

Science.gov (United States)

Gonzalez, S M; Ferland, L H; Robert, B; Abdelhay, E

1998-06-01

Vertebrate Msx genes are related to one of the most divergent homeobox genes of Drosophila, the muscle segment homeobox (msh) gene, and are expressed in a well-defined pattern at sites of tissue interactions. This pattern of expression is conserved in vertebrates as diverse as quail, zebrafish, and mouse in a range of sites including neural crest, appendages, and craniofacial structures. In the present work, we performed structural and functional analyses in order to identify potential cis-acting elements that may be regulating Msx1 gene expression. To this end, a 4.9-kb segment of the 5'-flanking region was sequenced and analyzed for transcription-factor binding sites. Four regions showing a high concentration of these sites were identified. Transfection assays with fragments of regulatory sequences driving the expression of the bacterial lacZ reporter gene showed that a region of 4 kb upstream of the transcription start site contains positive and negative elements responsible for controlling gene expression. Interestingly, a fragment of 130 bp seems to contain the minimal elements necessary for gene expression, as its removal completely abolishes gene expression in cultured cells. These results are reinforced by comparison of this region with the human Msx1 gene promoter, which shows extensive conservation, including many consensus binding sites, suggesting a regulatory role for them.
Evolutionary conservation of P-selectin glycoprotein ligand-1 primary structure and function

Directory of Open Access Journals (Sweden)

Schapira Marc

2007-09-01

Full Text Available Abstract Background P-selectin glycoprotein ligand-1 (PSGL-1 plays a critical role in recruiting leukocytes in inflammatory lesions by mediating leukocyte rolling on selectins. Core-2 O-glycosylation of a N-terminal threonine and sulfation of at least one tyrosine residue of PSGL-1 are required for L- and P-selectin binding. Little information is available on the intra- and inter-species evolution of PSGL-1 primary structure. In addition, the evolutionary conservation of selectin binding site on PSGL-1 has not been previously examined in detail. Therefore, we performed multiple sequence alignment of PSGL-1 amino acid sequences of 14 mammals (human, chimpanzee, rhesus monkey, bovine, pig, rat, tree-shrew, bushbaby, mouse, bat, horse, cat, sheep and dog and examined mammalian PSGL-1 interactions with human selectins. Results A signal peptide was predicted in each sequence and a propeptide cleavage site was found in 9/14 species. PSGL-1 N-terminus is poorly conserved. However, each species exhibits at least one tyrosine sulfation site and, except in horse and dog, a T [D/E]PP [D/E] motif associated to the core-2 O-glycosylation of a N-terminal threonine. A mucin-like domain of 250–280 amino acids long was disclosed in all studied species. It lies between the conserved N-terminal O-glycosylated threonine (Thr-57 in human and the transmembrane domain, and contains a central region exhibiting a variable number of decameric repeats (DR. Interspecies and intraspecies polymorphisms were observed. Transmembrane and cytoplasmic domain sequences are well conserved. The moesin binding residues that serve as adaptor between PSGL-1 and Syk, and are involved in regulating PSGL-1-dependent rolling on P-selectin are perfectly conserved in all analyzed mammalian sequences. Despite a poor conservation of PSGL-1 N-terminal sequence, CHO cells co-expressing human glycosyltransferases and human, bovine, pig or rat PSGL-1 efficiently rolled on human L- or P
Conservation genetics of Iberian raptors

Directory of Open Access Journals (Sweden)

Martinez–Cruz, B.

2011-12-01

Full Text Available In this paper I provide an overview of conservation genetics and describe the management actions in the wild that can benefit from conservation genetic studies. I describe the genetic factors of risk for the survival of wild species, the consequences of loss of genetic diversity, inbreeding and outbreeding depression, and the use of genetic tools to delimitate units of conservation. Then I introduce the most common applications of conservation genetics in the management of wild populations. In a second part of the paper I review the conservation genetic studies carried on the Iberian raptors. I introduce several studies on the Spanish imperial eagle, the bearded vulture, the black vulture and the red kite that were carried out using autosomal microsatellite markers and mitochondrial DNA (mtDNA sequencing. I describe studies on the lesser kestrel and Egyptian vulture that additionally applied major histocompatibility complex (MHC markers, with the purpose of incorporating the study of non–neutral variation. For every species I explain how these studies can be and/or are applied in the strategy of conservation in the wild.
Species delimitation of common reef corals in the genus Pocillopora using nucleotide sequence phylogenies, population genetics and symbiosis ecology.

Science.gov (United States)

Pinzón, Jorge H; LaJeunesse, Todd C

2011-01-01

Stony corals in the genus Pocillopora are among the most common and widely distributed of Indo-Pacific corals and, as such, are often the subject of physiological and ecological research. In the far Tropical Eastern Pacific (TEP), they are major constituents of shallow coral communities, exhibiting considerable variability in colony shape and branch morphology and marked differences in response to thermal stress. Numerous intermediates occur between morphospecies that may relate to extensive hybridization. The diversity of the Pocillopora genus in the TEP was analysed genetically using nuclear ribosomal (ITS2) and mitochondrial (ORF) sequences, and population genetic markers (seven microsatellite loci). The resident dinoflagellate endosymbiont (Symbiodinium sp.) in each sample was also characterized using sequences of the internal transcribed spacer 2 (ITS2) rDNA and the noncoding region of the chloroplast psbA minicircle. From these analyses, three symbiotically distinct, reproductively isolated, nonhybridizing, evolutionarily divergent animal lineages were identified. Designated types 1, 2 and 3, these groupings were incongruent with traditional morphospecies classification. Type 1 was abundant and widespread throughout the TEP; type 2 was restricted to the Clipperton Atoll; and type 3 was found only in Panama and the Galapagos Islands. Each type harboured a different Symbiodinium'species lineage' in Clade C, and only type 1 associated with the 'stress-tolerant'Symbiodinium glynni (D1). The accurate delineation of species and implementation of a proper taxonomy may profoundly improve our assessment of Pocillopora's reproductive biology, biogeographic distributions, and resilience to climate warming, information that must be considered when planning for the conservation of reef corals. © 2010 Blackwell Publishing Ltd.

An ancient spliceosomal intron in the ribosomal protein L7a gene (Rpl7a of Giardia lamblia

Directory of Open Access Journals (Sweden)

Gray Michael W

2005-08-01

Full Text Available Abstract Background Only one spliceosomal-type intron has previously been identified in the unicellular eukaryotic parasite, Giardia lamblia (a diplomonad. This intron is only 35 nucleotides in length and is unusual in possessing a non-canonical 5' intron boundary sequence, CT, instead of GT. Results We have identified a second spliceosomal-type intron in G. lamblia, in the ribosomal protein L7a gene (Rpl7a, that possesses a canonical GT 5' intron boundary sequence. A comparison of the two known Giardia intron sequences revealed extensive nucleotide identity at both the 5' and 3' intron boundaries, similar to the conserved sequence motifs recently identified at the boundaries of spliceosomal-type introns in Trichomonas vaginalis (a parabasalid. Based on these observations, we searched the partial G. lamblia genome sequence for these conserved features and identified a third spliceosomal intron, in an unassigned open reading frame. Our comprehensive analysis of the Rpl7a intron in other eukaryotic taxa demonstrates that it is evolutionarily conserved and is an ancient eukaryotic intron. Conclusion An analysis of the phylogenetic distribution and properties of the Rpl7a intron suggests its utility as a phylogenetic marker to evaluate particular eukaryotic groupings. Additionally, analysis of the G. lamblia introns has provided further insight into some of the conserved and unique features possessed by the recently identified spliceosomal introns in related organisms such as T. vaginalis and Carpediemonas membranifera.
Analysis of Pteridium ribosomal RNA sequences by rapid direct sequencing.

Science.gov (United States)

Tan, M K

1991-08-01

A total of 864 bases from 5 regions interspersed in the 18S and 26S rRNA molecules from various clones of Pteridium covering the general geographical distribution of the genus was analysed using a rapid rRNA sequencing technique. No base difference has been detected amongst the three major lineages, two of which apparently separated before the breakup of the ancient supercontinent, Pangaea. These regions of the rRNA sequences have thus been conserved for at least 160 million years and are here compared with other eukaryotic, especially plant rRNAs.
Universal sequence map (USM of arbitrary discrete sequences

Directory of Open Access Journals (Sweden)

Almeida Jonas S

2002-02-01

Full Text Available Abstract Background For over a decade the idea of representing biological sequences in a continuous coordinate space has maintained its appeal but not been fully realized. The basic idea is that any sequence of symbols may define trajectories in the continuous space conserving all its statistical properties. Ideally, such a representation would allow scale independent sequence analysis – without the context of fixed memory length. A simple example would consist on being able to infer the homology between two sequences solely by comparing the coordinates of any two homologous units. Results We have successfully identified such an iterative function for bijective mappingψ of discrete sequences into objects of continuous state space that enable scale-independent sequence analysis. The technique, named Universal Sequence Mapping (USM, is applicable to sequences with an arbitrary length and arbitrary number of unique units and generates a representation where map distance estimates sequence similarity. The novel USM procedure is based on earlier work by these and other authors on the properties of Chaos Game Representation (CGR. The latter enables the representation of 4 unit type sequences (like DNA as an order free Markov Chain transition table. The properties of USM are illustrated with test data and can be verified for other data by using the accompanying web-based tool:http://bioinformatics.musc.edu/~jonas/usm/. Conclusions USM is shown to enable a statistical mechanics approach to sequence analysis. The scale independent representation frees sequence analysis from the need to assume a memory length in the investigation of syntactic rules.
Sequence analysis of cereal sucrose synthase genes and isolation ...

African Journals Online (AJOL)

SERVER

2007-10-18

Oct 18, 2007 ... sequencing of sucrose synthase gene fragment from sor- ghum using primers designed at their conserved exons. MATERIALS AND METHODS. Multiple sequence alignment. Sucrose synthase gene sequences of various cereals like rice, maize, and barley were accessed from NCBI Genbank database.
DNA Barcoding: Amplification and sequence analysis of rbcl and matK genome regions in three divergent plant species

Directory of Open Access Journals (Sweden)

Javed Iqbal Wattoo

2016-11-01

Full Text Available Background: DNA barcoding is a novel method of species identification based on nucleotide diversity of conserved sequences. The establishment and refining of plant DNA barcoding systems is more challenging due to high genetic diversity among different species. Therefore, targeting the conserved nuclear transcribed regions would be more reliable for plant scientists to reveal genetic diversity, species discrimination and phylogeny. Methods: In this study, we amplified and sequenced the chloroplast DNA regions (matk+rbcl of Solanum nigrum, Euphorbia helioscopia and Dalbergia sissoo to study the functional annotation, homology modeling and sequence analysis to allow a more efficient utilization of these sequences among different plant species. These three species represent three families; Solanaceae, Euphorbiaceae and Fabaceae respectively. Biological sequence homology and divergence of amplified sequences was studied using Basic Local Alignment Tool (BLAST. Results: Both primers (matk+rbcl showed good amplification in three species. The sequenced regions reveled conserved genome information for future identification of different medicinal plants belonging to these species. The amplified conserved barcodes revealed different levels of biological homology after sequence analysis. The results clearly showed that the use of these conserved DNA sequences as barcode primers would be an accurate way for species identification and discrimination. Conclusion: The amplification and sequencing of conserved genome regions identified a novel sequence of matK in native species of Solanum nigrum. The findings of the study would be applicable in medicinal industry to establish DNA based identification of different medicinal plant species to monitor adulteration.
The utility of transcriptomics in fish conservation.

Science.gov (United States)

Connon, Richard E; Jeffries, Ken M; Komoroske, Lisa M; Todgham, Anne E; Fangue, Nann A

2018-01-29

There is growing recognition of the need to understand the mechanisms underlying organismal resilience (i.e. tolerance, acclimatization) to environmental change to support the conservation management of sensitive and economically important species. Here, we discuss how functional genomics can be used in conservation biology to provide a cellular-level understanding of organismal responses to environmental conditions. In particular, the integration of transcriptomics with physiological and ecological research is increasingly playing an important role in identifying functional physiological thresholds predictive of compensatory responses and detrimental outcomes, transforming the way we can study issues in conservation biology. Notably, with technological advances in RNA sequencing, transcriptome-wide approaches can now be applied to species where no prior genomic sequence information is available to develop species-specific tools and investigate sublethal impacts that can contribute to population declines over generations and undermine prospects for long-term conservation success. Here, we examine the use of transcriptomics as a means of determining organismal responses to environmental stressors and use key study examples of conservation concern in fishes to highlight the added value of transcriptome-wide data to the identification of functional response pathways. Finally, we discuss the gaps between the core science and policy frameworks and how thresholds identified through transcriptomic evaluations provide evidence that can be more readily used by resource managers. © 2018. Published by The Company of Biologists Ltd.
Comparative sequence analysis of Solanum and Arabidopsis in a hot spot for pathogen resistance on potato chromosome V reveals a patchwork of conserved and rapidly evolving genome segments

Directory of Open Access Journals (Sweden)

Bruggmann Rémy

2007-05-01

Full Text Available Abstract Background Quantitative phenotypic variation of agronomic characters in crop plants is controlled by environmental and genetic factors (quantitative trait loci = QTL. To understand the molecular basis of such QTL, the identification of the underlying genes is of primary interest and DNA sequence analysis of the genomic regions harboring QTL is a prerequisite for that. QTL mapping in potato (Solanum tuberosum has identified a region on chromosome V tagged by DNA markers GP21 and GP179, which contains a number of important QTL, among others QTL for resistance to late blight caused by the oomycete Phytophthora infestans and to root cyst nematodes. Results To obtain genomic sequence for the targeted region on chromosome V, two local BAC (bacterial artificial chromosome contigs were constructed and sequenced, which corresponded to parts of the homologous chromosomes of the diploid, heterozygous genotype P6/210. Two contiguous sequences of 417,445 and 202,781 base pairs were assembled and annotated. Gene-by-gene co-linearity was disrupted by non-allelic insertions of retrotransposon elements, stretches of diverged intergenic sequences, differences in gene content and gene order. The latter was caused by inversion of a 70 kbp genomic fragment. These features were also found in comparison to orthologous sequence contigs from three homeologous chromosomes of Solanum demissum, a wild tuber bearing species. Functional annotation of the sequence identified 48 putative open reading frames (ORF in one contig and 22 in the other, with an average of one ORF every 9 kbp. Ten ORFs were classified as resistance-gene-like, 11 as F-box-containing genes, 13 as transposable elements and three as transcription factors. Comparing potato to Arabidopsis thaliana annotated proteins revealed five micro-syntenic blocks of three to seven ORFs with A. thaliana chromosomes 1, 3 and 5. Conclusion Comparative sequence analysis revealed highly conserved collinear regions
Isolation, sequence identification and tissue expression profile of a ...

African Journals Online (AJOL)

The complete expressed sequence tag (CDS) sequence of Banna mini-pig inbred line (BMI) ribokinase gene (RBKS) was amplified using the reverse transcription-polymerase chain reaction (RT-PCR) based on the conserved sequence information of the cattle or other mammals and known highly homologous swine ESTs.
Total sequence decomposition distinguishes functional modules, "molegos" in apurinic/apyrimidinic endonucleases

Directory of Open Access Journals (Sweden)

Braun Werner

2002-11-01

Full Text Available Abstract Background Total sequence decomposition, using the web-based MASIA tool, identifies areas of conservation in aligned protein sequences. By structurally annotating these motifs, the sequence can be parsed into individual building blocks, molecular legos ("molegos", that can eventually be related to function. Here, the approach is applied to the apurinic/apyrimidinic endonuclease (APE DNA repair proteins, essential enzymes that have been highly conserved throughout evolution. The APEs, DNase-1 and inositol 5'-polyphosphate phosphatases (IPP form a superfamily that catalyze metal ion based phosphorolysis, but recognize different substrates. Results MASIA decomposition of APE yielded 12 sequence motifs, 10 of which are also structurally conserved within the family and are designated as molegos. The 12 motifs include all the residues known to be essential for DNA cleavage by APE. Five of these molegos are sequentially and structurally conserved in DNase-1 and the IPP family. Correcting the sequence alignment to match the residues at the ends of two of the molegos that are absolutely conserved in each of the three families greatly improved the local structural alignment of APEs, DNase-1 and synaptojanin. Comparing substrate/product binding of molegos common to DNase-1 showed that those distinctive for APEs are not directly involved in cleavage, but establish protein-DNA interactions 3' to the abasic site. These additional bonds enhance both specific binding to damaged DNA and the processivity of APE1. Conclusion A modular approach can improve structurally predictive alignments of homologous proteins with low sequence identity and reveal residues peripheral to the traditional "active site" that control the specificity of enzymatic activity.
Combining Amplification Typing of L1 Active Subfamilies (ATLAS) with High-Throughput Sequencing.

Science.gov (United States)

Rahbari, Raheleh; Badge, Richard M

2016-01-01

With the advent of new generations of high-throughput sequencing technologies, the catalog of human genome variants created by retrotransposon activity is expanding rapidly. However, despite these advances in describing L1 diversity and the fact that L1 must retrotranspose in the germline or prior to germline partitioning to be evolutionarily successful, direct assessment of de novo L1 retrotransposition in the germline or early embryogenesis has not been achieved for endogenous L1 elements. A direct study of de novo L1 retrotransposition into susceptible loci within sperm DNA (Freeman et al., Hum Mutat 32(8):978-988, 2011) suggested that the rate of L1 retrotransposition in the germline is much lower than previously estimated (ATLAS L1 display technique (Badge et al., Am J Hum Genet 72(4):823-838, 2003) to investigate de novo L1 retrotransposition in human genomes. In this chapter, we describe how we combined a high-coverage ATLAS variant with high-throughput sequencing, achieving 11-25× sequence depth per single amplicon, to study L1 retrotransposition in whole genome amplified (WGA) DNAs.
In silico Analysis of 3′-End-Processing Signals in Aspergillus oryzae Using Expressed Sequence Tags and Genomic Sequencing Data

Science.gov (United States)

Tanaka, Mizuki; Sakai, Yoshifumi; Yamada, Osamu; Shintani, Takahiro; Gomi, Katsuya

2011-01-01

To investigate 3′-end-processing signals in Aspergillus oryzae, we created a nucleotide sequence data set of the 3′-untranslated region (3′ UTR) plus 100 nucleotides (nt) sequence downstream of the poly(A) site using A. oryzae expressed sequence tags and genomic sequencing data. This data set comprised 1065 sequences derived from 1042 unique genes. The average 3′ UTR length in A. oryzae was 241 nt, which is greater than that in yeast but similar to that in plants. The 3′ UTR and 100 nt sequence downstream of the poly(A) site is notably U-rich, while the region located 15–30 nt upstream of the poly(A) site is markedly A-rich. The most frequently found hexanucleotide in this A-rich region is AAUGAA, although this sequence accounts for only 6% of all transcripts. These data suggested that A. oryzae has no highly conserved sequence element equivalent to AAUAAA, a mammalian polyadenylation signal. We identified that putative 3′-end-processing signals in A. oryzae, while less well conserved than those in mammals, comprised four sequence elements: the furthest upstream U-rich element, A-rich sequence, cleavage site, and downstream U-rich element flanking the cleavage site. Although these putative 3′-end-processing signals are similar to those in yeast and plants, some notable differences exist between them. PMID:21586533
Deletion of a coordinate regulator of type 2 cytokine expression in mice

Energy Technology Data Exchange (ETDEWEB)

Mohrs, Markus; Blankespoor, Catherine M.; Wang, Zhi-En; Loots, Gaby G.; Hadeiba, Husein; Shinkai, Kanade; Rubin, Edward M.; Locksley, Richard M.

2001-07-30

Mechanisms underlying the differentiation of stable T helper subsets will be important in understanding how discrete types of immunity develop in response to different pathogens. An evolutionarily conserved {approx}400 base pair non-coding sequence in the IL-4/IL-13 intergenic region, designated CNS-1, was deleted in mice. The capacity to develop Th2 cells was compromised in vitro and in vivo in the absence of CNS-1. Despite the profound effect in T cells, mast cells from CNS-1-deleted mice maintained their capacity to produce IL-4. A T cell-specific element critical for optimal expression of type 2 cytokines may represent evolution of a regulatory sequence exploited by adaptive immunity.
Comparative and functional characterization of intragenic tandem repeats in 10 Aspergillus genomes.

Science.gov (United States)

Gibbons, John G; Rokas, Antonis

2009-03-01

Intragenic tandem repeats (ITRs) are consecutive repeats of three or more nucleotides found in coding regions. ITRs are the underlying cause of several human genetic diseases and have been associated with phenotypic variation, including pathogenesis, in several clades of the tree of life. We have examined the evolution and functional role of ITRs in 10 genomes spanning the fungal genus Aspergillus, a clade of relevance to medicine, agriculture, and industry. We identified several hundred ITRs in each of the species examined. ITR content varied extensively between species, with an average 79% of ITRs unique to a given species. For the fraction of conserved ITR regions, sequence comparisons within species and between close relatives revealed that they were highly variable. ITR-containing proteins were evolutionarily less conserved, compositionally distinct, and overrepresented for domains associated with cell-surface localization and function relative to the rest of the proteome. Furthermore, ITRs were preferentially found in proteins involved in transcription, cellular communication, and cell-type differentiation but were underrepresented in proteins involved in metabolism and energy. Importantly, although ITRs were evolutionarily labile, their functional associations appeared. To be remarkably conserved across eukaryotes. Fungal ITRs likely participate in a variety of developmental processes and cell-surface-associated functions, suggesting that their contribution to fungal lifestyle and evolution may be more general than previously assumed.
Evolutionarily conserved bias of amino-acid usage refines the definition of PDZ-binding motif

Directory of Open Access Journals (Sweden)

Launey Thomas

2011-06-01

Full Text Available Abstract Background The interactions between PDZ (PSD-95, Dlg, ZO-1 domains and PDZ-binding motifs play central roles in signal transductions within cells. Proteins with PDZ domains bind to PDZ-binding motifs almost exclusively when the motifs are located at the carboxyl (C- terminal ends of their binding partners. However, it remains little explored whether PDZ-binding motifs show any preferential location at the C-terminal ends of proteins, at genome-level. Results Here, we examined the distribution of the type-I (x-x-S/T-x-I/L/V or type-II (x-x-V-x-I/V PDZ-binding motifs in proteins encoded in the genomes of five different species (human, mouse, zebrafish, fruit fly and nematode. We first established that these PDZ-binding motifs are indeed preferentially present at their C-terminal ends. Moreover, we found specific amino acid (AA bias for the 'x' positions in the motifs at the C-terminal ends. In general, hydrophilic AAs were favored. Our genomics-based findings confirm and largely extend the results of previous interaction-based studies, allowing us to propose refined consensus sequences for all of the examined PDZ-binding motifs. An ontological analysis revealed that the refined motifs are functionally relevant since a large fraction of the proteins bearing the motif appear to be involved in signal transduction. Furthermore, co-precipitation experiments confirmed two new protein interactions predicted by our genomics-based approach. Finally, we show that influenza virus pathogenicity can be correlated with PDZ-binding motif, with high-virulence viral proteins bearing a refined PDZ-binding motif. Conclusions Our refined definition of PDZ-binding motifs should provide important clues for identifying functional PDZ-binding motifs and proteins involved in signal transduction.
International interlaboratory study comparing single organism 16S rRNA gene sequencing data: Beyond consensus sequence comparisons

Science.gov (United States)

Olson, Nathan D.; Lund, Steven P.; Zook, Justin M.; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S.; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B.

2015-01-01

This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies. PMID:27077030
International interlaboratory study comparing single organism 16S rRNA gene sequencing data: Beyond consensus sequence comparisons

Directory of Open Access Journals (Sweden)

Nathan D. Olson

2015-03-01

Full Text Available This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1 identity of biologically conserved position, (2 ratio of 16S rRNA gene copies featuring identified variants, and (3 the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies.
Sequence of a cDNA encoding turtle high mobility group 1 protein.

Science.gov (United States)

Zheng, Jifang; Hu, Bi; Wu, Duansheng

2005-07-01

In order to understand sequence information about turtle HMG1 gene, a cDNA encoding HMG1 protein of the Chinese soft-shell turtle (Pelodiscus sinensis) was amplified by RT-PCR from kidney total RNA, and was cloned, sequenced and analyzed. The results revealed that the open reading frame (ORF) of turtle HMG1 cDNA is 606 bp long. The ORF codifies 202 amino acid residues, from which two DNA-binding domains and one polyacidic region are derived. The DNA-binding domains share higher amino acid identity with homologues sequences of chicken (96.5%) and mammalian (74%) than homologues sequence of rainbow trout (67%). The polyacidic region shows 84.6% amino acid homology with the equivalent region of chicken HMG1 cDNA. Turtle HMG1 protein contains 3 Cys residues located at completely conserved positions. Conservation in sequence and structure suggests that the functions of turtle HMG1 cDNA may be highly conserved during evolution. To our knowledge, this is the first report of HMG1 cDNA sequence in any reptilian.
Mitochondrial genome sequences illuminate maternal lineages of conservation concern in a rare carnivore

Science.gov (United States)

Brian J. Knaus; Richard Cronn; Aaron Liston; Kristine Pilgrim; Michael K. Schwartz

2011-01-01

Science-based wildlife management relies on genetic information to infer population connectivity and identify conservation units. The most commonly used genetic marker for characterizing animal biodiversity and identifying maternal lineages is the mitochondrial genome. Mitochondrial genotyping figures prominently in conservation and management plans, with much of the...
Low-pass shotgun sequencing of the barley genome facilitates rapid identification of genes, conserved non-coding sequences and novel repeats

Directory of Open Access Journals (Sweden)

Graner Andreas

2008-10-01

Full Text Available Abstract Background Barley has one of the largest and most complex genomes of all economically important food crops. The rise of new short read sequencing technologies such as Illumina/Solexa permits such large genomes to be effectively sampled at relatively low cost. Based on the corresponding sequence reads a Mathematically Defined Repeat (MDR index can be generated to map repetitive regions in genomic sequences. Results We have generated 574 Mbp of Illumina/Solexa sequences from barley total genomic DNA, representing about 10% of a genome equivalent. From these sequences we generated an MDR index which was then used to identify and mark repetitive regions in the barley genome. Comparison of the MDR plots with expert repeat annotation drawing on the information already available for known repetitive elements revealed a significant correspondence between the two methods. MDR-based annotation allowed for the identification of dozens of novel repeat sequences, though, which were not recognised by hand-annotation. The MDR data was also used to identify gene-containing regions by masking of repetitive sequences in eight de-novo sequenced bacterial artificial chromosome (BAC clones. For half of the identified candidate gene islands indeed gene sequences could be identified. MDR data were only of limited use, when mapped on genomic sequences from the closely related species Triticum monococcum as only a fraction of the repetitive sequences was recognised. Conclusion An MDR index for barley, which was obtained by whole-genome Illumina/Solexa sequencing, proved as efficient in repeat identification as manual expert annotation. Circumventing the labour-intensive step of producing a specific repeat library for expert annotation, an MDR index provides an elegant and efficient resource for the identification of repetitive and low-copy (i.e. potentially gene-containing sequences regions in uncharacterised genomic sequences. The restriction that a particular
The identification and functional annotation of RNA structures conserved in vertebrates

DEFF Research Database (Denmark)

Seemann, Ernst Stefan; Mirza, Aashiq Hussain; Hansen, Claus

2017-01-01

Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for Conserved RNA Structures (CRSs), leveraging structure-b......-structured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality.......Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for Conserved RNA Structures (CRSs), leveraging structure......-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ~516k human genomic regions containing CRSs. We find that a substantial fraction of human-mouse CRS regions (i) co-localize consistently with binding sites of the same RNA binding proteins...

Functional annotation by sequence-weighted structure alignments: statistical analysis and case studies from the Protein 3000 structural genomics project in Japan.

Science.gov (United States)

Standley, Daron M; Toh, Hiroyuki; Nakamura, Haruki

2008-09-01

A method to functionally annotate structural genomics targets, based on a novel structural alignment scoring function, is proposed. In the proposed score, position-specific scoring matrices are used to weight structurally aligned residue pairs to highlight evolutionarily conserved motifs. The functional form of the score is first optimized for discriminating domains belonging to the same Pfam family from domains belonging to different families but the same CATH or SCOP superfamily. In the optimization stage, we consider four standard weighting functions as well as our own, the "maximum substitution probability," and combinations of these functions. The optimized score achieves an area of 0.87 under the receiver-operating characteristic curve with respect to identifying Pfam families within a sequence-unique benchmark set of domain pairs. Confidence measures are then derived from the benchmark distribution of true-positive scores. The alignment method is next applied to the task of functionally annotating 230 query proteins released to the public as part of the Protein 3000 structural genomics project in Japan. Of these queries, 78 were found to align to templates with the same Pfam family as the query or had sequence identities > or = 30%. Another 49 queries were found to match more distantly related templates. Within this group, the template predicted by our method to be the closest functional relative was often not the most structurally similar. Several nontrivial cases are discussed in detail. Finally, 103 queries matched templates at the fold level, but not the family or superfamily level, and remain functionally uncharacterized. 2008 Wiley-Liss, Inc.
Identification of conserved regulatory elements by comparative genome analysis

Directory of Open Access Journals (Sweden)

Jareborg Niclas

2003-05-01

Full Text Available Abstract Background For genes that have been successfully delineated within the human genome sequence, most regulatory sequences remain to be elucidated. The annotation and interpretation process requires additional data resources and significant improvements in computational methods for the detection of regulatory regions. One approach of growing popularity is based on the preferential conservation of functional sequences over the course of evolution by selective pressure, termed 'phylogenetic footprinting'. Mutations are more likely to be disruptive if they appear in functional sites, resulting in a measurable difference in evolution rates between functional and non-functional genomic segments. Results We have devised a flexible suite of methods for the identification and visualization of conserved transcription-factor-binding sites. The system reports those putative transcription-factor-binding sites that are both situated in conserved regions and located as pairs of sites in equivalent positions in alignments between two orthologous sequences. An underlying collection of metazoan transcription-factor-binding profiles was assembled to facilitate the study. This approach results in a significant improvement in the detection of transcription-factor-binding sites because of an increased signal-to-noise ratio, as demonstrated with two sets of promoter sequences. The method is implemented as a graphical web application, ConSite, which is at the disposal of the scientific community at http://www.phylofoot.org/. Conclusions Phylogenetic footprinting dramatically improves the predictive selectivity of bioinformatic approaches to the analysis of promoter sequences. ConSite delivers unparalleled performance using a novel database of high-quality binding models for metazoan transcription factors. With a dynamic interface, this bioinformatics tool provides broad access to promoter analysis with phylogenetic footprinting.
A Trichosporonales genome tree based on 27 haploid and three evolutionarily conserved 'natural' hybrid genomes.

Science.gov (United States)

Takashima, Masako; Sriswasdi, Sira; Manabe, Ri-Ichiroh; Ohkuma, Moriya; Sugita, Takashi; Iwasaki, Wataru

2018-01-01

To construct a backbone tree consisting of basidiomycetous yeasts, draft genome sequences from 25 species of Trichosporonales (Tremellomycetes, Basidiomycota) were generated. In addition to the hybrid genomes of Trichosporon coremiiforme and Trichosporon ovoides that we described previously, we identified an interspecies hybrid genome in Cutaneotrichosporon mucoides (formerly Trichosporon mucoides). This hybrid genome had a gene retention rate of ~55%, and its closest haploid relative was Cutaneotrichosporon dermatis. After constructing the C. mucoides subgenomes, we generated a phylogenetic tree using genome data from the 27 haploid species and the subgenome data from the three hybrid genome species. It was a high-quality tree with 100% bootstrap support for all of the branches. The genome-based tree provided superior resolution compared with previous multi-gene analyses. Although our backbone tree does not include all Trichosporonales genera (e.g. Cryptotrichosporon), it will be valuable for future analyses of genome data. Interest in interspecies hybrid fungal genomes has recently increased because they may provide a basis for new technologies. The three Trichosporonales hybrid genomes described in this study are different from well-characterized hybrid genomes (e.g. those of Saccharomyces pastorianus and Saccharomyces bayanus) because these hybridization events probably occurred in the distant evolutionary past. Hence, they will be useful for studying genome stability following hybridization and speciation events. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
Structural and Sequence Similarities of Hydra Xeroderma Pigmentosum A Protein to Human Homolog Suggest Early Evolution and Conservation

Directory of Open Access Journals (Sweden)

Apurva Barve

2013-01-01

Full Text Available Xeroderma pigmentosum group A (XPA is a protein that binds to damaged DNA, verifies presence of a lesion, and recruits other proteins of the nucleotide excision repair (NER pathway to the site. Though its homologs from yeast, Drosophila, humans, and so forth are well studied, XPA has not so far been reported from protozoa and lower animal phyla. Hydra is a fresh-water cnidarian with a remarkable capacity for regeneration and apparent lack of organismal ageing. Cnidarians are among the first metazoa with a defined body axis, tissue grade organisation, and nervous system. We report here for the first time presence of XPA gene in hydra. Putative protein sequence of hydra XPA contains nuclear localization signal and bears the zinc-finger motif. It contains two conserved Pfam domains and various characterized features of XPA proteins like regions for binding to excision repair cross-complementing protein-1 (ERCC1 and replication protein A 70 kDa subunit (RPA70 proteins. Hydra XPA shows a high degree of similarity with vertebrate homologs and clusters with deuterostomes in phylogenetic analysis. Homology modelling corroborates the very close similarity between hydra and human XPA. The protein thus most likely functions in hydra in the same manner as in other animals, indicating that it arose early in evolution and has been conserved across animal phyla.
Genome-wide discovery of novel and conserved microRNAs in white shrimp (Litopenaeus vannamei).

Science.gov (United States)

Xi, Qian-Yun; Xiong, Yuan-Yan; Wang, Yuan-Mei; Cheng, Xiao; Qi, Qi-En; Shu, Gang; Wang, Song-Bo; Wang, Li-Na; Gao, Ping; Zhu, Xiao-Tong; Jiang, Qing-Yan; Zhang, Yong-Liang; Liu, Li

2015-01-01

Of late years, a large amount of conserved and species-specific microRNAs (miRNAs) have been performed on identification from species which are economically important but lack a full genome sequence. In this study, Solexa deep sequencing and cross-species miRNA microarray were used to detect miRNAs in white shrimp. We identified 239 conserved miRNAs, 14 miRNA* sequences and 20 novel miRNAs by bioinformatics analysis from 7,561,406 high-quality reads representing 325,370 distinct sequences. The all 20 novel miRNAs were species-specific in white shrimp and not homologous in other species. Using the conserved miRNAs from the miRBase database as a query set to search for homologs from shrimp expressed sequence tags (ESTs), 32 conserved computationally predicted miRNAs were discovered in shrimp. In addition, using microarray analysis in the shrimp fed with Panax ginseng polysaccharide complex, 151 conserved miRNAs were identified, 18 of which were significant up-expression, while 49 miRNAs were significant down-expression. In particular, qRT-PCR analysis was also performed for nine miRNAs in three shrimp tissues such as muscle, gill and hepatopancreas. Results showed that these miRNAs expression are tissue specific. Combining results of the three methods, we detected 20 novel and 394 conserved miRNAs. Verification with quantitative reverse transcription (qRT-PCR) and Northern blot showed a high confidentiality of data. The study provides the first comprehensive specific miRNA profile of white shrimp, which includes useful information for future investigations into the function of miRNAs in regulation of shrimp development and immunology.
Comparative molecular analysis of evolutionarily distant glyceraldehyde-3-phosphate dehydrogenase from Sardina pilchardus and Octopus vulgaris.

Science.gov (United States)

Baibai, Tarik; Oukhattar, Laila; Mountassif, Driss; Assobhei, Omar; Serrano, Aurelio; Soukri, Abdelaziz

2010-12-01

The NAD(+)-dependent cytosolic glyceraldehyde-3-phosphate dehydrogenase (GAPDH, EC 1.2.1.12), which is recognized as a key to central carbon metabolism in glycolysis and gluconeogenesis and as an important allozymic polymorphic biomarker, was purified from muscles of two marine species: the skeletal muscle of Sardina pilchardus Walbaum (Teleost, Clupeida) and the incompressible arm muscle of Octopus vulgaris (Mollusca, Cephalopoda). Comparative biochemical studies have revealed that they differ in their subunit molecular masses and in pI values. Partial cDNA sequences corresponding to an internal region of the GapC genes from Sardina and Octopus were obtained by polymerase chain reaction using degenerate primers designed from highly conserved protein motifs. Alignments of the deduced amino acid sequences were used to establish the 3D structures of the active site of two enzymes as well as the phylogenetic relationships of the sardine and octopus enzymes. These two enzymes are the first two GAPDHs characterized so far from teleost fish and cephalopod, respectively. Interestingly, phylogenetic analyses indicated that the sardina GAPDH is in a cluster with the archetypical enzymes from other vertebrates, while the octopus GAPDH comes together with other molluscan sequences in a distant basal assembly closer to bacterial and fungal orthologs, thus suggesting their different evolutionary scenarios.
Sequence walkers: a graphical method to display how binding proteins interact with DNA or RNA sequences | Center for Cancer Research

Science.gov (United States)

A graphical method is presented for displaying how binding proteins and other macromolecules interact with individual bases of nucleotide sequences. Characters representing the sequence are either oriented normally and placed above a line indicating favorable contact, or upside-down and placed below the line indicating unfavorable contact. The positive or negative height of each letter shows the contribution of that base to the average sequence conservation of the binding site, as represented by a sequence logo.
Zebrafish (Danio rerio) androgen receptor: sequence homology and up-regulation by the fungicide vinclozolin.

Science.gov (United States)

Smolinsky, Amanda N; Doughman, Jennifer M; Kratzke, Liên-Thành C; Lassiter, Christopher S

2010-03-01

Steroid hormones regulate gene expression in organisms by binding to receptor proteins. These hormones include the androgens, which signal through androgen receptors (ARs). Endocrine disrupters (EDCs) are chemicals in the environment that adversely affect organisms by binding to nuclear receptors, including ARs. Vinclozolin, a fungicide used on fruit and vegetable crops, is a known anti-androgen, a type of EDC that blocks signals from testosterone and its derivatives. In order to better understand the effects of EDCs, further research on androgen receptors and other hormone signaling pathways is necessary. In this study, we demonstrate the evolutionary conservation between the genomic structure of the human and zebrafish ar genes and find that ar mRNA expression increases in zebrafish embryos exposed to vinclozolin, which may be evolutionarily conserved as well. At 48 and 72 h post-fertilization, vinclozolin-treated embryos express ar mRNA 8-fold higher than the control level. These findings suggest that zebrafish embryos attempt to compensate for the presence of an anti-androgen by increasing the number of androgen receptors available.
Sequence of human protamine 2 cDNA

Energy Technology Data Exchange (ETDEWEB)

Domenjoud, L; Fronia, C; Uhde, F; Engel, W [Universitaet Goettingen (West Germany)

1988-08-11

The authors report the cloning and sequencing of a cDNA clone for human protamine 2 (hp2), isolated from a human testis cDNA library cloned in the vector {lambda}-gt11. A 66mer oligonucleotide, that corresponds to an amino acid sequence which is highly conserved between hp2 and mouse protamine 2 (mp2) served as hybridization probe. The homology between the amino acid sequence deduced from our cDNA and the published amino acid sequence for hp2 is 100%.
Comparative analysis of catfish BAC end sequences with the zebrafish genome

Directory of Open Access Journals (Sweden)

Abernathy Jason

2009-12-01

Full Text Available Abstract Background Comparative mapping is a powerful tool to transfer genomic information from sequenced genomes to closely related species for which whole genome sequence data are not yet available. However, such an approach is still very limited in catfish, the most important aquaculture species in the United States. This project was initiated to generate additional BAC end sequences and demonstrate their applications in comparative mapping in catfish. Results We reported the generation of 43,000 BAC end sequences and their applications for comparative genome analysis in catfish. Using these and the additional 20,000 existing BAC end sequences as a resource along with linkage mapping and existing physical map, conserved syntenic regions were identified between the catfish and zebrafish genomes. A total of 10,943 catfish BAC end sequences (17.3% had significant BLAST hits to the zebrafish genome (cutoff value ≤ e-5, of which 3,221 were unique gene hits, providing a platform for comparative mapping based on locations of these genes in catfish and zebrafish. Genetic linkage mapping of microsatellites associated with contigs allowed identification of large conserved genomic segments and construction of super scaffolds. Conclusion BAC end sequences and their associated polymorphic markers are great resources for comparative genome analysis in catfish. Highly conserved chromosomal regions were identified to exist between catfish and zebrafish. However, it appears that the level of conservation at local genomic regions are high while a high level of chromosomal shuffling and rearrangements exist between catfish and zebrafish genomes. Orthologous regions established through comparative analysis should facilitate both structural and functional genome analysis in catfish.
Indole: An evolutionarily conserved influencer of behavior across kingdoms.

Science.gov (United States)

Tomberlin, Jeffery K; Crippen, Tawni L; Wu, Guoyao; Griffin, Ashleigh S; Wood, Thomas K; Kilner, Rebecca M

2017-02-01

Indole is a key environmental cue that is used by many organisms. Based on its biochemistry, we suggest indole is used so universally, and by such different organisms, because it derives from the metabolism of tryptophan, a resource essential for many species yet rare in nature. These properties make it a valuable, environmental cue for resources almost universally important for promoting fitness. We then describe how indole is used to coordinate actions within organisms, to influence the behavior of conspecifics and can even be used to change the behavior of species that belong to other kingdoms. Drawing on the evolutionary framework that has been developed for understanding animal communication, we show how this is diversely achieved by indole acting as a cue, a manipulative signal, and an honest signal, as well as how indole can be used synergistically to amplify information conveyed by other molecules. Clarifying these distinct functions of indole identifies patterns that transcend different kingdoms of organisms. © 2016 WILEY Periodicals, Inc.
Evaluation of the conserve flavin reductase gene from three ...

African Journals Online (AJOL)

STORAGESEVER

2009-12-15

Dec 15, 2009 ... means of PCR technique. The nucleic acid sequences of the PCR primers were designed using conserved nucleic acid sequences of the flavin reductase enzyme from. Rhodococcus sp. strain IGTS8. The oligonucleotide primers were as follows: 5'-GAA TTC ATG TCT GAC. AAG CCG AAT GCC-3' (forward) ...
The dinoflagellates Durinskia baltica and Kryptoperidinium foliaceum retain functionally overlapping mitochondria from two evolutionarily distinct lineages

Directory of Open Access Journals (Sweden)

Keeling Patrick J

2007-09-01

Full Text Available Abtract Background The dinoflagellates Durinskia baltica and Kryptoperidinium foliaceum are distinguished by the presence of a tertiary plastid derived from a diatom endosymbiont. The diatom is fully integrated with the host cell cycle and is so altered in structure as to be difficult to recognize it as a diatom, and yet it retains a number of features normally lost in tertiary and secondary endosymbionts, most notably mitochondria. The dinoflagellate host is also reported to retain mitochondrion-like structures, making these cells unique in retaining two evolutionarily distinct mitochondria. This redundancy raises the question of whether the organelles share any functions in common or have distributed functions between them. Results We show that both host and endosymbiont mitochondrial genomes encode genes for electron transport proteins. We have characterized cytochrome c oxidase 1 (cox1, cytochrome oxidase 2 (cox2, cytochrome oxidase 3 (cox3, cytochrome b (cob, and large subunit of ribosomal RNA (LSUrRNA of endosymbiont mitochondrial ancestry, and cox1 and cob of host mitochondrial ancestry. We show that all genes are transcribed and that those ascribed to the host mitochondrial genome are extensively edited at the RNA level, as expected for a dinoflagellate mitochondrion-encoded gene. We also found evidence for extensive recombination in the host mitochondrial genes and that recombination products are also transcribed, as expected for a dinoflagellate. Conclusion Durinskia baltica and K. foliaceum retain two mitochondria from evolutionarily distinct lineages, and the functions of these organelles are at least partially overlapping, since both express genes for proteins in electron transport.
A Conserved Circular Network of Coregulated Lipids Modulates Innate Immune Responses.

Science.gov (United States)

Köberlin, Marielle S; Snijder, Berend; Heinz, Leonhard X; Baumann, Christoph L; Fauster, Astrid; Vladimer, Gregory I; Gavin, Anne-Claude; Superti-Furga, Giulio

2015-07-02

Lipid composition affects the biophysical properties of membranes that provide a platform for receptor-mediated cellular signaling. To study the regulatory role of membrane lipid composition, we combined genetic perturbations of sphingolipid metabolism with the quantification of diverse steps in Toll-like receptor (TLR) signaling and mass spectrometry-based lipidomics. Membrane lipid composition was broadly affected by these perturbations, revealing a circular network of coregulated sphingolipids and glycerophospholipids. This evolutionarily conserved network architecture simultaneously reflected membrane lipid metabolism, subcellular localization, and adaptation mechanisms. Integration of the diverse TLR-induced inflammatory phenotypes with changes in lipid abundance assigned distinct functional roles to individual lipid species organized across the network. This functional annotation accurately predicted the inflammatory response of cells derived from patients suffering from lipid storage disorders, based solely on their altered membrane lipid composition. The analytical strategy described here empowers the understanding of higher-level organization of membrane lipid function in diverse biological systems. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Perception Enhancement using Visual Attributes in Sequence Motif Visualization

OpenAIRE

Oon, Yin; Lee, Nung; Kok, Wei

2016-01-01

Sequence logo is a well-accepted scientific method to visualize the conservation characteristics of biological sequence motifs. Previous studies found that using sequence logo graphical representation for scientific evidence reports or arguments could seriously cause biases and misinterpretation by users. This study investigates on the visual attributes performance of a sequence logo in helping users to perceive and interpret the information based on preattentive theories and Gestalt principl...
Cell type-specific termination of transcription by transposable element sequences.

Science.gov (United States)

Conley, Andrew B; Jordan, I King

2012-09-30

Transposable elements (TEs) encode sequences necessary for their own transposition, including signals required for the termination of transcription. TE sequences within the introns of human genes show an antisense orientation bias, which has been proposed to reflect selection against TE sequences in the sense orientation owing to their ability to terminate the transcription of host gene transcripts. While there is evidence in support of this model for some elements, the extent to which TE sequences actually terminate transcription of human gene across the genome remains an open question. Using high-throughput sequencing data, we have characterized over 9,000 distinct TE-derived sequences that provide transcription termination sites for 5,747 human genes across eight different cell types. Rarefaction curve analysis suggests that there may be twice as many TE-derived termination sites (TE-TTS) genome-wide among all human cell types. The local chromatin environment for these TE-TTS is similar to that seen for 3' UTR canonical TTS and distinct from the chromatin environment of other intragenic TE sequences. However, those TE-TTS located within the introns of human genes were found to be far more cell type-specific than the canonical TTS. TE-TTS were much more likely to be found in the sense orientation than other intragenic TE sequences of the same TE family and TE-TTS in the sense orientation terminate transcription more efficiently than those found in the antisense orientation. Alu sequences were found to provide a large number of relatively weak TTS, whereas LTR elements provided a smaller number of much stronger TTS. TE sequences provide numerous termination sites to human genes, and TE-derived TTS are particularly cell type-specific. Thus, TE sequences provide a powerful mechanism for the diversification of transcriptional profiles between cell types and among evolutionary lineages, since most TE-TTS are evolutionarily young. The extent of transcription
De novo prediction of structured RNAs from genomic sequences

DEFF Research Database (Denmark)

Gorodkin, Jan; Hofacker, Ivo L.; Þórarinsson, Elfar

2010-01-01

currently available, because evolutionary conservation highlights functionally important regions. Conserved secondary structure, rather than primary sequence, is the hallmark of many functionally important RNAs, because compensatory substitutions in base-paired regions preserve structure. Unfortunately...
Combining protein sequence, structure, and dynamics: A novel approach for functional evolution analysis of PAS domain superfamily.

Science.gov (United States)

Dong, Zheng; Zhou, Hongyu; Tao, Peng

2018-02-01

PAS domains are widespread in archaea, bacteria, and eukaryota, and play important roles in various functions. In this study, we aim to explore functional evolutionary relationship among proteins in the PAS domain superfamily in view of the sequence-structure-dynamics-function relationship. We collected protein sequences and crystal structure data from RCSB Protein Data Bank of the PAS domain superfamily belonging to three biological functions (nucleotide binding, photoreceptor activity, and transferase activity). Protein sequences were aligned and then used to select sequence-conserved residues and build phylogenetic tree. Three-dimensional structure alignment was also applied to obtain structure-conserved residues. The protein dynamics were analyzed using elastic network model (ENM) and validated by molecular dynamics (MD) simulation. The result showed that the proteins with same function could be grouped by sequence similarity, and proteins in different functional groups displayed statistically significant difference in their vibrational patterns. Interestingly, in all three functional groups, conserved amino acid residues identified by sequence and structure conservation analysis generally have a lower fluctuation than other residues. In addition, the fluctuation of conserved residues in each biological function group was strongly correlated with the corresponding biological function. This research suggested a direct connection in which the protein sequences were related to various functions through structural dynamics. This is a new attempt to delineate functional evolution of proteins using the integrated information of sequence, structure, and dynamics. © 2017 The Protein Society.
High throughput sequencing of small RNA component of leaves and inflorescence revealed conserved and novel miRNAs as well as phasiRNA loci in chickpea.

Science.gov (United States)

Srivastava, Sangeeta; Zheng, Yun; Kudapa, Himabindu; Jagadeeswaran, Guru; Hivrale, Vandana; Varshney, Rajeev K; Sunkar, Ramanjulu

2015-06-01

Among legumes, chickpea (Cicer arietinum L.) is the second most important crop after soybean. MicroRNAs (miRNAs) play important roles by regulating target gene expression important for plant development and tolerance to stress conditions. Additionally, recently discovered phased siRNAs (phasiRNAs), a new class of small RNAs, are abundantly produced in legumes. Nevertheless, little is known about these regulatory molecules in chickpea. The small RNA population was sequenced from leaves and flowers of chickpea to identify conserved and novel miRNAs as well as phasiRNAs/phasiRNA loci. Bioinformatics analysis revealed 157 miRNA loci for the 96 highly conserved and known miRNA homologs belonging to 38 miRNA families in chickpea. Furthermore, 20 novel miRNAs belonging to 17 miRNA families were identified. Sequence analysis revealed approximately 60 phasiRNA loci. Potential target genes likely to be regulated by these miRNAs were predicted and some were confirmed by modified 5' RACE assay. Predicted targets are mostly transcription factors that might be important for developmental processes, and others include superoxide dismutases, plantacyanin, laccases and F-box proteins that could participate in stress responses and protein degradation. Overall, this study provides an inventory of miRNA-target gene interactions for chickpea, useful for the comparative analysis of small RNAs among legumes. Copyright © 2015 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Phylogenetic analysis reveals conservation and diversification of micro RNA166 genes among diverse plant species.

Science.gov (United States)

Barik, Suvakanta; SarkarDas, Shabari; Singh, Archita; Gautam, Vibhav; Kumar, Pramod; Majee, Manoj; Sarkar, Ananda K

2014-01-01

Similar to the majority of the microRNAs, mature miR166s are derived from multiple members of MIR166 genes (precursors) and regulate various aspects of plant development by negatively regulating their target genes (Class III HD-ZIP). The evolutionary conservation or functional diversification of miRNA166 family members remains elusive. Here, we show the phylogenetic relationships among MIR166 precursor and mature sequences from three diverse model plant species. Despite strong conservation, some mature miR166 sequences, such as ppt-miR166m, have undergone sequence variation. Critical sequence variation in ppt-miR166m has led to functional diversification, as it targets non-HD-ZIPIII gene transcript (s). MIR166 precursor sequences have diverged in a lineage specific manner, and both precursors and mature osa-miR166i/j are highly conserved. Interestingly, polycistronic MIR166s were present in Physcomitrella and Oryza but not in Arabidopsis. The nature of cis-regulatory motifs on the upstream promoter sequences of MIR166 genes indicates their possible contribution to the functional variation observed among miR166 species. Copyright © 2013 Elsevier Inc. All rights reserved.

Phylogeny based discovery of regulatory elements

Directory of Open Access Journals (Sweden)

Cohen Barak A

2006-05-01

Full Text Available Abstract Background Algorithms that locate evolutionarily conserved sequences have become powerful tools for finding functional DNA elements, including transcription factor binding sites; however, most methods do not take advantage of an explicit model for the constrained evolution of functional DNA sequences. Results We developed a probabilistic framework that combines an HKY85 model, which assigns probabilities to different base substitutions between species, and weight matrix models of transcription factor binding sites, which describe the probabilities of observing particular nucleotides at specific positions in the binding site. The method incorporates the phylogenies of the species under consideration and takes into account the position specific variation of transcription factor binding sites. Using our framework we assessed the suitability of alignments of genomic sequences from commonly used species as substrates for comparative genomic approaches to regulatory motif finding. We then applied this technique to Saccharomyces cerevisiae and related species by examining all possible six base pair DNA sequences (hexamers and identifying sequences that are conserved in a significant number of promoters. By combining similar conserved hexamers we reconstructed known cis-regulatory motifs and made predictions of previously unidentified motifs. We tested one prediction experimentally, finding it to be a regulatory element involved in the transcriptional response to glucose. Conclusion The experimental validation of a regulatory element prediction missed by other large-scale motif finding studies demonstrates that our approach is a useful addition to the current suite of tools for finding regulatory motifs.
Conserved upstream open reading frames in higher plants

Directory of Open Access Journals (Sweden)

Schultz Carolyn J

2008-07-01

Full Text Available Abstract Background Upstream open reading frames (uORFs can down-regulate the translation of the main open reading frame (mORF through two broad mechanisms: ribosomal stalling and reducing reinitiation efficiency. In distantly related plants, such as rice and Arabidopsis, it has been found that conserved uORFs are rare in these transcriptomes with approximately 100 loci. It is unclear how prevalent conserved uORFs are in closely related plants. Results We used a homology-based approach to identify conserved uORFs in five cereals (monocots that could potentially regulate translation. Our approach used a modified reciprocal best hit method to identify putative orthologous sequences that were then analysed by a comparative R-nomics program called uORFSCAN to find conserved uORFs. Conclusion This research identified new genes that may be controlled at the level of translation by conserved uORFs. We report that conserved uORFs are rare (
ASAP: Amplification, sequencing & annotation of plastomes

Directory of Open Access Journals (Sweden)

Folta Kevin M

2005-12-01

Full Text Available Abstract Background Availability of DNA sequence information is vital for pursuing structural, functional and comparative genomics studies in plastids. Traditionally, the first step in mining the valuable information within a chloroplast genome requires sequencing a chloroplast plasmid library or BAC clones. These activities involve complicated preparatory procedures like chloroplast DNA isolation or identification of the appropriate BAC clones to be sequenced. Rolling circle amplification (RCA is being used currently to amplify the chloroplast genome from purified chloroplast DNA and the resulting products are sheared and cloned prior to sequencing. Herein we present a universal high-throughput, rapid PCR-based technique to amplify, sequence and assemble plastid genome sequence from diverse species in a short time and at reasonable cost from total plant DNA, using the large inverted repeat region from strawberry and peach as proof of concept. The method exploits the highly conserved coding regions or intergenic regions of plastid genes. Using an informatics approach, chloroplast DNA sequence information from 5 available eudicot plastomes was aligned to identify the most conserved regions. Cognate primer pairs were then designed to generate ~1 – 1.2 kb overlapping amplicons from the inverted repeat region in 14 diverse genera. Results 100% coverage of the inverted repeat region was obtained from Arabidopsis, tobacco, orange, strawberry, peach, lettuce, tomato and Amaranthus. Over 80% coverage was obtained from distant species, including Ginkgo, loblolly pine and Equisetum. Sequence from the inverted repeat region of strawberry and peach plastome was obtained, annotated and analyzed. Additionally, a polymorphic region identified from gel electrophoresis was sequenced from tomato and Amaranthus. Sequence analysis revealed large deletions in these species relative to tobacco plastome thus exhibiting the utility of this method for structural and
Tissue-specific expression of aryl hydrocarbon receptor and putative developmental regulatory modules in Baltic salmon yolk-sac fry

Energy Technology Data Exchange (ETDEWEB)

Vuori, Kristiina A. [Centre of Excellence in Evolutionary Genetics and Physiology, Department of Biology, University of Turku, FI-20014 Turku (Finland)], E-mail: kristiina.vuori@utu.fi; Nordlund, Eija [Department of Information Technology, University of Turku, and Turku Centre for Computer Science (TUCS), FI-20014 Turku (Finland); Kallio, Jenny [Centre of Excellence in Evolutionary Genetics and Physiology, Department of Biology, University of Turku, FI-20014 Turku (Finland); Salakoski, Tapio [Department of Information Technology, University of Turku, and Turku Centre for Computer Science (TUCS), FI-20014 Turku (Finland); Nikinmaa, Mikko [Centre of Excellence in Evolutionary Genetics and Physiology, Department of Biology, University of Turku, FI-20014 Turku (Finland)

2008-04-08

The aryl hydrocarbon receptor (AhR) is an ancient protein that is conserved in vertebrates and invertebrates, indicating its important function throughout evolution. AhR has been studied largely because of its role in toxicology-gene expression via AhR is induced by many aromatic hydrocarbons in mammals. Recently, however, it has become clear that AhR is involved in various aspects of development such as cell proliferation and differentiation, and cell motility and migration. The mechanisms by which AhR regulates these various functions remain poorly understood. Across-species comparative studies of AhR in invertebrates, non-mammalian vertebrates and mammals may help to reveal the multiple functions of AhR. Here, we have studied AhR during larval development of Baltic salmon (Salmon salar). Our results indicate that AhR protein is expressed in nervous system, liver and muscle tissues. We also present putative regulatory modules and module-matching genes, produced by chromatin immunoprecipitation (ChIP) cloning and in silico analysis, which may be associated with evolutionarily conserved functions of AhR during development. For example, the module NFKB-AHRR-CREB found from salmon ChIP sequences is present in human ULK3 (regulating formation of granule cell axons in mouse and axon outgrowth in Caernohabditis elegans) and SRGAP1 (GTPase-activating protein involved in the Slit/Robo pathway) promoters. We suggest that AhR may have an evolutionarily conserved role in neuronal development and nerve cell targeting, and in Wnt signaling pathway.
Tissue-specific expression of aryl hydrocarbon receptor and putative developmental regulatory modules in Baltic salmon yolk-sac fry

International Nuclear Information System (INIS)

Vuori, Kristiina A.; Nordlund, Eija; Kallio, Jenny; Salakoski, Tapio; Nikinmaa, Mikko

2008-01-01

The aryl hydrocarbon receptor (AhR) is an ancient protein that is conserved in vertebrates and invertebrates, indicating its important function throughout evolution. AhR has been studied largely because of its role in toxicology-gene expression via AhR is induced by many aromatic hydrocarbons in mammals. Recently, however, it has become clear that AhR is involved in various aspects of development such as cell proliferation and differentiation, and cell motility and migration. The mechanisms by which AhR regulates these various functions remain poorly understood. Across-species comparative studies of AhR in invertebrates, non-mammalian vertebrates and mammals may help to reveal the multiple functions of AhR. Here, we have studied AhR during larval development of Baltic salmon (Salmon salar). Our results indicate that AhR protein is expressed in nervous system, liver and muscle tissues. We also present putative regulatory modules and module-matching genes, produced by chromatin immunoprecipitation (ChIP) cloning and in silico analysis, which may be associated with evolutionarily conserved functions of AhR during development. For example, the module NFKB-AHRR-CREB found from salmon ChIP sequences is present in human ULK3 (regulating formation of granule cell axons in mouse and axon outgrowth in Caernohabditis elegans) and SRGAP1 (GTPase-activating protein involved in the Slit/Robo pathway) promoters. We suggest that AhR may have an evolutionarily conserved role in neuronal development and nerve cell targeting, and in Wnt signaling pathway
Structure-sequence based analysis for identification of conserved regions in proteins

Science.gov (United States)

Zemla, Adam T; Zhou, Carol E; Lam, Marisa W; Smith, Jason R; Pardes, Elizabeth

2013-05-28

Disclosed are computational methods, and associated hardware and software products for scoring conservation in a protein structure based on a computationally identified family or cluster of protein structures. A method of computationally identifying a family or cluster of protein structures in also disclosed herein.
Differential sequence diversity at merozoite surface protein-1 locus of Plasmodium knowlesi from humans and macaques in Thailand.

Science.gov (United States)

Putaporntip, Chaturong; Thongaree, Siriporn; Jongwutiwes, Somchai

2013-08-01

To determine the genetic diversity and potential transmission routes of Plasmodium knowlesi, we analyzed the complete nucleotide sequence of the gene encoding the merozoite surface protein-1 of this simian malaria (Pkmsp-1), an asexual blood-stage vaccine candidate, from naturally infected humans and macaques in Thailand. Analysis of Pkmsp-1 sequences from humans (n=12) and monkeys (n=12) reveals five conserved and four variable domains. Most nucleotide substitutions in conserved domains were dimorphic whereas three of four variable domains contained complex repeats with extensive sequence and size variation. Besides purifying selection in conserved domains, evidence of intragenic recombination scattering across Pkmsp-1 was detected. The number of haplotypes, haplotype diversity, nucleotide diversity and recombination sites of human-derived sequences exceeded that of monkey-derived sequences. Phylogenetic networks based on concatenated conserved sequences of Pkmsp-1 displayed a character pattern that could have arisen from sampling process or the presence of two independent routes of P. knowlesi transmission, i.e. from macaques to human and from human to humans in Thailand. Copyright © 2013 Elsevier B.V. All rights reserved.
In Silico Characterization of Pectate Lyase Protein Sequences from Different Source Organisms

Directory of Open Access Journals (Sweden)

Amit Kumar Dubey

2010-01-01

Full Text Available A total of 121 protein sequences of pectate lyases were subjected to homology search, multiple sequence alignment, phylogenetic tree construction, and motif analysis. The phylogenetic tree constructed revealed different clusters based on different source organisms representing bacterial, fungal, plant, and nematode pectate lyases. The multiple accessions of bacterial, fungal, nematode, and plant pectate lyase protein sequences were placed closely revealing a sequence level similarity. The multiple sequence alignment of these pectate lyase protein sequences from different source organisms showed conserved regions at different stretches with maximum homology from amino acid residues 439–467, 715–816, and 829–910 which could be used for designing degenerate primers or probes specific for pectate lyases. The motif analysis revealed a conserved Pec_Lyase_C domain uniformly observed in all pectate lyases irrespective of variable sources suggesting its possible role in structural and enzymatic functions.
Identification of a conserved archaeal RNA polymerase subunit contacted by the basal transcription factor TFB.

Science.gov (United States)

Magill, C P; Jackson, S P; Bell, S D

2001-12-14

Archaea possess two general transcription factors that are required to recruit RNA polymerase (RNAP) to promoters in vitro. These are TBP, the TATA-box-binding protein and TFB, the archaeal homologue of TFIIB. Thus, the archaeal and eucaryal transcription machineries are fundamentally related. In both RNAP II and archaeal transcription systems, direct contacts between TFB/TFIIB and the RNAP have been demonstrated to mediate recruitment of the polymerase to the promoter. However the subunit(s) directly contacted by these factors has not been identified. Using systematic yeast two-hybrid and biochemical analyses we have identified an interaction between the N-terminal domain of TFB and an evolutionarily conserved subunit of the RNA polymerase, RpoK. Intriguingly, homologues of RpoK are found in all three nuclear RNA polymerases (Rpb6) and also in the bacterial RNA polymerase (omega-subunit).
Core genome conservation of Staphylococcus haemolyticus limits sequence based population structure analysis.

Science.gov (United States)

Cavanagh, Jorunn Pauline; Klingenberg, Claus; Hanssen, Anne-Merethe; Fredheim, Elizabeth Aarag; Francois, Patrice; Schrenzel, Jacques; Flægstad, Trond; Sollid, Johanna Ericson

2012-06-01

The notoriously multi-resistant Staphylococcus haemolyticus is an emerging pathogen causing serious infections in immunocompromised patients. Defining the population structure is important to detect outbreaks and spread of antimicrobial resistant clones. Currently, the standard typing technique is pulsed-field gel electrophoresis (PFGE). In this study we describe novel molecular typing schemes for S. haemolyticus using multi locus sequence typing (MLST) and multi locus variable number of tandem repeats (VNTR) analysis. Seven housekeeping genes (MLST) and five VNTR loci (MLVF) were selected for the novel typing schemes. A panel of 45 human and veterinary S. haemolyticus isolates was investigated. The collection had diverse PFGE patterns (38 PFGE types) and was sampled over a 20 year-period from eight countries. MLST resolved 17 sequence types (Simpsons index of diversity [SID]=0.877) and MLVF resolved 14 repeat types (SID=0.831). We found a low sequence diversity. Phylogenetic analysis clustered the isolates in three (MLST) and one (MLVF) clonal complexes, respectively. Taken together, neither the MLST nor the MLVF scheme was suitable to resolve the population structure of this S. haemolyticus collection. Future MLVF and MLST schemes will benefit from addition of more variable core genome sequences identified by comparing different fully sequenced S. haemolyticus genomes. Copyright © 2012 Elsevier B.V. All rights reserved.
Conservation of the Type IV secretion system throughout Wolbachia evolution

DEFF Research Database (Denmark)

Pichon, Samuel; Bouchon, Didier; Cordaux, Richard

2009-01-01

, encoding a T4SS were previously identified and characterized at two separate genomic loci. Using the largest data set of Wolbachia strains studied so far, we show that vir gene sequence and organization are strictly conserved among 37 Wolbachia strains inducing various phenotypes such as cytoplasmic...... incompatibility, feminization, or oogenesis in their arthropod hosts. In sharp contrast, extensive variation of genomic sequences flanking the virB8-D4 operon suggested its distinct location among Wolbachia genomes. Long term conservation of the T4SS may imply maintenance of a functional effector translocation...... system in Wolbachia, thereby suggesting the importance for the T4SS in Wolbachia biology and survival inside host cells....
Localization of Daucus carota NMCP1 to the nuclear periphery: the role of the N-terminal region and an NLS-linked sequence motif, RYNLRR, in the tail domain

Directory of Open Access Journals (Sweden)

Yuta eKimura

2014-02-01

Full Text Available Recent ultrastructural studies revealed that a structure similar to the vertebrate nuclear lamina exists in the nuclei of higher plants. However, plant genomes lack genes for lamins and intermediate-type filament proteins, and this suggests that plant-specific nuclear coiled-coil proteins make up the lamina-like structure in plants. NMCP1 is a protein, first identified in Daucus carota cells, that localizes exclusively to the nuclear periphery in interphase cells. It has a tripartite structure comprised of head, rod, and tail domains, and includes putative nuclear localization signal (NLS motifs. We identified the functional NLS of DcNMCP1 (carrot NMCP1 and determined the protein regions required for localizing to the nuclear periphery using EGFP-fused constructs transiently expressed in Apium graveolens epidermal cells. Transcription was driven under a CaMV35S promoter, and the genes were introduced into the epidermal cells by a DNA-coated microprojectile delivery system. Of the NLS motifs, KRRRK and RRHK in the tail domain were highly functional for nuclear localization. Addition of the N-terminal 141 amino acids from DcNMCP1 shifted the localization of a region including these NLSs from the entire nucleus to the nuclear periphery. Using this same construct, the replacement of amino acids in RRHK or its preceding sequence, YNL, with alanine residues abolished localization to the nuclear periphery, while replacement of KRRRK did not affect localization. The sequence R/Q/HYNLRR/H, including YNL and the first part of the sequence of RRHK, is evolutionarily conserved in a subclass of NMCP1 sequences from many plant species. These results show that NMCP1 localizes to the nuclear periphery by a combined action of a sequence composed of R/Q/HYNLRR/H, NLS, and the N-terminal region including the head and a portion of the rod domain, suggesting that more than one binding site is implicated in localization of NMCP1.
The architecture of mammalian ribosomal protein promoters

Directory of Open Access Journals (Sweden)

Perry Robert P

2005-02-01

Full Text Available Abstract Background Mammalian ribosomes contain 79 different proteins encoded by widely scattered single copy genes. Coordinate expression of these genes at transcriptional and post-transcriptional levels is required to ensure a roughly equimolar accumulation of ribosomal proteins. To date, detailed studies of only a very few ribosomal protein (rp promoters have been made. To elucidate the general features of rp promoter architecture, I made a detailed sequence comparison of the promoter regions of the entire set of orthologous human and mouse rp genes. Results A striking evolutionarily conserved feature of most rp genes is the separation by an intron of the sequences involved in transcriptional and translational regulation from the sequences with protein encoding function. Another conserved feature is the polypyrimidine initiator, which conforms to the consensus (Y2C+1TY(T2(Y3. At least 60 % of the rp promoters contain a largely conserved TATA box or A/T-rich motif, which should theoretically have TBP-binding capability. A remarkably high proportion of the promoters contain conserved binding sites for transcription factors that were previously implicated in rp gene expression, namely upstream GABP and Sp1 sites and downstream YY1 sites. Over 80 % of human and mouse rp genes contain a transposable element residue within 900 bp of 5' flanking sequence; very little sequence identity between human and mouse orthologues was evident more than 200 bp upstream of the transcriptional start point. Conclusions This analysis has provided some valuable insights into the general architecture of mammalian rp promoters and has identified parameters that might coordinately regulate the transcriptional activity of certain subsets of rp genes.
Strategies for measuring evolutionary conservation of RNA secondary structures

Directory of Open Access Journals (Sweden)

Hofacker Ivo L

2008-02-01

Full Text Available Abstract Background Evolutionary conservation of RNA secondary structure is a typical feature of many functional non-coding RNAs. Since almost all of the available methods used for prediction and annotation of non-coding RNA genes rely on this evolutionary signature, accurate measures for structural conservation are essential. Results We systematically assessed the ability of various measures to detect conserved RNA structures in multiple sequence alignments. We tested three existing and eight novel strategies that are based on metrics of folding energies, metrics of single optimal structure predictions, and metrics of structure ensembles. We find that the folding energy based SCI score used in the RNAz program and a simple base-pair distance metric are by far the most accurate. The use of more complex metrics like for example tree editing does not improve performance. A variant of the SCI performed particularly well on highly conserved alignments and is thus a viable alternative when only little evolutionary information is available. Surprisingly, ensemble based methods that, in principle, could benefit from the additional information contained in sub-optimal structures, perform particularly poorly. As a general trend, we observed that methods that include a consensus structure prediction outperformed equivalent methods that only consider pairwise comparisons. Conclusion Structural conservation can be measured accurately with relatively simple and intuitive metrics. They have the potential to form the basis of future RNA gene finders, that face new challenges like finding lineage specific structures or detecting mis-aligned sequences.
Interactions between the Nse3 and Nse4 components of the SMC5-6 complex identify evolutionarily conserved interactions between MAGE and EID Families.

Directory of Open Access Journals (Sweden)

Jessica J R Hudson

2011-02-01

Full Text Available The SMC5-6 protein complex is involved in the cellular response to DNA damage. It is composed of 6-8 polypeptides, of which Nse1, Nse3 and Nse4 form a tight sub-complex. MAGEG1, the mammalian ortholog of Nse3, is the founding member of the MAGE (melanoma-associated antigen protein family and Nse4 is related to the EID (E1A-like inhibitor of differentiation family of transcriptional repressors.Using site-directed mutagenesis, protein-protein interaction analyses and molecular modelling, we have identified a conserved hydrophobic surface on the C-terminal domain of Nse3 that interacts with Nse4 and identified residues in its N-terminal domain that are essential for interaction with Nse1. We show that these interactions are conserved in the human orthologs. Furthermore, interaction of MAGEG1, the mammalian ortholog of Nse3, with NSE4b, one of the mammalian orthologs of Nse4, results in transcriptional co-activation of the nuclear receptor, steroidogenic factor 1 (SF1. In an examination of the evolutionary conservation of the Nse3-Nse4 interactions, we find that several MAGE proteins can interact with at least one of the NSE4/EID proteins.We have found that, despite the evolutionary diversification of the MAGE family, the characteristic hydrophobic surface shared by all MAGE proteins from yeast to humans mediates its binding to NSE4/EID proteins. Our work provides new insights into the interactions, evolution and functions of the enigmatic MAGE proteins.
Analysis of Schizosaccharomyces pombe mediator reveals a set of essential subunits conserved between yeast and metazoan cells

DEFF Research Database (Denmark)

Spåhr, H; Samuelsen, C O; Baraznenok, V

2001-01-01

. cerevisiae share an essential protein module, which associates with nonessential speciesspecific subunits. In support of this view, sequence analysis of the conserved yeast Mediator components Med4 and Med8 reveals sequence homology to the metazoan Mediator components Trap36 and Arc32. Therefore, 8 of 10...... essential genes conserved between S. pombe and S. cerevisiae also have a metazoan homolog, indicating that an evolutionary conserved Mediator core is present in all eukaryotic cells. Our data suggest a closer functional relationship between yeast and metazoan Mediator than previously anticipated....
Subfamily logos: visualization of sequence deviations at alignment positions with high information content

Directory of Open Access Journals (Sweden)

Beitz Eric

2006-06-01

Full Text Available Abstract Background Recognition of relevant sequence deviations can be valuable for elucidating functional differences between protein subfamilies. Interesting residues at highly conserved positions can then be mutated and experimentally analyzed. However, identification of such sites is tedious because automated approaches are scarce. Results Subfamily logos visualize subfamily-specific sequence deviations. The display is similar to classical sequence logos but extends into the negative range. Positive, upright characters correspond to residues which are characteristic for the subfamily, negative, upside-down characters to residues typical for the remaining sequences. The symbol height is adjusted to the information content of the alignment position. Residues which are conserved throughout do not appear. Conclusion Subfamily logos provide an intuitive display of relevant sequence deviations. The method has proven to be valid using a set of 135 aligned aquaporin sequences in which established subfamily-specific positions were readily identified by the algorithm.
Wound-Induced Polyploidization: Regulation by Hippo and JNK Signaling and Conservation in Mammals.

Science.gov (United States)

Losick, Vicki P; Jun, Albert S; Spradling, Allan C

2016-01-01

Tissue integrity and homeostasis often rely on the proliferation of stem cells or differentiated cells to replace lost, aged, or damaged cells. Recently, we described an alternative source of cell replacement- the expansion of resident, non-dividing diploid cells by wound-induced polyploidization (WIP). Here we show that the magnitude of WIP is proportional to the extent of cell loss using a new semi-automated assay with single cell resolution. Hippo and JNK signaling regulate WIP; unexpectedly however, JNK signaling through AP-1 limits rather than stimulates the level of Yki activation and polyploidization in the Drosophila epidermis. We found that polyploidization also quantitatively compensates for cell loss in a mammalian tissue, mouse corneal endothelium, where increased cell death occurs with age in a mouse model of Fuchs Endothelial Corneal Dystrophy (FECD). Our results suggest that WIP is an evolutionarily conserved homeostatic mechanism that maintains the size and synthetic capacity of adult tissues.
Cooperativity, Specificity, and Evolutionary Stability of Polycomb Targeting in Drosophila

Directory of Open Access Journals (Sweden)

Bernd Schuettengruber

2014-10-01

Full Text Available Summary: Metazoan genomes are partitioned into modular chromosomal domains containing active or repressive chromatin. In flies, Polycomb group (PcG response elements (PREs recruit PHO and other DNA-binding factors and act as nucleation sites for the formation of Polycomb repressive domains. The sequence specificity of PREs is not well understood. Here, we use comparative epigenomics and transgenic assays to show that Drosophila domain organization and PRE specification are evolutionarily conserved despite significant cis-element divergence within Polycomb domains, whereas cis-element evolution is strongly correlated with transcription factor binding divergence outside of Polycomb domains. Cooperative interactions of PcG complexes and their recruiting factor PHO stabilize PHO recruitment to low-specificity sequences. Consistently, PHO recruitment to sites within Polycomb domains is stabilized by PRC1. These data suggest that cooperative rather than hierarchical interactions among low-affinity sequences, DNA-binding factors, and the Polycomb machinery are giving rise to specific and strongly conserved 3D structures in Drosophila. : Schuettengruber et al. present an extensive comparative epigenomics data set, providing new insights into cis-driven versus buffered evolution of Polycomb recruitment and Polycomb domain specificity. Using chromatin immunoprecipitation sequencing and transgenic assays, they demonstrate an extremely high conservation of Polycomb repressive domains in five Drosophila species. Using Hi-C and knockout experiments, they challenge the standard hierarchical Polycomb recruitment model and demonstrate that cooperative rather than hierarchical interactions among DNA motifs, transcription factors, and Polycomb group complexes define Polycomb domains.
Genetic Diversity and Population Structure of the Pelagic Thresher Shark (Alopias pelagicus) in the Pacific Ocean: Evidence for Two Evolutionarily Significant Units

Science.gov (United States)

Cardeñosa, Diego; Hyde, John; Caballero, Susana

2014-01-01

There has been an increasing concern about shark overexploitation in the last decade, especially for open ocean shark species, where there is a paucity of data about their life histories and population dynamics. Little is known regarding the population structure of the pelagic thresher shark, Alopias pelagicus. Though an earlier study using mtDNA control region data, showed evidence for differences between eastern and western Pacific populations, the study was hampered by low sample size and sparse geographic coverage, particularly a lack of samples from the central Pacific. Here, we present the population structure of Alopias pelagicus analyzing 351 samples from six different locations across the Pacific Ocean. Using data from mitochondrial DNA COI sequences and seven microsatellite loci we found evidence of strong population differentiation between western and eastern Pacific populations and evidence for reciprocally monophyly for organelle haplotypes and significant divergence of allele frequencies at nuclear loci, suggesting the existence of two Evolutionarily Significant Units (ESU) in the Pacific Ocean. Interestingly, the population in Hawaii appears to be composed of both ESUs in what seems to be clear sympatry with reproductive isolation. These results may indicate the existence of a new cryptic species in the Pacific Ocean. The presence of these distinct ESUs highlights the need for revised management plans for this highly exploited shark throughout its range. PMID:25337814

Cell type-specific termination of transcription by transposable element sequences

Directory of Open Access Journals (Sweden)

Conley Andrew B

2012-09-01

Full Text Available Abstract Background Transposable elements (TEs encode sequences necessary for their own transposition, including signals required for the termination of transcription. TE sequences within the introns of human genes show an antisense orientation bias, which has been proposed to reflect selection against TE sequences in the sense orientation owing to their ability to terminate the transcription of host gene transcripts. While there is evidence in support of this model for some elements, the extent to which TE sequences actually terminate transcription of human gene across the genome remains an open question. Results Using high-throughput sequencing data, we have characterized over 9,000 distinct TE-derived sequences that provide transcription termination sites for 5,747 human genes across eight different cell types. Rarefaction curve analysis suggests that there may be twice as many TE-derived termination sites (TE-TTS genome-wide among all human cell types. The local chromatin environment for these TE-TTS is similar to that seen for 3′ UTR canonical TTS and distinct from the chromatin environment of other intragenic TE sequences. However, those TE-TTS located within the introns of human genes were found to be far more cell type-specific than the canonical TTS. TE-TTS were much more likely to be found in the sense orientation than other intragenic TE sequences of the same TE family and TE-TTS in the sense orientation terminate transcription more efficiently than those found in the antisense orientation. Alu sequences were found to provide a large number of relatively weak TTS, whereas LTR elements provided a smaller number of much stronger TTS. Conclusions TE sequences provide numerous termination sites to human genes, and TE-derived TTS are particularly cell type-specific. Thus, TE sequences provide a powerful mechanism for the diversification of transcriptional profiles between cell types and among evolutionary lineages, since most TE-TTS are
Conserved queen pheromones in bumblebees: a reply to Amsalem et al.

Directory of Open Access Journals (Sweden)

Luke Holman

2017-05-01

Full Text Available In a recent study, Amsalem, Orlova & Grozinger (2015 performed experiments with Bombus impatiens bumblebees to test the hypothesis that saturated cuticular hydrocarbons are evolutionarily conserved signals used to regulate reproductive division of labor in many Hymenopteran social insects. They concluded that the cuticular hydrocarbon pentacosane (C25, previously identified as a queen pheromone in a congeneric bumblebee, does not affect worker reproduction in B. impatiens. Here we discuss some shortcomings of Amsalem et al.’s study that make its conclusions unreliable. In particular, several confounding effects may have affected the results of both experimental manipulations in the study. Additionally, the study’s low sample sizes (mean n per treatment = 13.6, range: 4–23 give it low power, not 96–99% power as claimed, such that its conclusions may be false negatives. Inappropriate statistical tests were also used, and our reanalysis found that C25 substantially reduced and delayed worker egg laying in B. impatiens. We review the evidence that cuticular hydrocarbons act as queen pheromones, and offer some recommendations for future queen pheromone experiments.
Clinical evaluation and mitochondrial DNA sequence analysis in three Chinese families with Leber's hereditary optic neuropathy

International Nuclear Information System (INIS)

Qian Yaping; Zhou Xiangtian; Hu Yongwu; Tong Yi; Li Ronghua; Lu Fan; Yang Huanming; Mo Junqin; Qu Jia; Guan Minxin

2005-01-01

We report here the clinical, genetic, and molecular characterization of three Chinese families (WZ4, WZ5, and WZ6) with Leber's hereditary optic neuropathy (LHON). Clinical and genetic evaluations revealed the variable severity and age-of-onset in visual impairment in these families. Penetrances of visual impairment in these Chinese families were 33.3%, 35.7%, and 35.5%, respectively, with an average 34.8%. Furthermore, the average age-at-onset in those Chinese families was 17, 20, and 18 years. In addition, the ratios between affected male and female matrilineal relatives in these Chinese families were 3:0, 1:1, and 1.2:1, respectively. Sequence analysis of the complete mitochondrial genomes in these pedigrees showed the distinct sets of mtDNA polymorphism, in addition to the identical G11778A mutation associated with LHON in many families. The fact that mtDNA of those pedigrees belonged to different haplogroups F1, D4, and M10 suggested that the G11778A mutation occurred sporadically and multiplied through evolution of the mtDNA in China. However, there was the absence of functionally significant mutations in tRNA and rRNAs or secondary LHON mutations in these Chinese families. The I187T mutation in the ND1, the S99A mutation in the A6, the V254I in CO3, and I58V in ND6 mutation, showing high evolutional conservation, may contribute to the phenotypic expression of the G11778A mutation in the WZ6 pedigree. By contrast, none of mtDNA variants are evolutionarily conserved and implicated to have significantly functional consequence in WZ4 and WZ5 pedigrees. Apparently, these variants do not have a potential modifying role in the development of visual impairment associated with G11778A mutation in those two families. Thus, nuclear modifier gene(s) or environmental factor(s) seem to account for the penetrance and expressivity of LHON in these three Chinese families carrying the G11778A mutation
Cloning of the cDNA for murine von Willebrand factor and identification of orthologous genes reveals the extent of conservation among diverse species.

Science.gov (United States)

Chitta, Mohan S; Duhé, Roy J; Kermode, John C

2007-05-01

Interaction of von Willebrand factor (VWF) with circulating platelets promotes hemostasis when a blood vessel is injured. The A1 domain of VWF is responsible for the initial interaction with platelets and is well conserved among species. Knowledge of the cDNA and genomic DNA sequences for human VWF allowed us to predict the cDNA sequence for murine VWF in silico and amplify its entire coding region by RT-PCR. The murine VWF cDNA has an open reading frame of 8,442 bp, encoding a protein of 2,813 amino acid residues with 83% identity to human pre-pro-VWF. The same strategy was used to predict in silico the cDNA sequence for the ortholog of VWF in a further six species. Many of these predictions diverged substantially from the putative Reference Sequences derived by ab initio methods. Our predicted sequences indicated that the VWF gene has a conserved structure of 52 exons in all seven mammalian species examined, as well as in the chicken. There is a minor structural variation in the pufferfish Takifugu rubripes insofar as the VWF gene in this species has 53 exons. Comparison of the translated amino acid sequences also revealed a high degree of conservation. In particular, the cysteine residues are conserved precisely throughout both the pro-peptide and the mature VWF sequence in all species, with a minor exception in the pufferfish VWF ortholog where two adjacent cysteine residues are omitted. The marked conservation of cysteine residues emphasizes the importance of the intricate pattern of disulfide bonds in governing the structure of pro-VWF and regulating the function of the mature VWF protein. It should also be emphasized that many of the conserved features of the VWF gene and protein were obscured when the comparison among species was based on the putative Reference Sequences instead of our predicted cDNA sequences.
Trade-off between Transcriptome Plasticity and Genome Evolution in Cephalopods.

Science.gov (United States)

Liscovitch-Brauer, Noa; Alon, Shahar; Porath, Hagit T; Elstein, Boaz; Unger, Ron; Ziv, Tamar; Admon, Arie; Levanon, Erez Y; Rosenthal, Joshua J C; Eisenberg, Eli

2017-04-06

RNA editing, a post-transcriptional process, allows the diversification of proteomes beyond the genomic blueprint; however it is infrequently used among animals for this purpose. Recent reports suggesting increased levels of RNA editing in squids thus raise the question of the nature and effects of these events. We here show that RNA editing is particularly common in behaviorally sophisticated coleoid cephalopods, with tens of thousands of evolutionarily conserved sites. Editing is enriched in the nervous system, affecting molecules pertinent for excitability and neuronal morphology. The genomic sequence flanking editing sites is highly conserved, suggesting that the process confers a selective advantage. Due to the large number of sites, the surrounding conservation greatly reduces the number of mutations and genomic polymorphisms in protein-coding regions. This trade-off between genome evolution and transcriptome plasticity highlights the importance of RNA recoding as a strategy for diversifying proteins, particularly those associated with neural function. PAPERCLIP. Copyright © 2017 Elsevier Inc. All rights reserved.
Mechanisms of stable lipid loss in a social insect

Science.gov (United States)

Ament, Seth A.; Chan, Queenie W.; Wheeler, Marsha M.; Nixon, Scott E.; Johnson, S. Peir; Rodriguez-Zas, Sandra L.; Foster, Leonard J.; Robinson, Gene E.

2011-01-01

SUMMARY Worker honey bees undergo a socially regulated, highly stable lipid loss as part of their behavioral maturation. We used large-scale transcriptomic and proteomic experiments, physiological experiments and RNA interference to explore the mechanistic basis for this lipid loss. Lipid loss was associated with thousands of gene expression changes in abdominal fat bodies. Many of these genes were also regulated in young bees by nutrition during an initial period of lipid gain. Surprisingly, in older bees, which is when maximum lipid loss occurs, diet played less of a role in regulating fat body gene expression for components of evolutionarily conserved nutrition-related endocrine systems involving insulin and juvenile hormone signaling. By contrast, fat body gene expression in older bees was regulated more strongly by evolutionarily novel regulatory factors, queen mandibular pheromone (a honey bee-specific social signal) and vitellogenin (a conserved yolk protein that has evolved novel, maturation-related functions in the bee), independent of nutrition. These results demonstrate that conserved molecular pathways can be manipulated to achieve stable lipid loss through evolutionarily novel regulatory processes. PMID:22031746
Mechanisms of stable lipid loss in a social insect.

Science.gov (United States)

Ament, Seth A; Chan, Queenie W; Wheeler, Marsha M; Nixon, Scott E; Johnson, S Peir; Rodriguez-Zas, Sandra L; Foster, Leonard J; Robinson, Gene E

2011-11-15

Worker honey bees undergo a socially regulated, highly stable lipid loss as part of their behavioral maturation. We used large-scale transcriptomic and proteomic experiments, physiological experiments and RNA interference to explore the mechanistic basis for this lipid loss. Lipid loss was associated with thousands of gene expression changes in abdominal fat bodies. Many of these genes were also regulated in young bees by nutrition during an initial period of lipid gain. Surprisingly, in older bees, which is when maximum lipid loss occurs, diet played less of a role in regulating fat body gene expression for components of evolutionarily conserved nutrition-related endocrine systems involving insulin and juvenile hormone signaling. By contrast, fat body gene expression in older bees was regulated more strongly by evolutionarily novel regulatory factors, queen mandibular pheromone (a honey bee-specific social signal) and vitellogenin (a conserved yolk protein that has evolved novel, maturation-related functions in the bee), independent of nutrition. These results demonstrate that conserved molecular pathways can be manipulated to achieve stable lipid loss through evolutionarily novel regulatory processes.
Effects of using coding potential, sequence conservation and mRNA structure conservation for predicting pyrroly-sine containing genes

DEFF Research Database (Denmark)

Have, Christian Theil; Zambach, Sine; Christiansen, Henning

2013-01-01

for prediction of pyrrolysine incorporating genes in genomes of bacteria and archaea leading to insights about the factors driving pyrrolysine translation and identification of new gene candidates. The method predicts known conserved genes with high recall and predicts several other promising candidates...... for experimental verification. The method is implemented as a computational pipeline which is available on request....
smRNAome profiling to identify conserved and novel microRNAs in Stevia rebaudiana Bertoni

Science.gov (United States)

2012-01-01

Background MicroRNAs (miRNAs) constitute a family of small RNA (sRNA) population that regulates the gene expression and plays an important role in plant development, metabolism, signal transduction and stress response. Extensive studies on miRNAs have been performed in different plants such as Arabidopsis thaliana, Oryza sativa etc. and volume of the miRNA database, mirBASE, has been increasing on day to day basis. Stevia rebaudiana Bertoni is an important perennial herb which accumulates high concentrations of diterpene steviol glycosides which contributes to its high indexed sweetening property with no calorific value. Several studies have been carried out for understanding molecular mechanism involved in biosynthesis of these glycosides, however, information about miRNAs has been lacking in S. rebaudiana. Deep sequencing of small RNAs combined with transcriptomic data is a powerful tool for identifying conserved and novel miRNAs irrespective of availability of genome sequence data. Results To identify miRNAs in S. rebaudiana, sRNA library was constructed and sequenced using Illumina genome analyzer II. A total of 30,472,534 reads representing 2,509,190 distinct sequences were obtained from sRNA library. Based on sequence similarity, we identified 100 miRNAs belonging to 34 highly conserved families. Also, we identified 12 novel miRNAs whose precursors were potentially generated from stevia EST and nucleotide sequences. All novel sequences have not been earlier described in other plant species. Putative target genes were predicted for most conserved and novel miRNAs. The predicted targets are mainly mRNA encoding enzymes regulating essential plant metabolic and signaling pathways. Conclusions This study led to the identification of 34 highly conserved miRNA families and 12 novel potential miRNAs indicating that specific miRNAs exist in stevia species. Our results provided information on stevia miRNAs and their targets building a foundation for future studies to
The most conserved genome segments for life detection on Earth and other planets.

Science.gov (United States)

Isenbarger, Thomas A; Carr, Christopher E; Johnson, Sarah Stewart; Finney, Michael; Church, George M; Gilbert, Walter; Zuber, Maria T; Ruvkun, Gary

2008-12-01

On Earth, very simple but powerful methods to detect and classify broad taxa of life by the polymerase chain reaction (PCR) are now standard practice. Using DNA primers corresponding to the 16S ribosomal RNA gene, one can survey a sample from any environment for its microbial inhabitants. Due to massive meteoritic exchange between Earth and Mars (as well as other planets), a reasonable case can be made for life on Mars or other planets to be related to life on Earth. In this case, the supremely sensitive technologies used to study life on Earth, including in extreme environments, can be applied to the search for life on other planets. Though the 16S gene has become the standard for life detection on Earth, no genome comparisons have established that the ribosomal genes are, in fact, the most conserved DNA segments across the kingdoms of life. We present here a computational comparison of full genomes from 13 diverse organisms from the Archaea, Bacteria, and Eucarya to identify genetic sequences conserved across the widest divisions of life. Our results identify the 16S and 23S ribosomal RNA genes as well as other universally conserved nucleotide sequences in genes encoding particular classes of transfer RNAs and within the nucleotide binding domains of ABC transporters as the most conserved DNA sequence segments across phylogeny. This set of sequences defines a core set of DNA regions that have changed the least over billions of years of evolution and provides a means to identify and classify divergent life, including ancestrally related life on other planets.
Sequence analysis of serum albumins reveals the molecular evolution of ligand recognition properties.

Science.gov (United States)

Fanali, Gabriella; Ascenzi, Paolo; Bernardi, Giorgio; Fasano, Mauro

2012-01-01

Serum albumin (SA) is a circulating protein providing a depot and carrier for many endogenous and exogenous compounds. At least seven major binding sites have been identified by structural and functional investigations mainly in human SA. SA is conserved in vertebrates, with at least 49 entries in protein sequence databases. The multiple sequence analysis of this set of entries leads to the definition of a cladistic tree for the molecular evolution of SA orthologs in vertebrates, thus showing the clustering of the considered species, with lamprey SAs (Lethenteron japonicum and Petromyzon marinus) in a separate outgroup. Sequence analysis aimed at searching conserved domains revealed that most SA sequences are made up by three repeated domains (about 600 residues), as extensively characterized for human SA. On the contrary, lamprey SAs are giant proteins (about 1400 residues) comprising seven repeated domains. The phylogenetic analysis of the SA family reveals a stringent correlation with the taxonomic classification of the species available in sequence databases. A focused inspection of the sequences of ligand binding sites in SA revealed that in all sites most residues involved in ligand binding are conserved, although the versatility towards different ligands could be peculiar of higher organisms. Moreover, the analysis of molecular links between the different sites suggests that allosteric modulation mechanisms could be restricted to higher vertebrates.
Chromosome-wide mapping of DNA methylation patterns in normal and malignant prostate cells reveals pervasive methylation of gene-associated and conserved intergenic sequences

Directory of Open Access Journals (Sweden)

De Marzo Angelo M

2011-06-01

Full Text Available Abstract Background DNA methylation has been linked to genome regulation and dysregulation in health and disease respectively, and methods for characterizing genomic DNA methylation patterns are rapidly emerging. We have developed/refined methods for enrichment of methylated genomic fragments using the methyl-binding domain of the human MBD2 protein (MBD2-MBD followed by analysis with high-density tiling microarrays. This MBD-chip approach was used to characterize DNA methylation patterns across all non-repetitive sequences of human chromosomes 21 and 22 at high-resolution in normal and malignant prostate cells. Results Examining this data using computational methods that were designed specifically for DNA methylation tiling array data revealed widespread methylation of both gene promoter and non-promoter regions in cancer and normal cells. In addition to identifying several novel cancer hypermethylated 5' gene upstream regions that mediated epigenetic gene silencing, we also found several hypermethylated 3' gene downstream, intragenic and intergenic regions. The hypermethylated intragenic regions were highly enriched for overlap with intron-exon boundaries, suggesting a possible role in regulation of alternative transcriptional start sites, exon usage and/or splicing. The hypermethylated intergenic regions showed significant enrichment for conservation across vertebrate species. A sampling of these newly identified promoter (ADAMTS1 and SCARF2 genes and non-promoter (downstream or within DSCR9, C21orf57 and HLCS genes hypermethylated regions were effective in distinguishing malignant from normal prostate tissues and/or cell lines. Conclusions Comparison of chromosome-wide DNA methylation patterns in normal and malignant prostate cells revealed significant methylation of gene-proximal and conserved intergenic sequences. Such analyses can be easily extended for genome-wide methylation analysis in health and disease.
Biochemical Conservation and Evolution of Germacrene A Oxidase in Asteraceae*

Science.gov (United States)

Nguyen, Don Trinh; Göpfert, Jens Christian; Ikezawa, Nobuhiro; MacNevin, Gillian; Kathiresan, Meena; Conrad, Jürgen; Spring, Otmar; Ro, Dae-Kyun

2010-01-01

Sesquiterpene lactones are characteristic natural products in Asteraceae, which constitutes ∼8% of all plant species. Despite their physiological and pharmaceutical importance, the biochemistry and evolution of sesquiterpene lactones remain unexplored. Here we show that germacrene A oxidase (GAO), evolutionarily conserved in all major subfamilies of Asteraceae, catalyzes three consecutive oxidations of germacrene A to yield germacrene A acid. Furthermore, it is also capable of oxidizing non-natural substrate amorphadiene. Co-expression of lettuce GAO with germacrene synthase in engineered yeast synthesized aberrant products, costic acids and ilicic acid, in an acidic condition. However, cultivation in a neutral condition allowed the de novo synthesis of a single novel compound that was identified as germacrene A acid by gas and liquid chromatography and NMR analyses. To trace the evolutionary lineage of GAO in Asteraceae, homologous genes were further isolated from the representative species of three major subfamilies of Asteraceae (sunflower, chicory, and costus from Asteroideae, Cichorioideae, and Carduoideae, respectively) and also from the phylogenetically basal species, Barnadesia spinosa, from Barnadesioideae. The recombinant GAOs from these genes clearly showed germacrene A oxidase activities, suggesting that GAO activity is widely conserved in Asteraceae including the basal lineage. All GAOs could catalyze the three-step oxidation of non-natural substrate amorphadiene to artemisinic acid, whereas amorphadiene oxidase diverged from GAO displayed negligible activity for germacrene A oxidation. The observed amorphadiene oxidase activity in GAOs suggests that the catalytic plasticity is embedded in ancestral GAO enzymes that may contribute to the chemical and catalytic diversity in nature. PMID:20351109
Meta-analysis of breast cancer microarray studies in conjunction with conserved cis-elements suggest patterns for coordinate regulation

Directory of Open Access Journals (Sweden)

Lundberg Cathryn

2008-01-01

Full Text Available Abstract Background Gene expression measurements from breast cancer (BrCa tumors are established clinical predictive tools to identify tumor subtypes, identify patients showing poor/good prognosis, and identify patients likely to have disease recurrence. However, diverse breast cancer datasets in conjunction with diagnostic clinical arrays show little overlap in the sets of genes identified. One approach to identify a set of consistently dysregulated candidate genes in these tumors is to employ meta-analysis of multiple independent microarray datasets. This allows one to compare expression data from a diverse collection of breast tumor array datasets generated on either cDNA or oligonucleotide arrays. Results We gathered expression data from 9 published microarray studies examining estrogen receptor positive (ER+ and estrogen receptor negative (ER- BrCa tumor cases from the Oncomine database. We performed a meta-analysis and identified genes that were universally up or down regulated with respect to ER+ versus ER- tumor status. We surveyed both the proximal promoter and 3' untranslated regions (3'UTR of our top-ranking genes in each expression group to test whether common sequence elements may contribute to the observed expression patterns. Utilizing a combination of known transcription factor binding sites (TFBS, evolutionarily conserved mammalian promoter and 3'UTR motifs, and microRNA (miRNA seed sequences, we identified numerous motifs that were disproportionately represented between the two gene classes suggesting a common regulatory network for the observed gene expression patterns. Conclusion Some of the genes we identified distinguish key transcripts previously seen in array studies, while others are newly defined. Many of the genes identified as overexpressed in ER- tumors were previously identified as expression markers for neoplastic transformation in multiple human cancers. Moreover, our motif analysis identified a collection of
Conserved domains and SINE diversity during animal evolution.

Science.gov (United States)

Luchetti, Andrea; Mantovani, Barbara

2013-10-01

Eukaryotic genomes harbour a number of mobile genetic elements (MGEs); moving from one genomic location to another, they are known to impact on the host genome. Short interspersed elements (SINEs) are well-represented, non-autonomous retroelements and they are likely the most diversified MGEs. In some instances, sequence domains conserved across unrelated SINEs have been identified; remarkably, one of these, called Nin, has been conserved since the Radiata-Bilateria splitting. Here we report on two new domains: Inv, derived from Nin, identified in insects and in deuterostomes, and Pln, restricted to polyneopteran insects. The identification of Inv and Pln sequences allowed us to retrieve new SINEs, two in insects and one in a hemichordate. The diverse structural combination of the different domains in different SINE families, during metazoan evolution, offers a clearer view of SINE diversity and their frequent de novo emergence through module exchange, possibly underlying the high evolutionary success of SINEs. © 2013 Elsevier Inc. All rights reserved.
Genome-wide identification of conserved microRNA and their response to drought stress in Dongxiang wild rice (Oryza rufipogon Griff.).

Science.gov (United States)

Zhang, Fantao; Luo, Xiangdong; Zhou, Yi; Xie, Jiankun

2016-04-01

To identify drought stress-responsive conserved microRNA (miRNA) from Dongxiang wild rice (Oryza rufipogon Griff., DXWR) on a genome-wide scale, high-throughput sequencing technology was used to sequence libraries of DXWR samples, treated with and without drought stress. 505 conserved miRNAs corresponding to 215 families were identified. 17 were significantly down-regulated and 16 were up-regulated under drought stress. Stem-loop qRT-PCR revealed the same expression patterns as high-throughput sequencing, suggesting the accuracy of the sequencing result was high. Potential target genes of the drought-responsive miRNA were predicted to be involved in diverse biological processes. Furthermore, 16 miRNA families were first identified to be involved in drought stress response from plants. These results present a comprehensive view of the conserved miRNA and their expression patterns under drought stress for DXWR, which will provide valuable information and sequence resources for future basis studies.
The sequence, structure and evolutionary features of HOTAIR in mammals

Science.gov (United States)

2011-01-01

Background An increasing number of long noncoding RNAs (lncRNAs) have been identified recently. Different from all the others that function in cis to regulate local gene expression, the newly identified HOTAIR is located between HoxC11 and HoxC12 in the human genome and regulates HoxD expression in multiple tissues. Like the well-characterised lncRNA Xist, HOTAIR binds to polycomb proteins to methylate histones at multiple HoxD loci, but unlike Xist, many details of its structure and function, as well as the trans regulation, remain unclear. Moreover, HOTAIR is involved in the aberrant regulation of gene expression in cancer. Results To identify conserved domains in HOTAIR and study the phylogenetic distribution of this lncRNA, we searched the genomes of 10 mammalian and 3 non-mammalian vertebrates for matches to its 6 exons and the two conserved domains within the 1800 bp exon6 using Infernal. There was just one high-scoring hit for each mammal, but many low-scoring hits were found in both mammals and non-mammalian vertebrates. These hits and their flanking genes in four placental mammals and platypus were examined to determine whether HOTAIR contained elements shared by other lncRNAs. Several of the hits were within unknown transcripts or ncRNAs, many were within introns of, or antisense to, protein-coding genes, and conservation of the flanking genes was observed only between human and chimpanzee. Phylogenetic analysis revealed discrete evolutionary dynamics for orthologous sequences of HOTAIR exons. Exon1 at the 5' end and a domain in exon6 near the 3' end, which contain domains that bind to multiple proteins, have evolved faster in primates than in other mammals. Structures were predicted for exon1, two domains of exon6 and the full HOTAIR sequence. The sequence and structure of two fragments, in exon1 and the domain B of exon6 respectively, were identified to robustly occur in predicted structures of exon1, domain B of exon6 and the full HOTAIR in mammals
Genome-wide identification of estrogen receptor alpha-binding sites in mouse liver

DEFF Research Database (Denmark)

Gao, Hui; Fält, Susann; Sandelin, Albin

2007-01-01

We report the genome-wide identification of estrogen receptor alpha (ERalpha)-binding regions in mouse liver using a combination of chromatin immunoprecipitation and tiled microarrays that cover all nonrepetitive sequences in the mouse genome. This analysis identified 5568 ERalpha-binding regions...... genes. The majority of ERalpha-binding regions lie in regions that are evolutionarily conserved between human and mouse. Motif-finding algorithms identified the estrogen response element, and variants thereof, together with binding sites for activator protein 1, basic-helix-loop-helix proteins, ETS...... signaling in mouse liver, by characterizing the first step in this signaling cascade, the binding of ERalpha to DNA in intact chromatin....
Neurobiology of rodent self-grooming and its value for translational neuroscience.

Science.gov (United States)

Kalueff, Allan V; Stewart, Adam Michael; Song, Cai; Berridge, Kent C; Graybiel, Ann M; Fentress, John C

2016-01-01

Self-grooming is a complex innate behaviour with an evolutionarily conserved sequencing pattern and is one of the most frequently performed behavioural activities in rodents. In this Review, we discuss the neurobiology of rodent self-grooming, and we highlight studies of rodent models of neuropsychiatric disorders--including models of autism spectrum disorder and obsessive compulsive disorder--that have assessed self-grooming phenotypes. We suggest that rodent self-grooming may be a useful measure of repetitive behaviour in such models, and therefore of value to translational psychiatry. Assessment of rodent self-grooming may also be useful for understanding the neural circuits that are involved in complex sequential patterns of action.
Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis.

Science.gov (United States)

Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

2004-02-01

To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.

Conservation of Tcrg-V5 and limited allelic sequence polymorphism of the other Tcrg-V genes used by mouse tissue-specific gd-T lymphocytes

Energy Technology Data Exchange (ETDEWEB)

Roger, T.; Morisset, J.; Seman, M. [Universite Denis Diderot, Paris (France)

1996-12-31

The mouse Tcrg locus comprises seven Tcrg-V, four Tcrg-J, and four Tcrg-C segments which generate only six major types of functional g chains, Vg7-, Vg4-, Vg6-, or Vg5-Jg1-Cg1, Vg2-Jg2-Cg2, and Vg1-Jg4-Cg4. A complete analysis of restriction fragment length polymorphism (RFLP) of the Tcrg locus in wild and inbred mice suggested its relative conservation compared to other loci of the immunoglobulin (Ig) gene family. Three haplotypes have been characterized in laboratory mice: gA, gB, and gC, represented by BALB/c, DBA/2, and AKR prototypes. Tcr-gA and -gC haplotypes are highly related. By contrast, Tcr-gB, likely inherited from Asian mouse subspecies, appeared very different by RFLP analysis. Yet only partial sequence data have been reported on gA and gB Tcrg-V genes. Here, the complete sequence of all Tcrg-V genes of the two haplotypes is described. 16 refs., 1 fig.
Expression analysis of an evolutionarily conserved alternative splicing factor, Sfrs10, in age-related macular degeneration.

Directory of Open Access Journals (Sweden)

Devi Krishna Priya Karunakaran

Full Text Available Age-related macular degeneration (AMD is the most common cause of blindness in the elderly population. Hypoxic stress created in the micro-environment of the photoreceptors is thought to be the underlying cause that results in the pathophysiology of AMD. However, association of AMD with alternative splicing mediated gene regulation is not well explored. Alternative Splicing is one of the primary mechanisms in humans by which fewer protein coding genes are able to generate a vast proteome. Here, we investigated the expression of a known stress response gene and an alternative splicing factor called Serine-Arginine rich splicing factor 10 (Sfrs10. Sfrs10 is a member of the serine-arginine (SR rich protein family and is 100% identical at the amino acid level in most mammals. Immunoblot analysis on retinal extracts from mouse, rat, and chicken showed a single immunoreactive band. Further, immunohistochemistry on adult mouse, rat and chicken retinae showed pan-retinal expression. However, SFRS10 was not detected in normal human retina but was observed as distinct nuclear speckles in AMD retinae. This is in agreement with previous reports that show Sfrs10 to be a stress response gene, which is upregulated under hypoxia. The difference in the expression of Sfrs10 between humans and lower mammals and the upregulation of SFRS10 in AMD is further reflected in the divergence of the promoter sequence between these species. Finally, SFRS10+ speckles were independent of the SC35+ SR protein speckles or the HSF1+ stress granules. In all, our data suggests that SFRS10 is upregulated and forms distinct stress-induced speckles and might be involved in AS of stress response genes in AMD.
CLONING AND SEQUENCING OF PGIP FROM ‘JIN SERIES’ ALMOND (PRUNUS DULCIS

Directory of Open Access Journals (Sweden)

Yuhu Han

2015-12-01

Full Text Available Specific primers synthesized according to conservative regions of polygalacturonase inhibiting protein (PGIP gene were used to amplify Prunus Dulcis genomic DNA by polymerase-chain reaction (PCR. Six bands (pgip1, pgip2, pgip3, pgip4, pgip5 and pgip6 of genes were obtained and cloned into PBS-T vector. According to the length of bands, 717bp, 864bp, 796bp were A1 (pgip1, pgip2, pgip3, A2 (pgip4, A4 (pgip5, pgip6, respectively. DNA sequences showed that the fragments taken together were the gene encoding PGIP. A2 and A3 contained two exons interrupted by one intron, which has GT-AG sequence. Its DNA and amino acid sequences were highly homologies to those from Prunus Persica; Prunus Salicina; Prunus Americana; Prunus Mume, respectively. A conserved lencinerial fragment exists in the derived protein sequence.
Conservation and divergence of ADAM family proteins in the Xenopus genome

Directory of Open Access Journals (Sweden)

Shah Anoop

2010-07-01

Full Text Available Abstract Background Members of the disintegrin metalloproteinase (ADAM family play important roles in cellular and developmental processes through their functions as proteases and/or binding partners for other proteins. The amphibian Xenopus has long been used as a model for early vertebrate development, but genome-wide analyses for large gene families were not possible until the recent completion of the X. tropicalis genome sequence and the availability of large scale expression sequence tag (EST databases. In this study we carried out a systematic analysis of the X. tropicalis genome and uncovered several interesting features of ADAM genes in this species. Results Based on the X. tropicalis genome sequence and EST databases, we identified Xenopus orthologues of mammalian ADAMs and obtained full-length cDNA clones for these genes. The deduced protein sequences, synteny and exon-intron boundaries are conserved between most human and X. tropicalis orthologues. The alternative splicing patterns of certain Xenopus ADAM genes, such as adams 22 and 28, are similar to those of their mammalian orthologues. However, we were unable to identify an orthologue for ADAM7 or 8. The Xenopus orthologue of ADAM15, an active metalloproteinase in mammals, does not contain the conserved zinc-binding motif and is hence considered proteolytically inactive. We also found evidence for gain of ADAM genes in Xenopus as compared to other species. There is a homologue of ADAM10 in Xenopus that is missing in most mammals. Furthermore, a single scaffold of X. tropicalis genome contains four genes encoding ADAM28 homologues, suggesting genome duplication in this region. Conclusions Our genome-wide analysis of ADAM genes in X. tropicalis revealed both conservation and evolutionary divergence of these genes in this amphibian species. On the one hand, all ADAMs implicated in normal development and health in other species are conserved in X. tropicalis. On the other hand, some
Using genomic information to conserve genetic diversity in livestock

NARCIS (Netherlands)

Eynard, Sonia E.

2018-01-01

Concern about the status of livestock breeds and their conservation has increased as selection and small population sizes caused loss of genetic diversity. Meanwhile, dense SNP chips and whole genome sequences (WGS) became available, providing opportunities to accurately quantify the impact of
Computational identification of developmental enhancers:conservation and function of transcription factor binding-site clustersin drosophila melanogaster and drosophila psedoobscura

Energy Technology Data Exchange (ETDEWEB)

Berman, Benjamin P.; Pfeiffer, Barret D.; Laverty, Todd R.; Salzberg, Steven L.; Rubin, Gerald M.; Eisen, Michael B.; Celniker, SusanE.

2004-08-06

The identification of sequences that control transcription in metazoans is a major goal of genome analysis. In a previous study, we demonstrated that searching for clusters of predicted transcription factor binding sites could discover active regulatory sequences, and identified 37 regions of the Drosophila melanogaster genome with high densities of predicted binding sites for five transcription factors involved in anterior-posterior embryonic patterning. Nine of these clusters overlapped known enhancers. Here, we report the results of in vivo functional analysis of 27 remaining clusters. We generated transgenic flies carrying each cluster attached to a basal promoter and reporter gene, and assayed embryos for reporter gene expression. Six clusters are enhancers of adjacent genes: giant, fushi tarazu, odd-skipped, nubbin, squeeze and pdm2; three drive expression in patterns unrelated to those of neighboring genes; the remaining 18 do not appear to have enhancer activity. We used the Drosophila pseudoobscura genome to compare patterns of evolution in and around the 15 positive and 18 false-positive predictions. Although conservation of primary sequence cannot distinguish true from false positives, conservation of binding-site clustering accurately discriminates functional binding-site clusters from those with no function. We incorporated conservation of binding-site clustering into a new genome-wide enhancer screen, and predict several hundred new regulatory sequences, including 85 adjacent to genes with embryonic patterns. Measuring conservation of sequence features closely linked to function--such as binding-site clustering--makes better use of comparative sequence data than commonly used methods that examine only sequence identity.
A regulatory code for neuron-specific odor receptor expression.

Directory of Open Access Journals (Sweden)

Anandasankar Ray

2008-05-01

Full Text Available Olfactory receptor neurons (ORNs must select-from a large repertoire-which odor receptors to express. In Drosophila, most ORNs express one of 60 Or genes, and most Or genes are expressed in a single ORN class in a process that produces a stereotyped receptor-to-neuron map. The construction of this map poses a problem of receptor gene regulation that is remarkable in its dimension and about which little is known. By using a phylogenetic approach and the genome sequences of 12 Drosophila species, we systematically identified regulatory elements that are evolutionarily conserved and specific for individual Or genes of the maxillary palp. Genetic analysis of these elements supports a model in which each receptor gene contains a zip code, consisting of elements that act positively to promote expression in a subset of ORN classes, and elements that restrict expression to a single ORN class. We identified a transcription factor, Scalloped, that mediates repression. Some elements are used in other chemosensory organs, and some are conserved upstream of axon-guidance genes. Surprisingly, the odor response spectra and organization of maxillary palp ORNs have been extremely well-conserved for tens of millions of years, even though the amino acid sequences of the receptors are not highly conserved. These results, taken together, define the logic by which individual ORNs in the maxillary palp select which odor receptors to express.
Characterizing leader sequences of CRISPR loci

DEFF Research Database (Denmark)

Alkhnbashi, Omer; Shah, Shiraz Ali; Garrett, Roger Antony

2016-01-01

The CRISPR-Cas system is an adaptive immune system in many archaea and bacteria, which provides resistance against invading genetic elements. The first phase of CRISPR-Cas immunity is called adaptation, in which small DNA fragments are excised from genetic elements and are inserted into a CRISPR...... array generally adjacent to its so called leader sequence at one end of the array. It has been shown that transcription initiation and adaptation signals of the CRISPR array are located within the leader. However, apart from promoters, there is very little knowledge of sequence or structural motifs...... sequences by focusing on the consensus repeat of the adjacent CRISPR array and weak upstream conservation signals. We applied our tool to the analysis of a comprehensive genomic database and identified several characteristic properties of leader sequences specific to archaea and bacteria, ranging from...
Effects of main-sequence mass loss on stellar and galactic chemical evolution

International Nuclear Information System (INIS)

Guzik, J.A.

1988-01-01

L.A. Willson, G.H. Bowen and C. Struck-Marcell have proposed that 1 to 3 solar mass stars may experience evolutionarily significant mass loss during the early part of their main-sequence phase. The suggested mass-loss mechanism is pulsation, facilitated by rapid rotation. Initial mass-loss rates may be as large as several times 10 -9 M mass of sun/yr, diminishing over several times 10 8 years. The author attempts to test this hypothesis by comparing some theoretical implications with observations. Three areas are addressed: Solar models, cluster HR diagrams, and galactic chemical evolution. Mass-losing solar models were evolved that match the Sun's luminosity and radius at its present age. The most extreme viable models have initial mass 2.0 M 0 , and mass-loss rates decreasing exponentially over 2-3 x 10 8 years. Evolution calculations incorporating main-sequence mass loss were completed for a grid of models with initial masses 1.25 to 2.0 M mass of sun and mass loss timescales 0.2 to 2.0 Gry. Cluster HR diagrams synthesized with these models confirm the potential for the hypothesis to explain observed spreads or bifurcations in the upper main sequence, blue stragglers, anomalous giants, and poor fits of main-sequence turnoffs by standard isochrones. Simple closed galactic chemical evolution models were used to test the effects of main-sequence mass loss on the F and G dwarf distribution. Stars between 3.0 M mass of sun and a metallicity-dependent lower mass are assumed to lose mass. The models produce a 30 to 60% increase in the stars to stars-plus-remnants ratio, with fewer early-F dwarfs and many more late-F dwarfs remaining on the main sequence to the present
Genomic sequencing of Pleistocene cave bears

Energy Technology Data Exchange (ETDEWEB)

Noonan, James P.; Hofreiter, Michael; Smith, Doug; Priest, JamesR.; Rohland, Nadin; Rabeder, Gernot; Krause, Johannes; Detter, J. Chris; Paabo, Svante; Rubin, Edward M.

2005-04-01

Despite the information content of genomic DNA, ancient DNA studies to date have largely been limited to amplification of mitochondrial DNA due to technical hurdles such as contamination and degradation of ancient DNAs. In this study, we describe two metagenomic libraries constructed using unamplified DNA extracted from the bones of two 40,000-year-old extinct cave bears. Analysis of {approx}1 Mb of sequence from each library showed that, despite significant microbial contamination, 5.8 percent and 1.1 percent of clones in the libraries contain cave bear inserts, yielding 26,861 bp of cave bear genome sequence. Alignment of this sequence to the dog genome, the closest sequenced genome to cave bear in terms of evolutionary distance, revealed roughly the expected ratio of cave bear exons, repeats and conserved noncoding sequences. Only 0.04 percent of all clones sequenced were derived from contamination with modern human DNA. Comparison of cave bear with orthologous sequences from several modern bear species revealed the evolutionary relationship of these lineages. Using the metagenomic approach described here, we have recovered substantial quantities of mammalian genomic sequence more than twice as old as any previously reported, establishing the feasibility of ancient DNA genomic sequencing programs.
Novel nonphosphorylated peptides with conserved sequences selectively bind to Grb7 SH2 domain with affinity comparable to its phosphorylated ligand.

Directory of Open Access Journals (Sweden)

Dan Zhang

Full Text Available The Grb7 (growth factor receptor-bound 7 protein, a member of the Grb7 protein family, is found to be highly expressed in such metastatic tumors as breast cancer, esophageal cancer, liver cancer, etc. The src-homology 2 (SH2 domain in the C-terminus is reported to be mainly involved in Grb7 signaling pathways. Using the random peptide library, we identified a series of Grb7 SH2 domain-binding nonphosphorylated peptides in the yeast two-hybrid system. These peptides have a conserved GIPT/K/N sequence at the N-terminus and G/WD/IP at the C-terminus, and the region between the N-and C-terminus contains fifteen amino acids enriched with serines, threonines and prolines. The association between the nonphosphorylated peptides and the Grb7 SH2 domain occurred in vitro and ex vivo. When competing for binding to the Grb7 SH2 domain in a complex, one synthesized nonphosphorylated ligand, containing the twenty-two amino acid-motif sequence, showed at least comparable affinity to the phosphorylated ligand of ErbB3 in vitro, and its overexpression inhibited the proliferation of SK-BR-3 cells. Such nonphosphorylated peptides may be useful for rational design of drugs targeted against cancers that express high levels of Grb7 protein.
Genomic sequence around butterfly wing development genes: annotation and comparative analysis.

Directory of Open Access Journals (Sweden)

Inês C Conceição

Full Text Available BACKGROUND: Analysis of genomic sequence allows characterization of genome content and organization, and access beyond gene-coding regions for identification of functional elements. BAC libraries, where relatively large genomic regions are made readily available, are especially useful for species without a fully sequenced genome and can increase genomic coverage of phylogenetic and biological diversity. For example, no butterfly genome is yet available despite the unique genetic and biological properties of this group, such as diversified wing color patterns. The evolution and development of these patterns is being studied in a few target species, including Bicyclus anynana, where a whole-genome BAC library allows targeted access to large genomic regions. METHODOLOGY/PRINCIPAL FINDINGS: We characterize ∼1.3 Mb of genomic sequence around 11 selected genes expressed in B. anynana developing wings. Extensive manual curation of in silico predictions, also making use of a large dataset of expressed genes for this species, identified repetitive elements and protein coding sequence, and highlighted an expansion of Alcohol dehydrogenase genes. Comparative analysis with orthologous regions of the lepidopteran reference genome allowed assessment of conservation of fine-scale synteny (with detection of new inversions and translocations and of DNA sequence (with detection of high levels of conservation of non-coding regions around some, but not all, developmental genes. CONCLUSIONS: The general properties and organization of the available B. anynana genomic sequence are similar to the lepidopteran reference, despite the more than 140 MY divergence. Our results lay the groundwork for further studies of new interesting findings in relation to both coding and non-coding sequence: 1 the Alcohol dehydrogenase expansion with higher similarity between the five tandemly-repeated B. anynana paralogs than with the corresponding B. mori orthologs, and 2 the high
Evolutionary conservation of essential and highly expressed genes in Pseudomonas aeruginosa

Directory of Open Access Journals (Sweden)

Scharfe Maren

2010-04-01

Full Text Available Abstract Background The constant increase in development and spread of bacterial resistance to antibiotics poses a serious threat to human health. New sequencing technologies are now on the horizon that will yield massive increases in our capacity for DNA sequencing and will revolutionize the drug discovery process. Since essential genes are promising novel antibiotic targets, the prediction of gene essentiality based on genomic information has become a major focus. Results In this study we demonstrate that pooled sequencing is applicable for the analysis of sequence variations of strain collections with more than 10 individual isolates. Pooled sequencing of 36 clinical Pseudomonas aeruginosa isolates revealed that essential and highly expressed proteins evolve at lower rates, whereas extracellular proteins evolve at higher rates. We furthermore refined the list of experimentally essential P. aeruginosa genes, and identified 980 genes that show no sequence variation at all. Among the conserved nonessential genes we found several that are involved in regulation, motility and virulence, indicating that they represent factors of evolutionary importance for the lifestyle of a successful environmental bacterium and opportunistic pathogen. Conclusion The detailed analysis of a comprehensive set of P. aeruginosa genomes in this study clearly disclosed detailed information of the genomic makeup and revealed a large set of highly conserved genes that play an important role for the lifestyle of this microorganism. Sequencing strain collections enables for a detailed and extensive identification of sequence variations as potential bacterial adaptation processes, e.g., during the development of antibiotic resistance in the clinical setting and thus may be the basis to uncover putative targets for novel treatment strategies.
Whole-Genome Sequencing and Variant Analysis of Human Papillomavirus 16 Infections.

Science.gov (United States)

van der Weele, Pascal; Meijer, Chris J L M; King, Audrey J

2017-10-01

Human papillomavirus (HPV) is a strongly conserved DNA virus, high-risk types of which can cause cervical cancer in persistent infections. The most common type found in HPV-attributable cancer is HPV16, which can be subdivided into four lineages (A to D) with different carcinogenic properties. Studies have shown HPV16 sequence diversity in different geographical areas, but only limited information is available regarding HPV16 diversity within a population, especially at the whole-genome level. We analyzed HPV16 major variant diversity and conservation in persistent infections and performed a single nucleotide polymorphism (SNP) comparison between persistent and clearing infections. Materials were obtained in the Netherlands from a cohort study with longitudinal follow-up for up to 3 years. Our analysis shows a remarkably large variant diversity in the population. Whole-genome sequences were obtained for 57 persistent and 59 clearing HPV16 infections, resulting in 109 unique variants. Interestingly, persistent infections were completely conserved through time. One reinfection event was identified where the initial and follow-up samples clustered differently. Non-A1/A2 variants seemed to clear preferentially ( P = 0.02). Our analysis shows that population-wide HPV16 sequence diversity is very large. In persistent infections, the HPV16 sequence was fully conserved. Sequencing can identify HPV16 reinfections, although occurrence is rare. SNP comparison identified no strongly acting effect of the viral genome affecting HPV16 infection clearance or persistence in up to 3 years of follow-up. These findings suggest the progression of an early HPV16 infection could be host related. IMPORTANCE Human papillomavirus 16 (HPV16) is the predominant type found in cervical cancer. Progression of initial infection to cervical cancer has been linked to sequence properties; however, knowledge of variants circulating in European populations, especially with longitudinal follow-up, is
Comparative analysis of the full genome sequence of European bat lyssavirus type 1 and type 2 with other lyssaviruses and evidence for a conserved transcription termination and polyadenylation motif in the G-L 3' non-translated region.

Science.gov (United States)

Marston, D A; McElhinney, L M; Johnson, N; Müller, T; Conzelmann, K K; Tordo, N; Fooks, A R

2007-04-01

We report the first full-length genomic sequences for European bat lyssavirus type-1 (EBLV-1) and type-2 (EBLV-2). The EBLV-1 genomic sequence was derived from a virus isolated from a serotine bat in Hamburg, Germany, in 1968 and the EBLV-2 sequence was derived from a virus isolate from a human case of rabies that occurred in Scotland in 2002. A long-distance PCR strategy was used to amplify the open reading frames (ORFs), followed by standard and modified RACE (rapid amplification of cDNA ends) techniques to amplify the 3' and 5' ends. The lengths of each complete viral genome for EBLV-1 and EBLV-2 were 11 966 and 11 930 base pairs, respectively, and follow the standard rhabdovirus genome organization of five viral proteins. Comparison with other lyssavirus sequences demonstrates variation in degrees of homology, with the genomic termini showing a high degree of complementarity. The nucleoprotein was the most conserved, both intra- and intergenotypically, followed by the polymerase (L), matrix and glyco- proteins, with the phosphoprotein being the most variable. In addition, we have shown that the two EBLVs utilize a conserved transcription termination and polyadenylation (TTP) motif, approximately 50 nt upstream of the L gene start codon. All available lyssavirus sequences to date, with the exception of Pasteur virus (PV) and PV-derived isolates, use the second TTP site. This observation may explain differences in pathogenicity between lyssavirus strains, dependent on the length of the untranslated region, which might affect transcriptional activity and RNA stability.
Comparative genome sequencing of drosophila pseudoobscura: Chromosomal, gene and cis-element evolution

Energy Technology Data Exchange (ETDEWEB)

Richards, Stephen; Liu, Yue; Bettencourt, Brian R.; Hradecky, Pavel; Letovsky, Stan; Nielsen, Rasmus; Thornton, Kevin; Todd, Melissa J.; Chen, Rui; Meisel, Richard P.; Couronne, Olivier; Hua, Sujun; Smith, Mark A.; Bussemaker, Harmen J.; van Batenburg, Marinus F.; Howells, Sally L.; Scherer, Steven E.; Sodergren, Erica; Matthews, Beverly B.; Crosby, Madeline A.; Schroeder, Andrew J.; Ortiz-Barrientos, Daniel; Rives, Catherine M.; Metzker, Michael L.; Muzny, Donna M.; Scott, Graham; Steffen, David; Wheeler, David A.; Worley, Kim C.; Havlak, Paul; Durbin, K. James; Egan, Amy; Gill, Rachel; Hume, Jennifer; Morgan, Margaret B.; Miner, George; Hamilton, Cerissa; Huang, Yanmei; Waldron, Lenee; Verduzco, Daniel; Blankenburg, Kerstin P.; Dubchak, Inna; Noor, Mohamed A.F.; Anderson, Wyatt; White, Kevin P.; Clark, Andrew G.; Schaeffer, Stephen W.; Gelbart, William; Weinstock, George M.; Gibbs, Richard A.

2004-04-01

The genome sequence of a second fruit fly, D. pseudoobscura, presents an opportunity for comparative analysis of a primary model organism D. melanogaster. The vast majority of Drosophila genes have remained on the same arm, but within each arm gene order has been extensively reshuffled leading to the identification of approximately 1300 syntenic blocks. A repetitive sequence is found in the D. pseudoobscura genome at many junctions between adjacent syntenic blocks. Analysis of this novel repetitive element family suggests that recombination between offset elements may have given rise to many paracentric inversions, thereby contributing to the shuffling of gene order in the D. pseudoobscura lineage. Based on sequence similarity and synteny, 10,516 putative orthologs have been identified as a core gene set conserved over 35 My since divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome wide average consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than control sequences between the species but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a picture of repeat mediated chromosomal rearrangement, and high co-adaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence between these species of Drosophila.
Discovery of Conservation and Diversification of miR171 Genes by Phylogenetic Analysis based on Global Genomes

Directory of Open Access Journals (Sweden)

Xudong Zhu

2015-07-01

Full Text Available The microRNA171 (miR171 family is widely distributed and highly conserved in a range of species and plays critical roles in regulating plant growth and development through repressing expression of ( transcription factors. However, information on the evolutionary conservation and functional diversification of the miRNA171 family members remains scanty. We reconstructed the phylogenetic relationships among miR171 precursor and mature sequences so as to investigate the extent and degree of evolutionary conservation of miR171 in (L. Heynh. (ath, grape ( L. (vvi, poplar ( Torr. & A.Gray ex Hook. (ptc, and rice ( L. (osa. Despite strong conservation of over 80%, some mature miR171 sequences, such as , and and , -, and -, have undergone critical sequence variation, leading to functional diversification, since they target non gene transcript(s. Phylogenetic analyses revealed a combination of old ancestral relationships and recent lineage-specific diversification in the miR171 family within the four model plants. The -regulatory motifs on the upstream promoter sequences of genes were highly divergent and shared some similar elements, indicating their possible contribution to the functional variation observed within the miR171 family. This study will buttress our understanding of the functional differentiation of miRNAs and the relationships of miRNA–target pairs based on the evolutionary history of genes.
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.

Science.gov (United States)

de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

2015-11-16

Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution.

Science.gov (United States)

2004-12-09

We present here a draft genome sequence of the red jungle fowl, Gallus gallus. Because the chicken is a modern descendant of the dinosaurs and the first non-mammalian amniote to have its genome sequenced, the draft sequence of its genome--composed of approximately one billion base pairs of sequence and an estimated 20,000-23,000 genes--provides a new perspective on vertebrate genome evolution, while also improving the annotation of mammalian genomes. For example, the evolutionary distance between chicken and human provides high specificity in detecting functional elements, both non-coding and coding. Notably, many conserved non-coding sequences are far from genes and cannot be assigned to defined functional classes. In coding regions the evolutionary dynamics of protein domains and orthologous groups illustrate processes that distinguish the lineages leading to birds and mammals. The distinctive properties of avian microchromosomes, together with the inferred patterns of conserved synteny, provide additional insights into vertebrate chromosome architecture.
Structure of the conserved hypothetical protein MAL13P1.257 from Plasmodium falciparum

International Nuclear Information System (INIS)

Holmes, Margaret A.; Buckner, Frederick S.; Van Voorhis, Wesley C.; Mehlin, Christopher; Boni, Erica; Earnest, Thomas N.; DeTitta, George; Luft, Joseph; Lauricella, Angela; Anderson, Lori; Kalyuzhniy, Oleksandr; Zucker, Frank; Schoenfeld, Lori W.; Hol, Wim G. J.; Merritt, Ethan A.

2006-01-01

The crystal structure of a conserved hypothetical protein, MAL13P1.257 from P. falciparum, has been determined at 2.17 Å resolution. The structure represents a new protein fold and is the first structural representative for Pfam sequence family PF05907. The structure of a conserved hypothetical protein, PlasmoDB sequence MAL13P1.257 from Plasmodium falciparum, Pfam sequence family PF05907, has been determined as part of the structural genomics effort of the Structural Genomics of Pathogenic Protozoa consortium. The structure was determined by multiple-wavelength anomalous dispersion at 2.17 Å resolution. The structure is almost entirely β-sheet; it consists of 15 β-strands and one short 3 10 -helix and represents a new protein fold. The packing of the two monomers in the asymmetric unit indicates that the biological unit may be a dimer.

Role of an Absolutely Conserved Tryptophan Pair in the Extracellular Domain of Cys-Loop Receptors

DEFF Research Database (Denmark)

Braun, Nina; Lynagh, Timothy; Yu, Rilei

2016-01-01

Cys-loop receptors mediate fast synaptic transmission in the nervous system, and their dysfunction is associated with a number of diseases. While some sequence variability is essential to ensure specific recognition of a chemically diverse set of ligands, other parts of the underlying amino acid...... sequences show a high degree of conservation, possibly to preserve the overall structural fold across the protein family. In this study, we focus on the only two absolutely conserved residues across the Cys-loop receptor family, two Trp side chains in the WXD motif of Loop D and in the WXPD motif of Loop A...
A conserved segmental duplication within ELA.

Science.gov (United States)

Brinkmeyer-Langford, C L; Murphy, W J; Childers, C P; Skow, L C

2010-12-01

The assembled genomic sequence of the horse major histocompatibility complex (MHC) (equine lymphocyte antigen, ELA) is very similar to the homologous human HLA, with the notable exception of a large segmental duplication at the boundary of ELA class I and class III that is absent in HLA. The segmental duplication consists of a ∼ 710 kb region of at least 11 repeated blocks: 10 blocks each contain an MHC class I-like sequence and the helicase domain portion of a BAT1-like sequence, and the remaining unit contains the full-length BAT1 gene. Similar genomic features were found in other Perissodactyls, indicating an ancient origin, which is consistent with phylogenetic analyses. Reverse-transcriptase PCR (RT-PCR) of mRNA from peripheral white blood cells of healthy and chronically or acutely infected horses detected transcription from predicted open reading frames in several of the duplicated blocks. This duplication is not present in the sequenced MHCs of most other mammals, although a similar feature at the same relative position is present in the feline MHC (FLA). Striking sequence conservation throughout Perissodactyl evolution is consistent with a functional role for at least some of the genes included within this segmental duplication. © 2010 The Authors, Journal compilation © 2010 Stichting International Foundation for Animal Genetics.
A conserved cysteine motif is critical for rice ceramide kinase activity and function.

Directory of Open Access Journals (Sweden)

Fang-Cheng Bi

Full Text Available Ceramide kinase (CERK is a key regulator of cell survival in dicotyledonous plants and animals. Much less is known about the roles of CERK and ceramides in mediating cellular processes in monocot plants. Here, we report the characterization of a ceramide kinase, OsCERK, from rice (Oryza sativa spp. Japonica cv. Nipponbare and investigate the effects of ceramides on rice cell viability.OsCERK can complement the Arabidopsis CERK mutant acd5. Recombinant OsCERK has ceramide kinase activity with Michaelis-Menten kinetics and optimal activity at 7.0 pH and 40°C. Mg2+ activates OsCERK in a concentration-dependent manner. Importantly, a CXXXCXXC motif, conserved in all ceramide kinases and important for the activity of the human enzyme, is critical for OsCERK enzyme activity and in planta function. In a rice protoplast system, inhibition of CERK leads to cell death and the ratio of added ceramide and ceramide-1-phosphate, CERK's substrate and product, respectively, influences cell survival. Ceramide-induced rice cell death has apoptotic features and is an active process that requires both de novo protein synthesis and phosphorylation, respectively. Finally, mitochondria membrane potential loss previously associated with ceramide-induced cell death in Arabidopsis was also found in rice, but it occurred with different timing.OsCERK is a bona fide ceramide kinase with a functionally and evolutionarily conserved Cys-rich motif that plays an important role in modulating cell fate in plants. The vital function of the conserved motif in both human and rice CERKs suggests that the biochemical mechanism of CERKs is similar in animals and plants. Furthermore, ceramides induce cell death with similar features in monocot and dicot plants.
EEG potentials associated with artificial grammar learning in the primate brain.

Science.gov (United States)

Attaheri, Adam; Kikuchi, Yukiko; Milne, Alice E; Wilson, Benjamin; Alter, Kai; Petkov, Christopher I

2015-09-01

Electroencephalography (EEG) has identified human brain potentials elicited by Artificial Grammar (AG) learning paradigms, which present participants with rule-based sequences of stimuli. Nonhuman animals are sensitive to certain AGs; therefore, evaluating which EEG Event Related Potentials (ERPs) are associated with AG learning in nonhuman animals could identify evolutionarily conserved processes. We recorded EEG potentials during an auditory AG learning experiment in two Rhesus macaques. The animals were first exposed to sequences of nonsense words generated by the AG. Then surface-based ERPs were recorded in response to sequences that were 'consistent' with the AG and 'violation' sequences containing illegal transitions. The AG violations strongly modulated an early component, potentially homologous to the Mismatch Negativity (mMMN), a P200 and a late frontal positivity (P500). The macaque P500 is similar in polarity and time of occurrence to a late EEG positivity reported in human AG learning studies but might differ in functional role. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Differential evolution-simulated annealing for multiple sequence alignment

Science.gov (United States)

Addawe, R. C.; Addawe, J. M.; Sueño, M. R. K.; Magadia, J. C.

2017-10-01

Multiple sequence alignments (MSA) are used in the analysis of molecular evolution and sequence structure relationships. In this paper, a hybrid algorithm, Differential Evolution - Simulated Annealing (DESA) is applied in optimizing multiple sequence alignments (MSAs) based on structural information, non-gaps percentage and totally conserved columns. DESA is a robust algorithm characterized by self-organization, mutation, crossover, and SA-like selection scheme of the strategy parameters. Here, the MSA problem is treated as a multi-objective optimization problem of the hybrid evolutionary algorithm, DESA. Thus, we name the algorithm as DESA-MSA. Simulated sequences and alignments were generated to evaluate the accuracy and efficiency of DESA-MSA using different indel sizes, sequence lengths, deletion rates and insertion rates. The proposed hybrid algorithm obtained acceptable solutions particularly for the MSA problem evaluated based on the three objectives.
Puzzling sequences: studying microbial genomes from 'Ötzi'

International Nuclear Information System (INIS)

Rattei, T.

2012-01-01

Ancient remains, and mummies in particular, are of central value for archaeological research. The Tyrolean iceman “Ötzi” was conserved in a glacier of the Ötztal Alps about 5000 years ago. Aside from morphological and phenotypical classification, the determination of DNA sequences and the subsequent genome analyses have been first applied to mitochondrial DNA and then been extended to genomic DNA. Typically also ancient microbial DNA is sequenced. These sequences allow the identification of pathogens as well as studying the evolution of microorganisms. The talk will explain the metagenomic aspects of the “Ötzi” genome project and discuss the first results. (author)
TARM1 Is a Novel Leukocyte Receptor Complex-Encoded ITAM Receptor That Costimulates Proinflammatory Cytokine Secretion by Macrophages and Neutrophils

DEFF Research Database (Denmark)

Radjabova, Valeria; Mastroeni, Piero; Skjødt, Karsten

2015-01-01

We identified a novel, evolutionarily conserved receptor encoded within the human leukocyte receptor complex and syntenic region of mouse chromosome 7, named T cell-interacting, activating receptor on myeloid cells-1 (TARM1). The transmembrane region of TARM1 contained a conserved arginine residu...
Redefining the role of syndecans in C. elegans biology

DEFF Research Database (Denmark)

Gopal, Sandeep; Couchman, John; Pocock, Roger

2016-01-01

in the activation of several downstream signaling pathways. We identified a previously unappreciated role of syndecans in cytosolic calcium regulation in mammals that is conserved in C. elegans. We concluded that calcium regulation is the basic, evolutionarily conserved role for syndecans, which enables them...
Conservation of Three-Dimensional Helix-Loop-Helix Structure through the Vertebrate Lineage Reopens the Cold Case of Gonadotropin-Releasing Hormone-Associated Peptide.

Science.gov (United States)

Pérez Sirkin, Daniela I; Lafont, Anne-Gaëlle; Kamech, Nédia; Somoza, Gustavo M; Vissio, Paula G; Dufour, Sylvie

2017-01-01

GnRH-associated peptide (GAP) is the C-terminal portion of the gonadotropin-releasing hormone (GnRH) preprohormone. Although it was reported in mammals that GAP may act as a prolactin-inhibiting factor and can be co-secreted with GnRH into the hypophyseal portal blood, GAP has been practically out of the research circuit for about 20 years. Comparative studies highlighted the low conservation of GAP primary amino acid sequences among vertebrates, contributing to consider that this peptide only participates in the folding or carrying process of GnRH. Considering that the three-dimensional (3D) structure of a protein may define its function, the aim of this study was to evaluate if GAP sequences and 3D structures are conserved in the vertebrate lineage. GAP sequences from various vertebrates were retrieved from databases. Analysis of primary amino acid sequence identity and similarity, molecular phylogeny, and prediction of 3D structures were performed. Amino acid sequence comparison and phylogeny analyses confirmed the large variation of GAP sequences throughout vertebrate radiation. In contrast, prediction of the 3D structure revealed a striking conservation of the 3D structure of GAP1 (GAP associated with the hypophysiotropic type 1 GnRH), despite low amino acid sequence conservation. This GAP1 peptide presented a typical helix-loop-helix (HLH) structure in all the vertebrate species analyzed. This HLH structure could also be predicted for GAP2 in some but not all vertebrate species and in none of the GAP3 analyzed. These results allowed us to infer that selective pressures have maintained GAP1 HLH structure throughout the vertebrate lineage. The conservation of the HLH motif, known to confer biological activity to various proteins, suggests that GAP1 peptides may exert some hypophysiotropic biological functions across vertebrate radiation.
Conservation of Three-Dimensional Helix-Loop-Helix Structure through the Vertebrate Lineage Reopens the Cold Case of Gonadotropin-Releasing Hormone-Associated Peptide

Directory of Open Access Journals (Sweden)

Daniela I. Pérez Sirkin

2017-08-01

Full Text Available GnRH-associated peptide (GAP is the C-terminal portion of the gonadotropin-releasing hormone (GnRH preprohormone. Although it was reported in mammals that GAP may act as a prolactin-inhibiting factor and can be co-secreted with GnRH into the hypophyseal portal blood, GAP has been practically out of the research circuit for about 20 years. Comparative studies highlighted the low conservation of GAP primary amino acid sequences among vertebrates, contributing to consider that this peptide only participates in the folding or carrying process of GnRH. Considering that the three-dimensional (3D structure of a protein may define its function, the aim of this study was to evaluate if GAP sequences and 3D structures are conserved in the vertebrate lineage. GAP sequences from various vertebrates were retrieved from databases. Analysis of primary amino acid sequence identity and similarity, molecular phylogeny, and prediction of 3D structures were performed. Amino acid sequence comparison and phylogeny analyses confirmed the large variation of GAP sequences throughout vertebrate radiation. In contrast, prediction of the 3D structure revealed a striking conservation of the 3D structure of GAP1 (GAP associated with the hypophysiotropic type 1 GnRH, despite low amino acid sequence conservation. This GAP1 peptide presented a typical helix-loop-helix (HLH structure in all the vertebrate species analyzed. This HLH structure could also be predicted for GAP2 in some but not all vertebrate species and in none of the GAP3 analyzed. These results allowed us to infer that selective pressures have maintained GAP1 HLH structure throughout the vertebrate lineage. The conservation of the HLH motif, known to confer biological activity to various proteins, suggests that GAP1 peptides may exert some hypophysiotropic biological functions across vertebrate radiation.
Climate-Driven Reshuffling of Species and Genes: Potential Conservation Roles for Species Translocations and Recombinant Hybrid Genotypes

Directory of Open Access Journals (Sweden)

Jon Mark Scriber

2013-12-01

, and genomes may become increasingly ecologically and evolutionarily predictable, but future conservation management programs are more likely to remain constrained by human behavior than by lack of academic knowledge.
Climate-Driven Reshuffling of Species and Genes: Potential Conservation Roles for Species Translocations and Recombinant Hybrid Genotypes.

Science.gov (United States)

Scriber, Jon Mark

2013-12-24

increasingly ecologically and evolutionarily predictable, but future conservation management programs are more likely to remain constrained by human behavior than by lack of academic knowledge.
Essentials of Conservation Biotechnology: A mini review

Science.gov (United States)

Merlyn Keziah, S.; Subathra Devi, C.

2017-11-01

Equilibrium of biodiversity is essential for the maintenance of the ecosystem as they are interdependent on each other. The decline in biodiversity is a global problem and an inevitable threat to the mankind. Major threats include unsustainable exploitation, habitat destruction, fragmentation, transformation, genetic pollution, invasive exotic species and degradation. This review covers the management strategies of biotechnology which include sin situ, ex situ conservation, computerized taxonomic analysis through construction of phylogenetic trees, calculating genetic distance, prioritizing the group for conservation, digital preservation of biodiversities within the coding and decoding keys, molecular approaches to asses biodiversity like polymerase chain reaction, real time, randomly amplified polymorphic DNA, restriction fragment length polymorphism, amplified fragment length polymorphism, single sequence repeats, DNA finger printing, single nucleotide polymorphism, cryopreservation and vitrification.
DNA barcodes for ecology, evolution, and conservation.

Science.gov (United States)

Kress, W John; García-Robledo, Carlos; Uriarte, Maria; Erickson, David L

2015-01-01

The use of DNA barcodes, which are short gene sequences taken from a standardized portion of the genome and used to identify species, is entering a new phase of application as more and more investigations employ these genetic markers to address questions relating to the ecology and evolution of natural systems. The suite of DNA barcode markers now applied to specific taxonomic groups of organisms are proving invaluable for understanding species boundaries, community ecology, functional trait evolution, trophic interactions, and the conservation of biodiversity. The application of next-generation sequencing (NGS) technology will greatly expand the versatility of DNA barcodes across the Tree of Life, habitats, and geographies as new methodologies are explored and developed. Published by Elsevier Ltd.
Conservation of the TRAPPII-specific subunits of a Ypt/Rab exchanger complex

Directory of Open Access Journals (Sweden)

Yoo Eunice

2007-02-01

Full Text Available Abstract Background Ypt/Rab GTPases and their GEF activators regulate intra-cellular trafficking in all eukaryotic cells. In S. cerivisiae, the modular TRAPP complex acts as a GEF for the Golgi gatekeepers: Ypt1 and the functional pair Ypt31/32. While TRAPPI, which acts in early Golgi, is conserved from fungi to animals, not much is known about TRAPPII, which acts in late Golgi and consists of TRAPPI plus three additional subunits. Results Here, we show a phylogenetic analysis of the three TRAPPII-specific subunits. One copy of each of the two essential subunits, Trs120 and Trs130, is present in almost every fully sequenced eukaryotic genome. Moreover, the primary, as well as the predicted secondary, structure of the Trs120- and Trs130-related sequences are conserved from fungi to animals. The mammalian orthologs of Trs120 and Trs130, NIBP and TMEM1, respectively, are candidates for human disorders. Currently, NIBP is implicated in signaling, and TMEM1 is suggested to have trans-membrane domains (TMDs and to function as a membrane channel. However, we show here that the yeast Trs130 does not function as a trans-membrane protein, and the human TMEM1 does not contain putative TMDs. The non-essential subunit, Trs65, is conserved only among many fungi and some unicellular eukaryotes. Multiple alignment analysis of each TRAPPII-specific subunit revealed conserved domains that include highly conserved amino acids. Conclusion We suggest that the function of both NIBP and TMEM1 in the regulation of intra-cellular trafficking is conserved from yeast to man. The conserved domains and amino acids discovered here can be used for functional analysis that should help to resolve the differences in the assigned functions of these proteins in fungi and animals.
Identification and Characterization of a Chloroplast-Targeted Obg GTPase in Dendrobium officinale.

Science.gov (United States)

Chen, Ji; Deng, Feng; Deng, Mengsheng; Han, Jincheng; Chen, Jianbin; Wang, Li; Yan, Shen; Tong, Kai; Liu, Fan; Tian, Mengliang

2016-12-01

Bacterial homologous chloroplast-targeted Obg GTPases (ObgCs) belong to the plant-typical Obg group, which is involved in diverse physiological processes during chloroplast development. However, the evolutionarily conserved function of ObgC in plants remains elusive and requires further investigation. In this study, we identified DoObgC from an epiphytic plant Dendrobium officinale and demonstrated the characteristics of DoObgC. Sequence analysis indicated that DoObgC is highly conserved with other plant ObgCs, which contain the chloroplast transit peptide (cTP), Obg fold, G domain, and OCT regions. The C terminus of DoObgC lacking the chloroplast-targeting cTP region, DoObgC Δ1-160 , showed strong similarity to ObgE and other bacterial Obgs. Overexpression of DoObgC Δ1-160 in Escherichia coli caused slow cell growth and an increased number of elongated cells. This phenotype was consistent with the phenotype of cells overexpressing ObgE. Furthermore, the expression of recombinant DoObgC Δ1-160 enhanced the cell persistence of E. coli to streptomycin. Results of transient expression assays revealed that DoObgC was localized to chloroplasts. Moreover, we demonstrated that DoObgC could rescue the embryotic lethal phenotype of the Arabidopsis obgc-t mutant, suggesting that DoObgC is a functional homolog to Arabidopsis AtObgC in D. officinale. Gene expression profiles showed that DoObgC was expressed in leaf-specific and light-dependent patterns and that DoObgC responded to wounding treatments. Our previous and present studies reveal that ObgC has an evolutionarily conserved role in ribosome biogenesis to adapt chloroplast development to the environment.
The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase.

Science.gov (United States)

Haggarty, N W; Dunbar, B; Fothergill, L A

1983-01-01

The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase, comprising 239 residues, was determined. The sequence was deduced from the four cyanogen bromide fragments, and from the peptides derived from these fragments after digestion with a number of proteolytic enzymes. Comparison of this sequence with that of the yeast glycolytic enzyme, phosphoglycerate mutase, shows that these enzymes are 47% identical. Most, but not all, of the residues implicated as being important for the activity of the glycolytic mutase are conserved in the erythrocyte diphosphoglycerate mutase. PMID:6313356
Multiplexed microsatellite recovery using massively parallel sequencing

Science.gov (United States)

Jennings, T.N.; Knaus, B.J.; Mullins, T.D.; Haig, S.M.; Cronn, R.C.

2011-01-01

Conservation and management of natural populations requires accurate and inexpensive genotyping methods. Traditional microsatellite, or simple sequence repeat (SSR), marker analysis remains a popular genotyping method because of the comparatively low cost of marker development, ease of analysis and high power of genotype discrimination. With the availability of massively parallel sequencing (MPS), it is now possible to sequence microsatellite-enriched genomic libraries in multiplex pools. To test this approach, we prepared seven microsatellite-enriched, barcoded genomic libraries from diverse taxa (two conifer trees, five birds) and sequenced these on one lane of the Illumina Genome Analyzer using paired-end 80-bp reads. In this experiment, we screened 6.1 million sequences and identified 356958 unique microreads that contained di- or trinucleotide microsatellites. Examination of four species shows that our conversion rate from raw sequences to polymorphic markers compares favourably to Sanger- and 454-based methods. The advantage of multiplexed MPS is that the staggering capacity of modern microread sequencing is spread across many libraries; this reduces sample preparation and sequencing costs to less than $400 (USD) per species. This price is sufficiently low that microsatellite libraries could be prepared and sequenced for all 1373 organisms listed as 'threatened' and 'endangered' in the United States for under $0.5M (USD).
Lymphatic filarial species differentiation using evolutionarily modified tandem repeats: generation of new genetic markers.

Science.gov (United States)

Sakthidevi, Moorthy; Murugan, Vadivel; Hoti, Sugeerappa Laxmanappa; Kaliraj, Perumal

2010-05-01

Polymerase chain reaction based methods are promising tools for the monitoring and evaluation of the Global Program for the Elimination of Lymphatic Filariasis. The currently available PCR methods do not differentiate the DNA of Wuchereria bancrofti or Brugia malayi by a single PCR and hence are cumbersome. Therefore, we designed a single step PCR strategy for differentiating Bancroftian infection from Brugian infection based on a newly identified gene from the W. bancrofti genome, abundant larval transcript-2 (alt-2), which is abundantly expressed. The difference in PCR product sizes generated from the presence or absence of evolutionarily altered tandem repeats in alt-2 intron-3 differentiated W. bancrofti from B. malayi. The analysis was performed on the genomic DNA of microfilariae from a number of patient blood samples or microfilariae positive slides from different Indian geographical regions. The assay gave consistent results, differentiating the two filarial parasite species accurately. This alt-2 intron-3 based PCR assay can be a potential tool for the diagnosis and differentiation of co-infections by lymphatic filarial parasites. Copyright (c) 2010 Elsevier B.V. All rights reserved.
Pervasive Effects of Aging on Gene Expression in Wild Wolves

Science.gov (United States)

Charruau, Pauline; Johnston, Rachel A.; Stahler, Daniel R.; Lea, Amanda; Snyder-Mackler, Noah; Smith, Douglas W.; vonHoldt, Bridgett M.; Cole, Steven W.; Tung, Jenny; Wayne, Robert K.

2016-01-01

Abstract Gene expression levels change as an individual ages and responds to environmental conditions. With the exception of humans, such patterns have principally been studied under controlled conditions, overlooking the array of developmental and environmental influences that organisms encounter under conditions in which natural selection operates. We used high-throughput RNA sequencing (RNA-Seq) of whole blood to assess the relative impacts of social status, age, disease, and sex on gene expression levels in a natural population of gray wolves (Canis lupus). Our findings suggest that age is broadly associated with gene expression levels, whereas other examined factors have minimal effects on gene expression patterns. Further, our results reveal evolutionarily conserved signatures of senescence, such as immunosenescence and metabolic aging, between wolves and humans despite major differences in life history and environment. The effects of aging on gene expression levels in wolves exhibit conservation with humans, but the more rapid expression differences observed in aging wolves is evolutionarily appropriate given the species’ high level of extrinsic mortality due to intraspecific aggression. Some expression changes that occur with age can facilitate physical age-related changes that may enhance fitness in older wolves. However, the expression of these ancestral patterns of aging in descendant modern dogs living in highly modified domestic environments may be maladaptive and cause disease. This work provides evolutionary insight into aging patterns observed in domestic dogs and demonstrates the applicability of studying natural populations to investigate the mechanisms of aging. PMID:27189566

The mitochondrial genome of the stingless bee Melipona bicolor (Hymenoptera, Apidae, Meliponini: sequence, gene organization and a unique tRNA translocation event conserved across the tribe Meliponini

Directory of Open Access Journals (Sweden)

Daniela Silvestre

2008-01-01

Full Text Available At present a complete mtDNA sequence has been reported for only two hymenopterans, the Old World honey bee, Apis mellifera and the sawfly Perga condei. Among the bee group, the tribe Meliponini (stingless bees has some distinction due to its Pantropical distribution, great number of species and large importance as main pollinators in several ecosystems, including the Brazilian rain forest. However few molecular studies have been conducted on this group of bees and few sequence data from mitochondrial genomes have been described. In this project, we PCR amplified and sequenced 78% of the mitochondrial genome of the stingless bee Melipona bicolor (Apidae, Meliponini. The sequenced region contains all of the 13 mitochondrial protein-coding genes, 18 of 22 tRNA genes, and both rRNA genes (one of them was partially sequenced. We also report the genome organization (gene content and order, gene translation, genetic code, and other molecular features, such as base frequencies, codon usage, gene initiation and termination. We compare these characteristics of M. bicolor to those of the mitochondrial genome of A. mellifera and other insects. A highly biased A+T content is a typical characteristic of the A. mellifera mitochondrial genome and it was even more extreme in that of M. bicolor. Length and compositional differences between M. bicolor and A. mellifera genes were detected and the gene order was compared. Eleven tRNA gene translocations were observed between these two species. This latter finding was surprising, considering the taxonomic proximity of these two bee tribes. The tRNA Lys gene translocation was investigated within Meliponini and showed high conservation across the Pantropical range of the tribe.
Selection on start codons in prokaryotes and potential compensatory nucleotide substitutions.

Science.gov (United States)

Belinky, Frida; Rogozin, Igor B; Koonin, Eugene V

2017-09-29

Reconstruction of the evolution of start codons in 36 groups of closely related bacterial and archaeal genomes reveals purifying selection affecting AUG codons. The AUG starts are replaced by GUG and especially UUG significantly less frequently than expected under the neutral expectation derived from the frequencies of the respective nucleotide triplet substitutions in non-coding regions and in 4-fold degenerate sites. Thus, AUG is the optimal start codon that is actively maintained by purifying selection. However, purifying selection on start codons is significantly weaker than the selection on the same codons in coding sequences, although the switches between the codons result in conservative amino acid substitutions. The only exception is the AUG to UUG switch that is strongly selected against among start codons. Selection on start codons is most pronounced in evolutionarily conserved, highly expressed genes. Mutation of the start codon to a sub-optimal form (GUG or UUG) tends to be compensated by mutations in the Shine-Dalgarno sequence towards a stronger translation initiation signal. Together, all these findings indicate that in prokaryotes, translation start signals are subject to weak but significant selection for maximization of initiation rate and, consequently, protein production.
Viruses are a dominant driver of protein adaptation in mammals.

Science.gov (United States)

Enard, David; Cai, Le; Gwennap, Carina; Petrov, Dmitri A

2016-05-17

Viruses interact with hundreds to thousands of proteins in mammals, yet adaptation against viruses has only been studied in a few proteins specialized in antiviral defense. Whether adaptation to viruses typically involves only specialized antiviral proteins or affects a broad array of virus-interacting proteins is unknown. Here, we analyze adaptation in ~1300 virus-interacting proteins manually curated from a set of 9900 proteins conserved in all sequenced mammalian genomes. We show that viruses (i) use the more evolutionarily constrained proteins within the cellular functions they interact with and that (ii) despite this high constraint, virus-interacting proteins account for a high proportion of all protein adaptation in humans and other mammals. Adaptation is elevated in virus-interacting proteins across all functional categories, including both immune and non-immune functions. We conservatively estimate that viruses have driven close to 30% of all adaptive amino acid changes in the part of the human proteome conserved within mammals. Our results suggest that viruses are one of the most dominant drivers of evolutionary change across mammalian and human proteomes.
Conservation of Charge and Conservation of Current

OpenAIRE

Eisenberg, Bob

2016-01-01

Conservation of current and conservation of charge are nearly the same thing: when enough is known about charge movement, conservation of current can be derived from conservation of charge, in ideal dielectrics, for example. Conservation of current is enforced implicitly in ideal dielectrics by theories that conserve charge. But charge movement in real materials like semiconductors or ionic solutions is never ideal. We present an apparently universal derivation of conservation of current and ...
A structural study for the optimisation of functional motifs encoded in protein sequences

Directory of Open Access Journals (Sweden)

Helmer-Citterich Manuela

2004-04-01

Full Text Available Abstract Background A large number of PROSITE patterns select false positives and/or miss known true positives. It is possible that – at least in some cases – the weak specificity and/or sensitivity of a pattern is due to the fact that one, or maybe more, functional and/or structural key residues are not represented in the pattern. Multiple sequence alignments are commonly used to build functional sequence patterns. If residues structurally conserved in proteins sharing a function cannot be aligned in a multiple sequence alignment, they are likely to be missed in a standard pattern construction procedure. Results Here we present a new procedure aimed at improving the sensitivity and/ or specificity of poorly-performing patterns. The procedure can be summarised as follows: 1. residues structurally conserved in different proteins, that are true positives for a pattern, are identified by means of a computational technique and by visual inspection. 2. the sequence positions of the structurally conserved residues falling outside the pattern are used to build extended sequence patterns. 3. the extended patterns are optimised on the SWISS-PROT database for their sensitivity and specificity. The method was applied to eight PROSITE patterns. Whenever structurally conserved residues are found in the surface region close to the pattern (seven out of eight cases, the addition of information inferred from structural analysis is shown to improve pattern selectivity and in some cases selectivity and sensitivity as well. In some of the cases considered the procedure allowed the identification of functionally interesting residues, whose biological role is also discussed. Conclusion Our method can be applied to any type of functional motif or pattern (not only PROSITE ones which is not able to select all and only the true positive hits and for which at least two true positive structures are available. The computational technique for the identification of
BLSSpeller: exhaustive comparative discovery of conserved cis-regulatory elements.

Science.gov (United States)

De Witte, Dieter; Van de Velde, Jan; Decap, Dries; Van Bel, Michiel; Audenaert, Pieter; Demeester, Piet; Dhoedt, Bart; Vandepoele, Klaas; Fostier, Jan

2015-12-01

The accurate discovery and annotation of regulatory elements remains a challenging problem. The growing number of sequenced genomes creates new opportunities for comparative approaches to motif discovery. Putative binding sites are then considered to be functional if they are conserved in orthologous promoter sequences of multiple related species. Existing methods for comparative motif discovery usually rely on pregenerated multiple sequence alignments, which are difficult to obtain for more diverged species such as plants. As a consequence, misaligned regulatory elements often remain undetected. We present a novel algorithm that supports both alignment-free and alignment-based motif discovery in the promoter sequences of related species. Putative motifs are exhaustively enumerated as words over the IUPAC alphabet and screened for conservation using the branch length score. Additionally, a confidence score is established in a genome-wide fashion. In order to take advantage of a cloud computing infrastructure, the MapReduce programming model is adopted. The method is applied to four monocotyledon plant species and it is shown that high-scoring motifs are significantly enriched for open chromatin regions in Oryza sativa and for transcription factor binding sites inferred through protein-binding microarrays in O.sativa and Zea mays. Furthermore, the method is shown to recover experimentally profiled ga2ox1-like KN1 binding sites in Z.mays. BLSSpeller was written in Java. Source code and manual are available at http://bioinformatics.intec.ugent.be/blsspeller Klaas.Vandepoele@psb.vib-ugent.be or jan.fostier@intec.ugent.be. Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Identification of multiple distinct Snf2 subfamilies with conserved structural motifs.

Science.gov (United States)

Flaus, Andrew; Martin, David M A; Barton, Geoffrey J; Owen-Hughes, Tom

2006-01-01

The Snf2 family of helicase-related proteins includes the catalytic subunits of ATP-dependent chromatin remodelling complexes found in all eukaryotes. These act to regulate the structure and dynamic properties of chromatin and so influence a broad range of nuclear processes. We have exploited progress in genome sequencing to assemble a comprehensive catalogue of over 1300 Snf2 family members. Multiple sequence alignment of the helicase-related regions enables 24 distinct subfamilies to be identified, a considerable expansion over earlier surveys. Where information is known, there is a good correlation between biological or biochemical function and these assignments, suggesting Snf2 family motor domains are tuned for specific tasks. Scanning of complete genomes reveals all eukaryotes contain members of multiple subfamilies, whereas they are less common and not ubiquitous in eubacteria or archaea. The large sample of Snf2 proteins enables additional distinguishing conserved sequence blocks within the helicase-like motor to be identified. The establishment of a phylogeny for Snf2 proteins provides an opportunity to make informed assignments of function, and the identification of conserved motifs provides a framework for understanding the mechanisms by which these proteins function.
Latency transition of plasminogen activator inhibitor type 1 is evolutionarily conserved

DEFF Research Database (Denmark)

Jendroszek, Agnieszka; Sønnichsen, Malene; Chana Munoz, Andres

2017-01-01

relevance of latency transition. In order to study the origin of PAI-1 latency transition, we produced PAI-1 from Spiny dogfish shark (Squalus acanthias) and African lungfish (Protopterus sp.), which represent central species in the evolution of vertebrates. Although human PAI-1 and the non-mammalian PAI-1...
Cytochrome b conservation between six camel breeds reared in Egypt

Directory of Open Access Journals (Sweden)

Othman E. Othman

2017-06-01

It is concluded that cyto b sequence is highly conserved among all camel breeds reared in Egypt which belong to Camelus dromedaries in addition to the advantage of cyto b in differentiation between different livestock sources which enables it to widely use for the adulteration detection in mixed meat.
Combining specificity determining and conserved residues improves functional site prediction

Directory of Open Access Journals (Sweden)

Gelfand Mikhail S

2009-06-01

Full Text Available Abstract Background Predicting the location of functionally important sites from protein sequence and/or structure is a long-standing problem in computational biology. Most current approaches make use of sequence conservation, assuming that amino acid residues conserved within a protein family are most likely to be functionally important. Most often these approaches do not consider many residues that act to define specific sub-functions within a family, or they make no distinction between residues important for function and those more relevant for maintaining structure (e.g. in the hydrophobic core. Many protein families bind and/or act on a variety of ligands, meaning that conserved residues often only bind a common ligand sub-structure or perform general catalytic activities. Results Here we present a novel method for functional site prediction based on identification of conserved positions, as well as those responsible for determining ligand specificity. We define Specificity-Determining Positions (SDPs, as those occupied by conserved residues within sub-groups of proteins in a family having a common specificity, but differ between groups, and are thus likely to account for specific recognition events. We benchmark the approach on enzyme families of known 3D structure with bound substrates, and find that in nearly all families residues predicted by SDPsite are in contact with the bound substrate, and that the addition of SDPs significantly improves functional site prediction accuracy. We apply SDPsite to various families of proteins containing known three-dimensional structures, but lacking clear functional annotations, and discusse several illustrative examples. Conclusion The results suggest a better means to predict functional details for the thousands of protein structures determined prior to a clear understanding of molecular function.
Bioactive endophytes warrant intensified exploration and conservation.

Science.gov (United States)

Smith, Stephen A; Tank, David C; Boulanger, Lori-Ann; Bascom-Slack, Carol A; Eisenman, Kaury; Kingery, David; Babbs, Beatrice; Fenn, Kathleen; Greene, Joshua S; Hann, Bradley D; Keehner, Jocelyn; Kelley-Swift, Elizabeth G; Kembaiyan, Vivek; Lee, Sun Jin; Li, Puyao; Light, David Y; Lin, Emily H; Ma, Cong; Moore, Emily; Schorn, Michelle A; Vekhter, Daniel; Nunez, Percy V; Strobel, Gary A; Donoghue, Michael J; Strobel, Scott A

2008-08-25

A key argument in favor of conserving biodiversity is that as yet undiscovered biodiversity will yield products of great use to humans. However, the link between undiscovered biodiversity and useful products is largely conjectural. Here we provide direct evidence from bioassays of endophytes isolated from tropical plants and bioinformatic analyses that novel biology will indeed yield novel chemistry of potential value. We isolated and cultured 135 endophytic fungi and bacteria from plants collected in Peru. nrDNAs were compared to samples deposited in GenBank to ascertain the genetic novelty of cultured specimens. Ten endophytes were found to be as much as 15-30% different than any sequence in GenBank. Phylogenetic trees, using the most similar sequences in GenBank, were constructed for each endophyte to measure phylogenetic distance. Assays were also conducted on each cultured endophyte to record bioactivity, of which 65 were found to be bioactive. The novelty of our contribution is that we have combined bioinformatic analyses that document the diversity found in environmental samples with culturing and bioassays. These results highlight the hidden hyperdiversity of endophytic fungi and the urgent need to explore and conserve hidden microbial diversity. This study also showcases how undergraduate students can obtain data of great scientific significance.
Bioactive endophytes warrant intensified exploration and conservation.

Directory of Open Access Journals (Sweden)

Stephen A Smith

2008-08-01

Full Text Available A key argument in favor of conserving biodiversity is that as yet undiscovered biodiversity will yield products of great use to humans. However, the link between undiscovered biodiversity and useful products is largely conjectural. Here we provide direct evidence from bioassays of endophytes isolated from tropical plants and bioinformatic analyses that novel biology will indeed yield novel chemistry of potential value.We isolated and cultured 135 endophytic fungi and bacteria from plants collected in Peru. nrDNAs were compared to samples deposited in GenBank to ascertain the genetic novelty of cultured specimens. Ten endophytes were found to be as much as 15-30% different than any sequence in GenBank. Phylogenetic trees, using the most similar sequences in GenBank, were constructed for each endophyte to measure phylogenetic distance. Assays were also conducted on each cultured endophyte to record bioactivity, of which 65 were found to be bioactive.The novelty of our contribution is that we have combined bioinformatic analyses that document the diversity found in environmental samples with culturing and bioassays. These results highlight the hidden hyperdiversity of endophytic fungi and the urgent need to explore and conserve hidden microbial diversity. This study also showcases how undergraduate students can obtain data of great scientific significance.
Conservation genetics and genomics of amphibians and reptiles.

Science.gov (United States)

Shaffer, H Bradley; Gidiş, Müge; McCartney-Melstad, Evan; Neal, Kevin M; Oyamaguchi, Hilton M; Tellez, Marisa; Toffelmier, Erin M

2015-01-01

Amphibians and reptiles as a group are often secretive, reach their greatest diversity often in remote tropical regions, and contain some of the most endangered groups of organisms on earth. Particularly in the past decade, genetics and genomics have been instrumental in the conservation biology of these cryptic vertebrates, enabling work ranging from the identification of populations subject to trade and exploitation, to the identification of cryptic lineages harboring critical genetic variation, to the analysis of genes controlling key life history traits. In this review, we highlight some of the most important ways that genetic analyses have brought new insights to the conservation of amphibians and reptiles. Although genomics has only recently emerged as part of this conservation tool kit, several large-scale data sources, including full genomes, expressed sequence tags, and transcriptomes, are providing new opportunities to identify key genes, quantify landscape effects, and manage captive breeding stocks of at-risk species.
Gene family size conservation is a good indicator of evolutionary rates.

Science.gov (United States)

Chen, Feng-Chi; Chen, Chiuan-Jung; Li, Wen-Hsiung; Chuang, Trees-Juen

2010-08-01

The evolution of duplicate genes has been a topic of broad interest. Here, we propose that the conservation of gene family size is a good indicator of the rate of sequence evolution and some other biological properties. By comparing the human-chimpanzee-macaque orthologous gene families with and without family size conservation, we demonstrate that genes with family size conservation evolve more slowly than those without family size conservation. Our results further demonstrate that both family expansion and contraction events may accelerate gene evolution, resulting in elevated evolutionary rates in the genes without family size conservation. In addition, we show that the duplicate genes with family size conservation evolve significantly more slowly than those without family size conservation. Interestingly, the median evolutionary rate of singletons falls in between those of the above two types of duplicate gene families. Our results thus suggest that the controversy on whether duplicate genes evolve more slowly than singletons can be resolved when family size conservation is taken into consideration. Furthermore, we also observe that duplicate genes with family size conservation have the highest level of gene expression/expression breadth, the highest proportion of essential genes, and the lowest gene compactness, followed by singletons and then by duplicate genes without family size conservation. Such a trend accords well with our observations of evolutionary rates. Our results thus point to the importance of family size conservation in the evolution of duplicate genes.
Identification of putative regulatory upstream ORFs in the yeast genome using heuristics and evolutionary conservation

Directory of Open Access Journals (Sweden)

Bilsland Elizabeth

2007-08-01

Full Text Available Abstract Background The translational efficiency of an mRNA can be modulated by upstream open reading frames (uORFs present in certain genes. A uORF can attenuate translation of the main ORF by interfering with translational reinitiation at the main start codon. uORFs also occur by chance in the genome, in which case they do not have a regulatory role. Since the sequence determinants for functional uORFs are not understood, it is difficult to discriminate functional from spurious uORFs by sequence analysis. Results We have used comparative genomics to identify novel uORFs in yeast with a high likelihood of having a translational regulatory role. We examined uORFs, previously shown to play a role in regulation of translation in Saccharomyces cerevisiae, for evolutionary conservation within seven Saccharomyces species. Inspection of the set of conserved uORFs yielded the following three characteristics useful for discrimination of functional from spurious uORFs: a length between 4 and 6 codons, a distance from the start of the main ORF between 50 and 150 nucleotides, and finally a lack of overlap with, and clear separation from, neighbouring uORFs. These derived rules are inherently associated with uORFs with properties similar to the GCN4 locus, and may not detect most uORFs of other types. uORFs with high scores based on these rules showed a much higher evolutionary conservation than randomly selected uORFs. In a genome-wide scan in S. cerevisiae, we found 34 conserved uORFs from 32 genes that we predict to be functional; subsequent analysis showed the majority of these to be located within transcripts. A total of 252 genes were found containing conserved uORFs with properties indicative of a functional role; all but 7 are novel. Functional content analysis of this set identified an overrepresentation of genes involved in transcriptional control and development. Conclusion Evolutionary conservation of uORFs in yeasts can be traced up to 100
Genes involved in complex adaptive processes tend to have highly conserved upstream regions in mammalian genomes

Directory of Open Access Journals (Sweden)

Kohane Isaac

2005-11-01

Full Text Available Abstract Background Recent advances in genome sequencing suggest a remarkable conservation in gene content of mammalian organisms. The similarity in gene repertoire present in different organisms has increased interest in studying regulatory mechanisms of gene expression aimed at elucidating the differences in phenotypes. In particular, a proximal promoter region contains a large number of regulatory elements that control the expression of its downstream gene. Although many studies have focused on identification of these elements, a broader picture on the complexity of transcriptional regulation of different biological processes has not been addressed in mammals. The regulatory complexity may strongly correlate with gene function, as different evolutionary forces must act on the regulatory systems under different biological conditions. We investigate this hypothesis by comparing the conservation of promoters upstream of genes classified in different functional categories. Results By conducting a rank correlation analysis between functional annotation and upstream sequence alignment scores obtained by human-mouse and human-dog comparison, we found a significantly greater conservation of the upstream sequence of genes involved in development, cell communication, neural functions and signaling processes than those involved in more basic processes shared with unicellular organisms such as metabolism and ribosomal function. This observation persists after controlling for G+C content. Considering conservation as a functional signature, we hypothesize a higher density of cis-regulatory elements upstream of genes participating in complex and adaptive processes. Conclusion We identified a class of functions that are associated with either high or low promoter conservation in mammals. We detected a significant tendency that points to complex and adaptive processes were associated with higher promoter conservation, despite the fact that they have emerged
A Bioinformatic Pipeline for Monitoring of the Mutational Stability of Viral Drug Targets with Deep-Sequencing Technology.

Science.gov (United States)

Kravatsky, Yuri; Chechetkin, Vladimir; Fedoseeva, Daria; Gorbacheva, Maria; Kravatskaya, Galina; Kretova, Olga; Tchurikov, Nickolai

2017-11-23

The efficient development of antiviral drugs, including efficient antiviral small interfering RNAs (siRNAs), requires continuous monitoring of the strict correspondence between a drug and the related highly variable viral DNA/RNA target(s). Deep sequencing is able to provide an assessment of both the general target conservation and the frequency of particular mutations in the different target sites. The aim of this study was to develop a reliable bioinformatic pipeline for the analysis of millions of short, deep sequencing reads corresponding to selected highly variable viral sequences that are drug target(s). The suggested bioinformatic pipeline combines the available programs and the ad hoc scripts based on an original algorithm of the search for the conserved targets in the deep sequencing data. We also present the statistical criteria for the threshold of reliable mutation detection and for the assessment of variations between corresponding data sets. These criteria are robust against the possible sequencing errors in the reads. As an example, the bioinformatic pipeline is applied to the study of the conservation of RNA interference (RNAi) targets in human immunodeficiency virus 1 (HIV-1) subtype A. The developed pipeline is freely available to download at the website http://virmut.eimb.ru/. Brief comments and comparisons between VirMut and other pipelines are also presented.
Amino acid sequence analysis of the annexin super-gene family of proteins.

Science.gov (United States)

Barton, G J; Newman, R H; Freemont, P S; Crumpton, M J

1991-06-15

The annexins are a widespread family of calcium-dependent membrane-binding proteins. No common function has been identified for the family and, until recently, no crystallographic data existed for an annexin. In this paper we draw together 22 available annexin sequences consisting of 88 similar repeat units, and apply the techniques of multiple sequence alignment, pattern matching, secondary structure prediction and conservation analysis to the characterisation of the molecules. The analysis clearly shows that the repeats cluster into four distinct families and that greatest variation occurs within the repeat 3 units. Multiple alignment of the 88 repeats shows amino acids with conserved physicochemical properties at 22 positions, with only Gly at position 23 being absolutely conserved in all repeats. Secondary structure prediction techniques identify five conserved helices in each repeat unit and patterns of conserved hydrophobic amino acids are consistent with one face of a helix packing against the protein core in predicted helices a, c, d, e. Helix b is generally hydrophobic in all repeats, but contains a striking pattern of repeat-specific residue conservation at position 31, with Arg in repeats 4 and Glu in repeats 2, but unconserved amino acids in repeats 1 and 3. This suggests repeats 2 and 4 may interact via a buried saltbridge. The loop between predicted helices a and b of repeat 3 shows features distinct from the equivalent loop in repeats 1, 2 and 4, suggesting an important structural and/or functional role for this region. No compelling evidence emerges from this study for uteroglobin and the annexins sharing similar tertiary structures, or for uteroglobin representing a derivative of a primordial one-repeat structure that underwent duplication to give the present day annexins. The analyses performed in this paper are re-evaluated in the Appendix, in the light of the recently published X-ray structure for human annexin V. The structure confirms most of
Survey sequencing and comparative analysis of the elephant shark (Callorhinchus milii genome.

Directory of Open Access Journals (Sweden)

Byrappa Venkatesh

2007-04-01

Full Text Available Owing to their phylogenetic position, cartilaginous fishes (sharks, rays, skates, and chimaeras provide a critical reference for our understanding of vertebrate genome evolution. The relatively small genome of the elephant shark, Callorhinchus milii, a chimaera, makes it an attractive model cartilaginous fish genome for whole-genome sequencing and comparative analysis. Here, the authors describe survey sequencing (1.4x coverage and comparative analysis of the elephant shark genome, one of the first cartilaginous fish genomes to be sequenced to this depth. Repetitive sequences, represented mainly by a novel family of short interspersed element-like and long interspersed element-like sequences, account for about 28% of the elephant shark genome. Fragments of approximately 15,000 elephant shark genes reveal specific examples of genes that have been lost differentially during the evolution of tetrapod and teleost fish lineages. Interestingly, the degree of conserved synteny and conserved sequences between the human and elephant shark genomes are higher than that between human and teleost fish genomes. Elephant shark contains putative four Hox clusters indicating that, unlike teleost fish genomes, the elephant shark genome has not experienced an additional whole-genome duplication. These findings underscore the importance of the elephant shark as a critical reference vertebrate genome for comparative analysis of the human and other vertebrate genomes. This study also demonstrates that a survey-sequencing approach can be applied productively for comparative analysis of distantly related vertebrate genomes.
The silkworm (Bombyx mori microRNAs and their expressions in multiple developmental stages.

Directory of Open Access Journals (Sweden)

Xiaomin Yu

Full Text Available BACKGROUND: MicroRNAs (miRNAs play crucial roles in various physiological processes through post-transcriptional regulation of gene expressions and are involved in development, metabolism, and many other important molecular mechanisms and cellular processes. The Bombyx mori genome sequence provides opportunities for a thorough survey for miRNAs as well as comparative analyses with other sequenced insect species. METHODOLOGY/PRINCIPAL FINDINGS: We identified 114 non-redundant conserved miRNAs and 148 novel putative miRNAs from the B. mori genome with an elaborate computational protocol. We also sequenced 6,720 clones from 14 developmental stage-specific small RNA libraries in which we identified 35 unique miRNAs containing 21 conserved miRNAs (including 17 predicted miRNAs and 14 novel miRNAs (including 11 predicted novel miRNAs. Among the 114 conserved miRNAs, we found six pairs of clusters evolutionarily conserved cross insect lineages. Our observations on length heterogeneity at 5' and/or 3' ends of nine miRNAs between cloned and predicted sequences, and three mature forms deriving from the same arm of putative pre-miRNAs suggest a mechanism by which miRNAs gain new functions. Analyzing development-related miRNAs expression at 14 developmental stages based on clone-sampling and stem-loop RT PCR, we discovered an unusual abundance of 33 sequences representing 12 different miRNAs and sharply fluctuated expression of miRNAs at larva-molting stage. The potential functions of several stage-biased miRNAs were also analyzed in combination with predicted target genes and silkworm's phenotypic traits; our results indicated that miRNAs may play key regulatory roles in specific developmental stages in the silkworm, such as ecdysis. CONCLUSIONS/SIGNIFICANCE: Taking a combined approach, we identified 118 conserved miRNAs and 151 novel miRNA candidates from the B. mori genome sequence. Our expression analyses by sampling miRNAs and real-time PCR over

PknB remains an essential and a conserved target for drug development in susceptible and MDR strains of M. Tuberculosis.

Science.gov (United States)

Gupta, Anamika; Pal, Sudhir K; Pandey, Divya; Fakir, Najneen A; Rathod, Sunita; Sinha, Dhiraj; SivaKumar, S; Sinha, Pallavi; Periera, Mycal; Balgam, Shilpa; Sekar, Gomathi; UmaDevi, K R; Anupurba, Shampa; Nema, Vijay

2017-08-18

The Mycobacterium tuberculosis (M.tb) protein kinase B (PknB) which is now proved to be essential for the growth and survival of M.tb, is a transmembrane protein with a potential to be a good drug target. However it is not known if this target remains conserved in otherwise resistant isolates from clinical origin. The present study describes the conservation analysis of sequences covering the inhibitor binding domain of PknB to assess if it remains conserved in susceptible and resistant clinical strains of mycobacteria picked from three different geographical areas of India. A total of 116 isolates from North, South and West India were used in the study with a variable profile of their susceptibilities towards streptomycin, isoniazid, rifampicin, ethambutol and ofloxacin. Isolates were also spoligotyped in order to find if the conservation pattern of pknB gene remain consistent or differ with different spoligotypes. The impact of variation as found in the study was analyzed using Molecular dynamics simulations. The sequencing results with 115/116 isolates revealed the conserved nature of pknB sequences irrespective of their susceptibility status and spoligotypes. The only variation found was in one strains wherein pnkB sequence had G to A mutation at 664 position translating into a change of amino acid, Valine to Isoleucine. After analyzing the impact of this sequence variation using Molecular dynamics simulations, it was observed that the variation is causing no significant change in protein structure or the inhibitor binding. Hence, the study endorses that PknB is an ideal target for drug development and there is no pre-existing or induced resistance with respect to the sequences involved in inhibitor binding. Also if the mutation that we are reporting for the first time is found again in subsequent work, it should be checked with phenotypic profile before drawing the conclusion that it would affect the activity in any way. Bioinformatics analysis in our study
Mutations in the newly identified RAX regulatory sequence are not a frequent cause of micro/anophthalmia.

Science.gov (United States)

Chassaing, Nicolas; Vigouroux, Adeline; Calvas, Patrick

2009-06-01

Microphthalmia and anophthalmia are at the severe end of the spectrum of abnormalities in ocular development. A few genes (SOX2, OTX2, RAX, and CHX10) have been implicated in isolated micro/anophthalmia, but causative mutations of these genes explain less than a quarter of these developmental defects. A specifically conserved SOX2/OTX2-mediated RAX expression regulatory sequence has recently been identified. We postulated that mutations in this sequence could lead to micro/anophthalmia, and thus we performed molecular screening of this regulatory element in patients suffering from micro/anophthalmia. Fifty-one patients suffering from nonsyndromic microphthalmia (n = 40) or anophthalmia (n = 11) were included in this study after negative molecular screening for SOX2, OTX2, RAX, and CHX10 mutations. Mutation screening of the RAX regulatory sequence was performed by direct sequencing for these patients. No mutations were identified in the highly conserved RAX regulatory sequence in any of the 51 patients. Mutations in the newly identified RAX regulatory sequence do not represent a frequent cause of nonsyndromic micro/anophthalmia.
Comparative analysis of vertebrate EIF2AK2 (PKR genes and assignment of the equine gene to ECA15q24–q25 and the bovine gene to BTA11q12–q15

Directory of Open Access Journals (Sweden)

Zharkikh Andrey A

2006-09-01

Full Text Available Abstract The structures of the canine, rabbit, bovine and equine EIF2AK2 genes were determined. Each of these genes has a 5' non-coding exon as well as 15 coding exons. All of the canine, bovine and equine EIF2AK2 introns have consensus donor and acceptor splice sites. In the equine EIF2AK2 gene, a unique single nucleotide polymorphism that encoded a Tyr329Cys substitution was detected. Regulatory elements predicted in the promoter region were conserved in ungulates, primates, rodents, Afrotheria (elephant and Insectifora (shrew. Western clawed frog and fugu EIF2AK2 gene sequences were detected in the USCS Genome Browser and compared to those of other vertebrate EIF2AK2 genes. A comparison of EIF2AK2 protein domains in vertebrates indicates that the kinase catalytic domains were evolutionarily more conserved than the nucleic acid-binding motifs. Nucleotide substitution rates were uniform among the vertebrate sequences with the exception of the zebrafish and goldfish EIF2AK2 genes, which showed substitution rates about 20% higher than those of other vertebrates. FISH was used to physically assign the horse and cattle genes to chromosome locations, ECA15q24–q25 and BTA11q12–15, respectively. Comparative mapping data confirmed conservation of synteny between ungulates, humans and rodents.
Sequence-based Screening for Rare Enzymes: New Insights into the World of AMDases Reveal a Conserved Motif and 58 Novel Enzymes Clustering in Eight Distinct Families.

Directory of Open Access Journals (Sweden)

Janine Maimanakos

2016-08-01

Full Text Available Arylmalonate-Decarboxylases (AMDases, EC 4.1.1.76 are very rare and mostly underexplored enzymes. Currently only four known and biochemically characterized representatives exist. However, their ability to decarboxylate α-disubstituted malonic acid derivatives to optically pure products without cofactors makes them attractive and promising candidates for the use as biocatalysts in industrial processes. Until now, AMDases could not be separated from other members of the aspartate/glutamate racemase superfamily based on their gene sequences. Within this work, a search algorithm was developed that enables a reliable prediction of AMDase activity for potential candidates. Based on specific sequence patterns and screening methods 58 novel AMDase candidate genes could be identified in this work. Thereby, AMDases with the conserved sequence pattern of Bordetella bronchiseptica’s prototype appeared to be limited to the classes of Alpha-, Beta- and Gammaproteobacteria. Amino acid homologies and comparison of gene surrounding sequences enabled the classification of eight enzyme clusters. Particularly striking is the accumulation of genes coding for different transporters of the TTT family, TRAP transporters and ABC transporters as well as genes coding for mandelate racemases/muconate lactonizing enzymes that might be involved in substrate uptake or degradation of AMDase products. Further, three novel AMDases were characterized which showed a high enantiomeric excess (>99% of the (R-enantiomer of flurbiprofen. These are the recombinant AmdA and AmdV from Variovorax sp. strains HH01 and HH02, originated from soil, and AmdP from Polymorphum gilvum found by a data base search. Altogether our findings give new insights into the class of AMDases and reveal many previously unknown enzyme candidates with high potential for bioindustrial processes.
The Highly Conserved Proline at Position 438 in Pseudorabies Virus gH Is Important for Regulation of Membrane Fusion

OpenAIRE

Schröter, Christina; Klupp, Barbara G.; Fuchs, Walter; Gerhard, Marika; Backovic, Marija; Rey, Felix A.; Mettenleiter, Thomas C.

2014-01-01

Membrane fusion in herpesviruses requires viral glycoproteins (g) gB and gH/gL. While gB is considered the actual fusion protein but is nonfusogenic per se, the function of gH/gL remains enigmatic. Crystal structures for different gH homologs are strikingly similar despite only moderate amino acid sequence conservation. A highly conserved sequence motif comprises the residues serine-proline-cysteine corresponding to positions 437 to 439 in pseudorabies virus (PrV) gH. The PrV-gH structure sho...
Conserved residues and their role in the structure, function, and stability of acyl-coenzyme A binding protein

DEFF Research Database (Denmark)

Kragelund, B B; Poulsen, K; Andersen, K V

1999-01-01

In the family of acyl-coenzyme A binding proteins, a subset of 26 sequence sites are identical in all eukaryotes and conserved throughout evolution of the eukaryotic kingdoms. In the context of the bovine protein, the importance of these 26 sequence positions for structure, function, stability...
Myxobolus cerebralis internal transcribed spacer 1 (ITS-1) sequences support recent spread of the parasite to North America and within Europe

Science.gov (United States)

Whipps, Christopher M.; El-Matbouli, M.; Hedrick, R.P.; Blazer, V.; Kent, M.L.

2004-01-01

Molecular approaches for resolving relationships among the Myxozoa have relied mainly on small subunit (SSU) ribosomal DNA (rDNA) sequence analysis. This region of the gene is generally used for higher phylogenetic studies, and the conservative nature of this gene may make it inadequate for intraspecific comparisons. Previous intraspecific studies of Myxobolus cerebralis based on molecular analyses reported that the sequence of SSU rDNA and the internal transcribed spacer (ITS) were highly conserved in representatives of the parasite from North America and Europe. Considering that the ITS is usually a more variable region than the SSU, we reanalyzed available sequences on GenBank and obtained sequences from other M. cerebralis representatives from the states of California and West Virginia in the USA and from Germany and Russia. With the exception of 7 base pairs, most of the sequence designated as ITS-1 in GenBank was a highly conserved portion of the rDNA near the 3-prime end of the SSU region. Nonetheless, the additional ITS-1 sequences obtained from the available geographic representatives were well conserved. It is unlikely that we would have observed virtually identical ITS-1 sequences between European and American M. cerebralis samples had it spread naturally over time, particularly when compared to the variation seen between isolates of another myxozoan (Kudoa thyrsites) that has most likely spread naturally. These data further support the hypothesis that the current distribution of M. cerebralis in North America is a result of recent introductions followed by dispersal via anthropogenic means, largely through the stocking of infected trout for sport fishing.
Sequence Capture versus Restriction Site Associated DNA Sequencing for Shallow Systematics.

Science.gov (United States)

Harvey, Michael G; Smith, Brian Tilston; Glenn, Travis C; Faircloth, Brant C; Brumfield, Robb T

2016-09-01

Sequence capture and restriction site associated DNA sequencing (RAD-Seq) are two genomic enrichment strategies for applying next-generation sequencing technologies to systematics studies. At shallow timescales, such as within species, RAD-Seq has been widely adopted among researchers, although there has been little discussion of the potential limitations and benefits of RAD-Seq and sequence capture. We discuss a series of issues that may impact the utility of sequence capture and RAD-Seq data for shallow systematics in non-model species. We review prior studies that used both methods, and investigate differences between the methods by re-analyzing existing RAD-Seq and sequence capture data sets from a Neotropical bird (Xenops minutus). We suggest that the strengths of RAD-Seq data sets for shallow systematics are the wide dispersion of markers across the genome, the relative ease and cost of laboratory work, the deep coverage and read overlap at recovered loci, and the high overall information that results. Sequence capture's benefits include flexibility and repeatability in the genomic regions targeted, success using low-quality samples, more straightforward read orthology assessment, and higher per-locus information content. The utility of a method in systematics, however, rests not only on its performance within a study, but on the comparability of data sets and inferences with those of prior work. In RAD-Seq data sets, comparability is compromised by low overlap of orthologous markers across species and the sensitivity of genetic diversity in a data set to an interaction between the level of natural heterozygosity in the samples examined and the parameters used for orthology assessment. In contrast, sequence capture of conserved genomic regions permits interrogation of the same loci across divergent species, which is preferable for maintaining comparability among data sets and studies for the purpose of drawing general conclusions about the impact of
Community standards for genomic resources, genetic conservation, and data integration

Science.gov (United States)

Jill Wegrzyn; Meg Staton; Emily Grau; Richard Cronn; C. Dana Nelson

2017-01-01

Genetics and genomics are increasingly important in forestry management and conservation. Next generation sequencing can increase analytical power, but still relies on building on the structure of previously acquired data. Data standards and data sharing allow the community to maximize the analytical power of high throughput genomics data. The landscape of incomplete...
Structural basis for sequence-specific recognition of DNA by TAL effectors

KAUST Repository

Deng, Dong; Yan, Chuangye; Pan, Xiaojing; Mahfouz, Magdy M.; Wang, Jiawei; Zhu, Jiankang; Shi, Yi Gong; Yan, Nieng

2012-01-01

TAL (transcription activator-like) effectors, secreted by phytopathogenic bacteria, recognize host DNA sequences through a central domain of tandem repeats. Each repeat comprises 33 to 35 conserved amino acids and targets a specific base pair
Investigation of genome sequences within the family Pasteurellaceae

DEFF Research Database (Denmark)

Angen, Øystein; Ussery, David

Introduction The bacterial genome sequences are now available for an increasing number of strains within the family Pasteurellaceae. At present, 24 Pasteurellaceae genomes are publicly available through internet databases, and another 40 genomes are being sequenced. This investigation will describe...... the core genome for both the family Pasteurellaceae and for the species Haemophilus influenzae. Methods Twenty genome sequences from the following species were included: Haemophilus influenzae (11 strains), Haemophilus ducreyi (1 strain), Histophilus somni (2 strains), Haemophilus parasuis (1 strain......), Actinobacillus pleuropneumoniae (2 strains), Actinobacillus succinogenes (1 strain), Mannheimia succiniciproducens (1 strain), and Pasteurella multocida (1 strain). The predicted proteins for each genome were BLASTed against each other, and a set of conserved core gene families was determined as described...
Conservation of nucleotide sequences for molecular diagnosis of Middle East respiratory syndrome coronavirus, 2015

Directory of Open Access Journals (Sweden)

Yuki Furuse

2015-11-01

Full Text Available Infection due to the Middle East respiratory syndrome coronavirus (MERS-CoV is widespread. The present study was performed to assess the protocols used for the molecular diagnosis of MERS-CoV by analyzing the nucleotide sequences of viruses detected between 2012 and 2015, including sequences from the large outbreak in eastern Asia in 2015. Although the diagnostic protocols were established only 2 years ago, mismatches between the sequences of primers/probes and viruses were found for several of the assays. Such mismatches could lead to a lower sensitivity of the assay, thereby leading to false-negative diagnosis. A slight modification in the primer design is suggested. Protocols for the molecular diagnosis of viral infections should be reviewed regularly after they are established, particularly for viruses that pose a great threat to public health such as MERS-CoV.
Single nucleotide polymorphism barcoding of cytochrome c oxidase I sequences for discriminating 17 species of Columbidae by decision tree algorithm.

Science.gov (United States)

Yang, Cheng-Hong; Wu, Kuo-Chuan; Dahms, Hans-Uwe; Chuang, Li-Yeh; Chang, Hsueh-Wei

2017-07-01

DNA barcodes are widely used in taxonomy, systematics, species identification, food safety, and forensic science. Most of the conventional DNA barcode sequences contain the whole information of a given barcoding gene. Most of the sequence information does not vary and is uninformative for a given group of taxa within a monophylum. We suggest here a method that reduces the amount of noninformative nucleotides in a given barcoding sequence of a major taxon, like the prokaryotes, or eukaryotic animals, plants, or fungi. The actual differences in genetic sequences, called single nucleotide polymorphism (SNP) genotyping, provide a tool for developing a rapid, reliable, and high-throughput assay for the discrimination between known species. Here, we investigated SNPs as robust markers of genetic variation for identifying different pigeon species based on available cytochrome c oxidase I (COI) data. We propose here a decision tree-based SNP barcoding (DTSB) algorithm where SNP patterns are selected from the DNA barcoding sequence of several evolutionarily related species in order to identify a single species with pigeons as an example. This approach can make use of any established barcoding system. We here firstly used as an example the mitochondrial gene COI information of 17 pigeon species (Columbidae, Aves) using DTSB after sequence trimming and alignment. SNPs were chosen which followed the rule of decision tree and species-specific SNP barcodes. The shortest barcode of about 11 bp was then generated for discriminating 17 pigeon species using the DTSB method. This method provides a sequence alignment and tree decision approach to parsimoniously assign a unique and shortest SNP barcode for any known species of a chosen monophyletic taxon where a barcoding sequence is available.
MODexplorer: an integrated tool for exploring protein sequence, structure and function relationships.

KAUST Repository

Kosinski, Jan; Barbato, Alessandro; Tramontano, Anna

2013-01-01

SUMMARY: MODexplorer is an integrated tool aimed at exploring the sequence, structural and functional diversity in protein families useful in homology modeling and in analyzing protein families in general. It takes as input either the sequence or the structure of a protein and provides alignments with its homologs along with a variety of structural and functional annotations through an interactive interface. The annotations include sequence conservation, similarity scores, ligand-, DNA- and RNA-binding sites, secondary structure, disorder, crystallographic structure resolution and quality scores of models implied by the alignments to the homologs of known structure. MODexplorer can be used to analyze sequence and structural conservation among the structures of similar proteins, to find structures of homologs solved in different conformational state or with different ligands and to transfer functional annotations. Furthermore, if the structure of the query is not known, MODexplorer can be used to select the modeling templates taking all this information into account and to build a comparative model. AVAILABILITY AND IMPLEMENTATION: Freely available on the web at http://modorama.biocomputing.it/modexplorer. Website implemented in HTML and JavaScript with all major browsers supported. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MODexplorer: an integrated tool for exploring protein sequence, structure and function relationships.

KAUST Repository

Kosinski, Jan

2013-02-08

SUMMARY: MODexplorer is an integrated tool aimed at exploring the sequence, structural and functional diversity in protein families useful in homology modeling and in analyzing protein families in general. It takes as input either the sequence or the structure of a protein and provides alignments with its homologs along with a variety of structural and functional annotations through an interactive interface. The annotations include sequence conservation, similarity scores, ligand-, DNA- and RNA-binding sites, secondary structure, disorder, crystallographic structure resolution and quality scores of models implied by the alignments to the homologs of known structure. MODexplorer can be used to analyze sequence and structural conservation among the structures of similar proteins, to find structures of homologs solved in different conformational state or with different ligands and to transfer functional annotations. Furthermore, if the structure of the query is not known, MODexplorer can be used to select the modeling templates taking all this information into account and to build a comparative model. AVAILABILITY AND IMPLEMENTATION: Freely available on the web at http://modorama.biocomputing.it/modexplorer. Website implemented in HTML and JavaScript with all major browsers supported. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Does the evolutionary conservation of microsatellite loci imply function?

Energy Technology Data Exchange (ETDEWEB)

Shriver, M.D.; Deka, R.; Ferrell, R.E. [Univ. of Pittsburgh, PA (United States)] [and others

1994-09-01

Microsatellites are highly polymorphic tandem arrays of short (1-6 bp) sequence motifs which have been found widely distributed in the genomes of all eukaryotes. We have analyzed allele frequency data on 16 microsatellite loci typed in the great apes (human, chimp, orangutan, and gorilla). The majority of these loci (13) were isolated from human genomic libraries; three were cloned from chimpanzee genomic DNA. Most of these loci are not only present in all apes species, but are polymorphic with comparable levels of heterozygosity and have alleles which overlap in size. The extent of divergence of allele frequencies among these four species were studies using the stepwise-weighted genetic distance (Dsw), which was previously shown to conform to linearity with evolutionary time since divergence for loci where mutations exist in a stepwise fashion. The phylogenetic tree of the great apes constructed from this distance matrix was consistent with the expected topology, with a high bootstrap confidence (82%) for the human/chimp clade. However, the allele frequency distributions of these species are 10 times more similar to each other than expected when they were calibrated with a conservative estimate of the time since separation of humans and the apes. These results are in agreement with sequence-based surveys of microsatellites which have demonstrated that they are highly (90%) conserved over short periods of evolutionary time (< 10 million years) and moderately (30%) conserved over long periods of evolutionary time (> 60-80 million years). This evolutionary conservation has prompted some authors to speculate that there are functional constraints on microsatellite loci. In contrast, the presence of directional bias of mutations with constraints and/or selection against aberrant sized alleles can explain these results.
Tigers of Sundarbans in India: is the population a separate conservation unit?

Science.gov (United States)

Singh, Sujeet Kumar; Mishra, Sudhanshu; Aspi, Jouni; Kvist, Laura; Nigam, Parag; Pandey, Puneet; Sharma, Reeta; Goyal, Surendra Prakash

2014-01-01

The Sundarbans tiger inhabits a unique mangrove habitat and are morphologically distinct from the recognized tiger subspecies in terms of skull morphometrics and body size. Thus, there is an urgent need to assess their ecological and genetic distinctiveness and determine if Sundarbans tigers should be defined and managed as separate conservation unit. We utilized nine microsatellites and 3 kb from four mitochondrial DNA (mtDNA) genes to estimate genetic variability, population structure, demographic parameters and visualize historic and contemporary connectivity among tiger populations from Sundarbans and mainland India. We also evaluated the traits that determine exchangeability or adaptive differences among tiger populations. Data from both markers suggest that Sundarbans tiger is not a separate tiger subspecies and should be regarded as Bengal tiger (P. t. tigris) subspecies. Maximum likelihood phylogenetic analyses of the mtDNA data revealed reciprocal monophyly. Genetic differentiation was found stronger for mtDNA than nuclear DNA. Microsatellite markers indicated low genetic variation in Sundarbans tigers (He= 0.58) as compared to other mainland populations, such as northern and Peninsular (Hebetween 0.67- 0.70). Molecular data supports migration between mainland and Sundarbans populations until very recent times. We attribute this reduction in gene flow to accelerated fragmentation and habitat alteration in the landscape over the past few centuries. Demographic analyses suggest that Sundarbans tigers have diverged recently from peninsular tiger population within last 2000 years. Sundarbans tigers are the most divergent group of Bengal tigers, and ecologically non-exchangeable with other tiger populations, and thus should be managed as a separate "evolutionarily significant unit" (ESU) following the adaptive evolutionary conservation (AEC) concept.
Tigers of Sundarbans in India: is the population a separate conservation unit?

Directory of Open Access Journals (Sweden)

Sujeet Kumar Singh

Full Text Available The Sundarbans tiger inhabits a unique mangrove habitat and are morphologically distinct from the recognized tiger subspecies in terms of skull morphometrics and body size. Thus, there is an urgent need to assess their ecological and genetic distinctiveness and determine if Sundarbans tigers should be defined and managed as separate conservation unit. We utilized nine microsatellites and 3 kb from four mitochondrial DNA (mtDNA genes to estimate genetic variability, population structure, demographic parameters and visualize historic and contemporary connectivity among tiger populations from Sundarbans and mainland India. We also evaluated the traits that determine exchangeability or adaptive differences among tiger populations. Data from both markers suggest that Sundarbans tiger is not a separate tiger subspecies and should be regarded as Bengal tiger (P. t. tigris subspecies. Maximum likelihood phylogenetic analyses of the mtDNA data revealed reciprocal monophyly. Genetic differentiation was found stronger for mtDNA than nuclear DNA. Microsatellite markers indicated low genetic variation in Sundarbans tigers (He= 0.58 as compared to other mainland populations, such as northern and Peninsular (Hebetween 0.67- 0.70. Molecular data supports migration between mainland and Sundarbans populations until very recent times. We attribute this reduction in gene flow to accelerated fragmentation and habitat alteration in the landscape over the past few centuries. Demographic analyses suggest that Sundarbans tigers have diverged recently from peninsular tiger population within last 2000 years. Sundarbans tigers are the most divergent group of Bengal tigers, and ecologically non-exchangeable with other tiger populations, and thus should be managed as a separate "evolutionarily significant unit" (ESU following the adaptive evolutionary conservation (AEC concept.
Tigers of Sundarbans in India: Is the Population a Separate Conservation Unit?

Science.gov (United States)

Singh, Sujeet Kumar; Mishra, Sudhanshu; Aspi, Jouni; Kvist, Laura; Nigam, Parag; Pandey, Puneet; Sharma, Reeta; Goyal, Surendra Prakash

2015-01-01

The Sundarbans tiger inhabits a unique mangrove habitat and are morphologically distinct from the recognized tiger subspecies in terms of skull morphometrics and body size. Thus, there is an urgent need to assess their ecological and genetic distinctiveness and determine if Sundarbans tigers should be defined and managed as separate conservation unit. We utilized nine microsatellites and 3 kb from four mitochondrial DNA (mtDNA) genes to estimate genetic variability, population structure, demographic parameters and visualize historic and contemporary connectivity among tiger populations from Sundarbans and mainland India. We also evaluated the traits that determine exchangeability or adaptive differences among tiger populations. Data from both markers suggest that Sundarbans tiger is not a separate tiger subspecies and should be regarded as Bengal tiger (P. t. tigris) subspecies. Maximum likelihood phylogenetic analyses of the mtDNA data revealed reciprocal monophyly. Genetic differentiation was found stronger for mtDNA than nuclear DNA. Microsatellite markers indicated low genetic variation in Sundarbans tigers (He= 0.58) as compared to other mainland populations, such as northern and Peninsular (Hebetween 0.67- 0.70). Molecular data supports migration between mainland and Sundarbans populations until very recent times. We attribute this reduction in gene flow to accelerated fragmentation and habitat alteration in the landscape over the past few centuries. Demographic analyses suggest that Sundarbans tigers have diverged recently from peninsular tiger population within last 2000 years. Sundarbans tigers are the most divergent group of Bengal tigers, and ecologically non-exchangeable with other tiger populations, and thus should be managed as a separate “evolutionarily significant unit” (ESU) following the adaptive evolutionary conservation (AEC) concept. PMID:25919139
The Mediator complex of Caenorhabditis elegans: insights into the developmental and physiological roles of a conserved transcriptional coregulator.

Science.gov (United States)

Grants, Jennifer M; Goh, Grace Y S; Taubert, Stefan

2015-02-27

The Mediator multiprotein complex ('Mediator') is an important transcriptional coregulator that is evolutionarily conserved throughout eukaryotes. Although some Mediator subunits are essential for the transcription of all protein-coding genes, others influence the expression of only subsets of genes and participate selectively in cellular signaling pathways. Here, we review the current knowledge of Mediator subunit function in the nematode Caenorhabditis elegans, a metazoan in which established and emerging genetic technologies facilitate the study of developmental and physiological regulation in vivo. In this nematode, unbiased genetic screens have revealed critical roles for Mediator components in core developmental pathways such as epidermal growth factor (EGF) and Wnt/β-catenin signaling. More recently, important roles for C. elegans Mediator subunits have emerged in the regulation of lipid metabolism and of systemic stress responses, engaging conserved transcription factors such as nuclear hormone receptors (NHRs). We emphasize instances where similar functions for individual Mediator subunits exist in mammals, highlighting parallels between Mediator subunit action in nematode development and in human cancer biology. We also discuss a parallel between the association of the Mediator subunit MED12 with several human disorders and the role of its C. elegans ortholog mdt-12 as a regulatory hub that interacts with numerous signaling pathways. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

Myotonin protein-kinase [AGC]n trinucleotide repeat in seven nonhuman primates

Energy Technology Data Exchange (ETDEWEB)

Novelli, G.; Sineo, L.; Pontieri, E. [Catholic Univ. of Rome (Italy)]|[Univ. of Milan (Italy)]|[Univ. Florence (Italy)] [and others

1994-09-01

Myotonic dystrophy (DM) is due to a genomic instability of a trinucleotide [AGC]n motif, located at the 3{prime} UTR region of a protein-kinase gene (myotonin protein kinase, MT-PK). The [AGC] repeat is meiotically and mitotically unstable, and it is directly related to the manifestations of the disorder. Although a gene dosage effect of the MT-PK has been demonstrated n DM muscle, the mechanism(s) by which the intragenic repeat expansion leads to disease is largely unknown. This non-standard mutational event could reflect an evolutionary mechanism widespread among animal genomes. We have isolated and sequenced the complete 3{prime}UTR region of the MT-PK gene in seven primates (macaque, orangutan, gorilla, chimpanzee, gibbon, owl monkey, saimiri), and examined by comparative sequence nucleotide analysis the [AGC]n intragenic repeat and the surrounding nucleotides. The genomic organization, including the [AGC]n repeat structure, was conserved in all examined species, excluding the gibbon (Hylobates agilis), in which the [AGC]n upstream sequence (GGAA) is replaced by a GA dinucleotide. The number of [AGC]n in the examined species ranged between 7 (gorilla) and 13 repeats (owl monkeys), with a polymorphism informative content (PIC) similar to that observed in humans. These results indicate that the 3{prime}UTR [AGC] repeat within the MT-PK gene is evolutionarily conserved, supporting that this region has important regulatory functions.
Plastome Sequence Determination and Comparative Analysis for Members of the Lolium-Festuca Grass Species Complex

Science.gov (United States)

Hand, Melanie L.; Spangenberg, German C.; Forster, John W.; Cogan, Noel O. I.

2013-01-01

Chloroplast genome sequences are of broad significance in plant biology, due to frequent use in molecular phylogenetics, comparative genomics, population genetics, and genetic modification studies. The present study used a second-generation sequencing approach to determine and assemble the plastid genomes (plastomes) of four representatives from the agriculturally important Lolium-Festuca species complex of pasture grasses (Lolium multiflorum, Festuca pratensis, Festuca altissima, and Festuca ovina). Total cellular DNA was extracted from either roots or leaves, was sequenced, and the output was filtered for plastome-related reads. A comparison between sources revealed fewer plastome-related reads from root-derived template but an increase in incidental bacterium-derived sequences. Plastome assembly and annotation indicated high levels of sequence identity and a conserved organization and gene content between species. However, frequent deletions within the F. ovina plastome appeared to contribute to a smaller plastid genome size. Comparative analysis with complete plastome sequences from other members of the Poaceae confirmed conservation of most grass-specific features. Detailed analysis of the rbcL–psaI intergenic region, however, revealed a “hot-spot” of variation characterized by independent deletion events. The evolutionary implications of this observation are discussed. The complete plastome sequences are anticipated to provide the basis for potential organelle-specific genetic modification of pasture grasses. PMID:23550121
Discovery of cis-elements between sorghum and rice using co-expression and evolutionary conservation

Directory of Open Access Journals (Sweden)

Haberer Georg

2009-06-01

Full Text Available Abstract Background The spatiotemporal regulation of gene expression largely depends on the presence and absence of cis-regulatory sites in the promoter. In the economically highly important grass family, our knowledge of transcription factor binding sites and transcriptional networks is still very limited. With the completion of the sorghum genome and the available rice genome sequence, comparative promoter analyses now allow genome-scale detection of conserved cis-elements. Results In this study, we identified thousands of phylogenetic footprints conserved between orthologous rice and sorghum upstream regions that are supported by co-expression information derived from three different rice expression data sets. In a complementary approach, cis-motifs were discovered by their highly conserved co-occurrence in syntenic promoter pairs. Sequence conservation and matches to known plant motifs support our findings. Expression similarities of gene pairs positively correlate with the number of motifs that are shared by gene pairs and corroborate the importance of similar promoter architectures for concerted regulation. This strongly suggests that these motifs function in the regulation of transcript levels in rice and, presumably also in sorghum. Conclusion Our work provides the first large-scale collection of cis-elements for rice and sorghum and can serve as a paradigm for cis-element analysis through comparative genomics in grasses in general.
Phylogenomics of Phrynosomatid Lizards: Conflicting Signals from Sequence Capture versus Restriction Site Associated DNA Sequencing

Science.gov (United States)

Leaché, Adam D.; Chavez, Andreas S.; Jones, Leonard N.; Grummer, Jared A.; Gottscho, Andrew D.; Linkem, Charles W.

2015-01-01

Sequence capture and restriction site associated DNA sequencing (RADseq) are popular methods for obtaining large numbers of loci for phylogenetic analysis. These methods are typically used to collect data at different evolutionary timescales; sequence capture is primarily used for obtaining conserved loci, whereas RADseq is designed for discovering single nucleotide polymorphisms (SNPs) suitable for population genetic or phylogeographic analyses. Phylogenetic questions that span both “recent” and “deep” timescales could benefit from either type of data, but studies that directly compare the two approaches are lacking. We compared phylogenies estimated from sequence capture and double digest RADseq (ddRADseq) data for North American phrynosomatid lizards, a species-rich and diverse group containing nine genera that began diversifying approximately 55 Ma. Sequence capture resulted in 584 loci that provided a consistent and strong phylogeny using concatenation and species tree inference. However, the phylogeny estimated from the ddRADseq data was sensitive to the bioinformatics steps used for determining homology, detecting paralogs, and filtering missing data. The topological conflicts among the SNP trees were not restricted to any particular timescale, but instead were associated with short internal branches. Species tree analysis of the largest SNP assembly, which also included the most missing data, supported a topology that matched the sequence capture tree. This preferred phylogeny provides strong support for the paraphyly of the earless lizard genera Holbrookia and Cophosaurus, suggesting that the earless morphology either evolved twice or evolved once and was subsequently lost in Callisaurus. PMID:25663487
The identification and functional annotation of RNA structures conserved in vertebrates.

Science.gov (United States)

Seemann, Stefan E; Mirza, Aashiq H; Hansen, Claus; Bang-Berthelsen, Claus H; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L; Gorodkin, Jan

2017-08-01

Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human-mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3' ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. © 2017 Seemann et al.; Published by Cold Spring Harbor Laboratory Press.
Genetic diversity in breonadia salicina based on intra-species sequence variation of chloroplast dna spacer sequence

International Nuclear Information System (INIS)

Qurainy, F.A.; Gaafar, A.R.Z.

2014-01-01

Assessment and knowledge of the genetic diversity and variation within and between populations of rare and endangered plants is very important for effective conservation. Intergenic spacer sequences variation of psbA-trnH locus of chloroplast genome was assessed within Breonadia salicina (Rubiaceae), a critically endangered and endemic plant species to South western part of Kingdom of Saudi Arabia. The obtained sequence data from 19 individuals in three populations revealed nine haplotypes. The aligned sequences obtained from the overall Saudi accessions extended to 355 bp, revealing nine haplotypes. A high level of haplotype diversity (Hd = 0.842) and low level of nucleotide diversity (Pi = 0.0058) were detected. Consistently, both hierarchical analysis of molecular variance (AMOVA) and constructed neighbor-joining tree indicated null genetic differentiation among populations. This level of differentiation between populations or between regions in psbA-trnH sequences may be due to effects of the abundance of ancestral haplotype sharing and the presence of private haplotypes fixed for each population. Furthermore, the results revealed almost the same level of genetic diversity in comparison with Yemeni accessions, in which Saudi accessions were sharing three haplotypes from the four haplotypes found in Yemeni accessions. (author)
A Bioinformatic Pipeline for Monitoring of the Mutational Stability of Viral Drug Targets with Deep-Sequencing Technology

Directory of Open Access Journals (Sweden)

Yuri Kravatsky

2017-11-01

Full Text Available The efficient development of antiviral drugs, including efficient antiviral small interfering RNAs (siRNAs, requires continuous monitoring of the strict correspondence between a drug and the related highly variable viral DNA/RNA target(s. Deep sequencing is able to provide an assessment of both the general target conservation and the frequency of particular mutations in the different target sites. The aim of this study was to develop a reliable bioinformatic pipeline for the analysis of millions of short, deep sequencing reads corresponding to selected highly variable viral sequences that are drug target(s. The suggested bioinformatic pipeline combines the available programs and the ad hoc scripts based on an original algorithm of the search for the conserved targets in the deep sequencing data. We also present the statistical criteria for the threshold of reliable mutation detection and for the assessment of variations between corresponding data sets. These criteria are robust against the possible sequencing errors in the reads. As an example, the bioinformatic pipeline is applied to the study of the conservation of RNA interference (RNAi targets in human immunodeficiency virus 1 (HIV-1 subtype A. The developed pipeline is freely available to download at the website http://virmut.eimb.ru/. Brief comments and comparisons between VirMut and other pipelines are also presented.
Two different groups of signal sequence in M-superfamily conotoxins.

Science.gov (United States)

Wang, Qi; Jiang, Hui; Han, Yu-Hong; Yuan, Duo-Duo; Chi, Cheng-Wu

2008-04-01

M-superfamily conotoxins can be divided into four branches (M-1, M-2, M-3 and M-4) according to the number of amino acid residues in the third Cys loop. In general, it is widely accepted that the conotoxin signal peptides of each superfamily are strictly conserved. Recently, we cloned six cDNAs of novel M-superfamily conotoxins from Conus leopardus, Conus marmoreus and Conus quercinus, belonging to either M-1 or M-3 branch. These conotoxins, judging from the putative peptide sequences deducted from cDNAs, are rich in acidic residues and share highly conserved signal and pro-peptide region. However, they are quite different from the reported conotoxins of M-2 and M-4 branches even in their signal peptides, which in general are considered highly conserved for each superfamily of conotoxins. The signal sequences of M-1 and M-3 conotoxins composed of 24 residues start with MLKMGVVL-, while those of M-2 and M-4 conotoxins composed of 25 residues start with MMSKLGVL-. It is another example that different types of signal peptides can exist within a superfamily besides the I-conotoxin superfamily. In addition to the different disulfide connectivity of M-1 conotoxins from that of M-4 or M-2 conotoxins, the sequence alignment, preferential Cys codon usage and phylogenetic tree analysis suggest that M-1 and M-3 conotoxins have much closer relationship, being different from the conotoxins of other two branches (M-4 and M-2) of M-superfamily.
Energy Conservation in Optical Fibers With Distributed Brick-Walls Filters

Science.gov (United States)

Garcia, Javier; Ghozlan, Hassan; Kramer, Gerhard

2018-05-01

A band-pass filtering scheme is proposed to mitigate spectral broadening and channel coupling in the Nonlinear Schr\\"odinger (NLS) fiber optic channel. The scheme is modeled by modifying the NLS Equation to include an attenuation profile with multiple brick-wall filters centered at different frequencies. It is shown that this brick-walls profile conserves the total in-band energy of the launch signal. Furthermore, energy fluctuations between the filtered channels are characterized, and conditions on the channel spacings are derived that ensure energy conservation in each channel. The maximum spectral efficiency of such a system is derived, and a constructive rule for achieving it using Sidon sequences is provided.
Conservation of the LexA repressor binding site in Deinococcus radiodurans

Directory of Open Access Journals (Sweden)

Khan Feroz

2008-03-01

Full Text Available The LexA protein is a transcriptional repressor of the bacterial SOS DNA repair system, which comprises a set of DNA repair and cellular survival genes that are induced in response to DNA damage. Its varied DNA binding motifs have been characterized and reported in the Escherichia coli, Bacillus subtilis, rhizobia family members, marine magnetotactic bacterium, Salmonella typhimurium and recently in Mycobacterium tuberculosis and this motifs information has been used in our theoretical analysis to detect its novel regulated genes in radio-resistant Deinococcus radiodurans genome. This bacterium showed presence of SOS-box like consensus sequence in the upstream sequences of 3166 genes with >60% motif score similarity percentage (MSSP on both strands. Attempts to identify LexA-binding sites and the composition of the putative SOS regulon in D. radiodurans have been unsuccessful so far. To resolve the problem we performed theoretical analysis with modifications on reported data set of genes related to DNA repair (61 genes, stress response (145 genes and some unusual predicted operons (21 clusters. Expression of some of the predicted SOS-box regulated operon members then was examined through the previously reported microarray data which confirm the expression of only single predicted operon i.e. DRB0143 (AAA superfamily NTPase related to 5-methylcytosine specific restriction enzyme subunit McrB and DRB0144 (homolog of the McrC subunit of the McrBC restriction modification system. The methodology involved weight matrix construction through CONSENSUS algorithm using information of conserved upstream sequences of eight known genes including dinB, tagC, lexA, recA, uvrB, yneA of B. subtilis while lexA and recA of D. radiodurans through phylogenetic footprinting method and later detection of similar conserved SOS-box like LexA binding motifs through both RSAT & PoSSuMsearch programs. The resultant DNA consensus sequence had highly conserved 14 bp SOS
ChIP-seq Identification of Weakly Conserved Heart Enhancers

Energy Technology Data Exchange (ETDEWEB)

Blow, Matthew J.; McCulley, David J.; Li, Zirong; Zhang, Tao; Akiyama, Jennifer A.; Holt, Amy; Plajzer-Frick, Ingrid; Shoukry, Malak; Wright, Crystal; Chen, Feng; Afzal, Veena; Bristow, James; Ren, Bing; Black, Brian L.; Rubin, Edward M.; Visel, Axel; Pennacchio, Len A.

2010-07-01

Accurate control of tissue-specific gene expression plays a pivotal role in heart development, but few cardiac transcriptional enhancers have thus far been identified. Extreme non-coding sequence conservation successfully predicts enhancers active in many tissues, but fails to identify substantial numbers of heart enhancers. Here we used ChIP-seq with the enhancer-associated protein p300 from mouse embryonic day 11.5 heart tissue to identify over three thousand candidate heart enhancers genome-wide. Compared to other tissues studied at this time-point, most candidate heart enhancers are less deeply conserved in vertebrate evolution. Nevertheless, the testing of 130 candidate regions in a transgenic mouse assay revealed that most of them reproducibly function as enhancers active in the heart, irrespective of their degree of evolutionary constraint. These results provide evidence for a large population of poorly conserved heart enhancers and suggest that the evolutionary constraint of embryonic enhancers can vary depending on tissue type.
An online conserved SSR discovery through cross-species comparison

Directory of Open Access Journals (Sweden)

Tun-Wen Pai

2009-02-01

Full Text Available Tun-Wen Pai1, Chien-Ming Chen1, Meng-Chang Hsiao1, Ronshan Cheng2, Wen-Shyong Tzou3, Chin-Hua Hu31Department of Computer Science and Engineering; 2Department of Aquaculture, 3Institute of Bioscience and Biotechnology, National Taiwan Ocean University, Keelung, Taiwan, Republic of ChinaAbstract: Simple sequence repeats (SSRs play important roles in gene regulation and genome evolution. Although there exist several online resources for SSR mining, most of them only extract general SSR patterns without providing functional information. Here, an online search tool, CG-SSR (Comparative Genomics SSR discovery, has been developed for discovering potential functional SSRs from vertebrate genomes through cross-species comparison. In addition to revealing SSR candidates in conserved regions among various species, it also combines accurate coordinate and functional genomics information. CG-SSR is the first comprehensive and efficient online tool for conserved SSR discovery.Keywords: microsatellites, genome, comparative genomics, functional SSR, gene ontology, conserved region
Dynamic Epigenetic Control of Highly Conserved Noncoding Elements

KAUST Repository

Seridi, Loqmane

2014-10-07

Background Many noncoding genomic loci have remained constant over long evolutionary periods, suggesting that they are exposed to strong selective pressures. The molecular functions of these elements have been partially elucidated, but the fundamental reason for their extreme conservation is still unknown. Results To gain new insights into the extreme selection of highly conserved noncoding elements (HCNEs), we used a systematic analysis of multi-omic data to study the epigenetic regulation of such elements during the development of Drosophila melanogaster. At the sequence level, HCNEs are GC-rich and have a characteristic oligomeric composition. They have higher levels of stable nucleosome occupancy than their flanking regions, and lower levels of mononucleosomes and H3.3, suggesting that these regions reside in compact chromatin. Furthermore, these regions showed remarkable modulations in histone modification and the expression levels of adjacent genes during development. Although HCNEs are primarily initiated late in replication, about 10% were related to early replication origins. Finally, HCNEs showed strong enrichment within lamina-associated domains. Conclusion HCNEs have distinct and protective sequence properties, undergo dynamic epigenetic regulation, and appear to be associated with the structural components of the chromatin, replication origins, and nuclear matrix. These observations indicate that such elements are likely to have essential cellular functions, and offer insights into their epigenetic properties.
Dynamic Epigenetic Control of Highly Conserved Noncoding Elements

KAUST Repository

Seridi, Loqmane; Ryu, Tae Woo; Ravasi, Timothy

2014-01-01

Background Many noncoding genomic loci have remained constant over long evolutionary periods, suggesting that they are exposed to strong selective pressures. The molecular functions of these elements have been partially elucidated, but the fundamental reason for their extreme conservation is still unknown. Results To gain new insights into the extreme selection of highly conserved noncoding elements (HCNEs), we used a systematic analysis of multi-omic data to study the epigenetic regulation of such elements during the development of Drosophila melanogaster. At the sequence level, HCNEs are GC-rich and have a characteristic oligomeric composition. They have higher levels of stable nucleosome occupancy than their flanking regions, and lower levels of mononucleosomes and H3.3, suggesting that these regions reside in compact chromatin. Furthermore, these regions showed remarkable modulations in histone modification and the expression levels of adjacent genes during development. Although HCNEs are primarily initiated late in replication, about 10% were related to early replication origins. Finally, HCNEs showed strong enrichment within lamina-associated domains. Conclusion HCNEs have distinct and protective sequence properties, undergo dynamic epigenetic regulation, and appear to be associated with the structural components of the chromatin, replication origins, and nuclear matrix. These observations indicate that such elements are likely to have essential cellular functions, and offer insights into their epigenetic properties.
Strong conservation of rhoptry-associated-protein-1 (RAP-1) locus organization and sequence among Babesia isolates infecting sheep from China (Babesia motasi-like phylogenetic group).

Science.gov (United States)

Niu, Qingli; Valentin, Charlotte; Bonsergent, Claire; Malandrin, Laurence

2014-12-01

Rhoptry-associated-protein 1 (RAP-1) is considered as a potential vaccine candidate due to its involvement in red blood cell invasion by parasites in the genus Babesia. We examined its value as a vaccine candidate by studying RAP-1 conservation in isolates of Babesia sp. BQ1 Ningxian, Babesia sp. Tianzhu and Babesia sp. Hebei, responsible for ovine babesiosis in different regions of China. The rap-1 locus in these isolates has very similar features to those described for Babesia sp. BQ1 Lintan, another Chinese isolate also in the B. motasi-like phylogenetic group, namely the presence of three types of rap-1 genes (rap-1a, rap-1b and rap-1c), multiple conserved rap-1b copies (5) interspaced with more or less variable rap-1a copies (6), and the 3' localization of one rap-1c. The isolates Babesia sp. Tianzhu, Babesia sp. BQ1 Lintan and Ningxian were almost identical (average nucleotide identity of 99.9%) over a putative locus of about 31 Kb, including the intergenic regions. Babesia sp. Hebei showed a similar locus organization but differed in the rap-1 locus sequence, for each gene and intergenic region, with an average nucleotide identity of 78%. Our results are in agreement with 18S rDNA phylogenetic studies performed on these isolates. However, in extremely closely related isolates the rap-1 locus seems more conserved (99.9%) than the 18S rDNA (98.7%), whereas in still closely related isolates the identities are much lower (78%) compared with the 18S rDNA (97.7%). The particularities of the rap-1 locus in terms of evolution, phylogeny, diagnosis and vaccine development are discussed. Copyright © 2014 The Authors. Published by Elsevier B.V. All rights reserved.
Evolutionarily conserved TCR binding sites, identification of T cells in primary lymphoid tissues, and surprising trans-rearrangements in nurse shark.

Science.gov (United States)

Criscitiello, Michael F; Ohta, Yuko; Saltis, Mark; McKinney, E Churchill; Flajnik, Martin F

2010-06-15

Cartilaginous fish are the oldest animals that generate RAG-based Ag receptor diversity. We have analyzed the genes and expressed transcripts of the four TCR chains for the first time in a cartilaginous fish, the nurse shark (Ginglymostoma cirratum). Northern blotting found TCR mRNA expression predominantly in lymphoid and mucosal tissues. Southern blotting suggested translocon-type loci encoding all four chains. Based on diversity of V and J segments, the expressed combinatorial diversity for gamma is similar to that of human, alpha and beta may be slightly lower, and delta diversity is the highest of any organism studied to date. Nurse shark TCRdelta have long CDR3 loops compared with the other three chains, creating binding site topologies comparable to those of mammalian TCR in basic paratope structure; additionally, nurse shark TCRdelta CDR3 are more similar to IgH CDR3 in length and heterogeneity than to other TCR chains. Most interestingly, several cDNAs were isolated that contained IgM or IgW V segments rearranged to other gene segments of TCRdelta and alpha. Finally, in situ hybridization experiments demonstrate a conservation of both alpha/beta and gamma/delta T cell localization in the thymus across 450 million years of vertebrate evolution, with gamma/delta TCR expression especially high in the subcapsular region. Collectively, these data make the first cellular identification of TCR-expressing lymphocytes in a cartilaginous fish.
A lower isoelectric point increases signal sequence-mediated secretion of recombinant proteins through a bacterial ABC transporter.

Science.gov (United States)

Byun, Hyunjong; Park, Jiyeon; Kim, Sun Chang; Ahn, Jung Hoon

2017-12-01

Efficient protein production for industrial and academic purposes often involves engineering microorganisms to produce and secrete target proteins into the culture. Pseudomonas fluorescens has a TliDEF ATP-binding cassette transporter, a type I secretion system, which recognizes C-terminal LARD3 signal sequence of thermostable lipase TliA. Many proteins are secreted by TliDEF in vivo when recombined with LARD3, but there are still others that cannot be secreted by TliDEF even when LARD3 is attached. However, the factors that determine whether or not a recombinant protein can be secreted through TliDEF are still unknown. Here, we recombined LARD3 with several proteins and examined their secretion through TliDEF. We found that the proteins secreted via LARD3 are highly negatively charged with highly-acidic isoelectric points (pI) lower than 5.5. Attaching oligo-aspartate to lower the pI of negatively-charged recombinant proteins improved their secretion, and attaching oligo-arginine to negatively-charged proteins blocked their secretion by LARD3. In addition, negatively supercharged green fluorescent protein (GFP) showed improved secretion, whereas positively supercharged GFP did not secrete. These results disclosed that proteins' acidic pI and net negative charge are major factors that determine their secretion through TliDEF. Homology modeling for TliDEF revealed that TliD dimer forms evolutionarily-conserved positively-charged clusters in its pore and substrate entrance site, which also partially explains the pI dependence of the TliDEF-dependent secretions. In conclusion, lowering the isoelectric point improved LARD3-mediated protein secretion, both widening the range of protein targets for efficient production via secretion and signifying an important aspect of ABC transporter-mediated secretions. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Acetylornithine deacetylase, succinyldiaminopimelate desuccinylase and carboxypeptidase G2 are evolutionarily related.

Science.gov (United States)

Boyen, A; Charlier, D; Charlier, J; Sakanyan, V; Mett, I; Glansdorff, N

1992-07-01

The nucleotide (nt) sequence of the Escherichia coli argE gene, encoding the acetylornithine deacetylase (AO) subunit, has been established and corresponds to a 43-kDa (M(r) 42,320) polypeptide. The enzyme has been purified to near homogeneity and it appears to be a dimer consisting of two 43-kDa subunits. The amino acid sequence deduced from the nt sequence was compared to that of the subunit of E. coli succinyldiaminopimelate desuccinylase (the dapE gene product involved in the diaminopimelate pathway for lysine biosynthesis), since both enzymes share functional and biochemical features. Significant similarity covering the entire sequence allows us to infer a common origin for both deacylases. This homology extends to the Pseudomonas sp. G2 carboxypeptidase (G2CP); this or a functionally related enzyme may be responsible for the minor AO activity found in organisms relying on ornithine acetyltransferase for ornithine biosynthesis.
Examining the Conservation of Kinks in Alpha Helices.

Directory of Open Access Journals (Sweden)

Eleanor C Law

Full Text Available Kinks are a structural feature of alpha-helices and many are known to have functional roles. Kinks have previously tended to be defined in a binary fashion. In this paper we have deliberately moved towards defining them on a continuum, which given the unimodal distribution of kink angles is a better description. From this perspective, we examine the conservation of kinks in proteins. We find that kink angles are not generally a conserved property of homologs, pointing either to their not being functionally critical or to their function being related to conformational flexibility. In the latter case, the different structures of homologs are providing snapshots of different conformations. Sequence identity between homologous helices is informative in terms of kink conservation, but almost equally so is the sequence identity of residues in spatial proximity to the kink. In the specific case of proline, which is known to be prevalent in kinked helices, loss of a proline from a kinked helix often also results in the loss of a kink or reduction in its kink angle. We carried out a study of the seven transmembrane helices in the GPCR family and found that changes in kinks could be related both to subfamilies of GPCRs and also, in a particular subfamily, to the binding of agonists or antagonists. These results suggest conformational change upon receptor activation within the GPCR family. We also found correlation between kink angles in different helices, and the possibility of concerted motion could be investigated further by applying our method to molecular dynamics simulations. These observations reinforce the belief that helix kinks are key, functional, flexible points in structures.
Crystal structure of AFV3-109, a highly conserved protein from crenarchaeal viruses

Directory of Open Access Journals (Sweden)

Quevillon-Cheruel Sophie

2007-01-01

Full Text Available Abstract The extraordinary morphologies of viruses infecting hyperthermophilic archaea clearly distinguish them from bacterial and eukaryotic viruses. Moreover, their genomes code for proteins that to a large extend have no related sequences in the extent databases. However, a small pool of genes is shared by overlapping subsets of these viruses, and the most conserved gene, exemplified by the ORF109 of the Acidianus Filamentous Virus 3, AFV3, is present on genomes of members of three viral familes, the Lipothrixviridae, Rudiviridae, and "Bicaudaviridae", as well as of the unclassified Sulfolobus Turreted Icosahedral Virus, STIV. We present here the crystal structure of the protein (Mr = 13.1 kD, 109 residues encoded by the AFV3 ORF 109 in two different crystal forms at 1.5 and 1.3 Å resolution. The structure of AFV3-109 is a five stranded β-sheet with loops on one side and three helices on the other. It forms a dimer adopting the shape of a cradle that encompasses the best conserved regions of the sequence. No protein with a related fold could be identified except for the ortholog from STIV1, whose structure was deposited at the Protein Data Bank. We could clearly identify a well bound glycerol inside the cradle, contacting exclusively totally conserved residues. This interaction was confirmed in solution by fluorescence titration. Although the function of AFV3-109 cannot be deduced directly from its structure, structural homology with the STIV1 protein, and the size and charge distribution of the cavity suggested it could interact with nucleic acids. Fluorescence quenching titrations also showed that AFV3-109 interacts with dsDNA. Genomic sequence analysis revealed bacterial homologs of AFV3-109 as a part of a putative previously unidentified prophage sequences in some Firmicutes.

A Novel Missense Mutation of Doublecortin: Mutation Analysis of Korean Patients with Subcortical Band Heterotopia

Science.gov (United States)

Kim, Myeong-Kyu; Park, Man-Seok; Kim, Byeong-Chae; Cho, Ki-Hyun; Kim, Young-Seon; Kim, Jin-Hee; Heo, Tag; Kim, Eun-Young

2005-01-01

The neuronal migration disorders, X-linked lissencephaly syndrome (XLIS) and subcortical band heterotopia (SBH), also called "double cortex", have been linked to missense, nonsense, aberrant splicing, deletion, and insertion mutations in doublecortin (DCX) in families and sporadic cases. Most DCX mutations identified to date are located in two evolutionarily conserved domains. We performed mutation analysis of DCX in two Korean patients with SBH. The SBH patients had mild to moderate developmental delays, drug-resistant generalized seizures, and diffuse thick SBH upon brain MRI. Sequence analysis of the DCX coding region in Patient 1 revealed a c.386 C>T change in exon 3. The sequence variation results in a serine to leucine amino acid change at position 129 (S129L), which has not been found in other family members of Patient 1 or in a large panel of 120 control X-chromosomes. We report here a novel c.386 C>T mutation of DCX that is responsible for SBH. PMID:16100463
Population structure of the endangered franciscana dolphin (Pontoporia blainvillei: reassessing management units.

Directory of Open Access Journals (Sweden)

Haydée A Cunha

Full Text Available Franciscanas are the most endangered dolphins in the Southwestern Atlantic. Due to their coastal and estuarine habits, franciscanas suffer from extensive fisheries bycatch, as well as from habitat loss and degradation. Four Franciscana Management Areas (FMA, proposed based on biology, demography, morphology and genetic data, were incorporated into management planning and in the delineation of research efforts. We re-evaluated that proposal through the analysis of control region sequences from franciscanas throughout their distribution range (N = 162, including novel sequences from the northern limit of the species and two other previously unsampled localities in Brazil. A deep evolutionary break was observed between franciscanas from the northern and southern portions of the species distribution, indicating that they must be managed as two Evolutionarily Significant Units (ESU. Furthermore, additional FMAs should be recognised to accommodate the genetic differentiation found in each ESU. These results have immediate consequences for the conservation and management of this endangered species.
Cell density-dependent nuclear/cytoplasmic localization of NORPEG (RAI14) protein

International Nuclear Information System (INIS)

Kutty, R. Krishnan; Chen, Shanyi; Samuel, William; Vijayasarathy, Camasamudram; Duncan, Todd; Tsai, Jen-Yue; Fariss, Robert N.; Carper, Deborah; Jaworski, Cynthia; Wiggert, Barbara

2006-01-01

NORPEG (RAI14), a developmentally regulated gene induced by retinoic acid, encodes a 980 amino acid (aa) residue protein containing six ankyrin repeats and a long coiled-coil domain [Kutty et al., J. Biol. Chem. 276 (2001), pp. 2831-2840]. We have expressed aa residues 1-287 of NORPEG and used the recombinant protein to produce an anti-NORPEG polyclonal antibody. Confocal immunofluorescence analysis showed that the subcellular localization of NORPEG in retinal pigment epithelial (ARPE-19) cells varies with cell density, with predominantly nuclear localization in nonconfluent cells, but a cytoplasmic localization, reminiscent of cytoskeleton, in confluent cultures. Interestingly, an evolutionarily conserved putative monopartite nuclear localization signal (P 27 KKRKAP 276 ) was identified by analyzing the sequences of NORPEG and its orthologs. GFP-NORPEG (2-287 aa), a fusion protein containing this signal, was indeed localized to nuclei when expressed in ARPE-19 or COS-7 cells. Deletion and mutation analysis indicated that the identified nuclear localization sequence is indispensable for nuclear targeting
Alterations of MicroRNAs in Solid Cancers and Their Prognostic Value

International Nuclear Information System (INIS)

Chira, Panagiota; Vareli, Katerina; Sainis, Ioannis; Papandreou, Christos; Briasoulis, Evangelos

2010-01-01

MicroRNAs (miRNAs) are evolutionarily conserved, naturally abundant, small, regulatory non-coding RNAs that inhibit gene expression at the post-transcriptional level in a sequence-specific manner. Each miRNA represses the protein expression of several coding genes in a manner proportional to the sequence complementarity with the target transcripts. MicroRNAs play key regulatory roles in organismal development and homeostasis. They control fundamental biological processes, such as stem-cell regulation and cellular metabolism, proliferation, differentiation, stress resistance, and apoptosis. Differential miRNA expression is found in malignant tumors in comparison to normal tissue counterparts. This indicates that miRNA deregulation contributes to the initiation and progression of cancer. Currently, miRNA expression signatures are being rigorously investigated in various tumor types, with the aim of developing novel, efficient biomarkers that can improve clinical management of cancer patients. This review discusses deregulated miRNAs in solid tumors, and focuses on their emerging prognostic potential
Direct uptake and degradation of DNA by lysosomes

Science.gov (United States)

Fujiwara, Yuuki; Kikuchi, Hisae; Aizawa, Shu; Furuta, Akiko; Hatanaka, Yusuke; Konya, Chiho; Uchida, Kenko; Wada, Keiji; Kabuta, Tomohiro

2013-01-01

Lysosomes contain various hydrolases that can degrade proteins, lipids, nucleic acids and carbohydrates. We recently discovered “RNautophagy,” an autophagic pathway in which RNA is directly taken up by lysosomes and degraded. A lysosomal membrane protein, LAMP2C, a splice variant of LAMP2, binds to RNA and acts as a receptor for this pathway. In the present study, we show that DNA is also directly taken up by lysosomes and degraded. Like RNautophagy, this autophagic pathway, which we term “DNautophagy,” is dependent on ATP. The cytosolic sequence of LAMP2C also directly interacts with DNA, and LAMP2C functions as a receptor for DNautophagy, in addition to RNautophagy. Similarly to RNA, DNA binds to the cytosolic sequences of fly and nematode LAMP orthologs. Together with the findings of our previous study, our present findings suggest that RNautophagy and DNautophagy are evolutionarily conserved systems in Metazoa. PMID:23839276
An aureobasidin A resistance gene isolated from Aspergillus is a homolog of yeast AUR1, a gene responsible for inositol phosphorylceramide (IPC) synthase activity.

Science.gov (United States)

Kuroda, M; Hashida-Okado, T; Yasumoto, R; Gomi, K; Kato, I; Takesako, K

1999-03-01

The AUR1 gene of Saccharomyces cerevisiae, mutations in which confer resistance to the antibiotic aureobasidin A, is necessary for inositol phosphorylceramide (IPC) synthase activity. We report the molecular cloning and characterization of the Aspergillus nidulans aurA gene, which is homologous to AUR1. A single point mutation in the aurA gene of A. nidulans confers a high level of resistance to aureobasidin A. The A. nidulans aurA gene was used to identify its homologs in other Aspergillus species, including A. fumigatus, A. niger, and A. oryzae. The deduced amino acid sequence of an aurA homolog from the pathogenic fungus A. fumigatus showed 87% identity to that of A. nidulans. The AurA proteins of A. nidulans and A. fumigatus shared common characteristics in primary structure, including sequence, hydropathy profile, and N-glycosylation sites, with their S. cerevisiae, Schizosaccharomyces pombe, and Candida albicans counterparts. These results suggest that the aureobasidin resistance gene is conserved evolutionarily in various fungi.
Evolution of the C-Type Lectin-Like Receptor Genes of the DECTIN-1 Cluster in the NK Gene Complex

Directory of Open Access Journals (Sweden)

Susanne Sattler

2012-01-01

Full Text Available Pattern recognition receptors are crucial in initiating and shaping innate and adaptive immune responses and often belong to families of structurally and evolutionarily related proteins. The human C-type lectin-like receptors encoded in the DECTIN-1 cluster within the NK gene complex contain prominent receptors with pattern recognition function, such as DECTIN-1 and LOX-1. All members of this cluster share significant homology and are considered to have arisen from subsequent gene duplications. Recent developments in sequencing and the availability of comprehensive sequence data comprising many species showed that the receptors of the DECTIN-1 cluster are not only homologous to each other but also highly conserved between species. Even in Caenorhabditis elegans, genes displaying homology to the mammalian C-type lectin-like receptors have been detected. In this paper, we conduct a comprehensive phylogenetic survey and give an up-to-date overview of the currently available data on the evolutionary emergence of the DECTIN-1 cluster genes.
Packaging of Mason-Pfizer monkey virus (MPMV) genomic RNA depends upon conserved long-range interactions (LRIs) between U5 and gag sequences.

Science.gov (United States)

Kalloush, Rawan M; Vivet-Boudou, Valérie; Ali, Lizna M; Mustafa, Farah; Marquet, Roland; Rizvi, Tahir A

2016-06-01

MPMV has great potential for development as a vector for gene therapy. In this respect, precisely defining the sequences and structural motifs that are important for dimerization and packaging of its genomic RNA (gRNA) are of utmost importance. A distinguishing feature of the MPMV gRNA packaging signal is two phylogenetically conserved long-range interactions (LRIs) between U5 and gag complementary sequences, LRI-I and LRI-II. To test their biological significance in the MPMV life cycle, we introduced mutations into these structural motifs and tested their effects on MPMV gRNA packaging and propagation. Furthermore, we probed the structure of key mutants using SHAPE (selective 2'hydroxyl acylation analyzed by primer extension). Disrupting base-pairing of the LRIs affected gRNA packaging and propagation, demonstrating their significance to the MPMV life cycle. A double mutant restoring a heterologous LRI-I was fully functional, whereas a similar LRI-II mutant failed to restore gRNA packaging and propagation. These results demonstrate that while LRI-I acts at the structural level, maintaining base-pairing is not sufficient for LRI-II function. In addition, in vitro RNA dimerization assays indicated that the loss of RNA packaging in LRI mutants could not be attributed to the defects in dimerization. Our findings suggest that U5-gag LRIs play an important architectural role in maintaining the structure of the 5' region of the MPMV gRNA, expanding the crucial role of LRIs to the nonlentiviral group of retroviruses. © 2016 Kalloush et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Tandemly repeated sequence in 5'end of mtDNA control region of ...

African Journals Online (AJOL)

STORAGESEVER

2008-12-17

Dec 17, 2008 ... chain reaction (PCR). Japanese Spanish ... mainly covered general ecology and fishery biology. No study concerning the ... Conserved sequence blocks and the repeat units are indicated by boxes. performed using the exact ...
Comparative genome sequencing of Drosophila pseudoobscura: Chromosomal, gene, and cis-element evolution

DEFF Research Database (Denmark)

Richards, Stephen; Liu, Yue; Bettencourt, Brian R.

2005-01-01

years (Myr) since the pseudoobscura/melanogaster divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome-wide average, consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than random and nearby sequences......We have sequenced the genome of a second Drosophila species, Drosophila pseudoobscura, and compared this to the genome sequence of Drosophila melanogaster, a primary model organism. Throughout evolution the vast majority of Drosophila genes have remained on the same chromosome arm, but within each...... between the species-but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a pattern of repeat-mediated chromosomal rearrangement, and high coadaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence...
Genome sequencing of chimpanzee malaria parasites reveals possible pathways of adaptation to human hosts

KAUST Repository

Otto, Thomas D.

2014-09-09

Plasmodium falciparum causes most human malaria deaths, having prehistorically evolved from parasites of African Great Apes. Here we explore the genomic basis of P. falciparum adaptation to human hosts by fully sequencing the genome of the closely related chimpanzee parasite species P. reichenowi, and obtaining partial sequence data from a more distantly related chimpanzee parasite (P. gaboni). The close relationship between P. reichenowi and P. falciparum is emphasized by almost complete conservation of genomic synteny, but against this strikingly conserved background we observe major differences at loci involved in erythrocyte invasion. The organization of most virulence-associated multigene families, including the hypervariable var genes, is broadly conserved, but P. falciparum has a smaller subset of rif and stevor genes whose products are expressed on the infected erythrocyte surface. Genome-wide analysis identifies other loci under recent positive selection, but a limited number of changes at the host–parasite interface may have mediated host switching.
Molecular Characterization of the Skate Peripherin/rds Gene: Relationship to Its Orthologues and Paralogues

Science.gov (United States)

Li, Chibo; Ding, Xi-Qin; O’Brien, John; Al-Ubaidi, Muayyad R.

2010-01-01

PURPOSE A great deal of information about functionally significant domains of a protein may be obtained by comparison of primary sequences of gene homologues over a broad phylogenetic base. This study was designed to identify evolutionarily conserved domains of the photoreceptor disc membrane protein peripherin/rds by analysis of the homologue in a primitive vertebrate, the skate. METHODS A skate retinal cDNA library was screened using a mouse peripherin/rds clone. The 5′ and 3′ untranslated regions of the skate peripherin/rds (srds) cDNA were isolated by the rapid amplification of cDNA ends (RACE) approach. The gene structure was characterized by PCR amplification and sequencing of genomic fragments. Northern and Western blot analyses were used to identify srds transcript and protein, respectively. RESULTS A new homologue of peripherin/rds was identified from the skate retinal cDNA library. SRDS is a glycoprotein with a predicted molecular mass of 40.2 kDa. The srds gene consists of two exons and one small intron and transcribes into a single 6-kb message. Phylogenetic analysis places SRDS at the base of peripherin/rds family and near the division of that group and the branch leading to rds-like and rom-1 genes. SRDS protein is 54.5% identical with peripherin/rds across species. Identity is significantly higher (73%) in the intradiscal domains. Sequence comparison revealed the conservation of all residues that have been shown, on mutation, to associate with retinitis pigmentosa and showed conservation of most residues associated with macular dystrophies. Comparison with ROM-1 and other rds-like proteins revealed the presence of a highly conserved domain in the large intradiscal loop. CONCLUSIONS Srds represents the skate orthologue of mammalian peripherin/rds genes. Conservation of most of the residues associated with human retinal diseases indicates that these residues serve important functional roles. The high degree of conservation of a short stretch within
Cloning and sequence analysis of the defective in anther ...

African Journals Online (AJOL)

To clone the defective in anther dehiscence1 (DAD1) gene fragment of Chinese kale, about 700 bp product was obtained by PCR amplification using Chinese kale genomic DNA as the template and a pair of specific primers designed according to the conserved sequence of DAD1 genes of Arabidopsis thaliana and ...
A transcriptome resource for the koala (Phascolarctos cinereus): insights into koala retrovirus transcription and sequence diversity.

Science.gov (United States)

Hobbs, Matthew; Pavasovic, Ana; King, Andrew G; Prentis, Peter J; Eldridge, Mark D B; Chen, Zhiliang; Colgan, Donald J; Polkinghorne, Adam; Wilkins, Marc R; Flanagan, Cheyne; Gillett, Amber; Hanger, Jon; Johnson, Rebecca N; Timms, Peter

2014-09-11

The koala, Phascolarctos cinereus, is a biologically unique and evolutionarily distinct Australian arboreal marsupial. The goal of this study was to sequence the transcriptome from several tissues of two geographically separate koalas, and to create the first comprehensive catalog of annotated transcripts for this species, enabling detailed analysis of the unique attributes of this threatened native marsupial, including infection by the koala retrovirus. RNA-Seq data was generated from a range of tissues from one male and one female koala and assembled de novo into transcripts using Velvet-Oases. Transcript abundance in each tissue was estimated. Transcripts were searched for likely protein-coding regions and a non-redundant set of 117,563 putative protein sequences was produced. In similarity searches there were 84,907 (72%) sequences that aligned to at least one sequence in the NCBI nr protein database. The best alignments were to sequences from other marsupials. After applying a reciprocal best hit requirement of koala sequences to those from tammar wallaby, Tasmanian devil and the gray short-tailed opossum, we estimate that our transcriptome dataset represents approximately 15,000 koala genes. The marsupial alignment information was used to look for potential gene duplications and we report evidence for copy number expansion of the alpha amylase gene, and of an aldehyde reductase gene.Koala retrovirus (KoRV) transcripts were detected in the transcriptomes. These were analysed in detail and the structure of the spliced envelope gene transcript was determined. There was appreciable sequence diversity within KoRV, with 233 sites in the KoRV genome showing small insertions/deletions or single nucleotide polymorphisms. Both koalas had sequences from the KoRV-A subtype, but the male koala transcriptome has, in addition, sequences more closely related to the KoRV-B subtype. This is the first report of a KoRV-B-like sequence in a wild population. This transcriptomic
A mammalian conserved element derived from SINE displays enhancer properties recapitulating Satb2 expression in early-born callosal projection neurons.

Directory of Open Access Journals (Sweden)

Kensuke Tashiro

Full Text Available Short interspersed repetitive elements (SINEs are highly repeated sequences that account for a significant proportion of many eukaryotic genomes and are usually considered "junk DNA". However, we previously discovered that many AmnSINE1 loci are evolutionarily conserved across mammalian genomes, suggesting that they may have acquired significant functions involved in controlling mammalian-specific traits. Notably, we identified the AS021 SINE locus, located 390 kbp upstream of Satb2. Using transgenic mice, we showed that this SINE displays specific enhancer activity in the developing cerebral cortex. The transcription factor Satb2 is expressed by cortical neurons extending axons through the corpus callosum and is a determinant of callosal versus subcortical projection. Mouse mutants reveal a crucial function for Sabt2 in corpus callosum formation. In this study, we compared the enhancer activity of the AS021 locus with Satb2 expression during telencephalic development in the mouse. First, we showed that the AS021 enhancer is specifically activated in early-born Satb2(+ neurons. Second, we demonstrated that the activity of the AS021 enhancer recapitulates the expression of Satb2 at later embryonic and postnatal stages in deep-layer but not superficial-layer neurons, suggesting the possibility that the expression of Satb2 in these two subpopulations of cortical neurons is under genetically distinct transcriptional control. Third, we showed that the AS021 enhancer is activated in neurons projecting through the corpus callosum, as described for Satb2(+ neurons. Notably, AS021 drives specific expression in axons crossing through the ventral (TAG1(-/NPY(+ portion of the corpus callosum, confirming that it is active in a subpopulation of callosal neurons. These data suggest that exaptation of the AS021 SINE locus might be involved in enhancement of Satb2 expression, leading to the establishment of interhemispheric communication via the corpus callosum
The complete mitochondrial genome sequence of Oceanic whitetip shark, Carcharhinus longimanus (Carcharhiniformes: Carcharhinidae).

Science.gov (United States)

Li, Weiwen; Dai, Xiaojie; Xu, Qianghua; Wu, Feng; Gao, Chunxia; Zhang, Yanbo

2016-05-01

The complete mitochondrial DNA sequence of Carcharhinus longimanus was determined and analyzed. The complete mtDNA genome sequence of C. longimanus was 16,706 bp in length. It contained 22 tRNA genes, 2 rRNA genes, 13 protein-coding genes and 2 non-conding regions: control region (D-loop) and origin of light-strand replication (OL). The complete mitogenome sequence information of C. longimanus can provide a useful data for further studies on molecular systematics, stock evaluation, taxonomic status and conservation genetics.
Genome-wide conserved consensus transcription factor binding motifs are hyper-methylated

Directory of Open Access Journals (Sweden)

Down Thomas A

2010-09-01

Full Text Available Abstract Background DNA methylation can regulate gene expression by modulating the interaction between DNA and proteins or protein complexes. Conserved consensus motifs exist across the human genome ("predicted transcription factor binding sites": "predicted TFBS" but the large majority of these are proven by chromatin immunoprecipitation and high throughput sequencing (ChIP-seq not to be biological transcription factor binding sites ("empirical TFBS". We hypothesize that DNA methylation at conserved consensus motifs prevents promiscuous or disorderly transcription factor binding. Results Using genome-wide methylation maps of the human heart and sperm, we found that all conserved consensus motifs as well as the subset of those that reside outside CpG islands have an aggregate profile of hyper-methylation. In contrast, empirical TFBS with conserved consensus motifs have a profile of hypo-methylation. 40% of empirical TFBS with conserved consensus motifs resided in CpG islands whereas only 7% of all conserved consensus motifs were in CpG islands. Finally we further identified a minority subset of TF whose profiles are either hypo-methylated or neutral at their respective conserved consensus motifs implicating that these TF may be responsible for establishing or maintaining an un-methylated DNA state, or whose binding is not regulated by DNA methylation. Conclusions Our analysis supports the hypothesis that at least for a subset of TF, empirical binding to conserved consensus motifs genome-wide may be controlled by DNA methylation.
Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

DEFF Research Database (Denmark)

Thomsen, Martin Christen Frølund; Nielsen, Morten

2012-01-01

Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active...... related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein...... sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally...
Finding a (pine) needle in a haystack: chloroplast genome sequence divergence in rare and widespread pines

Science.gov (United States)

J.B. Whittall; J. Syring; M. Parks; J. Buenrostro; C. Dick; A. Liston; R. Cronn

2010-01-01

Critical to conservation efforts and other investigations at low taxonomic levels, DNA sequence data offer important insights into the distinctiveness, biogeographic partitioning, and evolutionary histories of species. The resolving power of DNA sequences is often limited by insufficient variability at the intraspecific level. This is particularly true of studies...
Domain duplication, divergence, and loss events in vertebrate Msx paralogs reveal phylogenomically informed disease markers.

Science.gov (United States)

Finnerty, John R; Mazza, Maureen E; Jezewski, Peter A

2009-01-20

Msx originated early in animal evolution and is implicated in human genetic disorders. To reconstruct the functional evolution of Msx and inform the study of human mutations, we analyzed the phylogeny and synteny of 46 metazoan Msx proteins and tracked the duplication, diversification and loss of conserved motifs. Vertebrate Msx sequences sort into distinct Msx1, Msx2 and Msx3 clades. The sister-group relationship between MSX1 and MSX2 reflects their derivation from the 4p/5q chromosomal paralogon, a derivative of the original "MetaHox" cluster. We demonstrate physical linkage between Msx and other MetaHox genes (Hmx, NK1, Emx) in a cnidarian. Seven conserved domains, including two Groucho repression domains (N- and C-terminal), were present in the ancestral Msx. In cnidarians, the Groucho domains are highly similar. In vertebrate Msx1, the N-terminal Groucho domain is conserved, while the C-terminal domain diverged substantially, implying a novel function. In vertebrate Msx2 and Msx3, the C-terminal domain was lost. MSX1 mutations associated with ectodermal dysplasia or orofacial clefting disorders map to conserved domains in a non-random fashion. Msx originated from a MetaHox ancestor that also gave rise to Tlx, Demox, NK, and possibly EHGbox, Hox and ParaHox genes. Duplication, divergence or loss of domains played a central role in the functional evolution of Msx. Duplicated domains allow pleiotropically expressed proteins to evolve new functions without disrupting existing interaction networks. Human missense sequence variants reside within evolutionarily conserved domains, likely disrupting protein function. This phylogenomic evaluation of candidate disease markers will inform clinical and functional studies.

Domain duplication, divergence, and loss events in vertebrate Msx paralogs reveal phylogenomically informed disease markers

Directory of Open Access Journals (Sweden)

Finnerty John R

2009-01-01

Full Text Available Abstract Background Msx originated early in animal evolution and is implicated in human genetic disorders. To reconstruct the functional evolution of Msx and inform the study of human mutations, we analyzed the phylogeny and synteny of 46 metazoan Msx proteins and tracked the duplication, diversification and loss of conserved motifs. Results Vertebrate Msx sequences sort into distinct Msx1, Msx2 and Msx3 clades. The sister-group relationship between MSX1 and MSX2 reflects their derivation from the 4p/5q chromosomal paralogon, a derivative of the original "MetaHox" cluster. We demonstrate physical linkage between Msx and other MetaHox genes (Hmx, NK1, Emx in a cnidarian. Seven conserved domains, including two Groucho repression domains (N- and C-terminal, were present in the ancestral Msx. In cnidarians, the Groucho domains are highly similar. In vertebrate Msx1, the N-terminal Groucho domain is conserved, while the C-terminal domain diverged substantially, implying a novel function. In vertebrate Msx2 and Msx3, the C-terminal domain was lost. MSX1 mutations associated with ectodermal dysplasia or orofacial clefting disorders map to conserved domains in a non-random fashion. Conclusion Msx originated from a MetaHox ancestor that also gave rise to Tlx, Demox, NK, and possibly EHGbox, Hox and ParaHox genes. Duplication, divergence or loss of domains played a central role in the functional evolution of Msx. Duplicated domains allow pleiotropically expressed proteins to evolve new functions without disrupting existing interaction networks. Human missense sequence variants reside within evolutionarily conserved domains, likely disrupting protein function. This phylogenomic evaluation of candidate disease markers will inform clinical and functional studies.
Development of ent-kaurene Oxidase-Based Conserved Intron Spanning Primers for Species Identification in the Genus Poa (Poaceae; Bluegrass

Directory of Open Access Journals (Sweden)

Jonathan M. LaMantia

2018-04-01

Full Text Available Interspecific hybridization has been attempted to combine the heat and drought of Poa arachnifera Torr. with the turf quality characteristics of several Poa species. Confirmation of an F1 hybrid through morphological analysis of vegetative and flowering characteristics is often time consuming and ambiguous. Ent-kaurene oxidase (KO has been sequenced in rice, barley, and wheat. In rice, each of the five copies of KO gene has unique lengths for the first intron. Conserved intron spanning primers (CISP can be used as a DNA marker to exploit variations of intron lengths that flank conserved gene sequences. In the present study, we developed CISP to sequence partial genomic fragments of the KO gene from seven Poa species. Through sequence analysis, species-specific primers were also developed to produce co-dominant markers that can be used to identify interspecific hybrids between Texas bluegrass and six other Poa species used in the present study.
Aging: fruit flies break the chain to a longer life.

Science.gov (United States)

Linford, Nancy J; Pletcher, Scott D

2009-10-13

Mitochondria have long had an enigmatic role in the biology of aging. New research in Drosophila reveals an evolutionarily conserved function for the mitochondrial electron transport chain in the modulation of animal lifespan.
Identification of evolutionarily invariant sequences in the protein C gene promoter

NARCIS (Netherlands)

Spek, C. A.; Bertina, R. M.; Reitsma, P. H.

1998-01-01

Recent studies on human protein C gene expression have revealed the presence of three transcription factor binding sites in close proximity to the transcription start site. Binding sites for the liver-enriched hepatocyte nuclear factors 1 and 3 (HNF-1 and HNF-3, respectively) are located immediately
eShadow: A tool for comparing closely related sequences

Energy Technology Data Exchange (ETDEWEB)

Ovcharenko, Ivan; Boffelli, Dario; Loots, Gabriela G.

2004-01-15

Primate sequence comparisons are difficult to interpret due to the high degree of sequence similarity shared between such closely related species. Recently, a novel method, phylogenetic shadowing, has been pioneered for predicting functional elements in the human genome through the analysis of multiple primate sequence alignments. We have expanded this theoretical approach to create a computational tool, eShadow, for the identification of elements under selective pressure in multiple sequence alignments of closely related genomes, such as in comparisons of human to primate or mouse to rat DNA. This tool integrates two different statistical methods and allows for the dynamic visualization of the resulting conservation profile. eShadow also includes a versatile optimization module capable of training the underlying Hidden Markov Model to differentially predict functional sequences. This module grants the tool high flexibility in the analysis of multiple sequence alignments and in comparing sequences with different divergence rates. Here, we describe the eShadow comparative tool and its potential uses for analyzing both multiple nucleotide and protein alignments to predict putative functional elements. The eShadow tool is publicly available at http://eshadow.dcode.org/
A robust, simple genotyping-by-sequencing (GBS approach for high diversity species.

Directory of Open Access Journals (Sweden)

Robert J Elshire

Full Text Available Advances in next generation technologies have driven the costs of DNA sequencing down to the point that genotyping-by-sequencing (GBS is now feasible for high diversity, large genome species. Here, we report a procedure for constructing GBS libraries based on reducing genome complexity with restriction enzymes (REs. This approach is simple, quick, extremely specific, highly reproducible, and may reach important regions of the genome that are inaccessible to sequence capture approaches. By using methylation-sensitive REs, repetitive regions of genomes can be avoided and lower copy regions targeted with two to three fold higher efficiency. This tremendously simplifies computationally challenging alignment problems in species with high levels of genetic diversity. The GBS procedure is demonstrated with maize (IBM and barley (Oregon Wolfe Barley recombinant inbred populations where roughly 200,000 and 25,000 sequence tags were mapped, respectively. An advantage in species like barley that lack a complete genome sequence is that a reference map need only be developed around the restriction sites, and this can be done in the process of sample genotyping. In such cases, the consensus of the read clusters across the sequence tagged sites becomes the reference. Alternatively, for kinship analyses in the absence of a reference genome, the sequence tags can simply be treated as dominant markers. Future application of GBS to breeding, conservation, and global species and population surveys may allow plant breeders to conduct genomic selection on a novel germplasm or species without first having to develop any prior molecular tools, or conservation biologists to determine population structure without prior knowledge of the genome or diversity in the species.
Purifying selection acts on coding and non-coding sequences of paralogous genes in Arabidopsis thaliana.

Science.gov (United States)

Hoffmann, Robert D; Palmgren, Michael

2016-06-13

Whole-genome duplications in the ancestors of many diverse species provided the genetic material for evolutionary novelty. Several models explain the retention of paralogous genes. However, how these models are reflected in the evolution of coding and non-coding sequences of paralogous genes is unknown. Here, we analyzed the coding and non-coding sequences of paralogous genes in Arabidopsis thaliana and compared these sequences with those of orthologous genes in Arabidopsis lyrata. Paralogs with lower expression than their duplicate had more nonsynonymous substitutions, were more likely to fractionate, and exhibited less similar expression patterns with their orthologs in the other species. Also, lower-expressed genes had greater tissue specificity. Orthologous conserved non-coding sequences in the promoters, introns, and 3' untranslated regions were less abundant at lower-expressed genes compared to their higher-expressed paralogs. A gene ontology (GO) term enrichment analysis showed that paralogs with similar expression levels were enriched in GO terms related to ribosomes, whereas paralogs with different expression levels were enriched in terms associated with stress responses. Loss of conserved non-coding sequences in one gene of a paralogous gene pair correlates with reduced expression levels that are more tissue specific. Together with increased mutation rates in the coding sequences, this suggests that similar forces of purifying selection act on coding and non-coding sequences. We propose that coding and non-coding sequences evolve concurrently following gene duplication.
Application of DNA barcodes in wildlife conservation in Tropical East Asia.

Science.gov (United States)

Wilson, John-James; Sing, Kong-Wah; Lee, Ping-Shin; Wee, Alison K S

2016-10-01

Over the past 50 years, Tropical East Asia has lost more biodiversity than any tropical region. Tropical East Asia is a megadiverse region with an acute taxonomic impediment. DNA barcodes are short standardized DNA sequences used for taxonomic purposes and have the potential to lessen the challenges of biodiversity inventory and assessments in regions where they are most needed. We reviewed DNA barcoding efforts in Tropical East Asia relative to other tropical regions. We suggest DNA barcodes (or metabarcodes from next-generation sequencers) may be especially useful for characterizing and connecting species-level biodiversity units in inventories encompassing taxa lacking formal description (particularly arthropods) and in large-scale, minimal-impact approaches to vertebrate monitoring and population assessments through secondary sources of DNA (invertebrate derived DNA and environmental DNA). We suggest interest and capacity for DNA barcoding are slowly growing in Tropical East Asia, particularly among the younger generation of researchers who can connect with the barcoding analogy and understand the need for new approaches to the conservation challenges being faced. © 2016 Society for Conservation Biology.
Characterization of the expression, promoter activity and molecular architecture of fibin

Directory of Open Access Journals (Sweden)

Hermsdorf Thomas

2011-05-01

Full Text Available Abstract Background Fibin was initially discovered as a secreted signal molecule essential for pectoral fin bud initiation in zebrafish. Currently, there is little information about the molecular architecture and biological relevance of fibin in humans and other mammals. Results Fibin is expressed in cerebellum, skeletal muscle and many other embryonic and adult mouse tissues suggesting not only a role during embryonic development but also in adult functions. A 2.5-kbp genomic sequence fragment upstream of the coding sequence is sufficient to drive and regulate fibin expression through stimulation by glucocorticoids, activators of the protein kinase C signalling pathways and manganese ions. Fibin is an evolutionarily conserved protein, carries a cleavable signal peptide (amino acids 1-18 and is glycosylated at Asn30. The two conserved cysteines participate in intermolecular disulfide bond and multimer formation. Although fibin displays all features of a secretory protein, it is mostly retained in the endoplasmic reticulum when heterologously expressed. Conclusion Fibin is functionally relevant during embryogenesis and adult life. Its expression is regulated by a number of cellular signalling pathways and the protein is routed via the secretory pathway. However, proper secretion presumably requires an unknown covalently-linked or associated co-factor.
Repetitive sequences: the hidden diversity of heterochromatin in prochilodontid fish

Directory of Open Access Journals (Sweden)

Maria L. Terencio

2015-08-01

Full Text Available The structure and organization of repetitive elements in fish genomes are still relatively poorly understood, although most of these elements are believed to be located in heterochromatic regions. Repetitive elements are considered essential in evolutionary processes as hotspots for mutations and chromosomal rearrangements, among other functions – thus providing new genomic alternatives and regulatory sites for gene expression. The present study sought to characterize repetitive DNA sequences in the genomes of Semaprochilodus insignis (Jardine & Schomburgk, 1841 and Semaprochilodus taeniurus (Valenciennes, 1817 and identify regions of conserved syntenic blocks in this genome fraction of three species of Prochilodontidae (S. insignis, S. taeniurus, and Prochilodus lineatus (Valenciennes, 1836 by cross-FISH using Cot-1 DNA (renaturation kinetics probes. We found that the repetitive fractions of the genomes of S. insignis and S. taeniurus have significant amounts of conserved syntenic blocks in hybridization sites, but with low degrees of similarity between them and the genome of P. lineatus, especially in relation to B chromosomes. The cloning and sequencing of the repetitive genomic elements of S. insignis and S. taeniurus using Cot-1 DNA identified 48 fragments that displayed high similarity with repetitive sequences deposited in public DNA databases and classified as microsatellites, transposons, and retrotransposons. The repetitive fractions of the S. insignis and S. taeniurus genomes exhibited high degrees of conserved syntenic blocks in terms of both the structures and locations of hybridization sites, but a low degree of similarity with the syntenic blocks of the P. lineatus genome. Future comparative analyses of other prochilodontidae species will be needed to advance our understanding of the organization and evolution of the genomes in this group of fish.
Usb1 controls U6 snRNP assembly through evolutionarily divergent cyclic phosphodiesterase activities.

Science.gov (United States)

Didychuk, Allison L; Montemayor, Eric J; Carrocci, Tucker J; DeLaitsch, Andrew T; Lucarelli, Stefani E; Westler, William M; Brow, David A; Hoskins, Aaron A; Butcher, Samuel E

2017-09-08

U6 small nuclear ribonucleoprotein (snRNP) biogenesis is essential for spliceosome assembly, but not well understood. Here, we report structures of the U6 RNA processing enzyme Usb1 from yeast and a substrate analog bound complex from humans. Unlike the human ortholog, we show that yeast Usb1 has cyclic phosphodiesterase activity that leaves a terminal 3' phosphate which prevents overprocessing. Usb1 processing of U6 RNA dramatically alters its affinity for cognate RNA-binding proteins. We reconstitute the post-transcriptional assembly of yeast U6 snRNP in vitro, which occurs through a complex series of handoffs involving 10 proteins (Lhp1, Prp24, Usb1 and Lsm2-8) and anti-cooperative interactions between Prp24 and Lhp1. We propose a model for U6 snRNP assembly that explains how evolutionarily divergent and seemingly antagonistic proteins cooperate to protect and chaperone the nascent snRNA during its journey to the spliceosome.The mechanism of U6 small nuclear ribonucleoprotein (snRNP) biogenesis is not well understood. Here the authors characterize the enzymatic activities and structures of yeast and human U6 RNA processing enzyme Usb1, reconstitute post-transcriptional assembly of yeast U6 snRNP in vitro, and propose a model for U6 snRNP assembly.
Planarian homeobox genes: cloning, sequence analysis, and expression.

Science.gov (United States)

Garcia-Fernàndez, J; Baguñà, J; Saló, E

1991-01-01

Freshwater planarians (Platyhelminthes, Turbellaria, and Tricladida) are acoelomate, triploblastic, unsegmented, and bilaterally symmetrical organisms that are mainly known for their ample power to regenerate a complete organism from a small piece of their body. To identify potential pattern-control genes in planarian regeneration, we have isolated two homeobox-containing genes, Dth-1 and Dth-2 [Dugesia (Girardia) tigrina homeobox], by using degenerate oligonucleotides corresponding to the most conserved amino acid sequence from helix-3 of the homeodomain. Dth-1 and Dth-2 homeodomains are closely related (68% at the nucleotide level and 78% at the protein level) and show the conserved residues characteristic of the homeodomains identified to data. Similarity with most homeobox sequences is low (30-50%), except with Drosophila NK homeodomains (80-82% with NK-2) and the rodent TTF-1 homeodomain (77-87%). Some unusual amino acid residues specific to NK-2, TTF-1, Dth-1, and Dth-2 can be observed in the recognition helix (helix-3) and may define a family of homeodomains. The deduced amino acid sequences from the cDNAs contain, in addition to the homeodomain, other domains also present in various homeobox-containing genes. The expression of both genes, detected by Northern blot analysis, appear slightly higher in cephalic regions than in the rest of the intact organism, while a slight increase is detected in the central period (5 days) or regeneration. Images PMID:1714599
Sequence analysis of dolphin ferritin H and L subunits and possible iron-dependent translational control of dolphin ferritin gene

Directory of Open Access Journals (Sweden)

Sasaki Yukako

2008-10-01

Full Text Available Abstract Background Iron-storage protein, ferritin plays a central role in iron metabolism. Ferritin has dual function to store iron and segregate iron for protection of iron-catalyzed reactive oxygen species. Tissue ferritin is composed of two kinds of subunits (H: heavy chain or heart-type subunit; L: light chain or liver-type subunit. Ferritin gene expression is controlled at translational level in iron-dependent manner or at transcriptional level in iron-independent manner. However, sequencing analysis of marine mammalian ferritin subunits has not yet been performed fully. The purpose of this study is to reveal cDNA-derived amino acid sequences of cetacean ferritin H and L subunits, and demonstrate the possibility of expression of these subunits, especially H subunit, by iron. Methods Sequence analyses of cetacean ferritin H and L subunits were performed by direct sequencing of polymerase chain reaction (PCR fragments from cDNAs generated via reverse transcription-PCR of leukocyte total RNA prepared from blood samples of six different dolphin species (Pseudorca crassidens, Lagenorhynchus obliquidens, Grampus griseus, Globicephala macrorhynchus, Tursiops truncatus, and Delphinapterus leucas. The putative iron-responsive element sequence in the 5'-untranslated region of the six different dolphin species was revealed by direct sequencing of PCR fragments obtained using leukocyte genomic DNA. Results Dolphin H and L subunits consist of 182 and 174 amino acids, respectively, and amino acid sequence identities of ferritin subunits among these dolphins are highly conserved (H: 99–100%, (99→98 ; L: 98–100%. The conserved 28 bp IRE sequence was located -144 bp upstream from the initiation codon in the six different dolphin species. Conclusion These results indicate that six different dolphin species have conserved ferritin sequences, and suggest that these genes are iron-dependently expressed.
Effects of Main-Sequence Mass Loss on Stellar and Galactic Chemical Evolution.

Science.gov (United States)

Guzik, Joyce Ann

1988-06-01

L. A. Willson, G. H. Bowen and C. Struck -Marcell have proposed that 1 to 3 solar mass stars may experience evolutionarily significant mass loss during the early part of their main-sequence phase. The suggested mass-loss mechanism is pulsation, facilitated by rapid rotation. Initial mass-loss rates may be as large as several times 10^{-9}M o/yr, diminishing over several times 10^8 years. We attempted to test this hypothesis by comparing some theoretical implications with observations. Three areas are addressed: Solar models, cluster HR diagrams, and galactic chemical evolution. Mass-losing solar models were evolved that match the Sun's luminosity and radius at its present age. The most extreme viable models have initial mass 2.0 M o, and mass-loss rates decreasing exponentially over 2-3 times 10^8 years. Compared to a constant -mass model, these models require a reduced initial ^4He abundance, have deeper envelope convection zones and higher ^8B neutrino fluxes. Early processing of present surface layers at higher interior temperatures increases the surface ^3He abundance, destroys Li, Be and B, and decreases the surface C/N ratio following first dredge-up. Evolution calculations incorporating main-sequence mass loss were completed for a grid of models with initial masses 1.25 to 2.0 Mo and mass loss timescales 0.2 to 2.0 Gyr. Cluster HR diagrams synthesized with these models confirm the potential for the hypothesis to explain observed spreads or bifurcations in the upper main sequence, blue stragglers, anomalous giants, and poor fits of main-sequence turnoffs by standard isochrones. Simple closed galactic chemical evolution models were used to test the effects of main-sequence mass loss on the F and G dwarf distribution. Stars between 3.0 M o and a metallicity -dependent lower mass are assumed to lose mass. The models produce a 30 to 60% increase in the stars to stars-plus -remnants ratio, with fewer early-F dwarfs and many more late-F dwarfs remaining on the main
Comparative Annotation of Viral Genomes with Non-Conserved Gene Structure

DEFF Research Database (Denmark)

de Groot, Saskia; Mailund, Thomas; Hein, Jotun

2007-01-01

Motivation: Detecting genes in viral genomes is a complex task. Due to the biological necessity of them being constrained in length, RNA viruses in particular tend to code in overlapping reading frames. Since one amino acid is encoded by a triplet of nucleic acids, up to three genes may be coded...... allows for coding in unidirectional nested and overlapping reading frames, to annotate two homologous aligned viral genomes. Our method does not insist on conserved gene structure between the two sequences, thus making it applicable for the pairwise comparison of more distantly related sequences. Results...... and HIV2, as well as of two different Hepatitis Viruses, attaining results of ~87% sensitivity and ~98.5% specificity. We subsequently incorporate prior knowledge by "knowing" the gene structure of one sequence and annotating the other conditional on it. Boosting accuracy close to perfect we demonstrate...
Conservation Genetics of the Cheetah: Lessons Learned and New Opportunities.

Science.gov (United States)

O'Brien, Stephen J; Johnson, Warren E; Driscoll, Carlos A; Dobrynin, Pavel; Marker, Laurie

2017-09-01

The dwindling wildlife species of our planet have become a cause célèbre for conservation groups, governments, and concerned citizens throughout the world. The application of powerful new genetic technologies to surviving populations of threatened mammals has revolutionized our ability to recognize hidden perils that afflict them. We have learned new lessons of survival, adaptation, and evolution from viewing the natural history of genomes in hundreds of detailed studies. A single case history of one species, the African cheetah, Acinonyx jubatus, is here reviewed to reveal a long-term story of conservation challenges and action informed by genetic discoveries and insights. A synthesis of 3 decades of data, interpretation, and controversy, capped by whole genome sequence analysis of cheetahs, provides a compelling tale of conservation relevance and action to protect this species and other threatened wildlife. © The American Genetic Association 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Conservation patterns of HIV-1 RT connection and RNase H domains: identification of new mutations in NRTI-treated patients.

Directory of Open Access Journals (Sweden)

André F A Santos

Full Text Available BACKGROUND: Although extensive HIV drug resistance information is available for the first 400 amino acids of its reverse transcriptase, the impact of antiretroviral treatment in C-terminal domains of Pol (thumb, connection and RNase H is poorly understood. METHODS AND FINDINGS: We wanted to characterize conserved regions in RT C-terminal domains among HIV-1 group M subtypes and CRF. Additionally, we wished to identify NRTI-related mutations in HIV-1 RT C-terminal domains. We sequenced 118 RNase H domains from clinical viral isolates in Brazil, and analyzed 510 thumb and connection domain and 450 RNase H domain sequences collected from public HIV sequence databases, together with their treatment status and histories. Drug-naïve and NRTI-treated datasets were compared for intra- and inter-group conservation, and differences were determined using Fisher's exact tests. One third of RT C-terminal residues were found to be conserved among group M variants. Three mutations were found exclusively in NRTI-treated isolates. Nine mutations in the connection and 6 mutations in the RNase H were associated with NRTI treatment in subtype B. Some of them lay in or close to amino acid residues which contact nucleic acid or near the RNase H active site. Several of the residues pointed out herein have been recently associated to NRTI exposure or increase drug resistance to NRTI. CONCLUSIONS: This is the first comprehensive genotypic analysis of a large sequence dataset that describes NRTI-related mutations in HIV-1 RT C-terminal domains in vivo. The findings into the conservation of RT C-terminal domains may pave the way to more rational drug design initiatives targeting those regions.
Sequencing and analysis of the Mediterranean amphioxus (Branchiostoma lanceolatum transcriptome.

Directory of Open Access Journals (Sweden)

Silvan Oulion

Full Text Available BACKGROUND: The basally divergent phylogenetic position of amphioxus (Cephalochordata, as well as its conserved morphology, development and genetics, make it the best proxy for the chordate ancestor. Particularly, studies using the amphioxus model help our understanding of vertebrate evolution and development. Thus, interest for the amphioxus model led to the characterization of both the transcriptome and complete genome sequence of the American species, Branchiostoma floridae. However, recent technical improvements allowing induction of spawning in the laboratory during the breeding season on a daily basis with the Mediterranean species Branchiostoma lanceolatum have encouraged European Evo-Devo researchers to adopt this species as a model even though no genomic or transcriptomic data have been available. To fill this need we used the pyrosequencing method to characterize the B. lanceolatum transcriptome and then compared our results with the published transcriptome of B. floridae. RESULTS: Starting with total RNA from nine different developmental stages of B. lanceolatum, a normalized cDNA library was constructed and sequenced on Roche GS FLX (Titanium mode. Around 1.4 million of reads were produced and assembled into 70,530 contigs (average length of 490 bp. Overall 37% of the assembled sequences were annotated by BlastX and their Gene Ontology terms were determined. These results were then compared to genomic and transcriptomic data of B. floridae to assess similarities and specificities of each species. CONCLUSION: We obtained a high-quality amphioxus (B. lanceolatum reference transcriptome using a high throughput sequencing approach. We found that 83% of the predicted genes in the B. floridae complete genome sequence are also found in the B. lanceolatum transcriptome, while only 41% were found in the B. floridae transcriptome obtained with traditional Sanger based sequencing. Therefore, given the high degree of sequence conservation
Evolutionary relationships in the ilarviruses: nucleotide sequence of prunus necrotic ringspot virus RNA 3.

Science.gov (United States)

Sánchez-Navarro, J A; Pallás, V

1997-01-01

The complete nucleotide sequence of an isolate of prunus necrotic ringspot virus (PNRSV) RNA 3 has been determined. Elucidation of the amino acid sequence of the proteins encoded by the two large open reading frames (ORFs) allowed us to carry out comparative and phylogenetic studies on the movement (MP) and coat (CP) proteins in the ilarvirus group. Amino acid sequence comparison of the MP revealed a highly conserved basic sequence motif with an amphipathic alpha-helical structure preceding the conserved motif of the '30K superfamily' proposed by Mushegian and Koonin [26] for MP's. Within this '30K' motif a strictly conserved transmembrane domain is present in all ilarviruses sequenced so far. At the amino-terminal end, prune dwarf virus (PDV) has an extension not present in other ilarviruses but which is observed in all bromo- and cucumoviruses, suggesting a common ancestor or a recombinational event in the Bromoviridae family. Examination of the N-terminus of the CP's of all ilarviruses revealed a highly basic region, part of which resembles the Arg-rich motif that has been characterized in the RNA-binding protein family. This motif has also been found in the other members of the Bromoviridae family, suggesting its involvement in a structural function. Furthermore this region is required for infectivity in ilarviruses. The similarities found in this Arg-rich motif are discussed in terms of this process known as genome activation. Finally, phylogenetic analysis of both the MP and CP proteins revealed a higher relationship of A1MV to PNRSV, apple mosaic virus (ApMV) and PDV than any other member of the ilarvirus group. In that sense, A1MV should be considered as a true ilarvirus instead of forming a distinct group of viruses.
Sequencing and characterisation of rearrangements in three S. pastorianus strains reveals the presence of chimeric genes and gives evidence of breakpoint reuse.

Directory of Open Access Journals (Sweden)

Sarah K Hewitt

Full Text Available Gross chromosomal rearrangements have the potential to be evolutionarily advantageous to an adapting organism. The generation of a hybrid species increases opportunity for recombination by bringing together two homologous genomes. We sought to define the location of genomic rearrangements in three strains of Saccharomyces pastorianus, a natural lager-brewing yeast hybrid of Saccharomyces cerevisiae and Saccharomyces eubayanus, using whole genome shotgun sequencing. Each strain of S. pastorianus has lost species-specific portions of its genome and has undergone extensive recombination, producing chimeric chromosomes. We predicted 30 breakpoints that we confirmed at the single nucleotide level by designing species-specific primers that flank each breakpoint, and then sequencing the PCR product. These rearrangements are the result of recombination between areas of homology between the two subgenomes, rather than repetitive elements such as transposons or tRNAs. Interestingly, 28/30 S. cerevisiae-S. eubayanus recombination breakpoints are located within genic regions, generating chimeric genes. Furthermore we show evidence for the reuse of two breakpoints, located in HSP82 and KEM1, in strains of proposed independent origin.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.