WorldWideScience

Sample records for netb structural gene

  1. NetB, a new toxin that is associated with avian necrotic enteritis caused by Clostridium perfringens.

    Directory of Open Access Journals (Sweden)

    Anthony L Keyburn

    2008-02-01

    Full Text Available For over 30 years a phospholipase C enzyme called alpha-toxin was thought to be the key virulence factor in necrotic enteritis caused by Clostridium perfringens. However, using a gene knockout mutant we have recently shown that alpha-toxin is not essential for pathogenesis. We have now discovered a key virulence determinant. A novel toxin (NetB was identified in a C. perfringens strain isolated from a chicken suffering from necrotic enteritis (NE. The toxin displayed limited amino acid sequence similarity to several pore forming toxins including beta-toxin from C. perfringens (38% identity and alpha-toxin from Staphylococcus aureus (31% identity. NetB was only identified in C. perfringens type A strains isolated from chickens suffering NE. Both purified native NetB and recombinant NetB displayed cytotoxic activity against the chicken leghorn male hepatoma cell line LMH; inducing cell rounding and lysis. To determine the role of NetB in NE a netB mutant of a virulent C. perfringens chicken isolate was constructed by homologous recombination, and its virulence assessed in a chicken disease model. The netB mutant was unable to cause disease whereas the wild-type parent strain and the netB mutant complemented with a wild-type netB gene caused significant levels of NE. These data show unequivocally that in this isolate a functional NetB toxin is critical for the ability of C. perfringens to cause NE in chickens. This novel toxin is the first definitive virulence factor to be identified in avian C. perfringens strains capable of causing NE. Furthermore, the netB mutant is the first rationally attenuated strain obtained in an NE-causing isolate of C. perfringens; as such it has considerable vaccine potential.

  2. vaccination using profilin and NetB proteins in Montanide IMS adjuvant increases protective immunity against experimentally-induced necrotic enteritis

    Directory of Open Access Journals (Sweden)

    Hyun Soon Lillehoj

    2017-10-01

    Full Text Available Objective The effects of vaccinating 18-day-old chicken embryos with the combination of recombinant Eimeria profilin plus Clostridium perfringens (C. perfringens NetB proteins mixed in the Montanide IMS adjuvant on the chicken immune response to necrotic enteritis (NE were investigated using an Eimeria maxima (E. maxima/C. perfringens co-infection NE disease model that we previously developed. Methods Eighteen-day-old broiler embryos were injected with 100 μL of phosphate-buffered saline, profilin, profilin plus necrotic enteritis B-like (NetB, profilin plus NetB/Montanide adjuvant (IMS 106, and profilin plus Net-B/Montanide adjuvant (IMS 101. After post-hatch birds were challenged with our NE experimental disease model, body weights, intestinal lesions, serum antibody levels to NetB, and proinflammatory cytokine and chemokine mRNA levels in intestinal intraepithelial lymphocytes were measured. Results Chickens in ovo vaccinated with recombinant profilin plus NetB proteins/IMS106 and recombinant profilin plus NetB proteins/IMS101 showed significantly increased body weight gains and reduced gut damages compared with the profilin-only group, respectively. Greater antibody response to NetB toxin were observed in the profilin plus NetB/IMS 106, and profilin plus NetB/IMS 101 groups compared with the other three vaccine/adjuvant groups. Finally, diminished levels of transcripts encoding for proinflammatory cytokines such as lipopolysaccharide-induced tumor necrosis factor-α factor, tumor necrosis factor superfamily 15, and interleukin-8 were observed in the intestinal lymphocytes of chickens in ovo injected with profilin plus NetB toxin in combination with IMS 106, and profilin plus NetB toxin in combination with IMS 101 compared with profilin protein alone bird. Conclusion These results suggest that the Montanide IMS adjuvants potentiate host immunity to experimentally-induced avian NE when administered in ovo in conjunction with the profilin and

  3. In ovo vaccines based on recombinant NetB toxin and Montanide IMS adjuvants induced protective immunity against Necrotic Enteritis in chickens

    Science.gov (United States)

    The current study was conducted to investigate the effects of in ovo injection of recombinant clostridium NetB toxin plus Eimeria profilin proteins in combination with Montanide adjuvants in modulating immune system in chickens infected for experimental necrotic enteritis (NE) disease. Broiler eggs ...

  4. Synergistic effect of embryo vaccination with Eimeria profilin and Clostridium perfringens NetB proteins on inducing protective immunity against necrotic enteritis in broiler chickens

    Science.gov (United States)

    The effects of embryo vaccination with Eimeria profilin plus Clostridium perfringens NetB toxin proteins in combination with the Montanide IMS-OVO adjuvant on the chicken immune response to necrotic enteritis were investigated using an E. maxima/C. perfringens co-infection model. Eighteen-day-old br...

  5. Clostridium difficile and Clostridium perfringens from wild carnivore species in Brazil.

    Science.gov (United States)

    Silva, Rodrigo Otávio Silveira; D'Elia, Mirella Lauria; Tostes Teixeira, Erika Procópio; Pereira, Pedro Lúcio Lithg; de Magalhães Soares, Danielle Ferreira; Cavalcanti, Álvaro Roberto; Kocuvan, Aleksander; Rupnik, Maja; Santos, André Luiz Quagliatto; Junior, Carlos Augusto Oliveira; Lobato, Francisco Carlos Faria

    2014-08-01

    Despite some case reports, the importance of Clostridium perfringens and Clostridium difficile for wild carnivores remains unclear. Thus, the objective of this study was to identify C. perfringens and C. difficile strains in stool samples from wild carnivore species in Brazil. A total of 34 stool samples were collected and subjected to C. perfringens and C. difficile isolation. Suggestive colonies of C. perfringens were then analyzed for genes encoding the major C. perfringens toxins (alpha, beta, epsilon and iota) and the beta-2 toxin (cpb2), enterotoxin (cpe) and NetB (netb) genes. C. difficile strains were analyzed by multiplex-PCR for toxins A (tcdA) and B (tcdB) and a binary toxin gene (cdtB) and also submitted to a PCR ribotyping. Unthawed aliquots of samples positive for C. difficile isolation were subjected to the detection of A/B toxins by a cytotoxicity assay (CTA). C. perfringens was isolated from 26 samples (76.5%), all of which were genotyped as type A. The netb gene was not detected, whereas the cpb2 and cpe genes were found in nine and three C. perfringens strains, respectively. C. difficile was isolated from two (5.9%) samples. A non-toxigenic strain was recovered from a non-diarrheic maned wolf (Chrysocyon brachyurus). Conversely, a toxigenic strain was found in the sample of a diarrheic ocelot (Leopardus pardallis); an unthawed stool sample was also positive for A/B toxins by CTA, indicating a diagnosis of C. difficile-associated diarrhea in this animal. The present work suggests that wild carnivore species could carry C. difficile strains and that they could be susceptible to C. difficile infection. Copyright © 2014 Elsevier Ltd. All rights reserved.

  6. Toxinotyping of Clostridium perfringens strains isolated from packed chicken portions

    Directory of Open Access Journals (Sweden)

    Maryam Poursoltani

    2014-06-01

    Full Text Available Background and Aim: Clostridium perfringens are classified into five toxin types A to E, on the basis of production of Alpha, Beta, Epsilon and Iota toxins. Some strains are able to produce enterotoxin, can cause food poisoning in human. The bacteria are able to produce NetB and TpeL toxins which are virulence factors in necrotic enteritis in poultry. The aim of this study was to determine the toxin profile of C. perfringens strains isolated from packed chicken portions using Single and Multiplex PCR assays. Materials and Methods: In a crossectional study, 180 sample of chicken portions including wing (n=50, liver (n=50, neck (n=50 and gizzard (n=30 were collected randomly and examined for C. perfringens contamination. For this purpose all of samples were cultured on the 7% sheep defibrinated blood agar, TSN and TSC culture media. All of the isolates were investigated for the presence of alpha, beta, epsilon, iota toxin and virulence (tpeL and netB genes. Results: In the present study, 6 isolates out of 180 samples, were confirmed as C. perfringens by culture and molecular methods. All of the isolates (100% were confirmed as cpa and cpb positive strains and belong to type C of C. perfringens. The netB gene was detected in 5 isolates (83.33% and tpeL gene in three isolates (50%. Conclusions: Our findings show the majority of C. perfringens in broilers are belong to type C which produce necrotic enteritis in poultry and may be transmitted to human through poultry products.

  7. MADS-box gene evolution - structure and transcription patterns

    DEFF Research Database (Denmark)

    Johansen, Bo; Pedersen, Louise Buchholt; Skipper, Martin

    2002-01-01

    Mads-box genes, ABC model, Evolution, Phylogeny, Transcription patterns, Gene structure, Conserved motifs......Mads-box genes, ABC model, Evolution, Phylogeny, Transcription patterns, Gene structure, Conserved motifs...

  8. Gene Composer in a structural genomics environment

    International Nuclear Information System (INIS)

    Lorimer, Don; Raymond, Amy; Mixon, Mark; Burgin, Alex; Staker, Bart; Stewart, Lance

    2011-01-01

    For structural biology applications, protein-construct engineering is guided by comparative sequence analysis and structural information, which allow the researcher to better define domain boundaries for terminal deletions and nonconserved regions for surface mutants. A database software application called Gene Composer has been developed to facilitate construct design. The structural genomics effort at the Seattle Structural Genomics Center for Infectious Disease (SSGCID) requires the manipulation of large numbers of amino-acid sequences and the underlying DNA sequences which are to be cloned into expression vectors. To improve efficiency in high-throughput protein structure determination, a database software package, Gene Composer, has been developed which facilitates the information-rich design of protein constructs and their underlying gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bioinformatics steps used in modern structure-guided protein engineering and synthetic gene engineering. An example of the structure determination of H1N1 RNA-dependent RNA polymerase PB2 subunit is given

  9. Evaluating bacterial gene-finding HMM structures as probabilistic logic programs

    DEFF Research Database (Denmark)

    Mørk, Søren; Holmes, Ian

    2012-01-01

    , a probabilistic dialect of Prolog. Results: We evaluate Hidden Markov Model structures for bacterial protein-coding gene potential, including a simple null model structure, three structures based on existing bacterial gene finders and two novel model structures. We test standard versions as well as ADPH length...

  10. Causal gene identification using combinatorial V-structure search.

    Science.gov (United States)

    Cai, Ruichu; Zhang, Zhenjie; Hao, Zhifeng

    2013-07-01

    With the advances of biomedical techniques in the last decade, the costs of human genomic sequencing and genomic activity monitoring are coming down rapidly. To support the huge genome-based business in the near future, researchers are eager to find killer applications based on human genome information. Causal gene identification is one of the most promising applications, which may help the potential patients to estimate the risk of certain genetic diseases and locate the target gene for further genetic therapy. Unfortunately, existing pattern recognition techniques, such as Bayesian networks, cannot be directly applied to find the accurate causal relationship between genes and diseases. This is mainly due to the insufficient number of samples and the extremely high dimensionality of the gene space. In this paper, we present the first practical solution to causal gene identification, utilizing a new combinatorial formulation over V-Structures commonly used in conventional Bayesian networks, by exploring the combinations of significant V-Structures. We prove the NP-hardness of the combinatorial search problem under a general settings on the significance measure on the V-Structures, and present a greedy algorithm to find sub-optimal results. Extensive experiments show that our proposal is both scalable and effective, particularly with interesting findings on the causal genes over real human genome data. Copyright © 2013 Elsevier Ltd. All rights reserved.

  11. Structure and expression of thyroglobulin gene

    Energy Technology Data Exchange (ETDEWEB)

    Vassart, G; Brocas, H; Christophe, D; de Martynoff, G; Leriche, A; Mercken, L; Pohl, V; van Heuverswyn, B [Institut de Recherche Interdisciplinaire en Biologie Humaine et Nucleaire (IRIBHN), Faculte de Medecine, Universite libre de Bruxelles, Campus Hopital Erasme, Brussels (Belgium)

    1982-01-01

    Thyroglobulin is composed of two 300000 dalton polypeptide chains, translated from an 8000 base mRNA. Preparation of a full length cDNA and its cloning in E. coli have lead to the demonstration that the polypeptides of thyroglobulin protomers were identical. Used as molecular probes, the cloned cDNA allowed the isolation of a fragment of thyroglobulin gene. Electron microscopic studies have demonstrated that this gene contains more than 90 % intronic material separating small size exons (<200 bp). Sequencing of bovine thyroglobulin structural gene is in progress. Preliminary results show evidence for the existence of repetitive segments. Availability of cloned DNA complementary to bovine and human thyroglobulin mRNA allows the study of genetic defects of thyroglobulin gene expression in the human and in various animal models.

  12. Evaluating bacterial gene-finding HMM structures as probabilistic logic programs.

    Science.gov (United States)

    Mørk, Søren; Holmes, Ian

    2012-03-01

    Probabilistic logic programming offers a powerful way to describe and evaluate structured statistical models. To investigate the practicality of probabilistic logic programming for structure learning in bioinformatics, we undertook a simplified bacterial gene-finding benchmark in PRISM, a probabilistic dialect of Prolog. We evaluate Hidden Markov Model structures for bacterial protein-coding gene potential, including a simple null model structure, three structures based on existing bacterial gene finders and two novel model structures. We test standard versions as well as ADPH length modeling and three-state versions of the five model structures. The models are all represented as probabilistic logic programs and evaluated using the PRISM machine learning system in terms of statistical information criteria and gene-finding prediction accuracy, in two bacterial genomes. Neither of our implementations of the two currently most used model structures are best performing in terms of statistical information criteria or prediction performances, suggesting that better-fitting models might be achievable. The source code of all PRISM models, data and additional scripts are freely available for download at: http://github.com/somork/codonhmm. Supplementary data are available at Bioinformatics online.

  13. GETDB: 113406 [GETDB

    Lifescience Database Archive (English)

    Full Text Available 113406 Link to Original w[*] P{GawB}NP4151 / FM7c 12F3 Link to DGRC Genome Viewer: 113406...prob not (SH) muscle subset, cns midline, pns (ch) (TU) cns sg internal - - - - - g1 g2 - Show 113406... DGRC Number 113406 Link to Original Genotype w[*] P{GawB}NP4151 / FM7c Insertion Site 1...2F3 Map Viewer Link to DGRC Genome Viewer: 113406 Related Genes NetB CG15890 CG32595 Original Number 4151 Ch

  14. Gene structure, phylogeny and expression profile of the sucrose ...

    Indian Academy of Sciences (India)

    Gene structure, phylogeny and expression profile of the sucrose synthase gene family in .... 24, 701–713. Bate N. and Twell D. 1998 Functional architecture of a late pollen .... Manzara T. and Gruissem W. 1988 Organization and expression.

  15. The structure of the human interferon alpha/beta receptor gene.

    Science.gov (United States)

    Lutfalla, G; Gardiner, K; Proudhon, D; Vielh, E; Uzé, G

    1992-02-05

    Using the cDNA coding for the human interferon alpha/beta receptor (IFNAR), the IFNAR gene has been physically mapped relative to the other loci of the chromosome 21q22.1 region. 32,906 base pairs covering the IFNAR gene have been cloned and sequenced. Primer extension and solution hybridization-ribonuclease protection have been used to determine that the transcription of the gene is initiated in a broad region of 20 base pairs. Some aspects of the polymorphism of the gene, including noncoding sequences, have been analyzed; some are allelic differences in the coding sequence that induce amino acid variations in the resulting protein. The exon structure of the IFNAR gene and of that of the available genes for the receptors of the cytokine/growth hormone/prolactin/interferon receptor family have been compared with the predictions for the secondary structure of those receptors. From this analysis, we postulate a common origin and propose an hypothesis for the divergence from the immunoglobulin superfamily.

  16. Genome-wide identification of structural variants in genes encoding drug targets

    DEFF Research Database (Denmark)

    Rasmussen, Henrik Berg; Dahmcke, Christina Mackeprang

    2012-01-01

    The objective of the present study was to identify structural variants of drug target-encoding genes on a genome-wide scale. We also aimed at identifying drugs that are potentially amenable for individualization of treatments based on knowledge about structural variation in the genes encoding...

  17. Functional understanding of the diverse exon-intron structures of human GPCR genes.

    Science.gov (United States)

    Hammond, Dorothy A; Olman, Victor; Xu, Ying

    2014-02-01

    The GPCR genes have a variety of exon-intron structures even though their proteins are all structurally homologous. We have examined all human GPCR genes with at least two functional protein isoforms, totaling 199, aiming to gain an understanding of what may have contributed to the large diversity of the exon-intron structures of the GPCR genes. The 199 genes have a total of 808 known protein splicing isoforms with experimentally verified functions. Our analysis reveals that 1301 (80.6%) adjacent exon-exon pairs out of the total of 1,613 in the 199 genes have either exactly one exon skipped or the intron in-between retained in at least one of the 808 protein splicing isoforms. This observation has a statistical significance p-value of 2.051762 * e(-09), assuming that the observed splicing isoforms are independent of the exon-intron structures. Our interpretation of this observation is that the exon boundaries of the GPCR genes are not randomly determined; instead they may be selected to facilitate specific alternative splicing for functional purposes.

  18. Recurring Necrotic Enteritis Outbreaks in Commercial Broiler Chicken Flocks Strongly Influence Toxin Gene Carriage and Species Richness in the Resident Clostridium perfringens Population

    Directory of Open Access Journals (Sweden)

    Marie-Lou Gaucher

    2017-05-01

    Full Text Available Extensive use of antibiotic growth promoters (AGPs in food animals has been questioned due to the globally increasing problem of antibiotic resistance. For the poultry industry, digestive health management following AGP withdrawal in Europe has been a challenge, especially the control of necrotic enteritis. Much research work has focused on gut health in commercial broiler chicken husbandry. Understanding the behavior of Clostridium perfringens in its ecological niche, the poultry barn, is key to a sustainable and cost-effective production in the absence of AGPs. Using polymerase chain reaction and pulsed-field gel electrophoresis, we evaluated how the C. perfringens population evolved in drug-free commercial broiler chicken farms, either healthy or affected with recurring clinical necrotic enteritis outbreaks, over a 14-month period. We show that a high genotypic richness was associated with an increased risk of clinical necrotic enteritis. Also, necrotic enteritis-affected farms had a significant reduction of C. perfringens genotypic richness over time, an increase in the proportion of C. perfringens strains harboring the cpb2 gene, the netB gene, or both. Thus, necrotic enteritis occurrence is correlated with the presence of an initial highly diverse C. perfringens population, increasing the opportunity for the selective sweep of particularly virulent genotypes. Disease outbreaks also appear to largely influence the evolution of this bacterial species in poultry farms over time.

  19. Recurring Necrotic Enteritis Outbreaks in Commercial Broiler Chicken Flocks Strongly Influence Toxin Gene Carriage and Species Richness in the Resident Clostridium perfringens Population

    Science.gov (United States)

    Gaucher, Marie-Lou; Perron, Gabriel G.; Arsenault, Julie; Letellier, Ann; Boulianne, Martine; Quessy, Sylvain

    2017-01-01

    Extensive use of antibiotic growth promoters (AGPs) in food animals has been questioned due to the globally increasing problem of antibiotic resistance. For the poultry industry, digestive health management following AGP withdrawal in Europe has been a challenge, especially the control of necrotic enteritis. Much research work has focused on gut health in commercial broiler chicken husbandry. Understanding the behavior of Clostridium perfringens in its ecological niche, the poultry barn, is key to a sustainable and cost-effective production in the absence of AGPs. Using polymerase chain reaction and pulsed-field gel electrophoresis, we evaluated how the C. perfringens population evolved in drug-free commercial broiler chicken farms, either healthy or affected with recurring clinical necrotic enteritis outbreaks, over a 14-month period. We show that a high genotypic richness was associated with an increased risk of clinical necrotic enteritis. Also, necrotic enteritis-affected farms had a significant reduction of C. perfringens genotypic richness over time, an increase in the proportion of C. perfringens strains harboring the cpb2 gene, the netB gene, or both. Thus, necrotic enteritis occurrence is correlated with the presence of an initial highly diverse C. perfringens population, increasing the opportunity for the selective sweep of particularly virulent genotypes. Disease outbreaks also appear to largely influence the evolution of this bacterial species in poultry farms over time. PMID:28567032

  20. Recurring Necrotic Enteritis Outbreaks in Commercial Broiler Chicken Flocks Strongly Influence Toxin Gene Carriage and Species Richness in the Resident Clostridium perfringens Population.

    Science.gov (United States)

    Gaucher, Marie-Lou; Perron, Gabriel G; Arsenault, Julie; Letellier, Ann; Boulianne, Martine; Quessy, Sylvain

    2017-01-01

    Extensive use of antibiotic growth promoters (AGPs) in food animals has been questioned due to the globally increasing problem of antibiotic resistance. For the poultry industry, digestive health management following AGP withdrawal in Europe has been a challenge, especially the control of necrotic enteritis. Much research work has focused on gut health in commercial broiler chicken husbandry. Understanding the behavior of Clostridium perfringens in its ecological niche, the poultry barn, is key to a sustainable and cost-effective production in the absence of AGPs. Using polymerase chain reaction and pulsed-field gel electrophoresis, we evaluated how the C. perfringens population evolved in drug-free commercial broiler chicken farms, either healthy or affected with recurring clinical necrotic enteritis outbreaks, over a 14-month period. We show that a high genotypic richness was associated with an increased risk of clinical necrotic enteritis. Also, necrotic enteritis-affected farms had a significant reduction of C. perfringens genotypic richness over time, an increase in the proportion of C. perfringens strains harboring the cpb2 gene, the netB gene, or both. Thus, necrotic enteritis occurrence is correlated with the presence of an initial highly diverse C. perfringens population, increasing the opportunity for the selective sweep of particularly virulent genotypes. Disease outbreaks also appear to largely influence the evolution of this bacterial species in poultry farms over time.

  1. Comparative Annotation of Viral Genomes with Non-Conserved Gene Structure

    DEFF Research Database (Denmark)

    de Groot, Saskia; Mailund, Thomas; Hein, Jotun

    2007-01-01

    Motivation: Detecting genes in viral genomes is a complex task. Due to the biological necessity of them being constrained in length, RNA viruses in particular tend to code in overlapping reading frames. Since one amino acid is encoded by a triplet of nucleic acids, up to three genes may be coded...... allows for coding in unidirectional nested and overlapping reading frames, to annotate two homologous aligned viral genomes. Our method does not insist on conserved gene structure between the two sequences, thus making it applicable for the pairwise comparison of more distantly related sequences. Results...... and HIV2, as well as of two different Hepatitis Viruses, attaining results of ~87% sensitivity and ~98.5% specificity. We subsequently incorporate prior knowledge by "knowing" the gene structure of one sequence and annotating the other conditional on it. Boosting accuracy close to perfect we demonstrate...

  2. Comparative genomics of the relationship between gene structure and expression

    NARCIS (Netherlands)

    Ren, X.

    2006-01-01

    The relationship between the structure of genes and their expression is a relatively new aspect of genome organization and regulation. With more genome sequences and expression data becoming available, bioinformatics approaches can help the further elucidation of the relationships between gene

  3. Chromosome structures: reduction of certain problems with unequal gene content and gene paralogs to integer linear programming.

    Science.gov (United States)

    Lyubetsky, Vassily; Gershgorin, Roman; Gorbunov, Konstantin

    2017-12-06

    Chromosome structure is a very limited model of the genome including the information about its chromosomes such as their linear or circular organization, the order of genes on them, and the DNA strand encoding a gene. Gene lengths, nucleotide composition, and intergenic regions are ignored. Although highly incomplete, such structure can be used in many cases, e.g., to reconstruct phylogeny and evolutionary events, to identify gene synteny, regulatory elements and promoters (considering highly conserved elements), etc. Three problems are considered; all assume unequal gene content and the presence of gene paralogs. The distance problem is to determine the minimum number of operations required to transform one chromosome structure into another and the corresponding transformation itself including the identification of paralogs in two structures. We use the DCJ model which is one of the most studied combinatorial rearrangement models. Double-, sesqui-, and single-operations as well as deletion and insertion of a chromosome region are considered in the model; the single ones comprise cut and join. In the reconstruction problem, a phylogenetic tree with chromosome structures in the leaves is given. It is necessary to assign the structures to inner nodes of the tree to minimize the sum of distances between terminal structures of each edge and to identify the mutual paralogs in a fairly large set of structures. A linear algorithm is known for the distance problem without paralogs, while the presence of paralogs makes it NP-hard. If paralogs are allowed but the insertion and deletion operations are missing (and special constraints are imposed), the reduction of the distance problem to integer linear programming is known. Apparently, the reconstruction problem is NP-hard even in the absence of paralogs. The problem of contigs is to find the optimal arrangements for each given set of contigs, which also includes the mutual identification of paralogs. We proved that these

  4. The primary structures of two yeast enolase genes. Homology between the 5' noncoding flanking regions of yeast enolase and glyceraldehyde-3-phosphate dehydrogenase genes.

    Science.gov (United States)

    Holland, M J; Holland, J P; Thill, G P; Jackson, K A

    1981-02-10

    Segments of yeast genomic DNA containing two enolase structural genes have been isolated by subculture cloning procedures using a cDNA hybridization probe synthesized from purified yeast enolase mRNA. Based on restriction endonuclease and transcriptional maps of these two segments of yeast DNA, each hybrid plasmid contains a region of extensive nucleotide sequence homology which forms hybrids with the cDNA probe. The DNA sequences which flank this homologous region in the two hybrid plasmids are nonhomologous indicating that these sequences are nontandemly repeated in the yeast genome. The complete nucleotide sequence of the coding as well as the flanking noncoding regions of these genes has been determined. The amino acid sequence predicted from one reading frame of both structural genes is extremely similar to that determined for yeast enolase (Chin, C. C. Q., Brewer, J. M., Eckard, E., and Wold, F. (1981) J. Biol. Chem. 256, 1370-1376), confirming that these isolated structural genes encode yeast enolase. The nucleotide sequences of the coding regions of the genes are approximately 95% homologous, and neither gene contains an intervening sequence. Codon utilization in the enolase genes follows the same biased pattern previously described for two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes (Holland, J. P., and Holland, M. J. (1980) J. Biol. Chem. 255, 2596-2605). DNA blotting analysis confirmed that the isolated segments of yeast DNA are colinear with yeast genomic DNA and that there are two nontandemly repeated enolase genes per haploid yeast genome. The noncoding portions of the two enolase genes adjacent to the initiation and termination codons are approximately 70% homologous and contain sequences thought to be involved in the synthesis and processing messenger RNA. Finally there are regions of extensive homology between the two enolase structural genes and two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes within the 5

  5. Automated Eukaryotic Gene Structure Annotation Using EVidenceModeler and the Program to Assemble Spliced Alignments

    Energy Technology Data Exchange (ETDEWEB)

    Haas, B J; Salzberg, S L; Zhu, W; Pertea, M; Allen, J E; Orvis, J; White, O; Buell, C R; Wortman, J R

    2007-12-10

    EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.

  6. Optimal structural inference of signaling pathways from unordered and overlapping gene sets.

    Science.gov (United States)

    Acharya, Lipi R; Judeh, Thair; Wang, Guangdi; Zhu, Dongxiao

    2012-02-15

    A plethora of bioinformatics analysis has led to the discovery of numerous gene sets, which can be interpreted as discrete measurements emitted from latent signaling pathways. Their potential to infer signaling pathway structures, however, has not been sufficiently exploited. Existing methods accommodating discrete data do not explicitly consider signal cascading mechanisms that characterize a signaling pathway. Novel computational methods are thus needed to fully utilize gene sets and broaden the scope from focusing only on pairwise interactions to the more general cascading events in the inference of signaling pathway structures. We propose a gene set based simulated annealing (SA) algorithm for the reconstruction of signaling pathway structures. A signaling pathway structure is a directed graph containing up to a few hundred nodes and many overlapping signal cascades, where each cascade represents a chain of molecular interactions from the cell surface to the nucleus. Gene sets in our context refer to discrete sets of genes participating in signal cascades, the basic building blocks of a signaling pathway, with no prior information about gene orderings in the cascades. From a compendium of gene sets related to a pathway, SA aims to search for signal cascades that characterize the optimal signaling pathway structure. In the search process, the extent of overlap among signal cascades is used to measure the optimality of a structure. Throughout, we treat gene sets as random samples from a first-order Markov chain model. We evaluated the performance of SA in three case studies. In the first study conducted on 83 KEGG pathways, SA demonstrated a significantly better performance than Bayesian network methods. Since both SA and Bayesian network methods accommodate discrete data, use a 'search and score' network learning strategy and output a directed network, they can be compared in terms of performance and computational time. In the second study, we compared SA and

  7. Structure of genes for dermaseptins B, antimicrobial peptides from frog skin. Exon 1-encoded prepropeptide is conserved in genes for peptides of highly different structures and activities.

    Science.gov (United States)

    Vouille, V; Amiche, M; Nicolas, P

    1997-09-01

    We cloned the genes of two members of the dermaseptin family, broad-spectrum antimicrobial peptides isolated from the skin of the arboreal frog Phyllomedusa bicolor. The dermaseptin gene Drg2 has a 2-exon coding structure interrupted by a small 137-bp intron, wherein exon 1 encoded a 22-residue hydrophobic signal peptide and the first three amino acids of the acidic propiece; exon 2 contained the 18 additional acidic residues of the propiece plus a typical prohormone processing signal Lys-Arg and a 32-residue dermaseptin progenitor sequence. The dermaseptin genes Drg2 and Drg1g2 have conserved sequences at both untranslated ends and in the first and second coding exons. In contrast, Drg1g2 comprises a third coding exon for a short version of the acidic propiece and a second dermaseptin progenitor sequence. Structural conservation between the two genes suggests that Drg1g2 arose recently from an ancestral Drg2-like gene through amplification of part of the second coding exon and 3'-untranslated region. Analysis of the cDNAs coding precursors for several frog skin peptides of highly different structures and activities demonstrates that the signal peptides and part of the acidic propieces are encoded by conserved nucleotides encompassed by the first coding exon of the dermaseptin genes. The organization of the genes that belong to this family, with the signal peptide and the progenitor sequence on separate exons, permits strikingly different peptides to be directed into the secretory pathway. The recruitment of such a homologous 'secretory' exon by otherwise non-homologous genes may have been an early event in the evolution of amphibian.

  8. Gene-Transformation-Induced Changes in Chemical Functional Group Features and Molecular Structure Conformation in Alfalfa Plants Co-Expressing Lc-bHLH and C1-MYB Transcriptive Flavanoid Regulatory Genes: Effects of Single-Gene and Two-Gene Insertion.

    Science.gov (United States)

    Heendeniya, Ravindra G; Yu, Peiqiang

    2017-03-20

    Alfalfa ( Medicago sativa L.) genotypes transformed with Lc-bHLH and Lc transcription genes were developed with the intention of stimulating proanthocyanidin synthesis in the aerial parts of the plant. To our knowledge, there are no studies on the effect of single-gene and two-gene transformation on chemical functional groups and molecular structure changes in these plants. The objective of this study was to use advanced molecular spectroscopy with multivariate chemometrics to determine chemical functional group intensity and molecular structure changes in alfalfa plants when co-expressing Lc-bHLH and C1-MYB transcriptive flavanoid regulatory genes in comparison with non-transgenic (NT) and AC Grazeland (ACGL) genotypes. The results showed that compared to NT genotype, the presence of double genes ( Lc and C1 ) increased ratios of both the area and peak height of protein structural Amide I/II and the height ratio of α-helix to β-sheet. In carbohydrate-related spectral analysis, the double gene-transformed alfalfa genotypes exhibited lower peak heights at 1370, 1240, 1153, and 1020 cm -1 compared to the NT genotype. Furthermore, the effect of double gene transformation on carbohydrate molecular structure was clearly revealed in the principal component analysis of the spectra. In conclusion, single or double transformation of Lc and C1 genes resulted in changing functional groups and molecular structure related to proteins and carbohydrates compared to the NT alfalfa genotype. The current study provided molecular structural information on the transgenic alfalfa plants and provided an insight into the impact of transgenes on protein and carbohydrate properties and their molecular structure's changes.

  9. Community Structure Analysis of Gene Interaction Networks in Duchenne Muscular Dystrophy.

    Directory of Open Access Journals (Sweden)

    Tejaswini Narayanan

    Full Text Available Duchenne Muscular Dystrophy (DMD is an important pathology associated with the human skeletal muscle and has been studied extensively. Gene expression measurements on skeletal muscle of patients afflicted with DMD provides the opportunity to understand the underlying mechanisms that lead to the pathology. Community structure analysis is a useful computational technique for understanding and modeling genetic interaction networks. In this paper, we leverage this technique in combination with gene expression measurements from normal and DMD patient skeletal muscle tissue to study the structure of genetic interactions in the context of DMD. We define a novel framework for transforming a raw dataset of gene expression measurements into an interaction network, and subsequently apply algorithms for community structure analysis for the extraction of topological communities. The emergent communities are analyzed from a biological standpoint in terms of their constituent biological pathways, and an interpretation that draws correlations between functional and structural organization of the genetic interactions is presented. We also compare these communities and associated functions in pathology against those in normal human skeletal muscle. In particular, differential enhancements are observed in the following pathways between pathological and normal cases: Metabolic, Focal adhesion, Regulation of actin cytoskeleton and Cell adhesion, and implication of these mechanisms are supported by prior work. Furthermore, our study also includes a gene-level analysis to identify genes that are involved in the coupling between the pathways of interest. We believe that our results serve to highlight important distinguishing features in the structural/functional organization of constituent biological pathways, as it relates to normal and DMD cases, and provide the mechanistic basis for further biological investigations into specific pathways differently regulated

  10. Analysis of Gene Expression Variance in Schizophrenia Using Structural Equation Modeling

    Directory of Open Access Journals (Sweden)

    Anna A. Igolkina

    2018-06-01

    Full Text Available Schizophrenia (SCZ is a psychiatric disorder of unknown etiology. There is evidence suggesting that aberrations in neurodevelopment are a significant attribute of schizophrenia pathogenesis and progression. To identify biologically relevant molecular abnormalities affecting neurodevelopment in SCZ we used cultured neural progenitor cells derived from olfactory neuroepithelium (CNON cells. Here, we tested the hypothesis that variance in gene expression differs between individuals from SCZ and control groups. In CNON cells, variance in gene expression was significantly higher in SCZ samples in comparison with control samples. Variance in gene expression was enriched in five molecular pathways: serine biosynthesis, PI3K-Akt, MAPK, neurotrophin and focal adhesion. More than 14% of variance in disease status was explained within the logistic regression model (C-value = 0.70 by predictors accounting for gene expression in 69 genes from these five pathways. Structural equation modeling (SEM was applied to explore how the structure of these five pathways was altered between SCZ patients and controls. Four out of five pathways showed differences in the estimated relationships among genes: between KRAS and NF1, and KRAS and SOS1 in the MAPK pathway; between PSPH and SHMT2 in serine biosynthesis; between AKT3 and TSC2 in the PI3K-Akt signaling pathway; and between CRK and RAPGEF1 in the focal adhesion pathway. Our analysis provides evidence that variance in gene expression is an important characteristic of SCZ, and SEM is a promising method for uncovering altered relationships between specific genes thus suggesting affected gene regulation associated with the disease. We identified altered gene-gene interactions in pathways enriched for genes with increased variance in expression in SCZ. These pathways and loci were previously implicated in SCZ, providing further support for the hypothesis that gene expression variance plays important role in the etiology

  11. Phylogenetics and Gene Structure Dynamics of Polygalacturonase Genes in Aspergillus and Neurospora crassa

    Directory of Open Access Journals (Sweden)

    Jin-Sung Hong

    2013-09-01

    Full Text Available Polygalacturonase (PG gene is a typical gene family present in eukaryotes. Forty-nine PGs were mined from the genomes of Neurospora crassa and five Aspergillus species. The PGs were classified into 3 clades such as clade 1 for rhamno-PGs, clade 2 for exo-PGs and clade 3 for exo- and endo-PGs, which were further grouped into 13 sub-clades based on the polypeptide sequence similarity. In gene structure analysis, a total of 124 introns were present in 44 genes and five genes lacked introns to give an average of 2.5 introns per gene. Intron phase distribution was 64.5% for phase 0, 21.8% for phase 1, and 13.7% for phase 2, respectively. The introns varied in their sequences and their lengths ranged from 20 bp to 424 bp with an average of 65.9 bp, which is approximately half the size of introns in other fungal genes. There were 29 homologous intron blocks and 26 of those were sub-clade specific. Intron losses were counted in 18 introns in which no obvious phase preference for intron loss was observed. Eighteen introns were placed at novel positions, which is considerably higher than those of plant PGs. In an evolutionary sense both intron loss and gain must have taken place for shaping the current PGs in these fungi. Together with the small intron size, low conservation of homologous intron blocks and higher number of novel introns, PGs of fungal species seem to have recently undergone highly dynamic evolution.

  12. WebScipio: An online tool for the determination of gene structures using protein sequences

    Directory of Open Access Journals (Sweden)

    Waack Stephan

    2008-09-01

    Full Text Available Abstract Background Obtaining the gene structure for a given protein encoding gene is an important step in many analyses. A software suited for this task should be readily accessible, accurate, easy to handle and should provide the user with a coherent representation of the most probable gene structure. It should be rigorous enough to optimise features on the level of single bases and at the same time flexible enough to allow for cross-species searches. Results WebScipio, a web interface to the Scipio software, allows a user to obtain the corresponding coding sequence structure of a here given a query protein sequence that belongs to an already assembled eukaryotic genome. The resulting gene structure is presented in various human readable formats like a schematic representation, and a detailed alignment of the query and the target sequence highlighting any discrepancies. WebScipio can also be used to identify and characterise the gene structures of homologs in related organisms. In addition, it offers a web service for integration with other programs. Conclusion WebScipio is a tool that allows users to get a high-quality gene structure prediction from a protein query. It offers more than 250 eukaryotic genomes that can be searched and produces predictions that are close to what can be achieved by manual annotation, for in-species and cross-species searches alike. WebScipio is freely accessible at http://www.webscipio.org.

  13. Identification of Enzyme Genes Using Chemical Structure Alignments of Substrate-Product Pairs.

    Science.gov (United States)

    Moriya, Yuki; Yamada, Takuji; Okuda, Shujiro; Nakagawa, Zenichi; Kotera, Masaaki; Tokimatsu, Toshiaki; Kanehisa, Minoru; Goto, Susumu

    2016-03-28

    Although there are several databases that contain data on many metabolites and reactions in biochemical pathways, there is still a big gap in the numbers between experimentally identified enzymes and metabolites. It is supposed that many catalytic enzyme genes are still unknown. Although there are previous studies that estimate the number of candidate enzyme genes, these studies required some additional information aside from the structures of metabolites such as gene expression and order in the genome. In this study, we developed a novel method to identify a candidate enzyme gene of a reaction using the chemical structures of the substrate-product pair (reactant pair). The proposed method is based on a search for similar reactant pairs in a reference database and offers ortholog groups that possibly mediate the given reaction. We applied the proposed method to two experimentally validated reactions. As a result, we confirmed that the histidine transaminase was correctly identified. Although our method could not directly identify the asparagine oxo-acid transaminase, we successfully found the paralog gene most similar to the correct enzyme gene. We also applied our method to infer candidate enzyme genes in the mesaconate pathway. The advantage of our method lies in the prediction of possible genes for orphan enzyme reactions where any associated gene sequences are not determined yet. We believe that this approach will facilitate experimental identification of genes for orphan enzymes.

  14. Quantitative Structure-Activity Relationships and Docking Studies of Calcitonin Gene-Related Peptide Antagonists

    DEFF Research Database (Denmark)

    Jenssen, Håvard; Mehrabian, Mohadeseh; Kyani, Anahita

    2012-01-01

    Defining the role of calcitonin gene-related peptide in migraine pathogenesis could lead to the application of calcitonin gene-related peptide antagonists as novel migraine therapeutics. In this work, quantitative structure-activity relationship modeling of biological activities of a large range...... of calcitonin gene-related peptide antagonists was performed using a panel of physicochemical descriptors. The computational studies evaluated different variable selection techniques and demonstrated shuffling stepwise multiple linear regression to be superior over genetic algorithm-multiple linear regression....... The linear quantitative structure-activity relationship model revealed better statistical parameters of cross-validation in comparison with the non-linear support vector regression technique. Implementing only five peptide descriptors into this linear quantitative structure-activity relationship model...

  15. Structure of the human hepatic triglyceride lipase gene

    International Nuclear Information System (INIS)

    Cai, Shengjian; Wong, D.M.; Chen, Sanhwan; Chan, L.

    1989-01-01

    The structure of the human hepatic triglyceride lipase gene was determined from multiple cosmid clones. All the exons, exon-intron junctions, and 845 bp of the 5' and 254 bp of the 3' flanking DNA were sequenced. Comparison of the exon sequences to three previously published cDNA sequences revealed differences in the sequence of the codons for residue 133, 193, 202, and 234 that may represent sequence polymorphisms. By primer extension, hepatic lipase mRNA initiates at an adenine 77 bases upstream of the translation initiation site. The hepatic lipase gene spans over 60 kb containing 9 exons and 8 introns, the latter being all located within the region encoding the mature protein. The exons are all of average size (118-234 bp). Exon 1 encodes the signal peptide, exon 4, a region that binds to the lipoprotein substrate, and exon 5, an evolutionarily highly conserved region of potential catalytic function, and exons 6 and 9 encode sequences rich in basic amino acids thought to be important in anchoring the enzyme to the endothelial surface by interacting with acidic domains of the surface glycosaminoglycans. The human lipoprotein lipase gene has been recently reported to have an identical exon-intron organization containing the analogous structural domains. The observations strongly support the common evolutionary origin of these two lipolytic enzymes

  16. Improvisation in evolution of genes and genomes: whose structure is it anyway?

    Science.gov (United States)

    Shakhnovich, Boris E; Shakhnovich, Eugene I

    2008-06-01

    Significant progress has been made in recent years in a variety of seemingly unrelated fields such as sequencing, protein structure prediction, and high-throughput transcriptomics and metabolomics. At the same time, new microscopic models have been developed that made it possible to analyze the evolution of genes and genomes from first principles. The results from these efforts enable, for the first time, a comprehensive insight into the evolution of complex systems and organisms on all scales--from sequences to organisms and populations. Every newly sequenced genome uncovers new genes, families, and folds. Where do these new genes come from? How do gene duplication and subsequent divergence of sequence and structure affect the fitness of the organism? What role does regulation play in the evolution of proteins and folds? Emerging synergism between data and modeling provides first robust answers to these questions.

  17. Structural organization of glycophorin A and B genes: Glycophorin B gene evolved by homologous recombination at Alu repeat sequences

    International Nuclear Information System (INIS)

    Kudo, Shinichi; Fukuda, Minoru

    1989-01-01

    Glycophorins A (GPA) and B (GPB) are two major sialoglycoproteins of the human erythrocyte membrane. Here the authors present a comparison of the genomic structures of GPA and GPB developed by analyzing DNA clones isolated from a K562 genomic library. Nucleotide sequences of exon-intron junctions and 5' and 3' flanking sequences revealed that the GPA and GPB genes consist of 7 and 5 exons, respectively, and both genes have >95% identical sequence from the 5' flanking region to the region ∼ 1 kilobase downstream from the exon encoding the transmembrane regions. In this homologous part of the genes, GPB lacks one exon due to a point mutation at the 5' splicing site of the third intron, which inactivates the 5' cleavage event of splicing and leads to ligation of the second to the fourth exon. Following these very homologous sequences, the genomic sequences for GPA and GPB diverge significantly and no homology can be detected in their 3' end sequences. The analysis of the Alu sequences and their flanking direct repeat sequences suggest that an ancestral genomic structure has been maintained in the GPA gene, whereas the GPB gene has arisen from the acquisition of 3' sequences different from those of the GPA gene by homologous recombination at the Alu repeats during or after gene duplication

  18. Structure models of G72, the product of a susceptibility gene to schizophrenia.

    Science.gov (United States)

    Kato, Yusuke; Fukui, Kiyoshi

    2017-02-01

    The G72 gene is one of the most susceptible genes to schizophrenia and is contained exclusively in the genomes of primates. The product of the G72 gene modulates the activity of D-amino acid oxidase (DAO) and is a small protein prone to aggregate, which hampers its structural studies. In addition, lack of a known structure of a homologue makes it difficult to use the homology modelling method for the prediction of the structure. Thus, we first developed a hybrid ab initio approach for small proteins prior to the prediction of the structure of G72. The approach uses three known ab initio algorithms. To evaluate the hybrid approach, we tested our prediction of the structure of the amino acid sequences whose structures were already solved and compared the predicted structures with the experimentally solved structures. Based on these comparisons, the average accuracy of our approach was calculated to be ∼5 Å. We then applied the approach to the sequence of G72 and successfully predicted the structures of the N- and C-terminal domains (ND and CD, respectively) of G72. The predicted structures of ND and CD were similar to membrane-bound proteins and adaptor proteins, respectively. © The Authors 2016. Published by Oxford University Press on behalf of the Japanese Biochemical Society. All rights reserved.

  19. Recognizing genes and other components of genomic structure

    Energy Technology Data Exchange (ETDEWEB)

    Burks, C. (Los Alamos National Lab., NM (USA)); Myers, E. (Arizona Univ., Tucson, AZ (USA). Dept. of Computer Science); Stormo, G.D. (Colorado Univ., Boulder, CO (USA). Dept. of Molecular, Cellular and Developmental Biology)

    1991-01-01

    The Aspen Center for Physics (ACP) sponsored a three-week workshop, with 26 scientists participating, from 28 May to 15 June, 1990. The workshop, entitled Recognizing Genes and Other Components of Genomic Structure, focussed on discussion of current needs and future strategies for developing the ability to identify and predict the presence of complex functional units on sequenced, but otherwise uncharacterized, genomic DNA. We addressed the need for computationally-based, automatic tools for synthesizing available data about individual consensus sequences and local compositional patterns into the composite objects (e.g., genes) that are -- as composite entities -- the true object of interest when scanning DNA sequences. The workshop was structured to promote sustained informal contact and exchange of expertise between molecular biologists, computer scientists, and mathematicians. No participant stayed for less than one week, and most attended for two or three weeks. Computers, software, and databases were available for use as electronic blackboards'' and as the basis for collaborative exploration of ideas being discussed and developed at the workshop. 23 refs., 2 tabs.

  20. Evolutionary Relationship and Structural Characterization of the EPF/EPFL Gene Family

    OpenAIRE

    Takata, Naoki; Yokota, Kiyonobu; Ohki, Shinya; Mori, Masashi; Taniguchi, Toru; Kurita, Manabu

    2013-01-01

    EPF1-EPF2 and EPFL9/Stomagen act antagonistically in regulating leaf stomatal density. The aim of this study was to elucidate the evolutionary functional divergence of EPF/EPFL family genes. Phylogenetic analyses showed that AtEPFL9/Stomagen-like genes are conserved only in vascular plants and are closely related to AtEPF1/EPF2-like genes. Modeling showed that EPF/EPFL peptides share a common 3D structure that is constituted of a scaffold and loop. Molecular dynamics simulation suggested that...

  1. Optimization of cationic lipid mediated gene transfer: structure-function, physico-chemical, and cellular studies.

    Science.gov (United States)

    Carrière, Marie; Tranchant, Isabelle; Niore, Pierre-Antoine; Byk, Gerardo; Mignet, Nathalie; Escriou, Virginie; Scherman, Daniel; Herscovici, Jean

    2002-01-01

    The rationale design aimed at the enhancement of cationic lipid mediated gene transfer is discussed. These improvements are based on the straight evaluation of the structure-activity relationship and on the introduction of new structures. Much attention have been given to the supramolecular structures of the lipid/DNA complexes, to the effect of serum on gene transfer and to the intracellular trafficking of the lipoplexes. Finally new avenue using reducible cationic lipids has been discussed.

  2. Gene structure, phylogeny and expression profile of the sucrose synthase gene family in cacao (Theobroma cacao L.).

    Science.gov (United States)

    Li, Fupeng; Hao, Chaoyun; Yan, Lin; Wu, Baoduo; Qin, Xiaowei; Lai, Jianxiong; Song, Yinghui

    2015-09-01

    In higher plants, sucrose synthase (Sus, EC 2.4.1.13) is widely considered as a key enzyme involved in sucrose metabolism. Although, several paralogous genes encoding different isozymes of Sus have been identified and characterized in multiple plant genomes, to date detailed information about the Sus genes is lacking for cacao. This study reports the identification of six novel Sus genes from economically important cacao tree. Analyses of the gene structure and phylogeny of the Sus genes demonstrated evolutionary conservation in the Sus family across cacao and other plant species. The expression of cacao Sus genes was investigated via real-time PCR in various tissues, different developmental phases of leaf, flower bud and pod. The Sus genes exhibited distinct but partially redundant expression profiles in cacao, with TcSus1, TcSus5 and TcSus6, being the predominant genes in the bark with phloem, TcSus2 predominantly expressing in the seed during the stereotype stage. TcSus3 and TcSus4 were significantly detected more in the pod husk and seed coat along the pod development, and showed development dependent expression profiles in the cacao pod. These results provide new insights into the evolution, and basic information that will assist in elucidating the functions of cacao Sus gene family.

  3. Molecular comparison of the structural proteins encoding gene clusters of two related Lactobacillus delbrueckii bacteriophages.

    Science.gov (United States)

    Vasala, A; Dupont, L; Baumann, M; Ritzenthaler, P; Alatossava, T

    1993-01-01

    Virulent phage LL-H and temperate phage mv4 are two related bacteriophages of Lactobacillus delbrueckii. The gene clusters encoding structural proteins of these two phages have been sequenced and further analyzed. Six open reading frames (ORF-1 to ORF-6) were detected. Protein sequencing and Western immunoblotting experiments confirmed that ORF-3 (g34) encoded the main capsid protein Gp34. The presence of a putative late promoter in front of the phage LL-H g34 gene was suggested by primer extension experiments. Comparative sequence analysis between phage LL-H and phage mv4 revealed striking similarities in the structure and organization of this gene cluster, suggesting that the genes encoding phage structural proteins belong to a highly conservative module. Images PMID:8497043

  4. The mammalian adult neurogenesis gene ontology (MANGO provides a structural framework for published information on genes regulating adult hippocampal neurogenesis.

    Directory of Open Access Journals (Sweden)

    Rupert W Overall

    Full Text Available BACKGROUND: Adult hippocampal neurogenesis is not a single phenotype, but consists of a number of sub-processes, each of which is under complex genetic control. Interpretation of gene expression studies using existing resources often does not lead to results that address the interrelatedness of these processes. Formal structure, such as provided by ontologies, is essential in any field for comprehensive interpretation of existing knowledge but, until now, such a structure has been lacking for adult neurogenesis. METHODOLOGY/PRINCIPAL FINDINGS: We have created a resource with three components 1. A structured ontology describing the key stages in the development of adult hippocampal neural stem cells into functional granule cell neurons. 2. A comprehensive survey of the literature to annotate the results of all published reports on gene function in adult hippocampal neurogenesis (257 manuscripts covering 228 genes to the appropriate terms in our ontology. 3. An easy-to-use searchable interface to the resulting database made freely available online. The manuscript presents an overview of the database highlighting global trends such as the current bias towards research on early proliferative stages, and an example gene set enrichment analysis. A limitation of the resource is the current scope of the literature which, however, is growing by around 100 publications per year. With the ontology and database in place, new findings can be rapidly annotated and regular updates of the database will be made publicly available. CONCLUSIONS/SIGNIFICANCE: The resource we present allows relevant interpretation of gene expression screens in terms of defined stages of postnatal neuronal development. Annotation of genes by hand from the adult neurogenesis literature ensures the data are directly applicable to the system under study. We believe this approach could also serve as an example to other fields in a 'bottom-up' community effort complementing the already

  5. Population structuring of multi-copy, antigen-encoding genes in Plasmodium falciparum

    Science.gov (United States)

    Artzy-Randrup, Yael; Rorick, Mary M; Day, Karen; Chen, Donald; Dobson, Andrew P; Pascual, Mercedes

    2012-01-01

    The coexistence of multiple independently circulating strains in pathogen populations that undergo sexual recombination is a central question of epidemiology with profound implications for control. An agent-based model is developed that extends earlier ‘strain theory’ by addressing the var gene family of Plasmodium falciparum. The model explicitly considers the extensive diversity of multi-copy genes that undergo antigenic variation via sequential, mutually exclusive expression. It tracks the dynamics of all unique var repertoires in a population of hosts, and shows that even under high levels of sexual recombination, strain competition mediated through cross-immunity structures the parasite population into a subset of coexisting dominant repertoires of var genes whose degree of antigenic overlap depends on transmission intensity. Empirical comparison of patterns of genetic variation at antigenic and neutral sites supports this role for immune selection in structuring parasite diversity. DOI: http://dx.doi.org/10.7554/eLife.00093.001 PMID:23251784

  6. Relationships among msx gene structure and function in zebrafish and other vertebrates.

    Science.gov (United States)

    Ekker, M; Akimenko, M A; Allende, M L; Smith, R; Drouin, G; Langille, R M; Weinberg, E S; Westerfield, M

    1997-10-01

    The zebrafish genome contains at least five msx homeobox genes, msxA, msxB, msxC, msxD, and the newly isolated msxE. Although these genes share structural features common to all Msx genes, phylogenetic analyses of protein sequences indicate that the msx genes from zebrafish are not orthologous to the Msx1 and Msx2 genes of mammals, birds, and amphibians. The zebrafish msxB and msxC are more closely related to each other and to the mouse Msx3. Similarly, although the combinatorial expression of the zebrafish msx genes in the embryonic dorsal neuroectoderm, visceral arches, fins, and sensory organs suggests functional similarities with the Msx genes of other vertebrates, differences in the expression patterns preclude precise assignment of orthological relationships. Distinct duplication events may have given rise to the msx genes of modern fish and other vertebrate lineages whereas many aspects of msx gene functions during embryonic development have been preserved.

  7. Gene Structures, Evolution and Transcriptional Profiling of the WRKY Gene Family in Castor Bean (Ricinus communis L.).

    Science.gov (United States)

    Zou, Zhi; Yang, Lifu; Wang, Danhua; Huang, Qixing; Mo, Yeyong; Xie, Guishui

    2016-01-01

    WRKY proteins comprise one of the largest transcription factor families in plants and form key regulators of many plant processes. This study presents the characterization of 58 WRKY genes from the castor bean (Ricinus communis L., Euphorbiaceae) genome. Compared with the automatic genome annotation, one more WRKY-encoding locus was identified and 20 out of the 57 predicted gene models were manually corrected. All RcWRKY genes were shown to contain at least one intron in their coding sequences. According to the structural features of the present WRKY domains, the identified RcWRKY genes were assigned to three previously defined groups (I-III). Although castor bean underwent no recent whole-genome duplication event like physic nut (Jatropha curcas L., Euphorbiaceae), comparative genomics analysis indicated that one gene loss, one intron loss and one recent proximal duplication occurred in the RcWRKY gene family. The expression of all 58 RcWRKY genes was supported by ESTs and/or RNA sequencing reads derived from roots, leaves, flowers, seeds and endosperms. Further global expression profiles with RNA sequencing data revealed diverse expression patterns among various tissues. Results obtained from this study not only provide valuable information for future functional analysis and utilization of the castor bean WRKY genes, but also provide a useful reference to investigate the gene family expansion and evolution in Euphorbiaceus plants.

  8. A framework for scalable parameter estimation of gene circuit models using structural information

    KAUST Repository

    Kuwahara, Hiroyuki

    2013-06-21

    Motivation: Systematic and scalable parameter estimation is a key to construct complex gene regulatory models and to ultimately facilitate an integrative systems biology approach to quantitatively understand the molecular mechanisms underpinning gene regulation. Results: Here, we report a novel framework for efficient and scalable parameter estimation that focuses specifically on modeling of gene circuits. Exploiting the structure commonly found in gene circuit models, this framework decomposes a system of coupled rate equations into individual ones and efficiently integrates them separately to reconstruct the mean time evolution of the gene products. The accuracy of the parameter estimates is refined by iteratively increasing the accuracy of numerical integration using the model structure. As a case study, we applied our framework to four gene circuit models with complex dynamics based on three synthetic datasets and one time series microarray data set. We compared our framework to three state-of-the-art parameter estimation methods and found that our approach consistently generated higher quality parameter solutions efficiently. Although many general-purpose parameter estimation methods have been applied for modeling of gene circuits, our results suggest that the use of more tailored approaches to use domain-specific information may be a key to reverse engineering of complex biological systems. The Author 2013.

  9. A framework for scalable parameter estimation of gene circuit models using structural information

    KAUST Repository

    Kuwahara, Hiroyuki; Fan, Ming; Wang, Suojin; Gao, Xin

    2013-01-01

    Motivation: Systematic and scalable parameter estimation is a key to construct complex gene regulatory models and to ultimately facilitate an integrative systems biology approach to quantitatively understand the molecular mechanisms underpinning gene regulation. Results: Here, we report a novel framework for efficient and scalable parameter estimation that focuses specifically on modeling of gene circuits. Exploiting the structure commonly found in gene circuit models, this framework decomposes a system of coupled rate equations into individual ones and efficiently integrates them separately to reconstruct the mean time evolution of the gene products. The accuracy of the parameter estimates is refined by iteratively increasing the accuracy of numerical integration using the model structure. As a case study, we applied our framework to four gene circuit models with complex dynamics based on three synthetic datasets and one time series microarray data set. We compared our framework to three state-of-the-art parameter estimation methods and found that our approach consistently generated higher quality parameter solutions efficiently. Although many general-purpose parameter estimation methods have been applied for modeling of gene circuits, our results suggest that the use of more tailored approaches to use domain-specific information may be a key to reverse engineering of complex biological systems. The Author 2013.

  10. Characterization of the linkage disequilibrium structure and identification of tagging-SNPs in five DNA repair genes

    International Nuclear Information System (INIS)

    Allen-Brady, Kristina; Camp, Nicola J

    2005-01-01

    Characterization of the linkage disequilibrium (LD) structure of candidate genes is the basis for an effective association study of complex diseases such as cancer. In this study, we report the LD and haplotype architecture and tagging-single nucleotide polymorphisms (tSNPs) for five DNA repair genes: ATM, MRE11A, XRCC4, NBS1 and RAD50. The genes ATM, MRE11A, and XRCC4 were characterized using a panel of 94 unrelated female subjects (47 breast cancer cases, 47 controls) obtained from high-risk breast cancer families. A similar LD structure and tSNP analysis was performed for NBS1 and RAD50, using publicly available genotyping data. We studied a total of 61 SNPs at an average marker density of 10 kb. Using a matrix decomposition algorithm, based on principal component analysis, we captured >90% of the intragenetic variation for each gene. Our results revealed that three of the five genes did not conform to a haplotype block structure (MRE11A, RAD50 and XRCC4). Instead, the data fit a more flexible LD group paradigm, where SNPs in high LD are not required to be contiguous. Traditional haplotype blocks assume recombination is the only dynamic at work. For ATM, MRE11A and XRCC4 we repeated the analysis in cases and controls separately to determine whether LD structure was consistent across breast cancer cases and controls. No substantial difference in LD structures was found. This study suggests that appropriate SNP selection for an association study involving candidate genes should allow for both mutation and recombination, which shape the population-level genomic structure. Furthermore, LD structure characterization in either breast cancer cases or controls appears to be sufficient for future cancer studies utilizing these genes

  11. A Subset of Autism-Associated Genes Regulate the Structural Stability of Neurons

    Science.gov (United States)

    Lin, Yu-Chih; Frei, Jeannine A.; Kilander, Michaela B. C.; Shen, Wenjuan; Blatt, Gene J.

    2016-01-01

    Autism spectrum disorder (ASD) comprises a range of neurological conditions that affect individuals’ ability to communicate and interact with others. People with ASD often exhibit marked qualitative difficulties in social interaction, communication, and behavior. Alterations in neurite arborization and dendritic spine morphology, including size, shape, and number, are hallmarks of almost all neurological conditions, including ASD. As experimental evidence emerges in recent years, it becomes clear that although there is broad heterogeneity of identified autism risk genes, many of them converge into similar cellular pathways, including those regulating neurite outgrowth, synapse formation and spine stability, and synaptic plasticity. These mechanisms together regulate the structural stability of neurons and are vulnerable targets in ASD. In this review, we discuss the current understanding of those autism risk genes that affect the structural connectivity of neurons. We sub-categorize them into (1) cytoskeletal regulators, e.g., motors and small RhoGTPase regulators; (2) adhesion molecules, e.g., cadherins, NCAM, and neurexin superfamily; (3) cell surface receptors, e.g., glutamatergic receptors and receptor tyrosine kinases; (4) signaling molecules, e.g., protein kinases and phosphatases; and (5) synaptic proteins, e.g., vesicle and scaffolding proteins. Although the roles of some of these genes in maintaining neuronal structural stability are well studied, how mutations contribute to the autism phenotype is still largely unknown. Investigating whether and how the neuronal structure and function are affected when these genes are mutated will provide insights toward developing effective interventions aimed at improving the lives of people with autism and their families. PMID:27909399

  12. Primary structure and mapping of the hupA gene of Salmonella typhimurium.

    OpenAIRE

    Higgins, N P; Hillyard, D

    1988-01-01

    In bacteria, the complex nucleoid structure is folded and maintained by negative superhelical tension and a set of type II DNA-binding proteins, also called histonelike proteins. The most abundant type II DNA-binding protein is HU. Southern blot analysis showed that Salmonella typhimurium contained two HU genes that corresponded to Escherichia coli genes hupA (encoding HU-2 protein) and hupB (encoding HU-1). Salmonella hupA was cloned, and the nucleotide sequence of the gene was determined. C...

  13. Computer analysis of protein functional sites projection on exon structure of genes in Metazoa.

    Science.gov (United States)

    Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A

    2015-01-01

    Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and

  14. Occurrence of the structural enterocin A, P, B, L50B genes in enterococci of different origin.

    Science.gov (United States)

    Strompfová, Viola; Lauková, Andrea; Simonová, Monika; Marcináková, Miroslava

    2008-12-10

    Enterococci are well-known producers of antimicrobial peptides--bacteriocins (enterocins) and the number of characterized enterocins has been significantly increased. Recently, enterocins are of great interest for their potential as biopreservatives in food or feed while research on enterocins as alternative antimicrobials in humans and animals is only at the beginning. The present study provides a survey about the occurrence of enterocin structural genes A, P, B, L50B in a target of 427 strains of Enterococcus faecium (368) and Enterococcus faecalis (59) species from different sources (animal isolates, food and feed) performed by PCR method. Based on our results, 234 strains possessed one or more enterocin structural gene(s). The genes of enterocin P and enterocin A were the most frequently detected structural genes among the PCR positive strains (170 and 155 strains, respectively). Different frequency of the enterocin genes occurrence was detected in strains according to their origin; the strains from horses and silage showed the highest frequency of enterocin genes presence. All possible combinations of the tested genes occurred at least twice except the combination of the gene of enterocin B and L50B which possessed neither strain. The gene of enterocin A was exclusively detected among E. faecium strains, while the gene of enterocin P, B, L50B were detected in strains of both species E. faecium and E. faecalis. In conclusion, a high-frequency and variability of enterocin structural genes exists among enterococci of different origin what offers a big possibility to find effective bacteriocin-producing strains for their application in veterinary medicine.

  15. DNA breaks and chromatin structural changes enhance the transcription of autoimmune regulator target genes.

    Science.gov (United States)

    Guha, Mithu; Saare, Mario; Maslovskaja, Julia; Kisand, Kai; Liiv, Ingrid; Haljasorg, Uku; Tasa, Tõnis; Metspalu, Andres; Milani, Lili; Peterson, Pärt

    2017-04-21

    The autoimmune regulator (AIRE) protein is the key factor in thymic negative selection of autoreactive T cells by promoting the ectopic expression of tissue-specific genes in the thymic medullary epithelium. Mutations in AIRE cause a monogenic autoimmune disease called autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy. AIRE has been shown to promote DNA breaks via its interaction with topoisomerase 2 (TOP2). In this study, we investigated topoisomerase-induced DNA breaks and chromatin structural alterations in conjunction with AIRE-dependent gene expression. Using RNA sequencing, we found that inhibition of TOP2 religation activity by etoposide in AIRE-expressing cells had a synergistic effect on genes with low expression levels. AIRE-mediated transcription was not only enhanced by TOP2 inhibition but also by the TOP1 inhibitor camptothecin. The transcriptional activation was associated with structural rearrangements in chromatin, notably the accumulation of γH2AX and the exchange of histone H1 with HMGB1 at AIRE target gene promoters. In addition, we found the transcriptional up-regulation to co-occur with the chromatin structural changes within the genomic cluster of carcinoembryonic antigen-like cellular adhesion molecule genes. Overall, our results suggest that the presence of AIRE can trigger molecular events leading to an altered chromatin landscape and the enhanced transcription of low-expressed genes. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  16. Differential accumulation of nif structural gene mRNA in Azotobacter vinelandii.

    Science.gov (United States)

    Hamilton, Trinity L; Jacobson, Marty; Ludwig, Marcus; Boyd, Eric S; Bryant, Donald A; Dean, Dennis R; Peters, John W

    2011-09-01

    Northern analysis was employed to investigate mRNA produced by mutant strains of Azotobacter vinelandii with defined deletions in the nif structural genes and in the intergenic noncoding regions. The results indicate that intergenic RNA secondary structures effect the differential accumulation of transcripts, supporting the high Fe protein-to-MoFe protein ratio required for optimal diazotrophic growth.

  17. Evolutionary relationship and structural characterization of the EPF/EPFL gene family.

    Science.gov (United States)

    Takata, Naoki; Yokota, Kiyonobu; Ohki, Shinya; Mori, Masashi; Taniguchi, Toru; Kurita, Manabu

    2013-01-01

    EPF1-EPF2 and EPFL9/Stomagen act antagonistically in regulating leaf stomatal density. The aim of this study was to elucidate the evolutionary functional divergence of EPF/EPFL family genes. Phylogenetic analyses showed that AtEPFL9/Stomagen-like genes are conserved only in vascular plants and are closely related to AtEPF1/EPF2-like genes. Modeling showed that EPF/EPFL peptides share a common 3D structure that is constituted of a scaffold and loop. Molecular dynamics simulation suggested that AtEPF1/EPF2-like peptides form an additional disulfide bond in their loop regions and show greater flexibility in these regions than AtEPFL9/Stomagen-like peptides. This study uncovered the evolutionary relationship and the conformational divergence of proteins encoded by the EPF/EPFL family genes.

  18. A framework for scalable parameter estimation of gene circuit models using structural information.

    Science.gov (United States)

    Kuwahara, Hiroyuki; Fan, Ming; Wang, Suojin; Gao, Xin

    2013-07-01

    Systematic and scalable parameter estimation is a key to construct complex gene regulatory models and to ultimately facilitate an integrative systems biology approach to quantitatively understand the molecular mechanisms underpinning gene regulation. Here, we report a novel framework for efficient and scalable parameter estimation that focuses specifically on modeling of gene circuits. Exploiting the structure commonly found in gene circuit models, this framework decomposes a system of coupled rate equations into individual ones and efficiently integrates them separately to reconstruct the mean time evolution of the gene products. The accuracy of the parameter estimates is refined by iteratively increasing the accuracy of numerical integration using the model structure. As a case study, we applied our framework to four gene circuit models with complex dynamics based on three synthetic datasets and one time series microarray data set. We compared our framework to three state-of-the-art parameter estimation methods and found that our approach consistently generated higher quality parameter solutions efficiently. Although many general-purpose parameter estimation methods have been applied for modeling of gene circuits, our results suggest that the use of more tailored approaches to use domain-specific information may be a key to reverse engineering of complex biological systems. http://sfb.kaust.edu.sa/Pages/Software.aspx. Supplementary data are available at Bioinformatics online.

  19. Correlations in the population structure of music, genes and language

    Science.gov (United States)

    Brown, Steven; Savage, Patrick E.; Ko, Albert Min-Shan; Stoneking, Mark; Ko, Ying-Chin; Loo, Jun-Hun; Trejaut, Jean A.

    2014-01-01

    We present, to our knowledge, the first quantitative evidence that music and genes may have coevolved by demonstrating significant correlations between traditional group-level folk songs and mitochondrial DNA variation among nine indigenous populations of Taiwan. These correlations were of comparable magnitude to those between language and genes for the same populations, although music and language were not significantly correlated with one another. An examination of population structure for genetics showed stronger parallels to music than to language. Overall, the results suggest that music might have a sufficient time-depth to retrace ancient population movements and, additionally, that it might be capturing different aspects of population history than language. Music may therefore have the potential to serve as a novel marker of human migrations to complement genes, language and other markers. PMID:24225453

  20. Structural and functional organization of ribosomal genes within the mammalian cell nucleolus.

    Science.gov (United States)

    Derenzini, Massimo; Pasquinelli, Gianandrea; O'Donohue, Marie-Françoise; Ploton, Dominique; Thiry, Marc

    2006-02-01

    Data on the in situ structural-functional organization of ribosomal genes in the mammalian cell nucleolus are reviewed here. Major findings on chromatin structure in situ come from investigations carried out using the Feulgen-like osmium ammine reaction as a highly specific electron-opaque DNA tracer. Intranucleolar chromatin shows three different levels of organization: compact clumps, fibers ranging from 11 to 30 nm, and loose agglomerates of extended DNA filaments. Both clumps and fibers of chromatin exhibit a nucleosomal organization that is lacking in the loose agglomerates of extended DNA filaments. In fact, these filaments constantly show a thickness of 2-3 nm, the same as a DNA double-helix molecule. The loose agglomerates of DNA filaments are located in the fibrillar centers, the interphase counterpart of metaphase NORs, therefore being constituted by ribosomal DNA. The extended, non-nucleosomal configuration of this rDNA has been shown to be independent of transcriptional activity and characterizes ribosome genes that are either transcribed or transcriptionally silent. Data reviewed are consistent with a model of control for ribosome gene activity that is not mediated by changes in chromatin structure. The presence of rDNA in mammalian cells always structurally ready for transcription might facilitate a more rapid adjustment of the ribosome production in response to the metabolic needs of the cell.

  1. Structured association analysis leads to insight into Saccharomyces cerevisiae gene regulation by finding multiple contributing eQTL hotspots associated with functional gene modules.

    Science.gov (United States)

    Curtis, Ross E; Kim, Seyoung; Woolford, John L; Xu, Wenjie; Xing, Eric P

    2013-03-21

    Association analysis using genome-wide expression quantitative trait locus (eQTL) data investigates the effect that genetic variation has on cellular pathways and leads to the discovery of candidate regulators. Traditional analysis of eQTL data via pairwise statistical significance tests or linear regression does not leverage the availability of the structural information of the transcriptome, such as presence of gene networks that reveal correlation and potentially regulatory relationships among the study genes. We employ a new eQTL mapping algorithm, GFlasso, which we have previously developed for sparse structured regression, to reanalyze a genome-wide yeast dataset. GFlasso fully takes into account the dependencies among expression traits to suppress false positives and to enhance the signal/noise ratio. Thus, GFlasso leverages the gene-interaction network to discover the pleiotropic effects of genetic loci that perturb the expression level of multiple (rather than individual) genes, which enables us to gain more power in detecting previously neglected signals that are marginally weak but pleiotropically significant. While eQTL hotspots in yeast have been reported previously as genomic regions controlling multiple genes, our analysis reveals additional novel eQTL hotspots and, more interestingly, uncovers groups of multiple contributing eQTL hotspots that affect the expression level of functional gene modules. To our knowledge, our study is the first to report this type of gene regulation stemming from multiple eQTL hotspots. Additionally, we report the results from in-depth bioinformatics analysis for three groups of these eQTL hotspots: ribosome biogenesis, telomere silencing, and retrotransposon biology. We suggest candidate regulators for the functional gene modules that map to each group of hotspots. Not only do we find that many of these candidate regulators contain mutations in the promoter and coding regions of the genes, in the case of the Ribi group

  2. The Mycoplasma hominis vaa gene displays a mosaic gene structure

    DEFF Research Database (Denmark)

    Boesen, Thomas; Emmersen, Jeppe M. G.; Jensen, Lise T.

    1998-01-01

    Mycoplasma hominis contains a variable adherence-associated (vaa) gene. To classify variants of the vaa genes, we examined 42 M. hominis isolated by PCR, DNA sequencing and immunoblotting. This uncovered the existence of five gene categories. Comparison of the gene types revealed a modular...

  3. Revised Mimivirus major capsid protein sequence reveals intron-containing gene structure and extra domain

    Directory of Open Access Journals (Sweden)

    Suzan-Monti Marie

    2009-05-01

    Full Text Available Abstract Background Acanthamoebae polyphaga Mimivirus (APM is the largest known dsDNA virus. The viral particle has a nearly icosahedral structure with an internal capsid shell surrounded with a dense layer of fibrils. A Capsid protein sequence, D13L, was deduced from the APM L425 coding gene and was shown to be the most abundant protein found within the viral particle. However this protein remained poorly characterised until now. A revised protein sequence deposited in a database suggested an additional N-terminal stretch of 142 amino acids missing from the original deduced sequence. This result led us to investigate the L425 gene structure and the biochemical properties of the complete APM major Capsid protein. Results This study describes the full length 3430 bp Capsid coding gene and characterises the 593 amino acids long corresponding Capsid protein 1. The recombinant full length protein allowed the production of a specific monoclonal antibody able to detect the Capsid protein 1 within the viral particle. This protein appeared to be post-translationnally modified by glycosylation and phosphorylation. We proposed a secondary structure prediction of APM Capsid protein 1 compared to the Capsid protein structure of Paramecium Bursaria Chlorella Virus 1, another member of the Nucleo-Cytoplasmic Large DNA virus family. Conclusion The characterisation of the full length L425 Capsid coding gene of Acanthamoebae polyphaga Mimivirus provides new insights into the structure of the main Capsid protein. The production of a full length recombinant protein will be useful for further structural studies.

  4. The IQD gene family in soybean: structure, phylogeny, evolution and expression.

    Directory of Open Access Journals (Sweden)

    Lin Feng

    Full Text Available Members of the plant-specific IQ67-domain (IQD protein family are involved in plant development and the basal defense response. Although systematic characterization of this family has been carried out in Arabidopsis, tomato (Solanum lycopersicum, Brachypodium distachyon and rice (Oryza sativa, systematic analysis and expression profiling of this gene family in soybean (Glycine max have not previously been reported. In this study, we identified and structurally characterized IQD genes in the soybean genome. A complete set of 67 soybean IQD genes (GmIQD1-67 was identified using Blast search tools, and the genes were clustered into four subfamilies (IQD I-IV based on phylogeny. These soybean IQD genes are distributed unevenly across all 20 chromosomes, with 30 segmental duplication events, suggesting that segmental duplication has played a major role in the expansion of the soybean IQD gene family. Analysis of the Ka/Ks ratios showed that the duplicated genes of the GmIQD family primarily underwent purifying selection. Microsynteny was detected in most pairs: genes in clade 1-3 might be present in genome regions that were inverted, expanded or contracted after the divergence; most gene pairs in clade 4 showed high conservation with little rearrangement among these gene-residing regions. Of the soybean IQD genes examined, six were most highly expressed in young leaves, six in flowers, one in roots and two in nodules. Our qRT-PCR analysis of 24 soybean IQD III genes confirmed that these genes are regulated by MeJA stress. Our findings present a comprehensive overview of the soybean IQD gene family and provide insights into the evolution of this family. In addition, this work lays a solid foundation for further experiments aimed at determining the biological functions of soybean IQD genes in growth and development.

  5. Evolutionary relationship and structural characterization of the EPF/EPFL gene family.

    Directory of Open Access Journals (Sweden)

    Naoki Takata

    Full Text Available EPF1-EPF2 and EPFL9/Stomagen act antagonistically in regulating leaf stomatal density. The aim of this study was to elucidate the evolutionary functional divergence of EPF/EPFL family genes. Phylogenetic analyses showed that AtEPFL9/Stomagen-like genes are conserved only in vascular plants and are closely related to AtEPF1/EPF2-like genes. Modeling showed that EPF/EPFL peptides share a common 3D structure that is constituted of a scaffold and loop. Molecular dynamics simulation suggested that AtEPF1/EPF2-like peptides form an additional disulfide bond in their loop regions and show greater flexibility in these regions than AtEPFL9/Stomagen-like peptides. This study uncovered the evolutionary relationship and the conformational divergence of proteins encoded by the EPF/EPFL family genes.

  6. Structural influence of gene networks on their inference: analysis of C3NET

    Directory of Open Access Journals (Sweden)

    Emmert-Streib Frank

    2011-06-01

    Full Text Available Abstract Background The availability of large-scale high-throughput data possesses considerable challenges toward their functional analysis. For this reason gene network inference methods gained considerable interest. However, our current knowledge, especially about the influence of the structure of a gene network on its inference, is limited. Results In this paper we present a comprehensive investigation of the structural influence of gene networks on the inferential characteristics of C3NET - a recently introduced gene network inference algorithm. We employ local as well as global performance metrics in combination with an ensemble approach. The results from our numerical study for various biological and synthetic network structures and simulation conditions, also comparing C3NET with other inference algorithms, lead a multitude of theoretical and practical insights into the working behavior of C3NET. In addition, in order to facilitate the practical usage of C3NET we provide an user-friendly R package, called c3net, and describe its functionality. It is available from https://r-forge.r-project.org/projects/c3net and from the CRAN package repository. Conclusions The availability of gene network inference algorithms with known inferential properties opens a new era of large-scale screening experiments that could be equally beneficial for basic biological and biomedical research with auspicious prospects. The availability of our easy to use software package c3net may contribute to the popularization of such methods. Reviewers This article was reviewed by Lev Klebanov, Joel Bader and Yuriy Gusev.

  7. Structure and expression of the human and mouse T4 genes

    International Nuclear Information System (INIS)

    Maddon, P.J.; Molineaux, S.M.; Maddon, D.F.; Zimmerman, K.A.; Godfrey, M.; Alt, F.W.; Chess, L.; Axel, R.

    1987-01-01

    The T4 molecule may serve as a T-cell receptor recognizing molecules on the surface of specific target cells and also serves as the receptor for the human immunodeficiency virus. To define the mechanisms of interaction of T4 with the surface of antigen-presenting cells as well as with human immunodeficiency virus, the authors have further analyzed the sequence, structure, and expression of the human and mouse T4 genes. T4 consists of an extracellular segment comprised of a leader sequence followed by four tandem variable-joining (VJ)-like domains, a transmembrane domain, and A cytoplasmic segment. The structural domains of the T4 protein deduced from amino acid sequence are precisely reflected in the intron-exon organization of the gene. Analysis of the expression of the T4 gene indicates that T4 RNA is expressed not only in T lymphocytes, but in B cells, macrophages, and granulocytes. T4 is also expressed in a developmentally regulated manner in specific regions of the brain. It is, therefore, possible that T4 plays a more general role in mediating cell recognition events that are not restricted to the cellular immune response

  8. Engaging Students in a Bioinformatics Activity to Introduce Gene Structure and Function

    Directory of Open Access Journals (Sweden)

    Barbara J. May

    2013-02-01

    Full Text Available Bioinformatics spans many fields of biological research and plays a vital role in mining and analyzing data. Therefore, there is an ever-increasing need for students to understand not only what can be learned from this data, but also how to use basic bioinformatics tools.  This activity is designed to provide secondary and undergraduate biology students to a hands-on activity meant to explore and understand gene structure with the use of basic bioinformatic tools.  Students are provided an “unknown” sequence from which they are asked to use a free online gene finder program to identify the gene. Students then predict the putative function of this gene with the use of additional online databases.

  9. Use of tiling array data and RNA secondary structure predictions to identify noncoding RNA genes

    DEFF Research Database (Denmark)

    Weile, Christian; Gardner, Paul P; Hedegaard, Mads M

    2007-01-01

    neuroblastoma cell line SK-N-AS. Using this strategy, we identify thousands of human candidate RNA genes. To further verify the expression of these genes, we focused on candidate genes that had a stable hairpin structures or a high level of covariance. Using northern blotting, we verify the expression of 2 out...

  10. Structure of the Elastin-Contractile Units in the Thoracic Aorta and How Genes That Cause Thoracic Aortic Aneurysms and Dissections Disrupt This Structure.

    Science.gov (United States)

    Karimi, Ashkan; Milewicz, Dianna M

    2016-01-01

    The medial layer of the aorta confers elasticity and strength to the aortic wall and is composed of alternating layers of smooth muscle cells (SMCs) and elastic fibres. The SMC elastin-contractile unit is a structural unit that links the elastin fibres to the SMCs and is characterized by the following: (1) layers of elastin fibres that are surrounded by microfibrils; (2) microfibrils that bind to the integrin receptors in focal adhesions on the cell surface of the SMCs; and (3) SMC contractile filaments that are linked to the focal adhesions on the inner side of the membrane. The genes that are altered to cause thoracic aortic aneurysms and aortic dissections encode proteins involved in the structure or function of the SMC elastin-contractile unit. Included in this gene list are the genes encoding protein that are structural components of elastin fibres and microfibrils, FBN1, MFAP5, ELN, and FBLN4. Also included are genes that encode structural proteins in the SMC contractile unit, including ACTA2, which encodes SMC-specific α-actin and MYH11, which encodes SMC-specific myosin heavy chain, along with MYLK and PRKG1, which encode kinases that control SMC contraction. Finally, mutations in the gene encoding the protein linking integrin receptors to the contractile filaments, FLNA, also predispose to thoracic aortic disease. Thus, these data suggest that functional SMC elastin-contractile units are important for maintaining the structural integrity of the aorta. Copyright © 2016 Canadian Cardiovascular Society. Published by Elsevier Inc. All rights reserved.

  11. Macro optical projection tomography for large scale 3D imaging of plant structures and gene activity.

    Science.gov (United States)

    Lee, Karen J I; Calder, Grant M; Hindle, Christopher R; Newman, Jacob L; Robinson, Simon N; Avondo, Jerome J H Y; Coen, Enrico S

    2017-01-01

    Optical projection tomography (OPT) is a well-established method for visualising gene activity in plants and animals. However, a limitation of conventional OPT is that the specimen upper size limit precludes its application to larger structures. To address this problem we constructed a macro version called Macro OPT (M-OPT). We apply M-OPT to 3D live imaging of gene activity in growing whole plants and to visualise structural morphology in large optically cleared plant and insect specimens up to 60 mm tall and 45 mm deep. We also show how M-OPT can be used to image gene expression domains in 3D within fixed tissue and to visualise gene activity in 3D in clones of growing young whole Arabidopsis plants. A further application of M-OPT is to visualise plant-insect interactions. Thus M-OPT provides an effective 3D imaging platform that allows the study of gene activity, internal plant structures and plant-insect interactions at a macroscopic scale. © The Author 2016. Published by Oxford University Press on behalf of the Society for Experimental Biology.

  12. VizPrimer: a web server for visualized PCR primer design based on known gene structure.

    Science.gov (United States)

    Zhou, Yang; Qu, Wubin; Lu, Yiming; Zhang, Yanchun; Wang, Xiaolei; Zhao, Dongsheng; Yang, Yi; Zhang, Chenggang

    2011-12-15

    The visualization of gene structure plays an important role in polymerase chain reaction (PCR) primer design, especially for eukaryotic genes with a number of splice variants that users need to distinguish between via PCR. Here, we describe a visualized web server for primer design named VizPrimer. It utilizes the new information technology (IT) tools, HTML5 to display gene structure and JavaScript to interact with the users. In VizPrimer, the users can focus their attention on the gene structure and primer design strategy, without wasting time calculating the exon positions of splice variants or manually configuring complicated parameters. In addition, VizPrimer is also suitable for the design of PCR primers for amplifying open reading frames and detecting single nucleotide polymorphisms (SNPs). VizPrimer is freely available at http://biocompute.bmi.ac.cn/CZlab/VizPrimer/. The web server supported browsers: Chrome (≥5.0), Firefox (≥3.0), Safari (≥4.0) and Opera (≥10.0). zhangcg@bmi.ac.cn; yangyi528@vip.sina.com.

  13. A genomic perspective on protein tyrosine phosphatases: gene structure, pseudogenes, and genetic disease linkage

    DEFF Research Database (Denmark)

    Andersen, Jannik N; Jansen, Peter G; Echwald, Søren M

    2004-01-01

    sequence databases, we discovered one novel human PTP gene and defined chromosomal loci and exon structure of the additional 37 genes encoding known PTP transcripts. Direct orthologs were present in the mouse genome for all 38 human PTP genes. In addition, we identified 12 PTP pseudogenes unique to humans...... that have probably contaminated previous bioinformatics analysis of this gene family. PCR amplification and transcript sequencing indicate that some PTP pseudogenes are expressed, but their function (if any) is unknown. Furthermore, we analyzed the enhanced diversity generated by alternative splicing...

  14. Faktiske tekster 2. udgave

    DEFF Research Database (Denmark)

    Fibiger, Johannes; Lorentzen, Rasmus Fink; Iversen, Gurli Bjørn

    Artiklen redegør for udbredelsen af netbårne multimodale tekster som fx hjemmesider, tweets og wikis. i forlængelse heraf introduceres en kommunikationskritisk kompetence samt en udvidet kokmmunikationsmodel som danskgfaglige analyseredskaber. Afslutningsvis perspektiveres der til underviserens...

  15. Genome-wide analysis of the expansin gene superfamily reveals grapevine-specific structural and functional characteristics.

    Directory of Open Access Journals (Sweden)

    Silvia Dal Santo

    Full Text Available BACKGROUND: Expansins are proteins that loosen plant cell walls in a pH-dependent manner, probably by increasing the relative movement among polymers thus causing irreversible expansion. The expansin superfamily (EXP comprises four distinct families: expansin A (EXPA, expansin B (EXPB, expansin-like A (EXLA and expansin-like B (EXLB. There is experimental evidence that EXPA and EXPB proteins are required for cell expansion and developmental processes involving cell wall modification, whereas the exact functions of EXLA and EXLB remain unclear. The complete grapevine (Vitis vinifera genome sequence has allowed the characterization of many gene families, but an exhaustive genome-wide analysis of expansin gene expression has not been attempted thus far. METHODOLOGY/PRINCIPAL FINDINGS: We identified 29 EXP superfamily genes in the grapevine genome, representing all four EXP families. Members of the same EXP family shared the same exon-intron structure, and phylogenetic analysis confirmed a closer relationship between EXP genes from woody species, i.e. grapevine and poplar (Populus trichocarpa, compared to those from Arabidopsis thaliana and rice (Oryza sativa. We also identified grapevine-specific duplication events involving the EXLB family. Global gene expression analysis confirmed a strong correlation among EXP genes expressed in mature and green/vegetative samples, respectively, as reported for other gene families in the recently-published grapevine gene expression atlas. We also observed the specific co-expression of EXLB genes in woody organs, and the involvement of certain grapevine EXP genes in berry development and post-harvest withering. CONCLUSION: Our comprehensive analysis of the grapevine EXP superfamily confirmed and extended current knowledge about the structural and functional characteristics of this gene family, and also identified properties that are currently unique to grapevine expansin genes. Our data provide a model for the

  16. Structure and Chromosomal Organization of Yeast Genes Regulated by Topoisomerase II.

    Science.gov (United States)

    Joshi, Ricky S; Nikolaou, Christoforos; Roca, Joaquim

    2018-01-03

    Cellular DNA topoisomerases (topo I and topo II) are highly conserved enzymes that regulate the topology of DNA during normal genome transactions, such as DNA transcription and replication. In budding yeast, topo I is dispensable whereas topo II is essential, suggesting fundamental and exclusive roles for topo II, which might include the functions of the topo IIa and topo IIb isoforms found in mammalian cells. In this review, we discuss major findings of the structure and chromosomal organization of genes regulated by topo II in budding yeast. Experimental data was derived from short (10 min) and long term (120 min) responses to topo II inactivation in top-2 ts mutants. First, we discuss how short term responses reveal a subset of yeast genes that are regulated by topo II depending on their promoter architecture. These short term responses also uncovered topo II regulation of transcription across multi-gene clusters, plausibly by common DNA topology management. Finally, we examine the effects of deactivated topo II on the elongation of RNA transcripts. Each study provides an insight into the particular chromatin structure that interacts with the activity of topo II. These findings are of notable clinical interest as numerous anti-cancer therapies interfere with topo II activity.

  17. Phylogenetic analysis and protein structure modelling identifies distinct Ca(2+)/Cation antiporters and conservation of gene family structure within Arabidopsis and rice species.

    Science.gov (United States)

    Pittman, Jon K; Hirschi, Kendal D

    2016-12-01

    The Ca(2+)/Cation Antiporter (CaCA) superfamily is an ancient and widespread family of ion-coupled cation transporters found in nearly all kingdoms of life. In animals, K(+)-dependent and K(+)-indendent Na(+)/Ca(2+) exchangers (NCKX and NCX) are important CaCA members. Recently it was proposed that all rice and Arabidopsis CaCA proteins should be classified as NCX proteins. Here we performed phylogenetic analysis of CaCA genes and protein structure homology modelling to further characterise members of this transporter superfamily. Phylogenetic analysis of rice and Arabidopsis CaCAs in comparison with selected CaCA members from non-plant species demonstrated that these genes form clearly distinct families, with the H(+)/Cation exchanger (CAX) and cation/Ca(2+) exchanger (CCX) families dominant in higher plants but the NCKX and NCX families absent. NCX-related Mg(2+)/H(+) exchanger (MHX) and CAX-related Na(+)/Ca(2+) exchanger-like (NCL) proteins are instead present. Analysis of genomes of ten closely-related rice species and four Arabidopsis-related species found that CaCA gene family structures are highly conserved within related plants, apart from minor variation. Protein structures were modelled for OsCAX1a and OsMHX1. Despite exhibiting broad structural conservation, there are clear structural differences observed between the different CaCA types. Members of the CaCA superfamily form clearly distinct families with different phylogenetic, structural and functional characteristics, and therefore should not be simply classified as NCX proteins, which should remain as a separate gene family.

  18. Analysis of ribosomal protein gene structures: implications for intron evolution.

    Directory of Open Access Journals (Sweden)

    2006-03-01

    Full Text Available Many spliceosomal introns exist in the eukaryotic nuclear genome. Despite much research, the evolution of spliceosomal introns remains poorly understood. In this paper, we tried to gain insights into intron evolution from a novel perspective by comparing the gene structures of cytoplasmic ribosomal proteins (CRPs and mitochondrial ribosomal proteins (MRPs, which are held to be of archaeal and bacterial origin, respectively. We analyzed 25 homologous pairs of CRP and MRP genes that together had a total of 527 intron positions. We found that all 12 of the intron positions shared by CRP and MRP genes resulted from parallel intron gains and none could be considered to be "conserved," i.e., descendants of the same ancestor. This was supported further by the high frequency of proto-splice sites at these shared positions; proto-splice sites are proposed to be sites for intron insertion. Although we could not definitively disprove that spliceosomal introns were already present in the last universal common ancestor, our results lend more support to the idea that introns were gained late. At least, our results show that MRP genes were intronless at the time of endosymbiosis. The parallel intron gains between CRP and MRP genes accounted for 2.3% of total intron positions, which should provide a reliable estimate for future inferences of intron evolution.

  19. Self-similarities of periodic structures for a discrete model of a two-gene system

    International Nuclear Information System (INIS)

    Souza, S.L.T. de; Lima, A.A.; Caldas, I.L.; Medrano-T, R.O.; Guimarães-Filho, Z.O.

    2012-01-01

    We report self-similar properties of periodic structures remarkably organized in the two-parameter space for a two-gene system, described by two-dimensional symmetric map. The map consists of difference equations derived from the chemical reactions for gene expression and regulation. We characterize the system by using Lyapunov exponents and isoperiodic diagrams identifying periodic windows, denominated Arnold tongues and shrimp-shaped structures. Period-adding sequences are observed for both periodic windows. We also identify Fibonacci-type series and Golden ratio for Arnold tongues, and period multiple-of-three windows for shrimps. -- Highlights: ► The existence of noticeable periodic windows has been reported recently for several nonlinear systems. ► The periodic window distributions appear highly organized in two-parameter space. ► We characterize self-similar properties of Arnold tongues and shrimps for a two-gene model. ► We determine the period of the Arnold tongues recognizing a Fibonacci-type sequence. ► We explore self-similar features of the shrimps identifying multiple period-three structures.

  20. Self-similarities of periodic structures for a discrete model of a two-gene system

    Energy Technology Data Exchange (ETDEWEB)

    Souza, S.L.T. de, E-mail: thomaz@ufsj.edu.br [Departamento de Física e Matemática, Universidade Federal de São João del-Rei, Ouro Branco, MG (Brazil); Lima, A.A. [Escola de Farmácia, Universidade Federal de Ouro Preto, Ouro Preto, MG (Brazil); Caldas, I.L. [Instituto de Física, Universidade de São Paulo, São Paulo, SP (Brazil); Medrano-T, R.O. [Departamento de Ciências Exatas e da Terra, Universidade Federal de São Paulo, Diadema, SP (Brazil); Guimarães-Filho, Z.O. [Aix-Marseille Univ., CNRS PIIM UMR6633, International Institute for Fusion Science, Marseille (France)

    2012-03-12

    We report self-similar properties of periodic structures remarkably organized in the two-parameter space for a two-gene system, described by two-dimensional symmetric map. The map consists of difference equations derived from the chemical reactions for gene expression and regulation. We characterize the system by using Lyapunov exponents and isoperiodic diagrams identifying periodic windows, denominated Arnold tongues and shrimp-shaped structures. Period-adding sequences are observed for both periodic windows. We also identify Fibonacci-type series and Golden ratio for Arnold tongues, and period multiple-of-three windows for shrimps. -- Highlights: ► The existence of noticeable periodic windows has been reported recently for several nonlinear systems. ► The periodic window distributions appear highly organized in two-parameter space. ► We characterize self-similar properties of Arnold tongues and shrimps for a two-gene model. ► We determine the period of the Arnold tongues recognizing a Fibonacci-type sequence. ► We explore self-similar features of the shrimps identifying multiple period-three structures.

  1. Structure of Mycobacterium tuberculosis Rv2714, a representative of a duplicated gene family in Actinobacteria

    International Nuclear Information System (INIS)

    Graña, Martin; Bellinzoni, Marco; Miras, Isabelle; Fiez-Vandal, Cedric; Haouz, Ahmed; Shepard, William; Buschiazzo, Alejandro; Alzari, Pedro M.

    2009-01-01

    The crystal structure of Rv2714, a protein of unknown function from M. tuberculosis, has been determined at 2.6 Å resolution using single-wavelength anomalous diffraction methods. The gene Rv2714 from Mycobacterium tuberculosis, which codes for a hypothetical protein of unknown function, is a representative member of a gene family that is largely confined to the order Actinomycetales of Actinobacteria. Sequence analysis indicates the presence of two paralogous genes in most mycobacterial genomes and suggests that gene duplication was an ancient event in bacterial evolution. The crystal structure of Rv2714 has been determined at 2.6 Å resolution, revealing a trimer in which the topology of the protomer core is similar to that observed in a functionally diverse set of enzymes, including purine nucleoside phosphorylases, some carboxypeptidases, bacterial peptidyl-tRNA hydrolases and even the plastidic form of an intron splicing factor. However, some structural elements, such as a β-hairpin insertion involved in protein oligomerization and a C-terminal α-helical domain that serves as a lid to the putative substrate-binding (or ligand-binding) site, are only found in Rv2714 bacterial homologues and represent specific signatures of this protein family

  2. Structure of Mycobacterium tuberculosis Rv2714, a representative of a duplicated gene family in Actinobacteria

    Energy Technology Data Exchange (ETDEWEB)

    Graña, Martin; Bellinzoni, Marco [Institut Pasteur, Unité de Biochimie Structurale, URA CNRS 2185, 25 Rue du Dr Roux, 75724 Paris (France); Miras, Isabelle; Fiez-Vandal, Cedric; Haouz, Ahmed; Shepard, William [Institut Pasteur, Plate-forme de Cristallogenèse et Diffraction des Rayons X, 25 Rue du Dr Roux, 75724 Paris (France); Buschiazzo, Alejandro; Alzari, Pedro M., E-mail: alzari@pasteur.fr [Institut Pasteur, Unité de Biochimie Structurale, URA CNRS 2185, 25 Rue du Dr Roux, 75724 Paris (France)

    2009-10-01

    The crystal structure of Rv2714, a protein of unknown function from M. tuberculosis, has been determined at 2.6 Å resolution using single-wavelength anomalous diffraction methods. The gene Rv2714 from Mycobacterium tuberculosis, which codes for a hypothetical protein of unknown function, is a representative member of a gene family that is largely confined to the order Actinomycetales of Actinobacteria. Sequence analysis indicates the presence of two paralogous genes in most mycobacterial genomes and suggests that gene duplication was an ancient event in bacterial evolution. The crystal structure of Rv2714 has been determined at 2.6 Å resolution, revealing a trimer in which the topology of the protomer core is similar to that observed in a functionally diverse set of enzymes, including purine nucleoside phosphorylases, some carboxypeptidases, bacterial peptidyl-tRNA hydrolases and even the plastidic form of an intron splicing factor. However, some structural elements, such as a β-hairpin insertion involved in protein oligomerization and a C-terminal α-helical domain that serves as a lid to the putative substrate-binding (or ligand-binding) site, are only found in Rv2714 bacterial homologues and represent specific signatures of this protein family.

  3. K-shuff: A Novel Algorithm for Characterizing Structural and Compositional Diversity in Gene Libraries.

    Science.gov (United States)

    Jangid, Kamlesh; Kao, Ming-Hung; Lahamge, Aishwarya; Williams, Mark A; Rathbun, Stephen L; Whitman, William B

    2016-01-01

    K-shuff is a new algorithm for comparing the similarity of gene sequence libraries, providing measures of the structural and compositional diversity as well as the significance of the differences between these measures. Inspired by Ripley's K-function for spatial point pattern analysis, the Intra K-function or IKF measures the structural diversity, including both the richness and overall similarity of the sequences, within a library. The Cross K-function or CKF measures the compositional diversity between gene libraries, reflecting both the number of OTUs shared as well as the overall similarity in OTUs. A Monte Carlo testing procedure then enables statistical evaluation of both the structural and compositional diversity between gene libraries. For 16S rRNA gene libraries from complex bacterial communities such as those found in seawater, salt marsh sediments, and soils, K-shuff yields reproducible estimates of structural and compositional diversity with libraries greater than 50 sequences. Similarly, for pyrosequencing libraries generated from a glacial retreat chronosequence and Illumina® libraries generated from US homes, K-shuff required >300 and 100 sequences per sample, respectively. Power analyses demonstrated that K-shuff is sensitive to small differences in Sanger or Illumina® libraries. This extra sensitivity of K-shuff enabled examination of compositional differences at much deeper taxonomic levels, such as within abundant OTUs. This is especially useful when comparing communities that are compositionally very similar but functionally different. K-shuff will therefore prove beneficial for conventional microbiome analysis as well as specific hypothesis testing.

  4. Analysis of flavonoids and the flavonoid structural genes in brown fiber of upland cotton.

    Directory of Open Access Journals (Sweden)

    Hongjie Feng

    Full Text Available BACKGROUND: As a result of changing consumer preferences, cotton (Gossypium Hirsutum L. from varieties with naturally colored fibers is becoming increasingly sought after in the textile industry. The molecular mechanisms leading to colored fiber development are still largely unknown, although it is expected that the color is derived from flavanoids. EXPERIMENTAL DESIGN: Firstly, four key genes of the flavonoid biosynthetic pathway in cotton (GhC4H, GhCHS, GhF3'H, and GhF3'5'H were cloned and studied their expression profiles during the development of brown- and white cotton fibers by QRT-PCR. And then, the concentrations of four components of the flavonoid biosynthetic pathway, naringenin, quercetin, kaempferol and myricetin in brown- and white fibers were analyzed at different developmental stages by HPLC. RESULT: The predicted proteins of the four flavonoid structural genes corresponding to these genes exhibit strong sequence similarity to their counterparts in various plant species. Transcript levels for all four genes were considerably higher in developing brown fibers than in white fibers from a near isogenic line (NIL. The contents of four flavonoids (naringenin, quercetin, kaempferol and myricetin were significantly higher in brown than in white fibers and corresponding to the biosynthetic gene expression levels. CONCLUSIONS: Flavonoid structural gene expression and flavonoid metabolism are important in the development of pigmentation in brown cotton fibers.

  5. Exon organization of the mouse entactin gene corresponds to the structural domains of the polypeptide and has regional homology to the low-density lipoprotein receptor gene

    DEFF Research Database (Denmark)

    Durkin, M E; Wewer, U M; Chung, A E

    1995-01-01

    of the mouse entactin gene closely corresponds to the organization of the polypeptide into distinct structural and functional domains. The two amino-terminal globular domains are encoded by three exons each. Single exons encode the two protease-sensitive, O-glycosylated linking regions. The six EGF......Entactin is a widespread basement membrane protein of 150 kDa that binds to type IV collagen and laminin. The complete exon-intron structure of the mouse entactin gene has been determined from lambda genomic DNA clones. The gene spans at least 65 kb and contains 20 exons. The exon organization...

  6. The population genomics of begomoviruses: global scale population structure and gene flow

    Directory of Open Access Journals (Sweden)

    Prasanna HC

    2010-09-01

    Full Text Available Abstract Background The rapidly growing availability of diverse full genome sequences from across the world is increasing the feasibility of studying the large-scale population processes that underly observable pattern of virus diversity. In particular, characterizing the genetic structure of virus populations could potentially reveal much about how factors such as geographical distributions, host ranges and gene flow between populations combine to produce the discontinuous patterns of genetic diversity that we perceive as distinct virus species. Among the richest and most diverse full genome datasets that are available is that for the dicotyledonous plant infecting genus, Begomovirus, in the Family Geminiviridae. The begomoviruses all share the same whitefly vector, are highly recombinogenic and are distributed throughout tropical and subtropical regions where they seriously threaten the food security of the world's poorest people. Results We focus here on using a model-based population genetic approach to identify the genetically distinct sub-populations within the global begomovirus meta-population. We demonstrate the existence of at least seven major sub-populations that can further be sub-divided into as many as thirty four significantly differentiated and genetically cohesive minor sub-populations. Using the population structure framework revealed in the present study, we further explored the extent of gene flow and recombination between genetic populations. Conclusions Although geographical barriers are apparently the most significant underlying cause of the seven major population sub-divisions, within the framework of these sub-divisions, we explore patterns of gene flow to reveal that both host range differences and genetic barriers to recombination have probably been major contributors to the minor population sub-divisions that we have identified. We believe that the global Begomovirus population structure revealed here could

  7. Mapping hisS, the structural gene for histidyl-transfer ribonucleic acid synthetase, in Escherichia coli.

    Science.gov (United States)

    Parker, J; Fishman, S E

    1979-04-01

    The structural gene for histidyl-tRNA synthetase was localized to 53.8 min on the Escherichia coli genome. The gene order in this region was determined to be dapE-purC-upp-purG-(guaA, guaB)-hisS-glyA.

  8. Comparing large covariance matrices under weak conditions on the dependence structure and its application to gene clustering.

    Science.gov (United States)

    Chang, Jinyuan; Zhou, Wen; Zhou, Wen-Xin; Wang, Lan

    2017-03-01

    Comparing large covariance matrices has important applications in modern genomics, where scientists are often interested in understanding whether relationships (e.g., dependencies or co-regulations) among a large number of genes vary between different biological states. We propose a computationally fast procedure for testing the equality of two large covariance matrices when the dimensions of the covariance matrices are much larger than the sample sizes. A distinguishing feature of the new procedure is that it imposes no structural assumptions on the unknown covariance matrices. Hence, the test is robust with respect to various complex dependence structures that frequently arise in genomics. We prove that the proposed procedure is asymptotically valid under weak moment conditions. As an interesting application, we derive a new gene clustering algorithm which shares the same nice property of avoiding restrictive structural assumptions for high-dimensional genomics data. Using an asthma gene expression dataset, we illustrate how the new test helps compare the covariance matrices of the genes across different gene sets/pathways between the disease group and the control group, and how the gene clustering algorithm provides new insights on the way gene clustering patterns differ between the two groups. The proposed methods have been implemented in an R-package HDtest and are available on CRAN. © 2016, The International Biometric Society.

  9. The global relationship between chromatin physical topology, fractal structure, and gene expression

    DEFF Research Database (Denmark)

    Almassalha, Luay M; Tiwari, A; Ruhoff, P T

    2017-01-01

    in an empty space, but in a highly complex, interrelated, and dense nanoenvironment that profoundly influences chemical interactions. We explored the relationship between the physical nanoenvironment of chromatin and gene transcription in vitro. We analytically show that changes in the fractal dimension, D...... show that the increased heterogeneity of physical structure of chromatin due to increase in fractal dimension correlates with increased heterogeneity of gene networks. These findings indicate that the higher order folding of chromatin topology may act as a molecular-pathway independent code regulating...

  10. Mapping hisS, the structural gene for histidyl-transfer ribonucleic acid synthetase, in Escherichia coli.

    Science.gov (United States)

    Parker, J; Fishman, S E

    1979-01-01

    The structural gene for histidyl-tRNA synthetase was localized to 53.8 min on the Escherichia coli genome. The gene order in this region was determined to be dapE-purC-upp-purG-(guaA, guaB)-hisS-glyA. PMID:374370

  11. Acinetobacter baumannii K27 and K44 capsular polysaccharides have the same K unit but different structures due to the presence of distinct wzy genes in otherwise closely related K gene clusters.

    Science.gov (United States)

    Shashkov, Alexander S; Kenyon, Johanna J; Senchenkova, Sof'ya N; Shneider, Mikhail M; Popova, Anastasiya V; Arbatsky, Nikolay P; Miroshnikov, Konstantin A; Volozhantsev, Nikolay V; Hall, Ruth M; Knirel, Yuriy A

    2016-05-01

    Capsular polysaccharides (CPSs), from Acinetobacter baumannii isolates 1432, 4190 and NIPH 70, which have related gene content at the K locus, were examined, and the chemical structures established using 2D(1)H and(13)C NMR spectroscopy. The three isolates produce the same pentasaccharide repeat unit, which consists of 5-N-acetyl-7-N-[(S)-3-hydroxybutanoyl] (major) or 5,7-di-N-acetyl (minor) derivatives of 5,7-diamino-3,5,7,9-tetradeoxy-D-glycero-D-galacto-non-2-ulosonic (legionaminic) acid (Leg5Ac7R), D-galactose, N-acetyl-D-galactosamine and N-acetyl-D-glucosamine. However, the linkage between repeat units in NIPH 70 was different to that in 1432 and 4190, and this significantly alters the CPS structure. The KL27 gene cluster in 4190 and KL44 gene cluster in NIPH 70 are organized identically and contain lga genes for Leg5Ac7R synthesis, genes for the synthesis of the common sugars, as well as anitrA2 initiating transferase and four glycosyltransferases genes. They share high-level nucleotide sequence identity for corresponding genes, but differ in the wzy gene encoding the Wzy polymerase. The Wzy proteins, which have different lengths and share no similarity, would form the unrelated linkages in the K27 and K44 structures. The linkages formed by the four shared glycosyltransferases were predicted by comparison with gene clusters that synthesize related structures. These findings unambiguously identify the linkages formed by WzyK27 and WzyK44, and show that the presence of different wzy genes in otherwise closely related K gene clusters changes the structure of the CPS. This may affect its capacity as a protective barrier for A. baumannii. © The Author 2015. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  12. Gene structure and expression characteristic of a novel odorant receptor gene cluster in the parasitoid wasp Microplitis mediator (Hymenoptera: Braconidae).

    Science.gov (United States)

    Wang, S-N; Shan, S; Zheng, Y; Peng, Y; Lu, Z-Y; Yang, Y-Q; Li, R-J; Zhang, Y-J; Guo, Y-Y

    2017-08-01

    Odorant receptors (ORs) expressed in the antennae of parasitoid wasps are responsible for detection of various lipophilic airborne molecules. In the present study, 107 novel OR genes were identified from Microplitis mediator antennal transcriptome data. Phylogenetic analysis of the set of OR genes from M. mediator and Microplitis demolitor revealed that M. mediator OR (MmedOR) genes can be classified into different subfamilies, and the majority of MmedORs in each subfamily shared high sequence identities and clear orthologous relationships to M. demolitor ORs. Within a subfamily, six MmedOR genes, MmedOR98, 124, 125, 126, 131 and 155, shared a similar gene structure and were tightly linked in the genome. To evaluate whether the clustered MmedOR genes share common regulatory features, the transcription profile and expression characteristics of the six closely related OR genes were investigated in M. mediator. Rapid amplification of cDNA ends-PCR experiments revealed that the OR genes within the cluster were transcribed as single mRNAs, and a bicistronic mRNA for two adjacent genes (MmedOR124 and MmedOR98) was also detected in female antennae by reverse transcription PCR. In situ hybridization experiments indicated that each OR gene within the cluster was expressed in a different number of cells. Moreover, there was no co-expression of the two highly related OR genes, MmedOR124 and MmedOR98, which appeared to be individually expressed in a distinct population of neurons. Overall, there were distinct expression profiles of closely related MmedOR genes from the same cluster in M. mediator. These data provide a basic understanding of the olfactory coding in parasitoid wasps. © 2017 The Royal Entomological Society.

  13. Structural and functional studies of a family of Dictyostelium discoideum developmentally regulated, prestalk genes coding for small proteins

    Directory of Open Access Journals (Sweden)

    Escalante Ricardo

    2008-01-01

    Full Text Available Abstract Background The social amoeba Dictyostelium discoideum executes a multicellular development program upon starvation. This morphogenetic process requires the differential regulation of a large number of genes and is coordinated by extracellular signals. The MADS-box transcription factor SrfA is required for several stages of development, including slug migration and spore terminal differentiation. Results Subtractive hybridization allowed the isolation of a gene, sigN (SrfA-induced gene N, that was dependent on the transcription factor SrfA for expression at the slug stage of development. Homology searches detected the existence of a large family of sigN-related genes in the Dictyostelium discoideum genome. The 13 most similar genes are grouped in two regions of chromosome 2 and have been named Group1 and Group2 sigN genes. The putative encoded proteins are 87–89 amino acids long. All these genes have a similar structure, composed of a first exon containing a 13 nucleotides long open reading frame and a second exon comprising the remaining of the putative coding region. The expression of these genes is induced at10 hours of development. Analyses of their promoter regions indicate that these genes are expressed in the prestalk region of developing structures. The addition of antibodies raised against SigN Group 2 proteins induced disintegration of multi-cellular structures at the mound stage of development. Conclusion A large family of genes coding for small proteins has been identified in D. discoideum. Two groups of very similar genes from this family have been shown to be specifically expressed in prestalk cells during development. Functional studies using antibodies raised against Group 2 SigN proteins indicate that these genes could play a role during multicellular development.

  14. Novel sequence variations in LAMA2 and SGCG genes modulating cis-acting regulatory elements and RNA secondary structure

    Directory of Open Access Journals (Sweden)

    Olfa Siala

    2010-01-01

    Full Text Available In this study, we detected new sequence variations in LAMA2 and SGCG genes in 5 ethnic populations, and analysed their effect on enhancer composition and mRNA structure. PCR amplification and DNA sequencing were performed and followed by bioinformatics analyses using ESEfinder as well as MFOLD software. We found 3 novel sequence variations in the LAMA2 (c.3174+22_23insAT and c.6085 +12delA and SGCG (c.*102A/C genes. These variations were present in 210 tested healthy controls from Tunisian, Moroccan, Algerian, Lebanese and French populations suggesting that they represent novel polymorphisms within LAMA2 and SGCG genes sequences. ESEfinder showed that the c.*102A/C substitution created a new exon splicing enhancer in the 3'UTR of SGCG genes, whereas the c.6085 +12delA deletion was situated in the base pairing region between LAMA2 mRNA and the U1snRNA spliceosomal components. The RNA structure analyses showed that both variations modulated RNA secondary structure. Our results are suggestive of correlations between mRNA folding and the recruitment of spliceosomal components mediating splicing, including SR proteins. The contribution of common sequence variations to mRNA structural and functional diversity will contribute to a better study of gene expression.

  15. Two potential hookworm DAF-16 target genes, SNR-3 and LPP-1: gene structure, expression profile, and implications of a cis-regulatory element in the regulation of gene expression.

    Science.gov (United States)

    Gao, Xin; Goggin, Kevin; Dowling, Camille; Qian, Jason; Hawdon, John M

    2015-01-08

    Hookworms infect nearly 700 million people, causing anemia and developmental stunting in heavy infections. Little is known about the genomic structure or gene regulation in hookworms, although recent publication of draft genome assemblies has allowed the first investigations of these topics to be undertaken. The transcription factor DAF-16 mediates multiple developmental pathways in the free living nematode Caenorhabditis elegans, and is involved in the recovery from the developmentally arrested L3 in hookworms. Identification of downstream targets of DAF-16 will provide a better understanding of the molecular mechanism of hookworm infection. Genomic Fragment 2.23 containing a DAF-16 binding element (DBE) was used to identify overlapping complementary expressed sequence tags (ESTs). These sequences were used to search a draft assembly of the Ancylostoma caninum genome, and identified two neighboring genes, snr-3 and lpp-1, in a tail-to-tail orientation. Expression patterns of both genes during parasitic development were determined by qRT-PCR. DAF-16 dependent cis-regulatory activity of fragment 2.23 was investigated using an in vitro reporter system. The snr-3 gene spans approximately 5.6 kb in the genome and contains 3 exons and 2 introns, and contains the DBE in its 3' untranslated region. Downstream from snr-3 in a tail-to-tail arrangement is the gene lpp-1. The lpp-1 gene spans more than 6 kb and contains 10 exons and 9 introns. The A. caninum genome contains 2 apparent splice variants, but there are 7 splice variants in the A. ceylanicum genome. While the gene order is similar, the gene structures of the hookworm genes differ from their C. elegans orthologs. Both genes show peak expression in the late L4 stage. Using a cell culture based expression system, fragment 2.23 was found to have both DAF-16-dependent promoter and enhancer activity that required an intact DBE. Two putative DAF-16 targets were identified by genome wide screening for DAF-16 binding

  16. Sieve element occlusion (SEO) genes encode structural phloem proteins involved in wound sealing of the phloem.

    Science.gov (United States)

    Ernst, Antonia M; Jekat, Stephan B; Zielonka, Sascia; Müller, Boje; Neumann, Ulla; Rüping, Boris; Twyman, Richard M; Krzyzanek, Vladislav; Prüfer, Dirk; Noll, Gundula A

    2012-07-10

    The sieve element occlusion (SEO) gene family originally was delimited to genes encoding structural components of forisomes, which are specialized crystalloid phloem proteins found solely in the Fabaceae. More recently, SEO genes discovered in various non-Fabaceae plants were proposed to encode the common phloem proteins (P-proteins) that plug sieve plates after wounding. We carried out a comprehensive characterization of two tobacco (Nicotiana tabacum) SEO genes (NtSEO). Reporter genes controlled by the NtSEO promoters were expressed specifically in immature sieve elements, and GFP-SEO fusion proteins formed parietal agglomerates in intact sieve elements as well as sieve plate plugs after wounding. NtSEO proteins with and without fluorescent protein tags formed agglomerates similar in structure to native P-protein bodies when transiently coexpressed in Nicotiana benthamiana, and the analysis of these protein complexes by electron microscopy revealed ultrastructural features resembling those of native P-proteins. NtSEO-RNA interference lines were essentially devoid of P-protein structures and lost photoassimilates more rapidly after injury than control plants, thus confirming the role of P-proteins in sieve tube sealing. We therefore provide direct evidence that SEO genes in tobacco encode P-protein subunits that affect translocation. We also found that peptides recently identified in fascicular phloem P-protein plugs from squash (Cucurbita maxima) represent cucurbit members of the SEO family. Our results therefore suggest a common evolutionary origin for P-proteins found in the sieve elements of all dicotyledonous plants and demonstrate the exceptional status of extrafascicular P-proteins in cucurbits.

  17. Population Structure and Gene Flow of the Yellow Anaconda (Eunectes notaeus) in Northern Argentina

    Science.gov (United States)

    McCartney-Melstad, Evan; Waller, Tomás; Micucci, Patricio A.; Barros, Mariano; Draque, Juan; Amato, George; Mendez, Martin

    2012-01-01

    Yellow anacondas (Eunectes notaeus) are large, semiaquatic boid snakes found in wetland systems in South America. These snakes are commercially harvested under a sustainable management plan in Argentina, so information regarding population structuring can be helpful for determination of management units. We evaluated genetic structure and migration using partial sequences from the mitochondrial control region and mitochondrial genes cyt-b and ND4 for 183 samples collected within northern Argentina. A group of landscape features and environmental variables including several treatments of temperature and precipitation were explored as potential drivers of observed genetic patterns. We found significant population structure between most putative population comparisons and bidirectional but asymmetric migration in several cases. The configuration of rivers and wetlands was found to be significantly associated with yellow anaconda population structure (IBD), and important for gene flow, although genetic distances were not significantly correlated with the environmental variables used here. More in-depth analyses of environmental data may be needed to fully understand the importance of environmental conditions on population structure and migration. These analyses indicate that our putative populations are demographically distinct and should be treated as such in Argentina's management plan for the harvesting of yellow anacondas. PMID:22675425

  18. Generation of antigenic diversity in Plasmodium falciparum by structured rearrangement of Var genes during mitosis.

    Science.gov (United States)

    Claessens, Antoine; Hamilton, William L; Kekre, Mihir; Otto, Thomas D; Faizullabhoy, Adnan; Rayner, Julian C; Kwiatkowski, Dominic

    2014-12-01

    The most polymorphic gene family in P. falciparum is the ∼60 var genes distributed across parasite chromosomes, both in the subtelomeres and in internal regions. They encode hypervariable surface proteins known as P. falciparum erythrocyte membrane protein 1 (PfEMP1) that are critical for pathogenesis and immune evasion in Plasmodium falciparum. How var gene sequence diversity is generated is not currently completely understood. To address this, we constructed large clone trees and performed whole genome sequence analysis to study the generation of novel var gene sequences in asexually replicating parasites. While single nucleotide polymorphisms (SNPs) were scattered across the genome, structural variants (deletions, duplications, translocations) were focused in and around var genes, with considerable variation in frequency between strains. Analysis of more than 100 recombination events involving var exon 1 revealed that the average nucleotide sequence identity of two recombining exons was only 63% (range: 52.7-72.4%) yet the crossovers were error-free and occurred in such a way that the resulting sequence was in frame and domain architecture was preserved. Var exon 1, which encodes the immunologically exposed part of the protein, recombined in up to 0.2% of infected erythrocytes in vitro per life cycle. The high rate of var exon 1 recombination indicates that millions of new antigenic structures could potentially be generated each day in a single infected individual. We propose a model whereby var gene sequence polymorphism is mainly generated during the asexual part of the life cycle.

  19. GeneViTo: Visualizing gene-product functional and structural features in genomic datasets

    Directory of Open Access Journals (Sweden)

    Promponas Vasilis J

    2003-10-01

    Full Text Available Abstract Background The availability of increasing amounts of sequence data from completely sequenced genomes boosts the development of new computational methods for automated genome annotation and comparative genomics. Therefore, there is a need for tools that facilitate the visualization of raw data and results produced by bioinformatics analysis, providing new means for interactive genome exploration. Visual inspection can be used as a basis to assess the quality of various analysis algorithms and to aid in-depth genomic studies. Results GeneViTo is a JAVA-based computer application that serves as a workbench for genome-wide analysis through visual interaction. The application deals with various experimental information concerning both DNA and protein sequences (derived from public sequence databases or proprietary data sources and meta-data obtained by various prediction algorithms, classification schemes or user-defined features. Interaction with a Graphical User Interface (GUI allows easy extraction of genomic and proteomic data referring to the sequence itself, sequence features, or general structural and functional features. Emphasis is laid on the potential comparison between annotation and prediction data in order to offer a supplement to the provided information, especially in cases of "poor" annotation, or an evaluation of available predictions. Moreover, desired information can be output in high quality JPEG image files for further elaboration and scientific use. A compilation of properly formatted GeneViTo input data for demonstration is available to interested readers for two completely sequenced prokaryotes, Chlamydia trachomatis and Methanococcus jannaschii. Conclusions GeneViTo offers an inspectional view of genomic functional elements, concerning data stemming both from database annotation and analysis tools for an overall analysis of existing genomes. The application is compatible with Linux or Windows ME-2000-XP operating

  20. Gene Structures, Classification, and Expression Models of the DREB Transcription Factor Subfamily in Populus trichocarpa

    Directory of Open Access Journals (Sweden)

    Yunlin Chen

    2013-01-01

    Full Text Available We identified 75 dehydration-responsive element-binding (DREB protein genes in Populus trichocarpa. We analyzed gene structures, phylogenies, domain duplications, genome localizations, and expression profiles. The phylogenic construction suggests that the PtrDREB gene subfamily can be classified broadly into six subtypes (DREB A-1 to A-6 in Populus. The chromosomal localizations of the PtrDREB genes indicated 18 segmental duplication events involving 36 genes and six redundant PtrDREB genes were involved in tandem duplication events. There were fewer introns in the PtrDREB subfamily. The motif composition of PtrDREB was highly conserved in the same subtype. We investigated expression profiles of this gene subfamily from different tissues and/or developmental stages. Sixteen genes present in the digital expression analysis had high levels of transcript accumulation. The microarray results suggest that 18 genes were upregulated. We further examined the stress responsiveness of 15 genes by qRT-PCR. A digital northern analysis showed that the PtrDREB17, 18, and 32 genes were highly induced in leaves under cold stress, and the same expression trends were shown by qRT-PCR. Taken together, these observations may lay the foundation for future functional analyses to unravel the biological roles of Populus’ DREB genes.

  1. Inference of gene regulatory networks with sparse structural equation models exploiting genetic perturbations.

    Directory of Open Access Journals (Sweden)

    Xiaodong Cai

    Full Text Available Integrating genetic perturbations with gene expression data not only improves accuracy of regulatory network topology inference, but also enables learning of causal regulatory relations between genes. Although a number of methods have been developed to integrate both types of data, the desiderata of efficient and powerful algorithms still remains. In this paper, sparse structural equation models (SEMs are employed to integrate both gene expression data and cis-expression quantitative trait loci (cis-eQTL, for modeling gene regulatory networks in accordance with biological evidence about genes regulating or being regulated by a small number of genes. A systematic inference method named sparsity-aware maximum likelihood (SML is developed for SEM estimation. Using simulated directed acyclic or cyclic networks, the SML performance is compared with that of two state-of-the-art algorithms: the adaptive Lasso (AL based scheme, and the QTL-directed dependency graph (QDG method. Computer simulations demonstrate that the novel SML algorithm offers significantly better performance than the AL-based and QDG algorithms across all sample sizes from 100 to 1,000, in terms of detection power and false discovery rate, in all the cases tested that include acyclic or cyclic networks of 10, 30 and 300 genes. The SML method is further applied to infer a network of 39 human genes that are related to the immune function and are chosen to have a reliable eQTL per gene. The resulting network consists of 9 genes and 13 edges. Most of the edges represent interactions reasonably expected from experimental evidence, while the remaining may just indicate the emergence of new interactions. The sparse SEM and efficient SML algorithm provide an effective means of exploiting both gene expression and perturbation data to infer gene regulatory networks. An open-source computer program implementing the SML algorithm is freely available upon request.

  2. Structure-related clustering of gene expression fingerprints of thp-1 cells exposed to smaller polycyclic aromatic hydrocarbons.

    Science.gov (United States)

    Wan, B; Yarbrough, J W; Schultz, T W

    2008-01-01

    This study was undertaken to test the hypothesis that structurally similar PAHs induce similar gene expression profiles. THP-1 cells were exposed to a series of 12 selected PAHs at 50 microM for 24 hours and gene expressions profiles were analyzed using both unsupervised and supervised methods. Clustering analysis of gene expression profiles revealed that the 12 tested chemicals were grouped into five clusters. Within each cluster, the gene expression profiles are more similar to each other than to the ones outside the cluster. One-methylanthracene and 1-methylfluorene were found to have the most similar profiles; dibenzothiophene and dibenzofuran were found to share common profiles with fluorine. As expression pattern comparisons were expanded, similarity in genomic fingerprint dropped off dramatically. Prediction analysis of microarrays (PAM) based on the clustering pattern generated 49 predictor genes that can be used for sample discrimination. Moreover, a significant analysis of Microarrays (SAM) identified 598 genes being modulated by tested chemicals with a variety of biological processes, such as cell cycle, metabolism, and protein binding and KEGG pathways being significantly (p < 0.05) affected. It is feasible to distinguish structurally different PAHs based on their genomic fingerprints, which are mechanism based.

  3. Evolution of GHF5 endoglucanase gene structure in plant-parasitic nematodes: no evidence for an early domain shuffling event

    Directory of Open Access Journals (Sweden)

    Gheysen Godelieve

    2008-11-01

    Full Text Available Abstract Background Endo-1,4-beta-glucanases or cellulases from the glycosyl hydrolase family 5 (GHF5 have been found in numerous bacteria and fungi, and recently also in higher eukaryotes, particularly in plant-parasitic nematodes (PPN. The origin of these genes has been attributed to horizontal gene transfer from bacteria, although there still is a lot of uncertainty about the origin and structure of the ancestral GHF5 PPN endoglucanase. It is not clear whether this ancestral endoglucanase consisted of the whole gene cassette, containing a catalytic domain and a carbohydrate-binding module (CBM, type 2 in PPN and bacteria or only of the catalytic domain while the CBM2 was retrieved by domain shuffling later in evolution. Previous studies on the evolution of these genes have focused primarily on data of sedentary nematodes, while in this study, extra data from migratory nematodes were included. Results Two new endoglucanases from the migratory nematodes Pratylenchus coffeae and Ditylenchus africanus were included in this study. The latter one is the first gene isolated from a PPN of a different superfamily (Sphaerularioidea; all previously known nematode endoglucanases belong to the superfamily Tylenchoidea (order Rhabditida. Phylogenetic analyses were conducted with the PPN GHF5 endoglucanases and homologous endoglucanases from bacterial and other eukaryotic lineages such as beetles, fungi and plants. No statistical incongruence between the phylogenetic trees deduced from the catalytic domain and the CBM2 was found, which could suggest that both domains have evolved together. Furthermore, based on gene structure data, we inferred a model for the evolution of the GHF5 endoglucanase gene structure in plant-parasitic nematodes. Our data confirm a close relationship between Pratylenchus spp. and the root knot nematodes, while some Radopholus similis endoglucanases are more similar to cyst nematode genes. Conclusion We conclude that the ancestral

  4. Evolution of GHF5 endoglucanase gene structure in plant-parasitic nematodes: no evidence for an early domain shuffling event.

    Science.gov (United States)

    Kyndt, Tina; Haegeman, Annelies; Gheysen, Godelieve

    2008-11-03

    Endo-1,4-beta-glucanases or cellulases from the glycosyl hydrolase family 5 (GHF5) have been found in numerous bacteria and fungi, and recently also in higher eukaryotes, particularly in plant-parasitic nematodes (PPN). The origin of these genes has been attributed to horizontal gene transfer from bacteria, although there still is a lot of uncertainty about the origin and structure of the ancestral GHF5 PPN endoglucanase. It is not clear whether this ancestral endoglucanase consisted of the whole gene cassette, containing a catalytic domain and a carbohydrate-binding module (CBM, type 2 in PPN and bacteria) or only of the catalytic domain while the CBM2 was retrieved by domain shuffling later in evolution. Previous studies on the evolution of these genes have focused primarily on data of sedentary nematodes, while in this study, extra data from migratory nematodes were included. Two new endoglucanases from the migratory nematodes Pratylenchus coffeae and Ditylenchus africanus were included in this study. The latter one is the first gene isolated from a PPN of a different superfamily (Sphaerularioidea); all previously known nematode endoglucanases belong to the superfamily Tylenchoidea (order Rhabditida). Phylogenetic analyses were conducted with the PPN GHF5 endoglucanases and homologous endoglucanases from bacterial and other eukaryotic lineages such as beetles, fungi and plants. No statistical incongruence between the phylogenetic trees deduced from the catalytic domain and the CBM2 was found, which could suggest that both domains have evolved together. Furthermore, based on gene structure data, we inferred a model for the evolution of the GHF5 endoglucanase gene structure in plant-parasitic nematodes. Our data confirm a close relationship between Pratylenchus spp. and the root knot nematodes, while some Radopholus similis endoglucanases are more similar to cyst nematode genes. We conclude that the ancestral PPN GHF5 endoglucanase gene most probably consisted of

  5. Crystal structure of the MSMEG_4306 gene product from Mycobacterium smegmatis.

    Science.gov (United States)

    Kumar, Adarsh; Karthikeyan, Subramanian

    2018-03-01

    The MSMEG_4306 gene from Mycobacterium smegmatis encodes a protein of unknown function with 242 amino-acid residues that contains a conserved zinc-ribbon domain at its C-terminus. Here, the crystal structure of MSMEG_4306 determined by the single-wavelength anomalous dispersion method using just one zinc ion co-purified with the protein is reported. The crystal structure of MSMEG_4306 shows a coiled-coil helix domain in the N-terminal region and a zinc-ribbon domain in the C-terminal region. A structural similarity search against the Protein Data Bank using MSMEG_4306 as a query revealed two similar structures, namely CT398 from Chlamydia trachomatis and HP0958 from Helicobacter pylori, although they share only ∼15% sequence identity with MSMEG_4306. Based on comparative analysis, it is predicted that MSMEG_4306 may be involved in secretion systems, possibly by interacting with multiple proteins or nucleic acids.

  6. Structural basis for regulation of rhizobial nodulation and symbiosis gene expression by the regulatory protein NolR.

    Science.gov (United States)

    Lee, Soon Goo; Krishnan, Hari B; Jez, Joseph M

    2014-04-29

    The symbiosis between rhizobial microbes and host plants involves the coordinated expression of multiple genes, which leads to nodule formation and nitrogen fixation. As part of the transcriptional machinery for nodulation and symbiosis across a range of Rhizobium, NolR serves as a global regulatory protein. Here, we present the X-ray crystal structures of NolR in the unliganded form and complexed with two different 22-base pair (bp) double-stranded operator sequences (oligos AT and AA). Structural and biochemical analysis of NolR reveals protein-DNA interactions with an asymmetric operator site and defines a mechanism for conformational switching of a key residue (Gln56) to accommodate variation in target DNA sequences from diverse rhizobial genes for nodulation and symbiosis. This conformational switching alters the energetic contributions to DNA binding without changes in affinity for the target sequence. Two possible models for the role of NolR in the regulation of different nodulation and symbiosis genes are proposed. To our knowledge, these studies provide the first structural insight on the regulation of genes involved in the agriculturally and ecologically important symbiosis of microbes and plants that leads to nodule formation and nitrogen fixation.

  7. Metagenomes reveal microbial structures, functional potentials, and biofouling-related genes in a membrane bioreactor.

    Science.gov (United States)

    Ma, Jinxing; Wang, Zhiwei; Li, Huan; Park, Hee-Deung; Wu, Zhichao

    2016-06-01

    Metagenomic sequencing was used to investigate the microbial structures, functional potentials, and biofouling-related genes in a membrane bioreactor (MBR). The results showed that the microbial community in the MBR was highly diverse. Notably, function analysis of the dominant genera indicated that common genes from different phylotypes were identified for important functional potentials with the observation of variation of abundances of genes in a certain taxon (e.g., Dechloromonas). Despite maintaining similar metabolic functional potentials with a parallel full-scale conventional activated sludge (CAS) system due to treating the identical wastewater, the MBR had more abundant nitrification-related bacteria and coding genes of ammonia monooxygenase, which could well explain its excellent ammonia removal in the low-temperature period. Furthermore, according to quantification of the genes involved in exopolysaccharide and extracellular polymeric substance (EPS) protein metabolism, the MBR did not show a much different potential in producing EPS compared to the CAS system, and bacteria from the membrane biofilm had lower abundances of genes associated with EPS biosynthesis and transport compared to the activated sludge in the MBR.

  8. Cationic niosomes an effective gene carrier composed of novel spermine-derivative cationic lipids: effect of central core structures.

    Science.gov (United States)

    Opanasopit, Praneet; Leksantikul, Lalita; Niyomtham, Nattisa; Rojanarata, Theerasak; Ngawhirunpat, Tanasait; Yingyongnarongkul, Boon-Ek

    2017-05-01

    Cationic niosomes formulated from Span 20, cholesterol (Chol) and novel spermine-based cationic lipids of multiple central core structures (di(oxyethyl)amino, di(oxyethyl)amino carboxy, 3-amino-1,2-dioxypropyl and 2-amino-1,3-dioxypropyl) were successfully prepared for improving transfection efficiency in vitro. The niosomes composed of spermine cationic lipid with central core structure of di(oxyethyl)amino revealed the highest gene transfection efficiency. To investigate the factors affecting gene transfection and cell viability including differences in the central core structures of cationic lipids, the composition of vesicles, molar ratio of cationic lipids in formulations and the weight ratio of niosomes to DNA. Cationic niosomes composed of nonionic surfactants (Span20), cholesterol and spermine-based cationic lipids of multiple central core structures were formulated. Gene transfection and cell viability were evaluated on a human cervical carcinoma cell line (HeLa cells) using pDNA encoding green fluorescent protein (pEGFP-C2). The morphology, size and charge were also characterized. High transfection efficiency was obtained from cationic niosomes composed of Span20:Chol:cationic lipid at the molar ratio of 2.5:2.5:0.5 mM. Cationic lipids with di(oxyethyl)amino as a central core structure exhibited highest transfection efficiency. In addition, there was also no serum effect on transfection efficiency. These novel cationic niosomes may constitute a good alternative carrier for gene transfection.

  9. Gene structure, expression, and DNA methylation characteristics of sea cucumber cyclin B gene during aestivation.

    Science.gov (United States)

    Zhu, Aijun; Chen, Muyan; Zhang, Xiumei; Storey, Kenneth B

    2016-12-05

    The sea cucumber, Apostichopus japonicus, is a good model for studying environmentally-induced aestivation by a marine invertebrate. One of the central requirements of aestivation is the repression of energy-expensive cellular processes such as cell cycle progression. The present study identified the gene structure of the cell cycle regulator, cyclin B, and detected the expression levels of this gene over three stages of the annual aestivation-arousal cycle. Furthermore, the DNA methylation characteristics of cyclin B were analyzed in non-aestivation and deep-aestivation stages of sea cucumbers. We found that the cyclin B promoter contains a CpG island, three CCAAT-boxes and three cell cycle gene homology regions (CHRs). Application of qRT-PCR analysis showed significant downregulation of cyclin B transcript levels during deep-aestivation in comparison with non-aestivation in both intestine and longitudinal muscle, and these returned to basal levels after arousal from aestivation. Methylation analysis of the cyclin B core promoter revealed that its methylation level showed significant differences between non-aestivation and deep-aestivation stages (p<0.05) and interestingly, a positive correlation between Cyclin B transcripts expression and methylation levels of the core promoter was also observed. Our findings suggest that cell cycle progression may be reversibly arrested during aestivation as indicated by the changes in cyclin B expression levels and we propose that DNA methylation is one of the regulatory mechanisms involved in cyclin B transcriptional variation. Copyright © 2016 Elsevier B.V. All rights reserved.

  10. Transcriptional Regulation in Ebola Virus: Effects of Gene Border Structure and Regulatory Elements on Gene Expression and Polymerase Scanning Behavior.

    Science.gov (United States)

    Brauburger, Kristina; Boehmann, Yannik; Krähling, Verena; Mühlberger, Elke

    2016-02-15

    The highly pathogenic Ebola virus (EBOV) has a nonsegmented negative-strand (NNS) RNA genome containing seven genes. The viral genes either are separated by intergenic regions (IRs) of variable length or overlap. The structure of the EBOV gene overlaps is conserved throughout all filovirus genomes and is distinct from that of the overlaps found in other NNS RNA viruses. Here, we analyzed how diverse gene borders and noncoding regions surrounding the gene borders influence transcript levels and govern polymerase behavior during viral transcription. Transcription of overlapping genes in EBOV bicistronic minigenomes followed the stop-start mechanism, similar to that followed by IR-containing gene borders. When the gene overlaps were extended, the EBOV polymerase was able to scan the template in an upstream direction. This polymerase feature seems to be generally conserved among NNS RNA virus polymerases. Analysis of IR-containing gene borders showed that the IR sequence plays only a minor role in transcription regulation. Changes in IR length were generally well tolerated, but specific IR lengths led to a strong decrease in downstream gene expression. Correlation analysis revealed that these effects were largely independent of the surrounding gene borders. Each EBOV gene contains exceptionally long untranslated regions (UTRs) flanking the open reading frame. Our data suggest that the UTRs adjacent to the gene borders are the main regulators of transcript levels. A highly complex interplay between the different cis-acting elements to modulate transcription was revealed for specific combinations of IRs and UTRs, emphasizing the importance of the noncoding regions in EBOV gene expression control. Our data extend those from previous analyses investigating the implication of noncoding regions at the EBOV gene borders for gene expression control. We show that EBOV transcription is regulated in a highly complex yet not easily predictable manner by a set of interacting cis

  11. Structure and Expression Analyses of SVA Elements in Relation to Functional Genes

    Directory of Open Access Journals (Sweden)

    Yun-Jeong Kwon

    2013-09-01

    Full Text Available SINE-VNTR-Alu (SVA elements are present in hominoid primates and are divided into 6 subfamilies (SVA-A to SVA-F and active in the human population. Using a bioinformatic tool, 22 SVA element-associated genes are identified in the human genome. In an analysis of genomic structure, SVA elements are detected in the 5' untranslated region (UTR of HGSNAT (SVA-B, MRGPRX3 (SVA-D, HYAL1 (SVA-F, TCHH (SVA-F, and ATXN2L (SVA-F genes, while some elements are observed in the 3'UTR of SPICE1 (SVA-B, TDRKH (SVA-C, GOSR1 (SVA-D, BBS5 (SVA-D, NEK5 (SVA-D, ABHD2 (SVA-F, C1QTNF7 (SVA-F, ORC6L (SVA-F, TMEM69 (SVA-F, and CCDC137 (SVA-F genes. They could contribute to exon extension or supplying poly A signals. LEPR (SVA-C, ALOX5 (SVA-D, PDS5B (SVA-D, and ABCA10 (SVA-F genes also showed alternative transcripts by SVA exonization events. Dominant expression of HYAL1_SVA appeared in lung tissues, while HYAL1_noSVA showed ubiquitous expression in various human tissues. Expression of both transcripts (TDRKH_SVA and TDRKH_noSVA of the TDRKH gene appeared to be ubiquitous. Taken together, these data suggest that SVA elements cause transcript isoforms that contribute to modulation of gene regulation in various human tissues.

  12. Evidence of strain structure in Plasmodium falciparum var gene repertoires in children from Gabon, West Africa.

    Science.gov (United States)

    Day, Karen P; Artzy-Randrup, Yael; Tiedje, Kathryn E; Rougeron, Virginie; Chen, Donald S; Rask, Thomas S; Rorick, Mary M; Migot-Nabias, Florence; Deloron, Philippe; Luty, Adrian J F; Pascual, Mercedes

    2017-05-16

    Existing theory on competition for hosts between pathogen strains has proposed that immune selection can lead to the maintenance of strain structure consisting of discrete, weakly overlapping antigenic repertoires. This prediction of strain theory has conceptual overlap with fundamental ideas in ecology on niche partitioning and limiting similarity between coexisting species in an ecosystem, which oppose the hypothesis of neutral coexistence. For Plasmodium falciparum , strain theory has been specifically proposed in relation to the major surface antigen of the blood stage, known as Pf EMP1 and encoded by the multicopy multigene family known as the var genes. Deep sampling of the DBLα domain of var genes in the local population of Bakoumba, West Africa, was completed to define whether patterns of repertoire overlap support a role of immune selection under the opposing force of high outcrossing, a characteristic of areas of intense malaria transmission. Using a 454 high-throughput sequencing protocol, we report extremely high diversity of the DBLα domain and a large parasite population with DBLα repertoires structured into nonrandom patterns of overlap. Such population structure, significant for the high diversity of var genes that compose it at a local level, supports the existence of "strains" characterized by distinct var gene repertoires. Nonneutral, frequency-dependent competition would be at play and could underlie these patterns. With a computational experiment that simulates an intervention similar to mass drug administration, we argue that the observed repertoire structure matters for the antigenic var diversity of the parasite population remaining after intervention.

  13. The role of gene flow in shaping genetic structures of the subtropical conifer species Araucaria angustifolia.

    Science.gov (United States)

    Stefenon, V M; Gailing, O; Finkeldey, R

    2008-05-01

    The morphological features of pollen and seed of Araucaria angustifolia have led to the proposal of limited gene dispersal for this species. We used nuclear microsatellite and AFLP markers to assess patterns of genetic variation in six natural populations at the intra- and inter-population level, and related our findings to gene dispersal in this species. Estimates of both fine-scale spatial genetic structure (SGS) and migration rate suggest relatively short-distance gene dispersal. However, gene dispersal differed among populations, and effects of more efficient dispersal within population were observed in at least one stand. In addition, even though some seed dispersal may be aggregated in this principally barochorous species, reasonable secondary seed dispersal, presumably facilitated by animals, and overlap of seed shadows within populations is suggested. Overall, no correlation was observed between levels of SGS and inbreeding, density or age structure, except that a higher level of SGS was revealed for the population with a higher number of juvenile individuals. A low estimate for the number of migrants per generation between two neighbouring populations implies limited gene flow. We expect that stepping-stone pollen flow may have contributed to low genetic differentiation among populations observed in a previous survey. Thus, strategies for maintenance of gene flow among remnant populations should be considered in order to avoid degrading effects of population fragmentation on the evolution of A. angustifolia.

  14. High-throughput interpretation of gene structure changes in human and nonhuman resequencing data, using ACE.

    Science.gov (United States)

    Majoros, William H; Campbell, Michael S; Holt, Carson; DeNardo, Erin K; Ware, Doreen; Allen, Andrew S; Yandell, Mark; Reddy, Timothy E

    2017-05-15

    The accurate interpretation of genetic variants is critical for characterizing genotype-phenotype associations. Because the effects of genetic variants can depend strongly on their local genomic context, accurate genome annotations are essential. Furthermore, as some variants have the potential to disrupt or alter gene structure, variant interpretation efforts stand to gain from the use of individualized annotations that account for differences in gene structure between individuals or strains. We describe a suite of software tools for identifying possible functional changes in gene structure that may result from sequence variants. ACE ('Assessing Changes to Exons') converts phased genotype calls to a collection of explicit haplotype sequences, maps transcript annotations onto them, detects gene-structure changes and their possible repercussions, and identifies several classes of possible loss of function. Novel transcripts predicted by ACE are commonly supported by spliced RNA-seq reads, and can be used to improve read alignment and transcript quantification when an individual-specific genome sequence is available. Using publicly available RNA-seq data, we show that ACE predictions confirm earlier results regarding the quantitative effects of nonsense-mediated decay, and we show that predicted loss-of-function events are highly concordant with patterns of intolerance to mutations across the human population. ACE can be readily applied to diverse species including animals and plants, making it a broadly useful tool for use in eukaryotic population-based resequencing projects, particularly for assessing the joint impact of all variants at a locus. ACE is written in open-source C ++ and Perl and is available from geneprediction.org/ACE. myandell@genetics.utah.edu or tim.reddy@duke.edu. Supplementary information is available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e

  15. Diversity in copy number and structure of a silkworm morphogenetic gene as a result of domestication.

    Science.gov (United States)

    Sakudoh, Takashi; Nakashima, Takeharu; Kuroki, Yoko; Fujiyama, Asao; Kohara, Yuji; Honda, Naoko; Fujimoto, Hirofumi; Shimada, Toru; Nakagaki, Masao; Banno, Yutaka; Tsuchida, Kozo

    2011-03-01

    The carotenoid-binding protein (CBP) of the domesticated silkworm, Bombyx mori, a major determinant of cocoon color, is likely to have been substantially influenced by domestication of this species. We analyzed the structure of the CBP gene in multiple strains of B. mori, in multiple individuals of the wild silkworm, B. mandarina (the putative wild ancestor of B. mori), and in a number of other lepidopterans. We found the CBP gene copy number in genomic DNA to vary widely among B. mori strains, ranging from 1 to 20. The copies of CBP are of several types, based on the presence of a retrotransposon or partial deletion of the coding sequence. In contrast to B. mori, B. mandarina was found to possess a single copy of CBP without the retrotransposon insertion, regardless of habitat. Several other lepidopterans were found to contain sequences homologous to CBP, revealing that this gene is evolutionarily conserved in the lepidopteran lineage. Thus, domestication can generate significant diversity of gene copy number and structure over a relatively short evolutionary time. © 2011 by the Genetics Society of America

  16. Structural analysis of the α subunit of Na(+)/K(+) ATPase genes in invertebrates.

    Science.gov (United States)

    Thabet, Rahma; Rouault, J-D; Ayadi, Habib; Leignel, Vincent

    2016-01-01

    The Na(+)/K(+) ATPase is a ubiquitous pump coordinating the transport of Na(+) and K(+) across the membrane of cells and its role is fundamental to cellular functions. It is heteromer in eukaryotes including two or three subunits (α, β and γ which is specific to the vertebrates). The catalytic functions of the enzyme have been attributed to the α subunit. Several complete α protein sequences are available, but only few gene structures were characterized. We identified the genomic sequences coding the α-subunit of the Na(+)/K(+) ATPase, from the whole-genome shotgun contigs (WGS), NCBI Genomes (chromosome), Genomic Survey Sequences (GSS) and High Throughput Genomic Sequences (HTGS) databases across distinct phyla. One copy of the α subunit gene was found in Annelida, Arthropoda, Cnidaria, Echinodermata, Hemichordata, Mollusca, Placozoa, Porifera, Platyhelminthes, Urochordata, but the nematodes seem to possess 2 to 4 copies. The number of introns varied from 0 (Platyhelminthes) to 26 (Porifera); and their localization and length are also highly variable. Molecular phylogenies (Maximum Likelihood and Maximum Parsimony methods) showed some clusters constituted by (Chordata/(Echinodermata/Hemichordata)) or (Plathelminthes/(Annelida/Mollusca)) and a basal position for Porifera. These structural analyses increase our knowledge about the evolutionary events of the α subunit genes in the invertebrates. Copyright © 2016 Elsevier Inc. All rights reserved.

  17. Structure and gene cluster of the O-antigen of Escherichia coli O54.

    Science.gov (United States)

    Naumenko, Olesya I; Guo, Xi; Senchenkova, Sof'ya N; Geng, Peng; Perepelov, Andrei V; Shashkov, Alexander S; Liu, Bin; Knirel, Yuriy A

    2018-06-15

    Mild acid hydrolysis of the lipopolysaccharide of Escherichia coli O54 afforded an O-polysaccharide, which was studied by sugar analysis, solvolysis with anhydrous trifluoroacetic acid, and 1 H and 13 C NMR spectroscopy. Solvolysis cleaved predominantly the linkage of β-d-Ribf and, to a lesser extent, that of β-d-GlcpNAc, whereas the other linkages, including the linkage of α-l-Rhap, were stable under selected conditions (40 °C, 5 h). The following structure of the O-polysaccharide was established: →4)-α-d-GalpA-(1 → 2)-α-l-Rhap-(1 → 2)-β-d-Ribf-(1 → 4)-β-d-Galp-(1 → 3)-β-d-GlcpNAc-(1→ The O-antigen gene cluster of E. coli O54 was analyzed and found to be consistent in general with the O-polysaccharide structure established but there were two exceptions: i) in the cluster, there were genes for phosphoserine phosphatase and serine transferase, which have no apparent role in the O-polysaccharide synthesis, and ii) no ribofuranosyltransferase gene was present in the cluster. Both uncommon features are shared by some other enteric bacteria. Copyright © 2018 Elsevier Ltd. All rights reserved.

  18. Generation of antigenic diversity in Plasmodium falciparum by structured rearrangement of Var genes during mitosis.

    Directory of Open Access Journals (Sweden)

    Antoine Claessens

    2014-12-01

    Full Text Available The most polymorphic gene family in P. falciparum is the ∼60 var genes distributed across parasite chromosomes, both in the subtelomeres and in internal regions. They encode hypervariable surface proteins known as P. falciparum erythrocyte membrane protein 1 (PfEMP1 that are critical for pathogenesis and immune evasion in Plasmodium falciparum. How var gene sequence diversity is generated is not currently completely understood. To address this, we constructed large clone trees and performed whole genome sequence analysis to study the generation of novel var gene sequences in asexually replicating parasites. While single nucleotide polymorphisms (SNPs were scattered across the genome, structural variants (deletions, duplications, translocations were focused in and around var genes, with considerable variation in frequency between strains. Analysis of more than 100 recombination events involving var exon 1 revealed that the average nucleotide sequence identity of two recombining exons was only 63% (range: 52.7-72.4% yet the crossovers were error-free and occurred in such a way that the resulting sequence was in frame and domain architecture was preserved. Var exon 1, which encodes the immunologically exposed part of the protein, recombined in up to 0.2% of infected erythrocytes in vitro per life cycle. The high rate of var exon 1 recombination indicates that millions of new antigenic structures could potentially be generated each day in a single infected individual. We propose a model whereby var gene sequence polymorphism is mainly generated during the asexual part of the life cycle.

  19. The small heat shock proteins from Acidithiobacillus ferrooxidans: gene expression, phylogenetic analysis, and structural modeling

    Directory of Open Access Journals (Sweden)

    Ribeiro Daniela A

    2011-12-01

    Full Text Available Abstract Background Acidithiobacillus ferrooxidans is an acidophilic, chemolithoautotrophic bacterium that has been successfully used in metal bioleaching. In this study, an analysis of the A. ferrooxidans ATCC 23270 genome revealed the presence of three sHSP genes, Afe_1009, Afe_1437 and Afe_2172, that encode proteins from the HSP20 family, a class of intracellular multimers that is especially important in extremophile microorganisms. Results The expression of the sHSP genes was investigated in A. ferrooxidans cells submitted to a heat shock at 40°C for 15, 30 and 60 minutes. After 60 minutes, the gene on locus Afe_1437 was about 20-fold more highly expressed than the gene on locus Afe_2172. Bioinformatic and phylogenetic analyses showed that the sHSPs from A. ferrooxidans are possible non-paralogous proteins, and are regulated by the σ32 factor, a common transcription factor of heat shock proteins. Structural studies using homology molecular modeling indicated that the proteins encoded by Afe_1009 and Afe_1437 have a conserved α-crystallin domain and share similar structural features with the sHSP from Methanococcus jannaschii, suggesting that their biological assembly involves 24 molecules and resembles a hollow spherical shell. Conclusion We conclude that the sHSPs encoded by the Afe_1437 and Afe_1009 genes are more likely to act as molecular chaperones in the A. ferrooxidans heat shock response. In addition, the three sHSPs from A. ferrooxidans are not recent paralogs, and the Afe_1437 and Afe_1009 genes could be inherited horizontally by A. ferrooxidans.

  20. Structural characteristics of ScBx genes controlling the biosynthesis of hydroxamic acids in rye (Secale cereale L.).

    Science.gov (United States)

    Bakera, Beata; Makowska, Bogna; Groszyk, Jolanta; Niziołek, Michał; Orczyk, Wacław; Bolibok-Brągoszewska, Hanna; Hromada-Judycka, Aneta; Rakoczy-Trojanowska, Monika

    2015-08-01

    Benzoxazinoids (BX) are major secondary metabolites of gramineous plants that play an important role in disease resistance and allelopathy. They also have many other unique properties including anti-bacterial and anti-fungal activity, and the ability to reduce alfa-amylase activity. The biosynthesis and modification of BX are controlled by the genes Bx1 ÷ Bx10, GT and glu, and the majority of these Bx genes have been mapped in maize, wheat and rye. However, the genetic basis of BX biosynthesis remains largely uncharacterized apart from some data from maize and wheat. The aim of this study was to isolate, sequence and characterize five genes (ScBx1, ScBx2, ScBx3, ScBx4 and ScBx5) encoding enzymes involved in the synthesis of DIBOA, an important defense compound of rye. Using a modified 3D procedure of BAC library screening, seven BAC clones containing all of the ScBx genes were isolated and sequenced. Bioinformatic analyses of the resulting contigs were used to examine the structure and other features of these genes, including their promoters, introns and 3'UTRs. Comparative analysis showed that the ScBx genes are similar to those of other Poaceae species, especially to the TaBx genes. The polymorphisms present both in the coding sequences and non-coding regions of ScBx in relation to other Bx genes are predicted to have an impact on the expression, structure and properties of the encoded proteins.

  1. The 3D chromatin structure of the mouse β-haemoglobin gene cluster

    NARCIS (Netherlands)

    M.P.C. van de Corput (Mariëtte); T.A. Knoch (Tobias); E. de Boer (Ernie); W.A. van Cappellen (Gert); M. Lesnussa (Michael); H.J.F.M.M. Eussen (Bert)

    2010-01-01

    textabstractHere we show a 3D DNA-FISH method to visualizes the 3D structure of the β-globin locus. Geometric size and shape measurements of the 3D rendered signals (128Kb) show that the volume of the β-globin locus decreases almost two fold upon gene activation. A decrease in length and a

  2. Phylogenetic and structural diversity in the feline leukemia virus env gene.

    Directory of Open Access Journals (Sweden)

    Shinya Watanabe

    Full Text Available Feline leukemia virus (FeLV belongs to the genus Gammaretrovirus, and causes a variety of neoplastic and non-neoplastic diseases in cats. Alteration of viral env sequences is thought to be associated with disease specificity, but the way in which genetic diversity of FeLV contributes to the generation of such variants in nature is poorly understood. We isolated FeLV env genes from naturally infected cats in Japan and analyzed the evolutionary dynamics of these genes. Phylogenetic reconstructions separated our FeLV samples into three distinct genetic clusters, termed Genotypes I, II, and III. Genotype I is a major genetic cluster and can be further classified into Clades 1-7 in Japan. Genotypes were correlated with geographical distribution; Genotypes I and II were distributed within Japan, whilst FeLV samples from outside Japan belonged to Genotype III. These results may be due to geographical isolation of FeLVs in Japan. The observed structural diversity of the FeLV env gene appears to be caused primarily by mutation, deletion, insertion and recombination, and these variants may be generated de novo in individual cats. FeLV interference assay revealed that FeLV genotypes did not correlate with known FeLV receptor subgroups. We have identified the genotypes which we consider to be reliable for evaluating phylogenetic relationships of FeLV, which embrace the high structural diversity observed in our sample. Overall, these findings extend our understanding of Gammaretrovirus evolutionary patterns in the field, and may provide a useful basis for assessing the emergence of novel strains and understanding the molecular mechanisms of FeLV transmission in cats.

  3. HFE gene: Structure, function, mutations, and associated iron abnormalities.

    Science.gov (United States)

    Barton, James C; Edwards, Corwin Q; Acton, Ronald T

    2015-12-15

    The hemochromatosis gene HFE was discovered in 1996, more than a century after clinical and pathologic manifestations of hemochromatosis were reported. Linked to the major histocompatibility complex (MHC) on chromosome 6p, HFE encodes the MHC class I-like protein HFE that binds beta-2 microglobulin. HFE influences iron absorption by modulating the expression of hepcidin, the main controller of iron metabolism. Common HFE mutations account for ~90% of hemochromatosis phenotypes in whites of western European descent. We review HFE mapping and cloning, structure, promoters and controllers, and coding region mutations, HFE protein structure, cell and tissue expression and function, mouse Hfe knockouts and knockins, and HFE mutations in other mammals with iron overload. We describe the pertinence of HFE and HFE to mechanisms of iron homeostasis, the origin and fixation of HFE polymorphisms in European and other populations, and the genetic and biochemical basis of HFE hemochromatosis and iron overload. Copyright © 2015 Elsevier B.V. All rights reserved.

  4. Functional gene array-based analysis of microbial community structure in groundwaters with a gradient of contaminant levels

    Energy Technology Data Exchange (ETDEWEB)

    Waldron, P.J.; Wu, L.; Van Nostrand, J.D.; Schadt, C.W.; Watson, D.B.; Jardine, P.M.; Palumbo, A.V.; Hazen, T.C.; Zhou, J.

    2009-06-15

    To understand how contaminants affect microbial community diversity, heterogeneity, and functional structure, six groundwater monitoring wells from the Field Research Center of the U.S. Department of Energy Environmental Remediation Science Program (ERSP; Oak Ridge, TN), with a wide range of pH, nitrate, and heavy metal contamination were investigated. DNA from the groundwater community was analyzed with a functional gene array containing 2006 probes to detect genes involved in metal resistance, sulfate reduction, organic contaminant degradation, and carbon and nitrogen cycling. Microbial diversity decreased in relation to the contamination levels of the wells. Highly contaminated wells had lower gene diversity but greater signal intensity than the pristine well. The microbial composition was heterogeneous, with 17-70% overlap between different wells. Metal-resistant and metal-reducing microorganisms were detected in both contaminated and pristine wells, suggesting the potential for successful bioremediation of metal-contaminated groundwaters. In addition, results of Mantel tests and canonical correspondence analysis indicate that nitrate, sulfate, pH, uranium, and technetium have a significant (p < 0.05) effect on microbial community structure. This study provides an overall picture of microbial community structure in contaminated environments with functional gene arrays by showing that diversity and heterogeneity can vary greatly in relation to contamination.

  5. Primary structure and promoter analysis of leghemoglobin genes of the stem-nodulated tropical legume Sesbania rostrata: conserved coding sequences, cis-elements and trans-acting factors

    DEFF Research Database (Denmark)

    Metz, B A; Welters, P; Hoffmann, H J

    1988-01-01

    The primary structure of a leghemoglobin (lb) gene from the stem-nodulated, tropical legume Sesbania rostrata and two lb gene promoter regions was analysed. The S. rostrata lb gene structure and Lb amino acid composition were found to be highly conserved with previously described lb genes and Lb ...

  6. Genetic structure and gene flows within horses: a genealogical study at the french population scale.

    Science.gov (United States)

    Pirault, Pauline; Danvy, Sophy; Verrier, Etienne; Leroy, Grégoire

    2013-01-01

    Since horse breeds constitute populations submitted to variable and multiple outcrossing events, we analyzed the genetic structure and gene flows considering horses raised in France. We used genealogical data, with a reference population of 547,620 horses born in France between 2002 and 2011, grouped according to 55 breed origins. On average, individuals had 6.3 equivalent generations known. Considering different population levels, fixation index decreased from an overall species FIT of 1.37%, to an average [Formula: see text] of -0.07% when considering the 55 origins, showing that most horse breeds constitute populations without genetic structure. We illustrate the complexity of gene flows existing among horse breeds, a few populations being closed to foreign influence, most, however, being submitted to various levels of introgression. In particular, Thoroughbred and Arab breeds are largely used as introgression sources, since those two populations explain together 26% of founder origins within the overall horse population. When compared with molecular data, breeds with a small level of coancestry also showed low genetic distance; the gene pool of the breeds was probably impacted by their reproducer exchanges.

  7. Chromatin structure of ribosomal RNA genes in dipterans and its relationship to the location of nucleolar organizers.

    Science.gov (United States)

    Madalena, Christiane Rodriguez Gutierrez; Díez, José Luís; Gorab, Eduardo

    2012-01-01

    Nucleoli, nuclear organelles in which ribosomal RNA is synthesized and processed, emerge from nucleolar organizers (NORs) located in distinct chromosomal regions. In polytene nuclei of dipterans, nucleoli of some species can be observed under light microscopy exhibiting distinctive morphology: Drosophila and chironomid species display well-formed nucleoli in contrast to the fragmented and dispersed nucleoli seen in sciarid flies. The available data show no apparent relationship between nucleolar morphology and location of NORs in Diptera. The regulation of rRNA transcription involves controlling both the transcription rate per gene as well as the proportion of rRNA genes adopting a proper chromatin structure for transcription, since active and inactive rRNA gene copies coexist in NORs. Transcription units organized in nucleosomes and those lacking canonical nucleosomes can be analyzed by the method termed psoralen gel retarding assay (PGRA), allowing inferences on the ratio of active to inactive rRNA gene copies. In this work, possible connections between chromosomal location of NORs and proportion of active rRNA genes were studied in Drosophila melanogaster, and in chironomid and sciarid species. The data suggested a link between location of NORs and proportion of active rRNA genes since the copy number showing nucleosomal organization predominates when NORs are located in the pericentric heterochromatin. The results presented in this work are in agreement with previous data on the chromatin structure of rRNA genes from distantly related eukaryotes, as assessed by the PGRA.

  8. Chromatin structure of ribosomal RNA genes in dipterans and its relationship to the location of nucleolar organizers.

    Directory of Open Access Journals (Sweden)

    Christiane Rodriguez Gutierrez Madalena

    Full Text Available Nucleoli, nuclear organelles in which ribosomal RNA is synthesized and processed, emerge from nucleolar organizers (NORs located in distinct chromosomal regions. In polytene nuclei of dipterans, nucleoli of some species can be observed under light microscopy exhibiting distinctive morphology: Drosophila and chironomid species display well-formed nucleoli in contrast to the fragmented and dispersed nucleoli seen in sciarid flies. The available data show no apparent relationship between nucleolar morphology and location of NORs in Diptera. The regulation of rRNA transcription involves controlling both the transcription rate per gene as well as the proportion of rRNA genes adopting a proper chromatin structure for transcription, since active and inactive rRNA gene copies coexist in NORs. Transcription units organized in nucleosomes and those lacking canonical nucleosomes can be analyzed by the method termed psoralen gel retarding assay (PGRA, allowing inferences on the ratio of active to inactive rRNA gene copies. In this work, possible connections between chromosomal location of NORs and proportion of active rRNA genes were studied in Drosophila melanogaster, and in chironomid and sciarid species. The data suggested a link between location of NORs and proportion of active rRNA genes since the copy number showing nucleosomal organization predominates when NORs are located in the pericentric heterochromatin. The results presented in this work are in agreement with previous data on the chromatin structure of rRNA genes from distantly related eukaryotes, as assessed by the PGRA.

  9. Structural Diversification of Lyngbyatoxin A by Host-Dependent Heterologous Expression of the tleABC Biosynthetic Gene Cluster.

    Science.gov (United States)

    Zhang, Lihan; Hoshino, Shotaro; Awakawa, Takayoshi; Wakimoto, Toshiyuki; Abe, Ikuro

    2016-08-03

    Natural products have enormous structural diversity, yet little is known about how such diversity is achieved in nature. Here we report the structural diversification of a cyanotoxin-lyngbyatoxin A-and its biosynthetic intermediates by heterologous expression of the Streptomyces-derived tleABC biosynthetic gene cluster in three different Streptomyces hosts: S. lividans, S. albus, and S. avermitilis. Notably, the isolated lyngbyatoxin derivatives, including four new natural products, were biosynthesized by crosstalk between the heterologous tleABC gene cluster and the endogenous host enzymes. The simple strategy described here has expanded the structural diversity of lyngbyatoxin A and its biosynthetic intermediates, and provides opportunities for investigation of the currently underestimated hidden biosynthetic crosstalk. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  10. Analysis of the functional gene structure and metabolic potential of microbial community in high arsenic groundwater.

    Science.gov (United States)

    Li, Ping; Jiang, Zhou; Wang, Yanhong; Deng, Ye; Van Nostrand, Joy D; Yuan, Tong; Liu, Han; Wei, Dazhun; Zhou, Jizhong

    2017-10-15

    Microbial functional potential in high arsenic (As) groundwater ecosystems remains largely unknown. In this study, the microbial community functional composition of nineteen groundwater samples was investigated using a functional gene array (GeoChip 5.0). Samples were divided into low and high As groups based on the clustering analysis of geochemical parameters and microbial functional structures. The results showed that As related genes (arsC, arrA), sulfate related genes (dsrA and dsrB), nitrogen cycling related genes (ureC, amoA, and hzo) and methanogen genes (mcrA, hdrB) in groundwater samples were correlated with As, SO 4 2- , NH 4 + or CH 4 concentrations, respectively. Canonical correspondence analysis (CCA) results indicated that some geochemical parameters including As, total organic content, SO 4 2- , NH 4 + , oxidation-reduction potential (ORP) and pH were important factors shaping the functional microbial community structures. Alkaline and reducing conditions with relatively low SO 4 2- , ORP, and high NH 4 + , as well as SO 4 2- and Fe reduction and ammonification involved in microbially-mediated geochemical processes could be associated with As enrichment in groundwater. This study provides an overall picture of functional microbial communities in high As groundwater aquifers, and also provides insights into the critical role of microorganisms in As biogeochemical cycling. Copyright © 2017 Elsevier Ltd. All rights reserved.

  11. Efficacy of double-stranded RNA against white spot syndrome virus (WSSV non-structural (orf89, wsv191 and structural (vp28, vp26 genes in the Pacific white shrimp Litopenaeus vannamei

    Directory of Open Access Journals (Sweden)

    César M. Escobedo-Bonilla

    2015-04-01

    Full Text Available White spot syndrome virus (WSSV is a major pathogen in shrimp aquaculture. RNA interference (RNAi is a promising tool against viral infections. Previous works with RNAi showed different antiviral efficacies depending on the silenced gene. This work evaluated the antiviral efficacy of double-stranded (ds RNA against two non-structural (orf89, wsv191 WSSV genes compared to structural (vp26, vp28 genes to inhibit an experimental WSSV infection. Gene orf89 encodes a putative regulatory protein and gene white spot virus (wsv191 encodes a nonspecific nuclease; whereas genes vp26 and vp28 encode envelope proteins, respectively. Molecules of dsRNA against each of the WSSV genes were intramuscularly injected (4 μg per shrimp into a group of shrimp 48 h before a WSSV challenge. The highest antiviral activity occurred with dsRNA against orf89, vp28 and vp26 (cumulative mortalities 10%, 10% and 21%, respectively. In contrast, the least effective treatment was wsv191 dsRNA (cumulative mortality 83%. All dead animals were WSSV-positive by one-step PCR, whereas reverse-transcription PCR of all surviving shrimp confirmed inhibition of virus replication. This study showed that dsRNA against WSSV genes orf89, vp28 and vp26 were highly effective to inhibit virus replication and suggest an essential role in WSSV infection. Non-structural WSSV genes such as orf89 can be used as novel targets to design therapeutic RNAi molecules against WSSV infection.

  12. Comparative analyses of microbial structures and gene copy numbers in the anaerobic digestion of various types of sewage sludge.

    Science.gov (United States)

    Hidaka, Taira; Tsushima, Ikuo; Tsumori, Jun

    2018-04-01

    Anaerobic co-digestion of various sewage sludges is a promising approach for greater recovery of energy, but the process is more complicated than mono-digestion of sewage sludge. The applicability of microbial structure analyses and gene quantification to understand microbial conditions was evaluated. The results show that information from gene analyses is useful in managing anaerobic co-digestion and damaged microbes in addition to conventional parameters like total solids, pH and biogas production. Total bacterial 16S rRNA gene copy numbers are the most useful tools for evaluating unstable anaerobic digestion of sewage sludge, rather than mcrA and total archaeal 16S rRNA gene copy numbers, and high-throughput sequencing. First order decay rates of gene copy numbers during pH failure were higher than typical decay rates of microbes in stable operation. The sequencing analyses, including multidimensional scaling, showed very different microbial structure shifts, but the results were not consistent. Copyright © 2017 Elsevier Ltd. All rights reserved.

  13. Genome-wide identification, phylogenetic classification, and exon-intron structure characterisation of the tubulin and actin genes in flax (Linum usitatissimum).

    Science.gov (United States)

    Pydiura, Nikolay; Pirko, Yaroslav; Galinousky, Dmitry; Postovoitova, Anastasiia; Yemets, Alla; Kilchevsky, Aleksandr; Blume, Yaroslav

    2018-06-08

    Flax (Linum usitatissimum L.) is a valuable food and fiber crop cultivated for its quality fiber and seed oil. α-, β-, γ-tubulins and actins are the main structural proteins of the cytoskeleton. α- and γ-tubulin and actin genes have not been characterized yet in the flax genome. In this study, we have identified 6 α-tubulin genes, 13 β-tubulin genes, 2 γ-tubulin genes, and 15 actin genes in the flax genome and analysed the phylogenetic relationships between flax and A. thaliana tubulin and actin genes. Six α-tubulin genes are represented by 3 paralogous pairs, among 13 β-tubulin genes 7 different isotypes can be distinguished, 6 of which are encoded by two paralogous genes each. γ-tubulin is represented by a paralogous pair of genes one of which may be not functional. Fifteen actin genes represent 7 paralogous pairs - 7 actin isotypes and a sequentially duplicated copy of one of the genes of one of the isotypes. Exon-intron structure analysis has shown intron length polymorphism within the β-tubulin genes and intron number variation among the α-tubulin gene: 3 or 4 introns are found in two or four genes, respectively. Intron positioning occurs at conservative sites, as observed in numerous other plant species. Flax actin genes show both intron length polymorphisms and variation in the number of intron that may be 2 or 3. These data will be useful to support further studies on the specificity, functioning, regulation and evolution of the flax cytoskeleton proteins. This article is protected by copyright. All rights reserved.

  14. Circumpolar Genetic Structure and Recent Gene Flow of Polar Bears: A Reanalysis.

    Science.gov (United States)

    Malenfant, René M; Davis, Corey S; Cullingham, Catherine I; Coltman, David W

    2016-01-01

    Recently, an extensive study of 2,748 polar bears (Ursus maritimus) from across their circumpolar range was published in PLOS ONE, which used microsatellites and mitochondrial haplotypes to apparently show altered population structure and a dramatic change in directional gene flow towards the Canadian Archipelago-an area believed to be a future refugium for polar bears as their southernmost habitats decline under climate change. Although this study represents a major international collaborative effort and promised to be a baseline for future genetics work, methodological shortcomings and errors of interpretation undermine some of the study's main conclusions. Here, we present a reanalysis of this data in which we address some of these issues, including: (1) highly unbalanced sample sizes and large amounts of systematically missing data; (2) incorrect calculation of FST and of significance levels; (3) misleading estimates of recent gene flow resulting from non-convergence of the program BayesAss. In contrast to the original findings, in our reanalysis we find six genetic clusters of polar bears worldwide: the Hudson Bay Complex, the Western and Eastern Canadian Arctic Archipelago, the Western and Eastern Polar Basin, and-importantly-we reconfirm the presence of a unique and possibly endangered cluster of bears in Norwegian Bay near Canada's expected last sea-ice refugium. Although polar bears' abundance, distribution, and population structure will certainly be negatively affected by ongoing-and increasingly rapid-loss of Arctic sea ice, these genetic data provide no evidence of strong directional gene flow in response to recent climate change.

  15. Circumpolar Genetic Structure and Recent Gene Flow of Polar Bears: A Reanalysis.

    Directory of Open Access Journals (Sweden)

    René M Malenfant

    Full Text Available Recently, an extensive study of 2,748 polar bears (Ursus maritimus from across their circumpolar range was published in PLOS ONE, which used microsatellites and mitochondrial haplotypes to apparently show altered population structure and a dramatic change in directional gene flow towards the Canadian Archipelago-an area believed to be a future refugium for polar bears as their southernmost habitats decline under climate change. Although this study represents a major international collaborative effort and promised to be a baseline for future genetics work, methodological shortcomings and errors of interpretation undermine some of the study's main conclusions. Here, we present a reanalysis of this data in which we address some of these issues, including: (1 highly unbalanced sample sizes and large amounts of systematically missing data; (2 incorrect calculation of FST and of significance levels; (3 misleading estimates of recent gene flow resulting from non-convergence of the program BayesAss. In contrast to the original findings, in our reanalysis we find six genetic clusters of polar bears worldwide: the Hudson Bay Complex, the Western and Eastern Canadian Arctic Archipelago, the Western and Eastern Polar Basin, and-importantly-we reconfirm the presence of a unique and possibly endangered cluster of bears in Norwegian Bay near Canada's expected last sea-ice refugium. Although polar bears' abundance, distribution, and population structure will certainly be negatively affected by ongoing-and increasingly rapid-loss of Arctic sea ice, these genetic data provide no evidence of strong directional gene flow in response to recent climate change.

  16. Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing.

    Science.gov (United States)

    Seoane-Zonjic, Pedro; Cañas, Rafael A; Bautista, Rocío; Gómez-Maldonado, Josefa; Arrillaga, Isabel; Fernández-Pozo, Noé; Claros, M Gonzalo; Cánovas, Francisco M; Ávila, Concepción

    2016-02-27

    In the era of DNA throughput sequencing, assembling and understanding gymnosperm mega-genomes remains a challenge. Although drafts of three conifer genomes have recently been published, this number is too low to understand the full complexity of conifer genomes. Using techniques focused on specific genes, gene models can be established that can aid in the assembly of gene-rich regions, and this information can be used to compare genomes and understand functional evolution. In this study, gene capture technology combined with BAC isolation and sequencing was used as an experimental approach to establish de novo gene structures without a reference genome. Probes were designed for 866 maritime pine transcripts to sequence genes captured from genomic DNA. The gene models were constructed using GeneAssembler, a new bioinformatic pipeline, which reconstructed over 82% of the gene structures, and a high proportion (85%) of the captured gene models contained sequences from the promoter regulatory region. In a parallel experiment, the P. pinaster BAC library was screened to isolate clones containing genes whose cDNA sequence were already available. BAC clones containing the asparagine synthetase, sucrose synthase and xyloglucan endotransglycosylase gene sequences were isolated and used in this study. The gene models derived from the gene capture approach were compared with the genomic sequences derived from the BAC clones. This combined approach is a particularly efficient way to capture the genomic structures of gene families with a small number of members. The experimental approach used in this study is a valuable combined technique to study genomic gene structures in species for which a reference genome is unavailable. It can be used to establish exon/intron boundaries in unknown gene structures, to reconstruct incomplete genes and to obtain promoter sequences that can be used for transcriptional studies. A bioinformatics algorithm (GeneAssembler) is also provided as a

  17. Structure of the gene for human butyrylcholinesterase. Evidence for a single copy

    International Nuclear Information System (INIS)

    Arpagaus, M.; Kott, M.; Vatsis, K.P.; Bartels, C.F.; La Du, B.N.; Lockridge, O.

    1990-01-01

    The authors have isolated five genomic clones for human butyrylcholinesterase (BChE), using cDNA probes encoding the catalytic subunit of the hydrophilic tetramer. The BChE gene is at least 73 kb long and contains for exons. Exon 1 contains untranslated sequences and two potential translation initiation sites at codons -69 and -47. Exon 2 (1525 bp) contains 83% of the coding sequence for the mature protein, including the N-terminal and the active-site serine, and a third possible translation initiation site (likely functional), at codon -28. Exon 3 is 167 nucleotides long. Exon 4 (604 bp) codes for the C-terminus of the protein and the 3' untranslated region where two polyadenylation signals were identified. Intron 1 is 6.5 km long, and the minimal sizes of introns 2 and 3 are estimated to be 32 km each. Southern blot analysis of total human genomic DNA is in complete agreement with the gene structure established by restriction endonuclease mapping of the genomic clones: this strongly suggests that the BChE gene is present in a single copy

  18. Sulfamethoxazole and COD increase abundance of sulfonamide resistance genes and change bacterial community structures within sequencing batch reactors.

    Science.gov (United States)

    Guo, Xueping; Pang, Weihai; Dou, Chunling; Yin, Daqiang

    2017-05-01

    The abundant microbial community in biological treatment processes in wastewater treatment plants (WWTPs) may potentially enhance the horizontal gene transfer of antibiotic resistance genes with the presence of antibiotics. A lab-scale sequencing batch reactor was designed to investigate response of sulfonamide resistance genes (sulI, sulII) and bacterial communities to various concentrations of sulfamethoxazole (SMX) and chemical oxygen demand (COD) of wastewater. The SMX concentrations (0.001 mg/L, 0.1 mg/L and 10 mg/L) decreased with treatment time and higher SMX level was more difficult to remove. The presence of SMX also significantly reduced the removal efficiency of ammonia nitrogen, affecting the normal function of WWTPs. All three concentrations of SMX raised both sulI and sulII genes with higher concentrations exhibiting greater increases. The abundance of sul genes was positive correlated with treatment time and followed the second-order reaction kinetic model. Interestingly, these two genes have rather similar activity. SulI and sulII gene abundance also performed similar response to COD. Simpson index and Shannon-Weiner index did not show changes in the microbial community diversity. However, the 16S rRNA gene cloning and sequencing results showed the bacterial community structures varied during different stages. The results demonstrated that influent antibiotics into WWTPs may facilitate selection of ARGs and affect the wastewater conventional treatment as well as the bacteria community structures. Copyright © 2017 Elsevier Ltd. All rights reserved.

  19. Morquio A syndrome: Cloning, sequence, and structure of the human N-acetylgalactosamine 6-sulfatase (GALNS) gene

    Energy Technology Data Exchange (ETDEWEB)

    Morris, C.P.; Guo, Xiao-Hui; Apostolou, S. [Adelaide Children`s Hospital, North Adelaide (Australia)] [and others

    1994-08-01

    Deficiency of the lysosomal enzyme, N-acetylgalactosamine 6-sulfatase (GALNS;EC 3.1.6.4), results in the storage of the glycosaminoglycans, keratan sulfate and chrondroitin 6-sulfate, which leads to the lysosomal storage disorder Morquio A syndrome. Four overlapping genomic clones derived from a chromosome 16-specific gridded cosmid library containing the entire GALNS gene were isolated. The structure of the gene and the sequence of the exon/intron boundaries and the 5{prime} promoter region were determined. The GALNS gene is split into 14 exons spanning approximately 40 kb. The potential promoter for GALNS lacks a TATA box but contains GC box consensus sequences, consistent with its role as a housekeeping gene. The GALNS gene contains an Alu repeat in intron 5 and a VNTR-like sequence in intron 6. 12 refs., 3 figs., 1 tab.

  20. The structure and organization of the human carnitine/acylcarnitine translocase (CACT1) gene2

    NARCIS (Netherlands)

    Iacobazzi, V.; Naglieri, M. A.; Stanley, C. A.; Wanders, R. J.; Palmieri, F.

    1998-01-01

    The carnitine/acylcarnitine translocase (CACT) transports acylcarnitines into mitochondria in exchange for free carnitine and it is, therefore, essential for the fatty acid beta-oxidation pathway. We have determined the exon-intron structure of the human CACT gene, which is responsible for a genetic

  1. No effect of schizophrenia risk genes MIR137, TCF4, and ZNF804A on macroscopic brain structure

    NARCIS (Netherlands)

    Cousijn, H.; Eissing, M.; Fernandez, G.S.E.; Fisher, S.E.; Franke, B.; Zwiers, M.P.; Harrison, P.J.; Arias Vasquez, A.

    2014-01-01

    Single nucleotide polymorphisms (SNPs) within the MIR137, TCF4, and ZNF804A genes show genome-wide association to schizophrenia. However, the biological basis for the associations is unknown. Here, we tested the effects of these genes on brain structure in 1300 healthy adults. Using volumetry and

  2. The Use of Gene Modification and Advanced Molecular Structure Analyses towards Improving Alfalfa Forage.

    Science.gov (United States)

    Lei, Yaogeng; Hannoufa, Abdelali; Yu, Peiqiang

    2017-01-29

    Alfalfa is one of the most important legume forage crops in the world. In spite of its agronomic and nutritive advantages, alfalfa has some limitations in the usage of pasture forage and hay supplement. High rapid degradation of protein in alfalfa poses a risk of rumen bloat to ruminants which could cause huge economic losses for farmers. Coupled with the relatively high lignin content, which impedes the degradation of carbohydrate in rumen, alfalfa has unbalanced and asynchronous degradation ratio of nitrogen to carbohydrate (N/CHO) in rumen. Genetic engineering approaches have been used to manipulate the expression of genes involved in important metabolic pathways for the purpose of improving the nutritive value, forage yield, and the ability to resist abiotic stress. Such gene modification could bring molecular structural changes in alfalfa that are detectable by advanced structural analytical techniques. These structural analyses have been employed in assessing alfalfa forage characteristics, allowing for rapid, convenient and cost-effective analysis of alfalfa forage quality. In this article, we review two major obstacles facing alfalfa utilization, namely poor protein utilization and relatively high lignin content, and highlight genetic studies that were performed to overcome these drawbacks, as well as to introduce other improvements to alfalfa quality. We also review the use of advanced molecular structural analysis in the assessment of alfalfa forage for its potential usage in quality selection in alfalfa breeding.

  3. The Use of Gene Modification and Advanced Molecular Structure Analyses towards Improving Alfalfa Forage

    Energy Technology Data Exchange (ETDEWEB)

    Lei, Yaogeng; Hannoufa, Abdelali; Yu, Peiqiang

    2017-01-29

    Alfalfa is one of the most important legume forage crops in the world. In spite of its agronomic and nutritive advantages, alfalfa has some limitations in the usage of pasture forage and hay supplement. High rapid degradation of protein in alfalfa poses a risk of rumen bloat to ruminants which could cause huge economic losses for farmers. Coupled with the relatively high lignin content, which impedes the degradation of carbohydrate in rumen, alfalfa has unbalanced and asynchronous degradation ratio of nitrogen to carbohydrate (N/CHO) in rumen. Genetic engineering approaches have been used to manipulate the expression of genes involved in important metabolic pathways for the purpose of improving the nutritive value, forage yield, and the ability to resist abiotic stress. Such gene modification could bring molecular structural changes in alfalfa that are detectable by advanced structural analytical techniques. These structural analyses have been employed in assessing alfalfa forage characteristics, allowing for rapid, convenient and cost-effective analysis of alfalfa forage quality. In this article, we review two major obstacles facing alfalfa utilization, namely poor protein utilization and relatively high lignin content, and highlight genetic studies that were performed to overcome these drawbacks, as well as to introduce other improvements to alfalfa quality. We also review the use of advanced molecular structural analysis in the assessment of alfalfa forage for its potential usage in quality selection in alfalfa breeding.

  4. Genetic structure and gene flows within horses: a genealogical study at the french population scale.

    Directory of Open Access Journals (Sweden)

    Pauline Pirault

    Full Text Available Since horse breeds constitute populations submitted to variable and multiple outcrossing events, we analyzed the genetic structure and gene flows considering horses raised in France. We used genealogical data, with a reference population of 547,620 horses born in France between 2002 and 2011, grouped according to 55 breed origins. On average, individuals had 6.3 equivalent generations known. Considering different population levels, fixation index decreased from an overall species FIT of 1.37%, to an average [Formula: see text] of -0.07% when considering the 55 origins, showing that most horse breeds constitute populations without genetic structure. We illustrate the complexity of gene flows existing among horse breeds, a few populations being closed to foreign influence, most, however, being submitted to various levels of introgression. In particular, Thoroughbred and Arab breeds are largely used as introgression sources, since those two populations explain together 26% of founder origins within the overall horse population. When compared with molecular data, breeds with a small level of coancestry also showed low genetic distance; the gene pool of the breeds was probably impacted by their reproducer exchanges.

  5. Structural organization and chromosomal assignment of the mouse embryonic TEA domain-containing factor (ETF) gene.

    Science.gov (United States)

    Suzuki, K; Yasunami, M; Matsuda, Y; Maeda, T; Kobayashi, H; Terasaki, H; Ohkubo, H

    1996-09-01

    Embryonic TEA domain-containing factor (ETF) belongs to the family of proteins structurally related to transcriptional enhancer factor-1 (TEF-1) and is implicated in neural development. Isolation and characterization of the cosmid clones encoding the mouse ETF gene (Etdf) revealed that Etdf spans approximately 17.9 kb and consists of 12 exons. The exon-intron structure of Etdf closely resembles that of the Drosophila scalloped gene, indicating that these genes may have evolved from a common ancestor. The multiple transcription initiation sites revealed by S1 protection and primer extension analyses are consistent with the absence of the canonical TATA and CAAT boxes in the 5'-flanking region, which contains many potential regulatory sequences, such as the E-box, N-box, Sp1 element, GATA-1 element, TAATGARAT element, and B2 short interspersed element (SINE) as well as several direct and inverted repeat sequences. The Etdf locus was assigned to the proximal region of mouse chromosome 7 using fluorescence in situ hybridization and linkage mapping analyses. These results provide the molecular basis for studying the regulation, in vivo function, and evolution of Etdf.

  6. Gene structure and functional characterization of growth hormone in dogfish, Squalus acanthias.

    Science.gov (United States)

    Moriyama, Shunsuke; Oda, Mayumi; Yamazaki, Tomohide; Yamaguchi, Kiyoko; Amiya, Noriko; Takahashi, Akiyoshi; Amano, Masafumi; Goto, Tomoaki; Nozaki, Masumi; Meguro, Hiroshi; Kawauchi, Hiroshi

    2008-06-01

    Dogfish (Squalus acanthias) growth hormone (GH) was identified by cDNA cloning and protein purification from the pituitary gland. Dogfish GH cDNA encoded a prehormone of 210 amino acids (aa). Sequence analysis of purified GH revealed that the prehormone is composed of a signal peptide of 27 aa and a mature protein of 183 aa. Dogfish GH showed 94% sequence identity with blue shark GH, and also showed 37-66%, 26%, and 48-67% sequence identity with GH from osteichtyes, an agnathan, and tetrapods. The site of production was identified through immunocytochemistry to be cells of the proximal pars distalis of the pituitary gland. Dogfish GH stimulates both insulin-like growth factor-I and II mRNA levels in dogfish liver in vitro. The dogfish GH gene consisted of five exons and four introns, the same as in lamprey, teleosts such as cypriniforms and siluriforms, and tetrapods. The 5'-flanking region within 1082 bp of the transcription start site contained consensus sequences for the TATA box, Pit-1/GHF-1, CRE, TRE, and ERE. These results show that the endocrine mechanism for growth stimulation by the GH-IGF axis was established at an early stage of vertebrate evolution, and that the 5-exon-type gene organization might reflect the structure of the ancestral gene for the GH gene family.

  7. Primary structure and mapping of the hupA gene of Salmonella typhimurium.

    Science.gov (United States)

    Higgins, N P; Hillyard, D

    1988-01-01

    In bacteria, the complex nucleoid structure is folded and maintained by negative superhelical tension and a set of type II DNA-binding proteins, also called histonelike proteins. The most abundant type II DNA-binding protein is HU. Southern blot analysis showed that Salmonella typhimurium contained two HU genes that corresponded to Escherichia coli genes hupA (encoding HU-2 protein) and hupB (encoding HU-1). Salmonella hupA was cloned, and the nucleotide sequence of the gene was determined. Comparison of hupA of E. coli and S. typhimurium revealed that the HU-2 proteins were identical and that there was high conservation of nucleotide sequences outside the coding frames of the genes. A 300-member genomic library of S. typhimurium was constructed by using random transposition of MudP, a specialized chimeric P22-Mu phage that packages chromosomal DNA unidirectionally from its insertion point. Oligonucleotide hybridization against the library identified one MudP insertion that lies within 28 kilobases of hupA; the MudP was 12% linked to purH at 90.5 min on the standard map. Plasmids expressing HU-2 had a surprising phenotype; they caused growth arrest when they were introduced into E. coli strains bearing a himA or hip mutation. These results suggest that IHF and HU have interactive roles in bacteria. Images PMID:3056912

  8. Characterizing the pathotype structure of barley powdery mildew and effectiveness of resistance genes to this pathogen in Kazakhstan.

    Science.gov (United States)

    Rsaliyev, Aralbek; Pahratdinova, Zhazira; Rsaliyev, Shynbolat

    2017-11-14

    Powdery mildew of barley is a wind-borne and obligate biotrophic pathogen, which ranks among the most widespread barley pathogens worldwide. However, purposeful research towards studying the structure of the barley powdery mildew populations, of their virulence and of effectiveness of certain resistance genes against the infection was not conducted in Kazakhstan till present time. This paper is the first to describe characteristics of the pathotype structure of Blumeria graminis f.sp. hordei (Bgh) population and effectiveness of resistance genes in two regions of barley cultivation in the republic. One hundred and seven isolates of Bgh were obtained from seven populations occurring on cultivated barley at two geographically locations in Kazakhstan during 2015 and 2016. Their virulence frequency was determined on 17 differential lines Pallas. All isolates were virulent on the resistance gene Mla8 and avirulent for the resistance genes Mla9, Mla1 + MlaAl2, Mla6 + Mla14, Mla13 + MlRu3, Mla7 + MlNo3, Mla10 + MlDu2, Mla13 + MlRu3 and Mlo-5. The frequencies of isolates overcoming the genes Mla3, Mla22, Mlat Mlg + MlCP and Mla12 + MlEm2 were 0.0-33.33%, and frequencies of isolates overcoming the genes Mlra, Mlk, MlLa and Mlh ranged from 10.0 to 78.6%. Based on reactions of differential lines possessing the genes Mla22, Mlra, Mlk, Mlat, MlLa and Mlh, pathotypes were identified. In total, 23 pathotypes with virulence complexity ranging from 1 to 6 were identified. During both years in all populations of South Kazakhstan and Zhambyl regions pathotypes 24 and 64 mainly prevailed. Obtained data suggest that low similarity of populations Bgh in Kazakhstan to European, African, Australian and South-East Asian populations. The present study provides a foundation for future studies on the pathogenic variability within of Bgh populations in Kazakhstan and addresses the knowledge gap on the virulence structure of Bgh in Central Asia. Complete effectiveness of the

  9. Mathematical and Biological Modelling of RNA Secondary Structure and Its Effects on Gene Expression

    Directory of Open Access Journals (Sweden)

    T. A. Hughes

    2006-01-01

    Full Text Available Secondary structures within the 5′ untranslated regions of messenger RNAs can have profound effects on the efficiency of translation of their messages and thereby on gene expression. Consequently they can act as important regulatory motifs in both physiological and pathological settings. Current approaches to predicting the secondary structure of these RNA sequences find the structure with the global-minimum free energy. However, since RNA folds progressively from the 5′ end when synthesised or released from the translational machinery, this may not be the most probable structure. We discuss secondary structure prediction based on local-minimisation of free energy with thermodynamic fluctuations as nucleotides are added to the 3′ end and show that these can result in different secondary structures. We also discuss approaches for studying the extent of the translational inhibition specified by structures within the 5′ untranslated region.

  10. Structural organization of the human and mouse laminin beta2 chain genes, and alternative splicing at the 5' end of the human transcript

    DEFF Research Database (Denmark)

    Durkin, M E; Gautam, M; Loechel, F

    1996-01-01

    We have determined the structural organization of the human and mouse genes that encode the laminin beta2 chain (s-laminin), an essential component of the basement membranes of the neuromuscular synapse and the kidney glomerulus. The human and mouse genes have a nearly identical exon-intron organ......We have determined the structural organization of the human and mouse genes that encode the laminin beta2 chain (s-laminin), an essential component of the basement membranes of the neuromuscular synapse and the kidney glomerulus. The human and mouse genes have a nearly identical exon...

  11. Combined protein construct and synthetic gene engineering for heterologous protein expression and crystallization using Gene Composer

    Directory of Open Access Journals (Sweden)

    Walchli John

    2009-04-01

    Full Text Available Abstract Background With the goal of improving yield and success rates of heterologous protein production for structural studies we have developed the database and algorithm software package Gene Composer. This freely available electronic tool facilitates the information-rich design of protein constructs and their engineered synthetic gene sequences, as detailed in the accompanying manuscript. Results In this report, we compare heterologous protein expression levels from native sequences to that of codon engineered synthetic gene constructs designed by Gene Composer. A test set of proteins including a human kinase (P38α, viral polymerase (HCV NS5B, and bacterial structural protein (FtsZ were expressed in both E. coli and a cell-free wheat germ translation system. We also compare the protein expression levels in E. coli for a set of 11 different proteins with greatly varied G:C content and codon bias. Conclusion The results consistently demonstrate that protein yields from codon engineered Gene Composer designs are as good as or better than those achieved from the synonymous native genes. Moreover, structure guided N- and C-terminal deletion constructs designed with the aid of Gene Composer can lead to greater success in gene to structure work as exemplified by the X-ray crystallographic structure determination of FtsZ from Bacillus subtilis. These results validate the Gene Composer algorithms, and suggest that using a combination of synthetic gene and protein construct engineering tools can improve the economics of gene to structure research.

  12. Exploration of structural stability in deleterious nsSNPs of the XPA gene: A molecular dynamics approach

    Directory of Open Access Journals (Sweden)

    N NagaSundaram

    2011-01-01

    Full Text Available Background: Distinguishing the deleterious from the massive number of non-functional nsSNPs that occur within a single genome is a considerable challenge in mutation research. In this approach, we have used the existing in silico methods to explore the mutation-structure-function relationship in the XPA gene. Materials and Methods: We used the Sorting Intolerant From Tolerant (SIFT, Polymorphism Phenotyping (PolyPhen, I-Mutant 2.0, and the Protein Analysis THrough Evolutionary Relationships methods to predict the effects of deleterious nsSNPs on protein function and evaluated the impact of mutation on protein stability by Molecular Dynamics simulations. Results: By comparing the scores of all the four in silico methods, nsSNP with an ID rs104894131 at position C108F was predicted to be highly deleterious. We extended our Molecular dynamics approach to gain insight into the impact of this non-synonymous polymorphism on structural changes that may affect the activity of the XPA gene. Conclusion: Based on the in silico methods score, potential energy, root-mean-square deviation, and root-mean-square fluctuation, we predict that deleterious nsSNP at position C108F would play a significant role in causing disease by the XPA gene. Our approach would present the application of in silico tools in understanding the functional variation from the perspective of structure, evolution, and phenotype.

  13. Gene order data from a model amphibian (Ambystoma: new perspectives on vertebrate genome structure and evolution

    Directory of Open Access Journals (Sweden)

    Voss S Randal

    2006-08-01

    Full Text Available Abstract Background Because amphibians arise from a branch of the vertebrate evolutionary tree that is juxtaposed between fishes and amniotes, they provide important comparative perspective for reconstructing character changes that have occurred during vertebrate evolution. Here, we report the first comparative study of vertebrate genome structure that includes a representative amphibian. We used 491 transcribed sequences from a salamander (Ambystoma genetic map and whole genome assemblies for human, mouse, rat, dog, chicken, zebrafish, and the freshwater pufferfish Tetraodon nigroviridis to compare gene orders and rearrangement rates. Results Ambystoma has experienced a rate of genome rearrangement that is substantially lower than mammalian species but similar to that of chicken and fish. Overall, we found greater conservation of genome structure between Ambystoma and tetrapod vertebrates, nevertheless, 57% of Ambystoma-fish orthologs are found in conserved syntenies of four or more genes. Comparisons between Ambystoma and amniotes reveal extensive conservation of segmental homology for 57% of the presumptive Ambystoma-amniote orthologs. Conclusion Our analyses suggest relatively constant interchromosomal rearrangement rates from the euteleost ancestor to the origin of mammals and illustrate the utility of amphibian mapping data in establishing ancestral amniote and tetrapod gene orders. Comparisons between Ambystoma and amniotes reveal some of the key events that have structured the human genome since diversification of the ancestral amniote lineage.

  14. Structural and functional characterization of the exonuclease I (sbcB) gene and gene product from Escherichia coli and a Markov chain analysis of DNA sequences

    International Nuclear Information System (INIS)

    Phillips, G.J.

    1987-01-01

    The nucleotide sequence for the structural gene for exonuclease I (sbcB) from Escherichia coli was determined. Two putative promotes for this gene were identified and were predicted to have weak transcription initiation activity. In addition, the sbcB coding region contains many non-optimal codons. These observations are consistent with the suggestions that sbcB is a poorly expressed gene. Several mutant exonuclease I genes were cloned onto pBR322 plasmids. These genes represented both sbcB and xonA mutation. One of the xonA mutation (xonA6) was associated with a 1.2-kb insertion of an IS-30 related mobile genetic element in the 3'-region of the gene. Two of the mutations (xonA2 and xonA6) encode unstable polypeptides. Determination of exonucleolytic activity on single-stranded DNA from cell extracts containing each of the cloned mutant genes revealed no correlation between residual exonucleolytic activity and the pheno-types of sbcB and xonA mutants. A proposal that the exonuclease I protein contains an additional activity besides its ability to degrade single-stranded DNA is presented. Characterization of E. coli strains which overproduce exonuclease I showed increased sensitivity to UV irradiation

  15. Characterization of Clostridium perfringens isolates from healthy turkeys and from turkeys with necrotic enteritis

    DEFF Research Database (Denmark)

    Lyhs, Ulrike; Perko-Mäkelä, P.; Kallio, H.

    2013-01-01

    from 1998 to 2012. Furthermore, C. perfringens isolates from healthy and diseased turkeys were characterized and their genetic diversity was investigated using pulsed-field gel electrophoresis (PFGE). Isolates (n = 212) from birds with necrotic gut lesions and from healthy flocks of 30 commercial...... turkey farms were characterized for the presence of cpa, cpb, iA, etx, cpb2, and cpe and netB genes. A total of 93 C. perfringens isolates, including 55 from birds with necrotic gut lesions and 38 from healthy birds from 13 different farms, were analyzed with PFGE. All contract turkey farmers (n = 48......) of a turkey company that produces 99% of domestic turkey meat in Finland were interviewed about background information, management at the farm, and stress factors related to NE outbreaks. Pulsed-field gel electrophoresis analysis with SmaI restriction enzyme resulted in 30 PFGE patterns among the 92 C...

  16. The Use of Gene Modification and Advanced Molecular Structure Analyses towards Improving Alfalfa Forage

    Directory of Open Access Journals (Sweden)

    Yaogeng Lei

    2017-01-01

    Full Text Available Abstract: Alfalfa is one of the most important legume forage crops in the world. In spite of its agronomic and nutritive advantages, alfalfa has some limitations in the usage of pasture forage and hay supplement. High rapid degradation of protein in alfalfa poses a risk of rumen bloat to ruminants which could cause huge economic losses for farmers. Coupled with the relatively high lignin content, which impedes the degradation of carbohydrate in rumen, alfalfa has unbalanced and asynchronous degradation ratio of nitrogen to carbohydrate (N/CHO in rumen. Genetic engineering approaches have been used to manipulate the expression of genes involved in important metabolic pathways for the purpose of improving the nutritive value, forage yield, and the ability to resist abiotic stress. Such gene modification could bring molecular structural changes in alfalfa that are detectable by advanced structural analytical techniques. These structural analyses have been employed in assessing alfalfa forage characteristics, allowing for rapid, convenient and cost-effective analysis of alfalfa forage quality. In this article, we review two major obstacles facing alfalfa utilization, namely poor protein utilization and relatively high lignin content, and highlight genetic studies that were performed to overcome these drawbacks, as well as to introduce other improvements to alfalfa quality. We also review the use of advanced molecular structural analysis in the assessment of alfalfa forage for its potential usage in quality selection in alfalfa breeding.

  17. Disturbance of cardiac gene expression and cardiomyocyte structure predisposes Mecp2-null mice to arrhythmias

    Science.gov (United States)

    Hara, Munetsugu; Takahashi, Tomoyuki; Mitsumasu, Chiaki; Igata, Sachiyo; Takano, Makoto; Minami, Tomoko; Yasukawa, Hideo; Okayama, Satoko; Nakamura, Keiichiro; Okabe, Yasunori; Tanaka, Eiichiro; Takemura, Genzou; Kosai, Ken-ichiro; Yamashita, Yushiro; Matsuishi, Toyojiro

    2015-01-01

    Methyl-CpG-binding protein 2 (MeCP2) is an epigenetic regulator of gene expression that is essential for normal brain development. Mutations in MeCP2 lead to disrupted neuronal function and can cause Rett syndrome (RTT), a neurodevelopmental disorder. Previous studies reported cardiac dysfunction, including arrhythmias in both RTT patients and animal models of RTT. In addition, recent studies indicate that MeCP2 may be involved in cardiac development and dysfunction, but its role in the developing and adult heart remains unknown. In this study, we found that Mecp2-null ESCs could differentiate into cardiomyocytes, but the development and further differentiation of cardiovascular progenitors were significantly affected in MeCP2 deficiency. In addition, we revealed that loss of MeCP2 led to dysregulation of endogenous cardiac genes and myocardial structural alterations, although Mecp2-null mice did not exhibit obvious cardiac functional abnormalities. Furthermore, we detected methylation of the CpG islands in the Tbx5 locus, and showed that MeCP2 could target these sequences. Taken together, these results suggest that MeCP2 is an important regulator of the gene-expression program responsible for maintaining normal cardiac development and cardiomyocyte structure. PMID:26073556

  18. Diversity in Copy Number and Structure of a Silkworm Morphogenetic Gene as a Result of Domestication

    OpenAIRE

    Sakudoh, Takashi; Nakashima, Takeharu; Kuroki, Yoko; Fujiyama, Asao; Kohara, Yuji; Honda, Naoko; Fujimoto, Hirofumi; Shimada, Toru; Nakagaki, Masao; Banno, Yutaka; Tsuchida, Kozo

    2011-01-01

    The carotenoid-binding protein (CBP) of the domesticated silkworm, Bombyx mori, a major determinant of cocoon color, is likely to have been substantially influenced by domestication of this species. We analyzed the structure of the CBP gene in multiple strains of B. mori, in multiple individuals of the wild silkworm, B. mandarina (the putative wild ancestor of B. mori), and in a number of other lepidopterans. We found the CBP gene copy number in genomic DNA to vary widely among B. mori strain...

  19. Structure of the neutral capsular polysaccharide of Acinetobacter baumannii NIPH146 that carries the KL37 capsule gene cluster.

    Science.gov (United States)

    Arbatsky, Nikolay P; Shneider, Mikhail M; Kenyon, Johanna J; Shashkov, Alexander S; Popova, Anastasiya V; Miroshnikov, Konstantin A; Volozhantsev, Nikolay V; Knirel, Yuriy A

    2015-09-02

    Capsular polysaccharide (CPS) was isolated from Acinetobacter baumannii NIPH146, and the following structure of branched pentasaccharide repeating unit was established by sugar analyses along with 1D and 2D NMR spectroscopy: In comparison to most other known capsular polysaccharides of A. baumannii, the CPS studied is neutral and lacks any specific monosaccharide component. The synthesis, assembly and export of this structure could be attributed to genes in a novel capsule biosynthesis gene cluster, designated KL37, which was found in the NIPH146 genome. The CPS of A. baumannii NIPH146 shares the α-d-Galp-(1→6)-β-d-Glcp-(1→3)-d-GalpNAc-(1→ trisaccharide fragment with the CPS units of several A. baumannii strains, including ATCC 17978 and LUH 5537 that carry the KL3 and KL22 gene clusters, respectively. KL37 contains two genes for glycosyltransferases that are related to two glycosyltransferase genes present in both KL3 and KL22, and the encoded proteins could be tentatively assigned to linkages between sugars in the CPS repeat. Copyright © 2015 Elsevier Ltd. All rights reserved.

  20. The genomic structure of the DMBT1 gene

    DEFF Research Database (Denmark)

    Mollenhauer, J; Holmskov, U; Wiemann, S

    1999-01-01

    Increasing evidence has accumulated for an involvement of the inactivation of tumour suppressor genes at chromosome 10q in the carcinogenesis of brain tumours, melanomas, and carcinomas of the lung, the prostate, the pancreas, and the endometrium. The gene DMBT1 (Deleted in Malignant Brain Tumours...... 1) is located at chromosome 10q25.3-q26.1, within one of the putative intervals for tumour suppressor genes. DMBT1 is a member of the scavenger-receptor cysteine-rich (SRCR) superfamily and displays homozygous deletions or lack of expression in glioblastoma multiforme, medulloblastoma......, and in gastrointestinal and lung cancers. Based on these properties, DMBT1 has been proposed to be a candidate tumour suppressor gene. We have determined the genomic sequence of DMBT1 to allow analyses of mutations. The gene has at least 54 exons that span a genomic region of about 80 kb. We have identified a putative...

  1. Evidence-based gene models for structural and functional annotations of the oil palm genome.

    Science.gov (United States)

    Chan, Kuang-Lim; Tatarinova, Tatiana V; Rosli, Rozana; Amiruddin, Nadzirah; Azizi, Norazah; Halim, Mohd Amin Ab; Sanusi, Nik Shazana Nik Mohd; Jayanthi, Nagappan; Ponomarenko, Petr; Triska, Martin; Solovyev, Victor; Firdaus-Raih, Mohd; Sambanthamurthi, Ravigadevi; Murphy, Denis; Low, Eng-Ti Leslie

    2017-09-08

    Oil palm is an important source of edible oil. The importance of the crop, as well as its long breeding cycle (10-12 years) has led to the sequencing of its genome in 2013 to pave the way for genomics-guided breeding. Nevertheless, the first set of gene predictions, although useful, had many fragmented genes. Classification and characterization of genes associated with traits of interest, such as those for fatty acid biosynthesis and disease resistance, were also limited. Lipid-, especially fatty acid (FA)-related genes are of particular interest for the oil palm as they specify oil yields and quality. This paper presents the characterization of the oil palm genome using different gene prediction methods and comparative genomics analysis, identification of FA biosynthesis and disease resistance genes, and the development of an annotation database and bioinformatics tools. Using two independent gene-prediction pipelines, Fgenesh++ and Seqping, 26,059 oil palm genes with transcriptome and RefSeq support were identified from the oil palm genome. These coding regions of the genome have a characteristic broad distribution of GC 3 (fraction of cytosine and guanine in the third position of a codon) with over half the GC 3 -rich genes (GC 3  ≥ 0.75286) being intronless. In comparison, only one-seventh of the oil palm genes identified are intronless. Using comparative genomics analysis, characterization of conserved domains and active sites, and expression analysis, 42 key genes involved in FA biosynthesis in oil palm were identified. For three of them, namely EgFABF, EgFABH and EgFAD3, segmental duplication events were detected. Our analysis also identified 210 candidate resistance genes in six classes, grouped by their protein domain structures. We present an accurate and comprehensive annotation of the oil palm genome, focusing on analysis of important categories of genes (GC 3 -rich and intronless), as well as those associated with important functions, such as FA

  2. The cartilage-derived, C-type lectin (CLECSF1): structure of the gene and chromosomal location.

    Science.gov (United States)

    Neame, P J; Tapp, H; Grimm, D R

    1999-09-03

    Cartilage is a tissue that is primarily extracellular matrix, the bulk of which consists of proteoglycan aggregates constrained within a collagen framework. Candidate components that organize the extracellular assembly of the matrix consist of collagens, proteoglycans and multimeric glycoproteins. We describe the human gene structure of a potential organizing factor, a cartilage-derived member of the C-type lectin superfamily (CLECSF1; C-type lectin superfamily) related to the serum protein, tetranectin. We show by Northern analysis that this protein is restricted to cartilage and locate the gene on chromosome 16q23. We have characterized 10.9 kb of sequence upstream of the first exon. Similarly to human tetranectin, there are three exons. The residues that are conserved between CLECSF1 and tetranectin suggest that the cartilage-derived protein forms a trimeric structure similar to that of tetranectin, with three N-terminal alpha-helical domains aggregating through hydrophobic faces. The globular, C-terminal domain that has been shown to bind carbohydrate in some members of the family and plasminogen in tetranectin, is likely to have a similar overall structure to that of tetranectin.

  3. Investigation of energy gene expressions and community structures of free and attached acidophilic bacteria in chalcopyrite bioleaching.

    Science.gov (United States)

    Zhu, Jianyu; Jiao, Weifeng; Li, Qian; Liu, Xueduan; Qin, Wenqing; Qiu, Guanzhou; Hu, Yuehua; Chai, Liyuan

    2012-12-01

    In order to better understand the bioleaching mechanism, expression of genes involved in energy conservation and community structure of free and attached acidophilic bacteria in chalcopyrite bioleaching were investigated. Using quantitative real-time PCR, we studied the expression of genes involved in energy conservation in free and attached Acidithiobacillus ferrooxidans during bioleaching of chalcopyrite. Sulfur oxidation genes of attached A. ferrooxidans were up-regulated while ferrous iron oxidation genes were down-regulated compared with free A. ferrooxidans in the solution. The up-regulation may be induced by elemental sulfur on the mineral surface. This conclusion was supported by the results of HPLC analysis. Sulfur-oxidizing Acidithiobacillus thiooxidans and ferrous-oxidizing Leptospirillum ferrooxidans were the members of the mixed culture in chalcopyrite bioleaching. Study of the community structure of free and attached bacteria showed that A. thiooxidans dominated the attached bacteria while L. ferrooxidans dominated the free bacteria. With respect to available energy sources during bioleaching of chalcopyrite, sulfur-oxidizers tend to be on the mineral surfaces whereas ferrous iron-oxidizers tend to be suspended in the aqueous phase. Taken together, these results indicate that the main role of attached acidophilic bacteria was to oxidize elemental sulfur and dissolution of chalcopyrite involved chiefly an indirect bioleaching mechanism.

  4. Full structure and insight into the gene cluster of the O-specific polysaccharide of Yersinia intermedia H9-36/83 (O:17).

    Science.gov (United States)

    Sizova, Olga V; Shashkov, Alexander S; Kondakova, Anna N; Knirel, Yuriy A; Shaikhutdinova, Rima Z; Ivanov, Sergei A; Kislichkina, Angelina A; Kadnikova, Lidia A; Bogun, Aleksandr G; Dentovskaya, Svetlana V

    2018-05-02

    Lipopolysaccharide was isolated from bacteria Yersinia intermedia H9-36/83 (O:17) and degraded with mild acid to give an O-specific polysaccharide, which was isolated by GPC on Sephadex G-50 and studied by sugar analysis and 1D and 2D NMR spectroscopy. The polysaccharide was found to contain 3-deoxy-3-[(R)-3-hydroxybutanoylamino]-d-fucose (d-Fuc3NR3Hb) and the following structure of the heptasaccharide repeating unit was established: The structure established is consistent with the gene content of the O-antigen gene cluster. The O-polysaccharide structure and gene cluster of Y. intermedia are related to those of Hafnia alvei 1211 and Escherichia coli O:103. Copyright © 2018 Elsevier Ltd. All rights reserved.

  5. Ultra high-resolution gene centric genomic structural analysis of a non-syndromic congenital heart defect, Tetralogy of Fallot.

    Directory of Open Access Journals (Sweden)

    Douglas C Bittel

    Full Text Available Tetralogy of Fallot (TOF is one of the most common severe congenital heart malformations. Great progress has been made in identifying key genes that regulate heart development, yet approximately 70% of TOF cases are sporadic and nonsyndromic with no known genetic cause. We created an ultra high-resolution gene centric comparative genomic hybridization (gcCGH microarray based on 591 genes with a validated association with cardiovascular development or function. We used our gcCGH array to analyze the genomic structure of 34 infants with sporadic TOF without a deletion on chromosome 22q11.2 (n male = 20; n female = 14; age range of 2 to 10 months. Using our custom-made gcCGH microarray platform, we identified a total of 613 copy number variations (CNVs ranging in size from 78 base pairs to 19.5 Mb. We identified 16 subjects with 33 CNVs that contained 13 different genes which are known to be directly associated with heart development. Additionally, there were 79 genes from the broader list of genes that were partially or completely contained in a CNV. All 34 individuals examined had at least one CNV involving these 79 genes. Furthermore, we had available whole genome exon arrays from right ventricular tissue in 13 of our subjects. We analyzed these for correlations between copy number and gene expression level. Surprisingly, we could detect only one clear association between CNVs and expression (GSTT1 for any of the 591 focal genes on the gcCGH array. The expression levels of GSTT1 were correlated with copy number in all cases examined (r = 0.95, p = 0.001. We identified a large number of small CNVs in genes with varying associations with heart development. Our results illustrate the complexity of human genome structural variation and underscore the need for multifactorial assessment of potential genetic/genomic factors that contribute to congenital heart defects.

  6. Structure of Rot, a global regulator of virulence genes in Staphylococcus aureus.

    Science.gov (United States)

    Zhu, Yuwei; Fan, Xiaojiao; Zhang, Xu; Jiang, Xuguang; Niu, Liwen; Teng, Maikun; Li, Xu

    2014-09-01

    Staphylococcus aureus is a highly versatile pathogen that can infect human tissue by producing a large arsenal of virulence factors that are tightly regulated by a complex regulatory network. Rot, which shares sequence similarity with SarA homologues, is a global regulator that regulates numerous virulence genes. However, the recognition model of Rot for the promoter region of target genes and the putative regulation mechanism remain elusive. In this study, the 1.77 Å resolution X-ray crystal structure of Rot is reported. The structure reveals that two Rot molecules form a compact homodimer, each of which contains a typical helix-turn-helix module and a β-hairpin motif connected by a flexible loop. Fluorescence polarization results indicate that Rot preferentially recognizes AT-rich dsDNA with ~30-base-pair nucleotides and that the conserved positively charged residues on the winged-helix motif are vital for binding to the AT-rich dsDNA. It is proposed that the DNA-recognition model of Rot may be similar to that of SarA, SarR and SarS, in which the helix-turn-helix motifs of each monomer interact with the major grooves of target dsDNA and the winged motifs contact the minor grooves. Interestingly, the structure shows that Rot adopts a novel dimerization model that differs from that of other SarA homologues. As expected, perturbation of the dimer interface abolishes the dsDNA-binding ability of Rot, suggesting that Rot functions as a dimer. In addition, the results have been further confirmed in vivo by measuring the transcriptional regulation of α-toxin, a major virulence factor produced by most S. aureus strains.

  7. Search for 5'-leader regulatory RNA structures based on gene annotation aided by the RiboGap database.

    Science.gov (United States)

    Naghdi, Mohammad Reza; Smail, Katia; Wang, Joy X; Wade, Fallou; Breaker, Ronald R; Perreault, Jonathan

    2017-03-15

    The discovery of noncoding RNAs (ncRNAs) and their importance for gene regulation led us to develop bioinformatics tools to pursue the discovery of novel ncRNAs. Finding ncRNAs de novo is challenging, first due to the difficulty of retrieving large numbers of sequences for given gene activities, and second due to exponential demands on calculation needed for comparative genomics on a large scale. Recently, several tools for the prediction of conserved RNA secondary structure were developed, but many of them are not designed to uncover new ncRNAs, or are too slow for conducting analyses on a large scale. Here we present various approaches using the database RiboGap as a primary tool for finding known ncRNAs and for uncovering simple sequence motifs with regulatory roles. This database also can be used to easily extract intergenic sequences of eubacteria and archaea to find conserved RNA structures upstream of given genes. We also show how to extend analysis further to choose the best candidate ncRNAs for experimental validation. Copyright © 2017 Elsevier Inc. All rights reserved.

  8. Population structure of Tor tor inferred from mitochondrial gene cytochrome b.

    Science.gov (United States)

    Pasi, Komal Shyamakant; Lakra, W S; Bhatt, J P; Goswami, M; Malakar, A Kr

    2013-06-01

    Tor tor, commonly called as Tor mahseer, is a high-valued food and game fish endemic to trans-Himalayan region. Mitochondrial cytochrome b (cyt b) gene region of 967 bp was used to estimate the population structure of T. tor. Three populations of T. tor were collected from Narmada (Hosangabad), Ken (Madla), and Parbati river (Sheopur) in Madhya Pradesh, India. The sequence analysis revealed that the nucleotide diversity (π) was low, ranging from 0.000 to 0.0150. Haplotype diversity (h) ranged from 0.000 to 1.000. The analysis of molecular variance analysis indicated significant genetic divergence among the three populations of T. tor. Neighboring-joining tree also showed that all individuals from three populations clustered into three distinct clades. The data generated by cyt b marker revealed interesting insight about population structure of T. tor, which would serve as baseline data for conservation and management of mahseer fishery.

  9. Population structure from NOS genes correlates with geographical differences in coronary incidence across Europe.

    Science.gov (United States)

    Carreras-Torres, Robert; Ferran, Albert; Zanetti, Daniela; Esteban, Esther; Varesi, Laurent; Pojskic, Naris; Coia, Valentina; Chaabani, Hassen; Via, Marc; Moral, Pedro

    2016-12-01

    The population analysis of cardiovascular risk and non-risk genetic variation can help to identify adaptive or random demographic processes that shaped coronary incidence variation across geography. In this study, 114 single nucleotide polymorphisms and 17 tandem repeat polymorphisms from Nitric Oxide Synthases (NOS) regions were analyzed in 1686 individuals from 35 populations from Europe, North Africa, and the Middle East. NOS genes encode for key enzymes on nitric oxide availability, which is involved in several cardiovascular processes. These genetic variations were used to test for selection and to infer the population structure of NOS regions. Moreover, we tested whether the variation in the incidence of coronary events and in the levels of classical risk factors in 11 of these European populations could be explained by the population structure estimates. Our results supported, first, the absence of clear signs of selection for NOS genetic variants associated with cardiovascular diseases, and second, the presence of a continuous genetic pattern of variation across European and North African populations without a Mediterranean barrier for gene flow. Finally, population structure estimates from NOS regions are closely correlated with coronary event rates and classical risk parameters (explaining 39-98%) among European populations. Our results reinforce the hypothesis that genetic bases of cardiovascular diseases and associated complex phenotypes could be geographically shaped by random demographic processes. © 2016 Wiley Periodicals, Inc.

  10. Fractal structure in the volumetric contrast enhancement of malignant gliomas as a marker of oxidative metabolic pathway gene expression

    NARCIS (Netherlands)

    Miller, Kai J.; Berendsen, Sharon; Seute, Tatjana; Yeom, Kristen; Gephardt, Melanie H.; Grant, Gerald A.; Robe, Pierre A.

    2017-01-01

    Background: Fractal structure is found throughout many processes in nature, and often arises from sets of simple rules. We examined MRI contrast enhancement patterns from glioblastoma patients for evidence of fractal structure and correlated these with gene expression patterns. Methods: For 39

  11. Analysis of TCRAD gene recombination: radio-induct rearrangement and signal joint structure

    International Nuclear Information System (INIS)

    Touvrey, C.

    2005-09-01

    We have shown that irradiation of pre-TCR-deficient CD3ε -/- mice restores thymocyte differentiation, by a p53-dependent and by a p53-independent pathway. Events normally associated during normal thymocyte development are dissociated in response to radiation exposure. Both of these pathways require LAT expression. Therefore, radiation exposure activates pre-TCR-like signals. TCRA gene rearrangement is induced following radiation exposure. The signal joints resulting from TCRA gene rearrangement have the same structure than those found in wild type mice. All signal joint analyzed in un-manipulated wild type mice do exhibit junctional diversity. This diversity results mainly from TdT activity. We present evidences that proteins involved in DNA repair and genomic stability participated in SJ formation. We propose that signal joint diversity is not an aberrant process but is a key feature of V(D)J recombination. All our work increases our understanding of molecular events associated with V(D)J recombination. (author)

  12. Structuring osteosarcoma knowledge: an osteosarcoma-gene association database based on literature mining and manual annotation.

    Science.gov (United States)

    Poos, Kathrin; Smida, Jan; Nathrath, Michaela; Maugg, Doris; Baumhoer, Daniel; Neumann, Anna; Korsching, Eberhard

    2014-01-01

    Osteosarcoma (OS) is the most common primary bone cancer exhibiting high genomic instability. This genomic instability affects multiple genes and microRNAs to a varying extent depending on patient and tumor subtype. Massive research is ongoing to identify genes including their gene products and microRNAs that correlate with disease progression and might be used as biomarkers for OS. However, the genomic complexity hampers the identification of reliable biomarkers. Up to now, clinico-pathological factors are the key determinants to guide prognosis and therapeutic treatments. Each day, new studies about OS are published and complicate the acquisition of information to support biomarker discovery and therapeutic improvements. Thus, it is necessary to provide a structured and annotated view on the current OS knowledge that is quick and easily accessible to researchers of the field. Therefore, we developed a publicly available database and Web interface that serves as resource for OS-associated genes and microRNAs. Genes and microRNAs were collected using an automated dictionary-based gene recognition procedure followed by manual review and annotation by experts of the field. In total, 911 genes and 81 microRNAs related to 1331 PubMed abstracts were collected (last update: 29 October 2013). Users can evaluate genes and microRNAs according to their potential prognostic and therapeutic impact, the experimental procedures, the sample types, the biological contexts and microRNA target gene interactions. Additionally, a pathway enrichment analysis of the collected genes highlights different aspects of OS progression. OS requires pathways commonly deregulated in cancer but also features OS-specific alterations like deregulated osteoclast differentiation. To our knowledge, this is the first effort of an OS database containing manual reviewed and annotated up-to-date OS knowledge. It might be a useful resource especially for the bone tumor research community, as specific

  13. Structure and chromosomal localization of the human lymphotoxin gene

    International Nuclear Information System (INIS)

    Nedwin, G.E.; Jarrett-Nedwin, J.; Smith, D.H.; Naylor, S.L.; Sakaguchi, A.Y.; Goeddel, D.V.; Gray, P.W.

    1987-01-01

    The authors have isolated, sequenced, and determined the chromosomal localization of the gene encoding human lymphotoxin (LT). The single copy gene was isolated from a human genomic library using a /sup 32/P-labeled 116 bp synthetic DNA fragment whose sequence was based on the NH/sub 2/-terminal amino acid sequence of LT. The gene spans 3 kb of DNA and is interrupted by three intervening sequences. The LT gene is located on human chromosome 6, as determined by Southern blot analysis of human-murine hybrid DNA. Putative transcriptional control regions and areas of homology with the promoters of interferon and other genes are identified

  14. Gene composer: database software for protein construct design, codon engineering, and gene synthesis.

    Science.gov (United States)

    Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

    2009-04-21

    To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease

  15. Gene Composer: database software for protein construct design, codon engineering, and gene synthesis

    Directory of Open Access Journals (Sweden)

    Mixon Mark

    2009-04-01

    Full Text Available Abstract Background To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. Results An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. Conclusion We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene

  16. Duplication of the IGFBP-2 gene in teleost fish: protein structure and functionality conservation and gene expression divergence.

    Directory of Open Access Journals (Sweden)

    Jianfeng Zhou

    Full Text Available BACKGROUND: Insulin-like growth factor binding protein-2 (IGFBP-2 is a secreted protein that binds and regulates IGF actions in controlling growth, development, reproduction, and aging. Elevated expression of IGFBP-2 is often associated with progression of many types of cancers. METHODOLOGY/PRINCIPAL FINDINGS: We report the identification and characterization of two IGFBP-2 genes in zebrafish and four other teleost fish. Comparative genomics and structural analyses suggest that they are co-orthologs of the human IGFBP-2 gene. Biochemical assays show that both zebrafish igfbp-2a and -2b encode secreted proteins that bind IGFs. These two genes exhibit distinct spatiotemporal expression patterns. During embryogenesis, IGFBP-2a mRNA is initially detected in the lens, then in the brain boundary vasculature, and subsequently becomes highly expressed in the liver. In the adult stage, liver has the highest levels of IGFBP-2a mRNA, followed by the brain. Low levels of IGFBP-2a mRNA were detected in muscle and in the gonad in male adults only. IGFBP-2b mRNA is detected initially in all tissues at low levels, but later becomes abundant in the liver. In adult males, IGFBP-2b mRNA is only detected in the liver. In adult females, it is also found in the gut, kidney, ovary, and muscle. To gain insights into how the IGFBP-2 genes may have evolved through partitioning of ancestral functions, functional and mechanistic studies were carried out. Expression of zebrafish IGFBP-2a and -2b caused significant decreases in the growth and developmental rates and their effects are comparable to that of human IGFBP-2. IGFBP-2 mutants with altered IGF binding-, RGD-, and heparin-binding sites were generated and their actions examined. While mutating the RGD and heparin binding sites had little effect, altering the IGF binding site abolished its biological activity. CONCLUSIONS/SIGNIFICANCE: These results suggest that IGFBP-2 is a conserved regulatory protein and it inhibits

  17. The primary structures of two leghemoglobin genes from soybean

    DEFF Research Database (Denmark)

    Hyldig-Nielsen, J J; Jensen, E O; Paludan, K

    1982-01-01

    We present the complete nucleotide sequences of two leghemoglobin genes isolated from soybean DNA. Both genes contain three intervening sequences which interrupt the two coding sequences in identical positions. The 5' and 3' flanking sequences in both genes contain conserved sequences similar...

  18. Structure and expression of the chicken calmodulin I gene

    DEFF Research Database (Denmark)

    Ye, Q; Berchtold, M W

    1997-01-01

    The chicken calmodulin I (CaMI) gene has been isolated and characterized on the level of cDNA and genomic DNA. The deduced amino acid (aa) sequence is identical to the one of chicken CaMII which consists of 148 aa. The CaMI gene contains six exons. Its intron/exon organization is identical...... to that of the chicken CaMII and the CaMI and CaMIII genes of rat and human. Expression of the CaMI gene was detected in all chicken tissues examined, although at varying levels. The gene is transcribed into four mRNAs of 0.8, 1.4, 1.7 and 4.4 kb as determined by Northern blot analysis. Our results demonstrate...... that the "multigene-one-protein" principle of CaM synthesis is not only applicable to mammals whose CaM is encoded by three different genes, but also to chickens....

  19. Independent Gene Discovery and Testing

    Science.gov (United States)

    Palsule, Vrushalee; Coric, Dijana; Delancy, Russell; Dunham, Heather; Melancon, Caleb; Thompson, Dennis; Toms, Jamie; White, Ashley; Shultz, Jeffry

    2010-01-01

    A clear understanding of basic gene structure is critical when teaching molecular genetics, the central dogma and the biological sciences. We sought to create a gene-based teaching project to improve students' understanding of gene structure and to integrate this into a research project that can be implemented by instructors at the secondary level…

  20. Preservation of bone mass and structure in hibernating black bears (Ursus americanus) through elevated expression of anabolic genes.

    Science.gov (United States)

    Fedorov, Vadim B; Goropashnaya, Anna V; Tøien, Øivind; Stewart, Nathan C; Chang, Celia; Wang, Haifang; Yan, Jun; Showe, Louise C; Showe, Michael K; Donahue, Seth W; Barnes, Brian M

    2012-06-01

    Physical inactivity reduces mechanical load on the skeleton, which leads to losses of bone mass and strength in non-hibernating mammalian species. Although bears are largely inactive during hibernation, they show no loss in bone mass and strength. To obtain insight into molecular mechanisms preventing disuse bone loss, we conducted a large-scale screen of transcriptional changes in trabecular bone comparing winter hibernating and summer non-hibernating black bears using a custom 12,800 probe cDNA microarray. A total of 241 genes were differentially expressed (P 1.4) in the ilium bone of bears between winter and summer. The Gene Ontology and Gene Set Enrichment Analysis showed an elevated proportion in hibernating bears of overexpressed genes in six functional sets of genes involved in anabolic processes of tissue morphogenesis and development including skeletal development, cartilage development, and bone biosynthesis. Apoptosis genes demonstrated a tendency for downregulation during hibernation. No coordinated directional changes were detected for genes involved in bone resorption, although some genes responsible for osteoclast formation and differentiation (Ostf1, Rab9a, and c-Fos) were significantly underexpressed in bone of hibernating bears. Elevated expression of multiple anabolic genes without induction of bone resorption genes, and the down regulation of apoptosis-related genes, likely contribute to the adaptive mechanism that preserves bone mass and structure through prolonged periods of immobility during hibernation.

  1. Bioinformatics study of the mangrove actin genes

    Science.gov (United States)

    Basyuni, M.; Wasilah, M.; Sumardi

    2017-01-01

    This study describes the bioinformatics methods to analyze eight actin genes from mangrove plants on DDBJ/EMBL/GenBank as well as predicted the structure, composition, subcellular localization, similarity, and phylogenetic. The physical and chemical properties of eight mangroves showed variation among the genes. The percentage of the secondary structure of eight mangrove actin genes followed the order of a helix > random coil > extended chain structure for BgActl, KcActl, RsActl, and A. corniculatum Act. In contrast to this observation, the remaining actin genes were random coil > extended chain structure > a helix. This study, therefore, shown the prediction of secondary structure was performed for necessary structural information. The values of chloroplast or signal peptide or mitochondrial target were too small, indicated that no chloroplast or mitochondrial transit peptide or signal peptide of secretion pathway in mangrove actin genes. These results suggested the importance of understanding the diversity and functional of properties of the different amino acids in mangrove actin genes. To clarify the relationship among the mangrove actin gene, a phylogenetic tree was constructed. Three groups of mangrove actin genes were formed, the first group contains B. gymnorrhiza BgAct and R. stylosa RsActl. The second cluster which consists of 5 actin genes the largest group, and the last branch consist of one gene, B. sexagula Act. The present study, therefore, supported the previous results that plant actin genes form distinct clusters in the tree.

  2. Recombination-dependent replication and gene conversion homogenize repeat sequences and diversify plastid genome structure.

    Science.gov (United States)

    Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K

    2017-04-01

    There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.

  3. Structural and functional analysis of mouse Msx1 gene promoter: sequence conservation with human MSX1 promoter points at potential regulatory elements.

    Science.gov (United States)

    Gonzalez, S M; Ferland, L H; Robert, B; Abdelhay, E

    1998-06-01

    Vertebrate Msx genes are related to one of the most divergent homeobox genes of Drosophila, the muscle segment homeobox (msh) gene, and are expressed in a well-defined pattern at sites of tissue interactions. This pattern of expression is conserved in vertebrates as diverse as quail, zebrafish, and mouse in a range of sites including neural crest, appendages, and craniofacial structures. In the present work, we performed structural and functional analyses in order to identify potential cis-acting elements that may be regulating Msx1 gene expression. To this end, a 4.9-kb segment of the 5'-flanking region was sequenced and analyzed for transcription-factor binding sites. Four regions showing a high concentration of these sites were identified. Transfection assays with fragments of regulatory sequences driving the expression of the bacterial lacZ reporter gene showed that a region of 4 kb upstream of the transcription start site contains positive and negative elements responsible for controlling gene expression. Interestingly, a fragment of 130 bp seems to contain the minimal elements necessary for gene expression, as its removal completely abolishes gene expression in cultured cells. These results are reinforced by comparison of this region with the human Msx1 gene promoter, which shows extensive conservation, including many consensus binding sites, suggesting a regulatory role for them.

  4. Isolation, structural analysis, and expression characteristics of the maize nuclear factor Y gene families

    International Nuclear Information System (INIS)

    Zhang, Zhongbao; Li, Xianglong; Zhang, Chun; Zou, Huawen; Wu, Zhongyi

    2016-01-01

    NUCLEAR FACTOR-Y (NF-Y) has been shown to play an important role in growth, development, and response to environmental stress. A NF-Y complex, which consists of three subunits, NF-YA, NF-YB, and, NF-YC, binds to CCAAT sequences in a promoter to control the expression of target genes. Although NF-Y proteins have been reported in Arabidopsis and rice, a comprehensive and systematic analysis of ZmNF-Y genes has not yet been performed. To examine the functions of ZmNF-Y genes in this family, we isolated and characterized 50 ZmNF-Y (14 ZmNF-YA, 18 ZmNF-YB, and 18 ZmNF-YC) genes in an analysis of the maize genome. The 50 ZmNF-Y genes were distributed on all 10 maize chromosomes, and 12 paralogs were identified. Multiple alignments showed that maize ZmNF-Y family proteins had conserved regions and relatively variable N-terminal or C-terminal domains. The comparative syntenic map illustrated 40 paralogous NF-Y gene pairs among the 10 maize chromosomes. Microarray data showed that the ZmNF-Y genes had tissue-specific expression patterns in various maize developmental stages and in response to biotic and abiotic stresses. The results suggested that ZmNF-YB2, 4, 8, 10, 13, and 16 and ZmNF-YC6, 8, and 15 were induced, while ZmNF-YA1, 3, 4, 6, 7, 10, 12, and 13, ZmNF-YB15, and ZmNF-YC3 and 9 were suppressed by drought stress. ZmNF-YA3, ZmNF-YA8 and ZmNF-YA12 were upregulated after infection by the three pathogens, while ZmNF-YA1 and ZmNF-YB2 were suppressed. These results indicate that the ZmNF-Ys may have significant roles in the response to abiotic and biotic stresses. - Highlights: • We indicated a total of 50 members of ZmNF-Y gene family in maize genome. • We analyzed gene structure, protein architecture of ZmNF-Y genes. • Evolution pattern and phylogenic relationships were analyzed among 50 ZmNF-Y genes. • Expression pattern of ZmNF-Ys were detected in various maize tissues. • Transcript levels of ZmNF-Ys were measured under various abiotic and biotic stresses.

  5. The structure of an unusual leghemoglobin gene from soybean

    DEFF Research Database (Denmark)

    Wiborg, O; Hyldig-Nielsen, J J; Jensen, E O

    1983-01-01

    A clone containing an unusual leghemoglobin (Lb) gene was isolated from a soybean DNA library present in Charon 4A phage. DNA sequence analysis revealed that the isolated Lb gene has three intervening sequences (IVS-1, IVS-2 and IVS-3) located in the same positions as those found in other Lb gene...... is mutated in two regions which seem to be important for transcription. It is, therefore, tentatively suggested that the isolated Lb gene is non-functional, and consequently is an Lb pseudogene. Udgivelsesdato: 1983-null...

  6. Population structure of barley landrace populations and gene-flow with modern varieties.

    Directory of Open Access Journals (Sweden)

    Elisa Bellucci

    Full Text Available Landraces are heterogeneous plant varieties that are reproduced by farmers as populations that are subject to both artificial and natural selection. Landraces are distinguished by farmers due to their specific traits, and different farmers often grow different populations of the same landrace. We used simple sequence repeats (SSRs to analyse 12 barley landrace populations from Sardinia from two collections spanning 10 years. We analysed the population structure, and compared the population diversity of the landraces that were collected at field level (population. We used a representative pool of barley varieties for diversity comparisons and to analyse the effects of gene flow from modern varieties. We found that the Sardinian landraces are a distinct gene pool from those of both two-row and six-row barley varieties. There is also a low, but significant, mean level and population-dependent level of introgression from the modern varieties into the Sardinian landraces. Moreover, we show that the Sardinian landraces have the same level of gene diversity as the representative sample of modern commercial varieties grown in Italy in the last decades, even within population level. Thus, these populations represent crucial sources of germplasm that will be useful for crop improvement and for population genomics studies and association mapping, to identify genes, loci and genome regions responsible for adaptive variations. Our data also suggest that landraces are a source of valuable germplasm for sustainable agriculture in the context of future climate change, and that in-situ conservation strategies based on farmer use can preserve the genetic identity of landraces while allowing adaptation to local environments.

  7. Activation of the alpha-globin gene expression correlates with dramatic upregulation of nearby non-globin genes and changes in local and large-scale chromatin spatial structure.

    Science.gov (United States)

    Ulianov, Sergey V; Galitsyna, Aleksandra A; Flyamer, Ilya M; Golov, Arkadiy K; Khrameeva, Ekaterina E; Imakaev, Maxim V; Abdennur, Nezar A; Gelfand, Mikhail S; Gavrilov, Alexey A; Razin, Sergey V

    2017-07-11

    In homeotherms, the alpha-globin gene clusters are located within permanently open genome regions enriched in housekeeping genes. Terminal erythroid differentiation results in dramatic upregulation of alpha-globin genes making their expression comparable to the rRNA transcriptional output. Little is known about the influence of the erythroid-specific alpha-globin gene transcription outburst on adjacent, widely expressed genes and large-scale chromatin organization. Here, we have analyzed the total transcription output, the overall chromatin contact profile, and CTCF binding within the 2.7 Mb segment of chicken chromosome 14 harboring the alpha-globin gene cluster in cultured lymphoid cells and cultured erythroid cells before and after induction of terminal erythroid differentiation. We found that, similarly to mammalian genome, the chicken genomes is organized in TADs and compartments. Full activation of the alpha-globin gene transcription in differentiated erythroid cells is correlated with upregulation of several adjacent housekeeping genes and the emergence of abundant intergenic transcription. An extended chromosome region encompassing the alpha-globin cluster becomes significantly decompacted in differentiated erythroid cells, and depleted in CTCF binding and CTCF-anchored chromatin loops, while the sub-TAD harboring alpha-globin gene cluster and the upstream major regulatory element (MRE) becomes highly enriched with chromatin interactions as compared to lymphoid and proliferating erythroid cells. The alpha-globin gene domain and the neighboring loci reside within the A-like chromatin compartment in both lymphoid and erythroid cells and become further segregated from the upstream gene desert upon terminal erythroid differentiation. Our findings demonstrate that the effects of tissue-specific transcription activation are not restricted to the host genomic locus but affect the overall chromatin structure and transcriptional output of the encompassing

  8. The UDP-glucuronate decarboxylase gene family in Populus: structure, expression, and association genetics.

    Directory of Open Access Journals (Sweden)

    Qingzhang Du

    Full Text Available In woody crop plants, the oligosaccharide components of the cell wall are essential for important traits such as bioenergy content, growth, and structural wood properties. UDP-glucuronate decarboxylase (UXS is a key enzyme in the synthesis of UDP-xylose for the formation of xylans during cell wall biosynthesis. Here, we isolated a multigene family of seven members (PtUXS1-7 encoding UXS from Populus tomentosa, the first investigation of UXSs in a tree species. Analysis of gene structure and phylogeny showed that the PtUXS family could be divided into three groups (PtUXS1/4, PtUXS2/5, and PtUXS3/6/7, consistent with the tissue-specific expression patterns of each PtUXS. We further evaluated the functional consequences of nucleotide polymorphisms in PtUXS1. In total, 243 single-nucleotide polymorphisms (SNPs were identified, with a high frequency of SNPs (1/18 bp and nucleotide diversity (πT = 0.01033, θw = 0.01280. Linkage disequilibrium (LD analysis showed that LD did not extend over the entire gene (r (2<0.1, P<0.001, within 700 bp. SNP- and haplotype-based association analysis showed that nine SNPs (Q <0.10 and 12 haplotypes (P<0.05 were significantly associated with growth and wood property traits in the association population (426 individuals, with 2.70% to 12.37% of the phenotypic variation explained. Four significant single-marker associations (Q <0.10 were validated in a linkage mapping population of 1200 individuals. Also, RNA transcript accumulation varies among genotypic classes of SNP10 was further confirmed in the association population. This is the first comprehensive study of the UXS gene family in woody plants, and lays the foundation for genetic improvements of wood properties and growth in trees using genetic engineering or marker-assisted breeding.

  9. Effect of Flooding and the nosZ Gene in Bradyrhizobia on Bradyrhizobial Community Structure in the Soil.

    Science.gov (United States)

    Saeki, Yuichi; Nakamura, Misato; Mason, Maria Luisa T; Yano, Tsubasa; Shiro, Sokichi; Sameshima-Saito, Reiko; Itakura, Manabu; Minamisawa, Kiwamu; Yamamoto, Akihiro

    2017-06-24

    We investigated the effects of the water status (flooded or non-flooded) and presence of the nosZ gene in bradyrhizobia on the bradyrhizobial community structure in a factorial experiment that examined three temperature levels (20°C, 25°C, and 30°C) and two soil types (andosol and gray lowland soil) using microcosm incubations. All microcosms were inoculated with Bradyrhizobium japonicum USDA6 T , B. japonicum USDA123, and B. elkanii USDA76 T , which do not possess the nosZ gene, and then half received B. diazoefficiens USDA110 T wt (wt for the wild-type) and the other half received B. diazoefficiens USDA110ΔnosZ. USDA110 T wt possesses the nosZ gene, which encodes N 2 O reductase; 110ΔnosZ, a mutant variant, does not. Changes in the community structure after 30- and 60-d incubations were investigated by denaturing-gradient gel electrophoresis and an image analysis. USDA6 T and 76 T strains slightly increased in non-flooded soil regardless of which USDA110 T strain was present. In flooded microcosms with the USDA110 T wt strain, USDA110 T wt became dominant, whereas in microcosms with the USDA110ΔnosZ, a similar change in the community structure occurred to that in non-flooded microcosms. These results suggest that possession of the nosZ gene confers a competitive advantage to B. diazoefficiens USDA110 T in flooded soil. We herein demonstrated that the dominance of B. diazoefficiens USDA110 T wt within the soil bradyrhizobial population may be enhanced by periods of flooding or waterlogging systems such as paddy-soybean rotations because it appears to have the ability to thrive in moderately anaerobic soil.

  10. Use of deep whole-genome sequencing data to identify structure risk variants in breast cancer susceptibility genes.

    Science.gov (United States)

    Guo, Xingyi; Shi, Jiajun; Cai, Qiuyin; Shu, Xiao-Ou; He, Jing; Wen, Wanqing; Allen, Jamie; Pharoah, Paul; Dunning, Alison; Hunter, David J; Kraft, Peter; Easton, Douglas F; Zheng, Wei; Long, Jirong

    2018-03-01

    Functional disruptions of susceptibility genes by large genomic structure variant (SV) deletions in germlines are known to be associated with cancer risk. However, few studies have been conducted to systematically search for SV deletions in breast cancer susceptibility genes. We analysed deep (> 30x) whole-genome sequencing (WGS) data generated in blood samples from 128 breast cancer patients of Asian and European descent with either a strong family history of breast cancer or early cancer onset disease. To identify SV deletions in known or suspected breast cancer susceptibility genes, we used multiple SV calling tools including Genome STRiP, Delly, Manta, BreakDancer and Pindel. SV deletions were detected by at least three of these bioinformatics tools in five genes. Specifically, we identified heterozygous deletions covering a fraction of the coding regions of BRCA1 (with approximately 80kb in two patients), and TP53 genes (with ∼1.6 kb in two patients), and of intronic regions (∼1 kb) of the PALB2 (one patient), PTEN (three patients) and RAD51C genes (one patient). We confirmed the presence of these deletions using real-time quantitative PCR (qPCR). Our study identified novel SV deletions in breast cancer susceptibility genes and the identification of such SV deletions may improve clinical testing.

  11. Structure of the horseradish peroxidase isozyme C genes.

    Science.gov (United States)

    Fujiyama, K; Takemura, H; Shibayama, S; Kobayashi, K; Choi, J K; Shinmyo, A; Takano, M; Yamada, Y; Okada, H

    1988-05-02

    We have isolated, cloned and characterized three cDNAs and two genomic DNAs corresponding to the mRNAs and genes for the horseradish (Armoracia rusticana) peroxidase isoenzyme C (HPR C). The amino acid sequence of HRP C1, deduced from the nucleotide sequence of one of the cDNA clone, pSK1, contained the same primary sequence as that of the purified enzyme established by Welinder [FEBS Lett. 72, 19-23 (1976)] with additional sequences at the N and C terminal. All three inserts in the cDNA clones, pSK1, pSK2 and pSK3, coded the same size of peptide (308 amino acid residues) if these are processed in the same way, and the amino acid sequence were homologous to each other by 91-94%. Functional amino acids, including His40, His170, Tyr185 and Arg183 and S-S-bond-forming Cys, were conserved in the three isozymes, but a few N-glycosylation sites were not the same. Two HRP C isoenzyme genomic genes, prxC1 and prxC2, were tandem on the chromosomal DNA and each gene consisted of four exons and three introns. The positions in the exons interrupted by introns were the same in two genes. We observed a putative promoter sequence 5' upstream and a poly(A) signal 3' downstream in both genes. The gene product of prxC1 might be processed with a signal sequence of 30 amino acid residues at the N terminus and a peptide consisting of 15 amino acid residues at the C terminus.

  12. Gene structure, transcripts and calciotropic effects of the PTH family of peptides in Xenopus and chicken

    Directory of Open Access Journals (Sweden)

    Power Deborah M

    2010-12-01

    Full Text Available Abstract Background Parathyroid hormone (PTH and PTH-related peptide (PTHrP belong to a family of endocrine factors that share a highly conserved N-terminal region (amino acids 1-34 and play key roles in calcium homeostasis, bone formation and skeletal development. Recently, PTH-like peptide (PTH-L was identified in teleost fish raising questions about the evolution of these proteins. Although PTH and PTHrP have been intensively studied in mammals their function in other vertebrates is poorly documented. Amphibians and birds occupy unique phylogenetic positions, the former at the transition of aquatic to terrestrial life and the latter at the transition to homeothermy. Moreover, both organisms have characteristics indicative of a complex system in calcium regulation. This study investigated PTH family evolution in vertebrates with special emphasis on Xenopus and chicken. Results The PTH-L gene is present throughout the vertebrates with the exception of placental mammals. Gene structure of PTH and PTH-L seems to be conserved in vertebrates while PTHrP gene structure is divergent and has acquired new exons and alternative promoters. Splice variants of PTHrP and PTH-L are common in Xenopus and chicken and transcripts of the former have a widespread tissue distribution, although PTH-L is more restricted. PTH is widely expressed in fish tissue but from Xenopus to mammals becomes largely restricted to the parathyroid gland. The N-terminal (1-34 region of PTH, PTHrP and PTH-L in Xenopus and chicken share high sequence conservation and the capacity to modify calcium fluxes across epithelia suggesting a conserved role in calcium metabolism possibly via similar receptors. Conclusions The parathyroid hormone family contains 3 principal members, PTH, PTHrP and the recently identified PTH-L. In teleosts there are 5 genes which encode PTHrP (2, PTH (2 and PTH-L and in tetrapods there are 3 genes (PTHrP, PTH and PTH-L, the exception is placental mammals which

  13. Fine Physical Bin Mapping of the Powdery Mildew Resistance Gene Pm21 Based on Chromosomal Structural Variations in Wheat

    Directory of Open Access Journals (Sweden)

    Shanying Zhu

    2018-02-01

    Full Text Available Pm21, derived from wheat wild relative Dasypyrum villosum, is one of the most effective powdery mildew resistance genes and has been widely applied in wheat breeding in China. Mapping and cloning Pm21 are of importance for understanding its resistance mechanism. In the present study, physical mapping was performed using different genetic stocks involving in structural variations of chromosome 6VS carrying Pm21. The data showed that 6VS could be divided into eight distinguishable chromosomal bins, and Pm21 was mapped to the bin FLb4–b5/b6 closely flanked by the markers 6VS-08.6 and 6VS-10.2. Comparative genomic mapping indicated that the orthologous regions of FLb4–b5/b6 carrying Pm21 were narrowed to a 117.7 kb genomic region harboring 19 genes in Brachypodium and a 37.7 kb region harboring 5 genes in rice, respectively. The result was consistent with that given by recent genetic mapping in diploid D. villosum. In conclusion, this study demonstrated that physical mapping based on chromosomal structural variations is an efficient method for locating alien genes in wheat background.

  14. Structure-function correlation of chloroquine and analogues as transgene expression enhancers in nonviral gene delivery.

    Science.gov (United States)

    Cheng, Jianjun; Zeidan, Ryan; Mishra, Swaroop; Liu, Aijie; Pun, Suzie H; Kulkarni, Rajan P; Jensen, Gregory S; Bellocq, Nathalie C; Davis, Mark E

    2006-11-02

    To understand how chloroquine (CQ) enhances transgene expression in polycation-based, nonviral gene delivery systems, a number of CQ analogues with variations in the aliphatic amino side chain or in the aromatic ring are synthesized and investigated. Our studies indicate that the aliphatic amino moiety of CQ is essential to provide increased gene expression. Further, the enhancements are more dramatically affected by changes to the aromatic ring and are positively correlated to the strength of intercalation between DNA and the CQ analogues. Quinacrine (QC), a CQ analogue with a fused acridinyl structure that can strongly intercalate DNA, enhances transfection similarly to CQ at a concentration 10 times lower, while N(4)-(4-pyridinyl)-N(1),N(1)-diethyl-1,4-pentanediamine (CP), a CQ analogue that has a weakly intercalating pyridinyl ring, shows no effect on gene expression. Subtle change on the 7-substituent of the chloroquine aromatic structure can also greatly affect the ability of the CQ analogues to enhance transgene expression. Transfection in the presence of N(4)-(7-trifluoromethyl-4-quinolinyl)-N(1),N(1)-diethyl-1,4-pentanediamin e (CQ7a) shows expression efficiency 10 times higher than in the presence of CQ at same concentration, while transfection in the presence of N(4)-(4-quinolinyl)-N(1),N(1)-diethyl-1,4-pentanediamine (CQ7b) does not reveal any enhancing effects on expression. Through a number of comparative studies with CQ and its analogues, we conclude that there are at least three mechanistic features of CQ that lead to the enhancement in gene expression: (i) pH buffering in endocytic vesicles, (ii) displacement of polycations from the nucleic acids in polyplexes, and (iii) alteration of the biophysical properties of the released nucleic acid.

  15. The organization structure and regulatory elements of Chlamydomonas histone genes reveal features linking plant and animal genes.

    Science.gov (United States)

    Fabry, S; Müller, K; Lindauer, A; Park, P B; Cornelius, T; Schmitt, R

    1995-09-01

    The genome of the green alga Chlamydomonas reinhardtii contains approximately 15 gene clusters of the nucleosomal (or core) histone H2A, H2B, H3 and H4 genes and at least one histone H1 gene. Seven non-allelic histone gene loci were isolated from a genomic library, physically mapped, and the nucleotide sequences of three isotypes of each core histone gene species and one linked H1 gene determined. The core histone genes are organized in clusters of H2A-H2B and H3-H4 pairs, in which each gene pair shows outwardly divergent transcription from a short (< 300 bp) intercistronic region. These intercistronic regions contain typically conserved promoter elements, namely a TATA-box and the three motifs TGGCCAG-G(G/C)-CGAG, CGTTGACC and CGGTTG. Different from the genes of higher plants, but like those of animals and the related alga Volvox, the 3' untranslated regions contain no poly A signal, but a palindromic sequence (3' palindrome) essential for mRNA processing is present. One single H1 gene was found in close linkage to a H2A-H2B pair. The H1 upstream region contains the octameric promoter element GGTTGACC (also found upstream of the core histone genes) and two specific sequence motifs that are shared only with the Volvox H1 promoters. This suggests differential transcription of the H1 and the core histone genes. The H1 gene is interrupted by two introns. Unlike Volvox H3 genes, the three sequenced H3 isoforms are intron-free. Primer-directed PCR of genomic DNA demonstrated, however, that at least 8 of the about 15 H3 genes do contain one intron at a conserved position. In synchronized C. reinhardtii cells, H4 mRNA levels (representative of all core histone mRNAs) peak during cell division, suggesting strict replication-dependent gene control. The derived peptide sequences place C. reinhardtii core histones closer to plants than to animals, except that the H2A histones are more animal-like. The peptide sequence of histone H1 is closely related to the V. carteri VH1-II

  16. Structural evolution of the 4/1 genes and proteins in non-vascular and lower vascular plants.

    Science.gov (United States)

    Morozov, Sergey Y; Milyutina, Irina A; Bobrova, Vera K; Ryazantsev, Dmitry Y; Erokhina, Tatiana N; Zavriev, Sergey K; Agranovsky, Alexey A; Solovyev, Andrey G; Troitsky, Alexey V

    2015-12-01

    The 4/1 protein of unknown function is encoded by a single-copy gene in most higher plants. The 4/1 protein of Nicotiana tabacum (Nt-4/1 protein) has been shown to be alpha-helical and predominantly expressed in conductive tissues. Here, we report the analysis of 4/1 genes and the encoded proteins of lower land plants. Sequences of a number of 4/1 genes from liverworts, lycophytes, ferns and gymnosperms were determined and analyzed together with sequences available in databases. Most of the vascular plants were found to encode Magnoliophyta-like 4/1 proteins exhibiting previously described gene structure and protein properties. Identification of the 4/1-like proteins in hornworts, liverworts and charophyte algae (sister lineage to all land plants) but not in mosses suggests that 4/1 proteins are likely important for plant development but not required for a primary metabolic function of plant cell. Copyright © 2015 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.

  17. The complete chloroplast genome sequence of Podocarpus lambertii: genome structure, evolutionary aspects, gene content and SSR detection.

    Directory of Open Access Journals (Sweden)

    Leila do Nascimento Vieira

    Full Text Available BACKGROUND: Podocarpus lambertii (Podocarpaceae is a native conifer from the Brazilian Atlantic Forest Biome, which is considered one of the 25 biodiversity hotspots in the world. The advancement of next-generation sequencing technologies has enabled the rapid acquisition of whole chloroplast (cp genome sequences at low cost. Several studies have proven the potential of cp genomes as tools to understand enigmatic and basal phylogenetic relationships at different taxonomic levels, as well as further probe the structural and functional evolution of plants. In this work, we present the complete cp genome sequence of P. lambertii. METHODOLOGY/PRINCIPAL FINDINGS: The P. lambertii cp genome is 133,734 bp in length, and similar to other sequenced cupressophytes, it lacks one of the large inverted repeat regions (IR. It contains 118 unique genes and one duplicated tRNA (trnN-GUU, which occurs as an inverted repeat sequence. The rps16 gene was not found, which was previously reported for the plastid genome of another Podocarpaceae (Nageia nagi and Araucariaceae (Agathis dammara. Structurally, P. lambertii shows 4 inversions of a large DNA fragment ∼20,000 bp compared to the Podocarpus totara cp genome. These unexpected characteristics may be attributed to geographical distance and different adaptive needs. The P. lambertii cp genome presents a total of 28 tandem repeats and 156 SSRs, with homo- and dipolymers being the most common and tri-, tetra-, penta-, and hexapolymers occurring with less frequency. CONCLUSION: The complete cp genome sequence of P. lambertii revealed significant structural changes, even in species from the same genus. These results reinforce the apparently loss of rps16 gene in Podocarpaceae cp genome. In addition, several SSRs in the P. lambertii cp genome are likely intraspecific polymorphism sites, which may allow highly sensitive phylogeographic and population structure studies, as well as phylogenetic studies of species of

  18. Genome-wide analysis of Epstein-Barr virus identifies variants and genes associated with gastric carcinoma and population structure.

    Science.gov (United States)

    Yao, Youyuan; Xu, Miao; Liang, Liming; Zhang, Haojiong; Xu, Ruihua; Feng, Qisheng; Feng, Lin; Luo, Bing; Zeng, Yi-Xin

    2017-10-01

    Epstein-Barr virus is a ubiquitous virus and is associated with several human malignances, including the significant subset of gastric carcinoma, Epstein-Barr virus-associated gastric carcinoma. Some Epstein-Barr virus-associated diseases are uniquely prevalent in populations with different geographic origins. However, the features of the disease and geographically associated Epstein-Barr virus genetic variation as well as the roles that the variation plays in carcinogenesis and evolution remain unclear. Therefore, in this study, we sequenced 95 geographically distinct Epstein-Barr virus isolates from Epstein-Barr virus-associated gastric carcinoma biopsies and saliva of healthy donors to detect variants and genes associated with gastric carcinoma and population structure from a genome-wide spectrum. We demonstrated that Epstein-Barr virus revealed the population structure between North China and South China. In addition, we observed population stratification between Epstein-Barr virus strains from gastric carcinoma and healthy controls, indicating that certain Epstein-Barr virus subtypes are associated with different gastric carcinoma risks. We identified that the BRLF1, BBRF3, and BBLF2/BBLF3 genes had significant associations with gastric carcinoma. LMP1 and BNLF2a genes were strongly geographically associated genes in Epstein-Barr virus. Our study provides insights into the genetic basis of oncogenic Epstein-Barr virus for gastric carcinoma, and the genetic variants associated with gastric carcinoma can serve as biomarkers for oncogenic Epstein-Barr virus.

  19. Characterization of chicken riboflavin carrier protein gene structure ...

    Indian Academy of Sciences (India)

    The chicken riboflavin carrier protein (RCP) is an estrogen induced egg yolk and white protein. Eggs from hens which have a splice mutation in RCP gene fail to hatch, indicating an absolute requirement of RCP for the transport of riboflavin to the oocyte. In order to understand the mechanism of regulation of this gene by ...

  20. Structure and chromosomal localization of the human renal kallikrein gene

    International Nuclear Information System (INIS)

    Evans, B.A.; Yun, Z.X.; Close, J.A.

    1988-01-01

    Glandular kallikreins are a family of proteases encoded by a variable number of genes in different mammalian species. In all species examined, however, one particular kallikrein is functionally conserved in its capacity to release the vasoactive peptide, Lys-bradykinin, from low molecular weight kininogen. This kallikrein is found in the kidney, pancreas, and salivary gland, showing a unique pattern of tissue-specific expression relative to other members of the family. The authors have isolated a genomic clone carrying the human renal kallikrein gene and compared the nucleotide sequence of its promoter region with those of the mouse renal kallikrein gene and another mouse kallikrein gene expressed in a distinct cell type. They find four sequence elements conserved between renal kallikrein genes from the two species. They have also shown that the human gene is localized to 19q13, a position analogous to that of the kallikrein gene family on mouse chromosome 7

  1. Enrichment of HP1a on Drosophila chromosome 4 genes creates an alternate chromatin structure critical for regulation in this heterochromatic domain.

    Directory of Open Access Journals (Sweden)

    Nicole C Riddle

    2012-09-01

    Full Text Available Chromatin environments differ greatly within a eukaryotic genome, depending on expression state, chromosomal location, and nuclear position. In genomic regions characterized by high repeat content and high gene density, chromatin structure must silence transposable elements but permit expression of embedded genes. We have investigated one such region, chromosome 4 of Drosophila melanogaster. Using chromatin-immunoprecipitation followed by microarray (ChIP-chip analysis, we examined enrichment patterns of 20 histone modifications and 25 chromosomal proteins in S2 and BG3 cells, as well as the changes in several marks resulting from mutations in key proteins. Active genes on chromosome 4 are distinct from those in euchromatin or pericentric heterochromatin: while there is a depletion of silencing marks at the transcription start sites (TSSs, HP1a and H3K9me3, but not H3K9me2, are enriched strongly over gene bodies. Intriguingly, genes on chromosome 4 are less frequently associated with paused polymerase. However, when the chromatin is altered by depleting HP1a or POF, the RNA pol II enrichment patterns of many chromosome 4 genes shift, showing a significant decrease over gene bodies but not at TSSs, accompanied by lower expression of those genes. Chromosome 4 genes have a low incidence of TRL/GAGA factor binding sites and a low T(m downstream of the TSS, characteristics that could contribute to a low incidence of RNA polymerase pausing. Our data also indicate that EGG and POF jointly regulate H3K9 methylation and promote HP1a binding over gene bodies, while HP1a targeting and H3K9 methylation are maintained at the repeats by an independent mechanism. The HP1a-enriched, POF-associated chromatin structure over the gene bodies may represent one type of adaptation for genes embedded in repetitive DNA.

  2. Gene organization in rice revealed by full-length cDNA mapping and gene expression analysis through microarray.

    Directory of Open Access Journals (Sweden)

    Kouji Satoh

    Full Text Available Rice (Oryza sativa L. is a model organism for the functional genomics of monocotyledonous plants since the genome size is considerably smaller than those of other monocotyledonous plants. Although highly accurate genome sequences of indica and japonica rice are available, additional resources such as full-length complementary DNA (FL-cDNA sequences are also indispensable for comprehensive analyses of gene structure and function. We cross-referenced 28.5K individual loci in the rice genome defined by mapping of 578K FL-cDNA clones with the 56K loci predicted in the TIGR genome assembly. Based on the annotation status and the presence of corresponding cDNA clones, genes were classified into 23K annotated expressed (AE genes, 33K annotated non-expressed (ANE genes, and 5.5K non-annotated expressed (NAE genes. We developed a 60mer oligo-array for analysis of gene expression from each locus. Analysis of gene structures and expression levels revealed that the general features of gene structure and expression of NAE and ANE genes were considerably different from those of AE genes. The results also suggested that the cloning efficiency of rice FL-cDNA is associated with the transcription activity of the corresponding genetic locus, although other factors may also have an effect. Comparison of the coverage of FL-cDNA among gene families suggested that FL-cDNA from genes encoding rice- or eukaryote-specific domains, and those involved in regulatory functions were difficult to produce in bacterial cells. Collectively, these results indicate that rice genes can be divided into distinct groups based on transcription activity and gene structure, and that the coverage bias of FL-cDNA clones exists due to the incompatibility of certain eukaryotic genes in bacteria.

  3. Localization to Chromosomes of Structural Genes for the Major Protease Inhibitors of Barley Grains

    DEFF Research Database (Denmark)

    Hejgaard, Jørn; Bjørn, S.E.; Nielsen, Gunnar Gissel

    1984-01-01

    Wheat-barley chromosome addition lines were compared by isoelectric focusing of protein extracts to identify chromosomes carrying loci for the major immunochemically distinct protease inhibitors of barley grains. Structural genes for the following inhibitors were localized: an inhibitor of both...... endogenous α-amylase 2 and subtilisin (ASI) on chromosome 2, two chymotrypsin/subtilisin inhibitors (CI-1 and CI-2) on chromosome 5 (long arm) and the major trypsin inhibitor (TI-1) on chromosome 3....

  4. Memory functions reveal structural properties of gene regulatory networks

    Science.gov (United States)

    Perez-Carrasco, Ruben

    2018-01-01

    Gene regulatory networks (GRNs) control cellular function and decision making during tissue development and homeostasis. Mathematical tools based on dynamical systems theory are often used to model these networks, but the size and complexity of these models mean that their behaviour is not always intuitive and the underlying mechanisms can be difficult to decipher. For this reason, methods that simplify and aid exploration of complex networks are necessary. To this end we develop a broadly applicable form of the Zwanzig-Mori projection. By first converting a thermodynamic state ensemble model of gene regulation into mass action reactions we derive a general method that produces a set of time evolution equations for a subset of components of a network. The influence of the rest of the network, the bulk, is captured by memory functions that describe how the subnetwork reacts to its own past state via components in the bulk. These memory functions provide probes of near-steady state dynamics, revealing information not easily accessible otherwise. We illustrate the method on a simple cross-repressive transcriptional motif to show that memory functions not only simplify the analysis of the subnetwork but also have a natural interpretation. We then apply the approach to a GRN from the vertebrate neural tube, a well characterised developmental transcriptional network composed of four interacting transcription factors. The memory functions reveal the function of specific links within the neural tube network and identify features of the regulatory structure that specifically increase the robustness of the network to initial conditions. Taken together, the study provides evidence that Zwanzig-Mori projections offer powerful and effective tools for simplifying and exploring the behaviour of GRNs. PMID:29470492

  5. Expression regulation of design process gene in product design

    DEFF Research Database (Denmark)

    Li, Bo; Fang, Lusheng; Li, Bo

    2011-01-01

    To improve the design process efficiency, this paper proposes the principle and methodology that design process gene controls the characteristics of design process under the framework of design process reuse and optimization based on design process gene. First, the concept of design process gene...... is proposed and analyzed, as well as its three categories i.e., the operator gene, the structural gene and the regulator gene. Second, the trigger mechanism that design objectives and constraints trigger the operator gene is constructed. Third, the expression principle of structural gene is analyzed...... with the example of design management gene. Last, the regulation mode that the regulator gene regulates the expression of the structural gene is established and it is illustrated by taking the design process management gene as an example. © (2011) Trans Tech Publications....

  6. The Genetic Diversity and Structure of Linkage Disequilibrium of the MTHFR Gene in Populations of Northern Eurasia.

    Science.gov (United States)

    Trifonova, E A; Eremina, E R; Urnov, F D; Stepanov, V A

    2012-01-01

    The structure of the haplotypes and linkage disequilibrium (LD) of the methylenetetrahydrofolate reductase gene (MTHFR) in 9 population groups from Northern Eurasia and populations of the international HapMap project was investigated in the present study. The data suggest that the architecture of LD in the human genome is largely determined by the evolutionary history of populations; however, the results of phylogenetic and haplotype analyses seems to suggest that in fact there may be a common "old" mechanism for the formation of certain patterns of LD. Variability in the structure of LD and the level of diversity of MTHFRhaplotypes cause a certain set of tagSNPs with an established prognostic significance for each population. In our opinion, the results obtained in the present study are of considerable interest for understanding multiple genetic phenomena: namely, the association of interpopulation differences in the patterns of LD with structures possessing a genetic susceptibility to complex diseases, and the functional significance of the pleiotropicMTHFR gene effect. Summarizing the results of this study, a conclusion can be made that the genetic variability analysis with emphasis on the structure of LD in human populations is a powerful tool that can make a significant contribution to such areas of biomedical science as human evolutionary biology, functional genomics, genetics of complex diseases, and pharmacogenomics.

  7. Gene expression in chicken reveals correlation with structural genomic features and conserved patterns of transcription in the terrestrial vertebrates.

    Directory of Open Access Journals (Sweden)

    Haisheng Nie

    Full Text Available BACKGROUND: The chicken is an important agricultural and avian-model species. A survey of gene expression in a range of different tissues will provide a benchmark for understanding expression levels under normal physiological conditions in birds. With expression data for birds being very scant, this benchmark is of particular interest for comparative expression analysis among various terrestrial vertebrates. METHODOLOGY/PRINCIPAL FINDINGS: We carried out a gene expression survey in eight major chicken tissues using whole genome microarrays. A global picture of gene expression is presented for the eight tissues, and tissue specific as well as common gene expression were identified. A Gene Ontology (GO term enrichment analysis showed that tissue-specific genes are enriched with GO terms reflecting the physiological functions of the specific tissue, and housekeeping genes are enriched with GO terms related to essential biological functions. Comparisons of structural genomic features between tissue-specific genes and housekeeping genes show that housekeeping genes are more compact. Specifically, coding sequence and particularly introns are shorter than genes that display more variation in expression between tissues, and in addition intergenic space was also shorter. Meanwhile, housekeeping genes are more likely to co-localize with other abundantly or highly expressed genes on the same chromosomal regions. Furthermore, comparisons of gene expression in a panel of five common tissues between birds, mammals and amphibians showed that the expression patterns across tissues are highly similar for orthologous genes compared to random gene pairs within each pair-wise comparison, indicating a high degree of functional conservation in gene expression among terrestrial vertebrates. CONCLUSIONS: The housekeeping genes identified in this study have shorter gene length, shorter coding sequence length, shorter introns, and shorter intergenic regions, there seems

  8. Genomic survey, gene expression analysis and structural modeling suggest diverse roles of DNA methyltransferases in legumes.

    Directory of Open Access Journals (Sweden)

    Rohini Garg

    Full Text Available DNA methylation plays a crucial role in development through inheritable gene silencing. Plants possess three types of DNA methyltransferases (MTases, namely Methyltransferase (MET, Chromomethylase (CMT and Domains Rearranged Methyltransferase (DRM, which maintain methylation at CG, CHG and CHH sites. DNA MTases have not been studied in legumes so far. Here, we report the identification and analysis of putative DNA MTases in five legumes, including chickpea, soybean, pigeonpea, Medicago and Lotus. MTases in legumes could be classified in known MET, CMT, DRM and DNA nucleotide methyltransferases (DNMT2 subfamilies based on their domain organization. First three MTases represent DNA MTases, whereas DNMT2 represents a transfer RNA (tRNA MTase. Structural comparison of all the MTases in plants with known MTases in mammalian and plant systems have been reported to assign structural features in context of biological functions of these proteins. The structure analysis clearly specified regions crucial for protein-protein interactions and regions important for nucleosome binding in various domains of CMT and MET proteins. In addition, structural model of DRM suggested that circular permutation of motifs does not have any effect on overall structure of DNA methyltransferase domain. These results provide valuable insights into role of various domains in molecular recognition and should facilitate mechanistic understanding of their function in mediating specific methylation patterns. Further, the comprehensive gene expression analyses of MTases in legumes provided evidence of their role in various developmental processes throughout the plant life cycle and response to various abiotic stresses. Overall, our study will be very helpful in establishing the specific functions of DNA MTases in legumes.

  9. Structure, tissue distribution, and chromosomal localization of the prepronociceptin gene.

    Science.gov (United States)

    Mollereau, C; Simons, M J; Soularue, P; Liners, F; Vassart, G; Meunier, J C; Parmentier, M

    1996-08-06

    Nociceptin (orphanin FQ), the newly discovered natural agonist of opioid receptor-like (ORL1) receptor, is a neuropeptide that is endowed with pronociceptive activity in vivo. Nociceptin is derived from a larger precursor, prepronociceptin (PPNOC), whose human, mouse, and rat genes we have now isolated. The PPNOC gene is highly conserved in the three species and displays organizational features that are strikingly similar to those of the genes of preproenkephalin, preprodynorphin, and preproopiomelanocortin, the precursors to endogenous opioid peptides, suggesting the four genes belong to the same family-i.e., have a common evolutionary origin. The PPNOC gene encodes a single copy of nociceptin as well as of other peptides whose sequence is strictly conserved across murine and human species; hence it is likely to be neurophysiologically significant. Northern blot analysis shows that the PPNOC gene is predominantly transcribed in the central nervous system (brain and spinal cord) and, albeit weakly, in the ovary, the sole peripheral organ expressing the gene. By using a radiation hybrid cell line panel, the PPNOC gene was mapped to the short arm of human chromosome 8 (8p21), between sequence-tagged site markers WI-5833 and WI-1172, in close proximity of the locus encoding the neurofilament light chain NEFL. Analysis of yeast artificial chromosome clones belonging to the WC8.4 contig covering the 8p21 region did not allow to detect the presence of the gene on these yeast artificial chromosomes, suggesting a gap in the coverage within this contig.

  10. Novel Nucleotide Variations, Haplotypes Structure and Associations with Growth Related Traits of Goat AT Motif-Binding Factor ( Gene

    Directory of Open Access Journals (Sweden)

    Xiaoyan Zhang

    2015-10-01

    Full Text Available The AT motif-binding factor (ATBF1 not only interacts with protein inhibitor of activated signal transducer and activator of transcription 3 (STAT3 (PIAS3 to suppress STAT3 signaling regulating embryo early development and cell differentiation, but is required for early activation of the pituitary specific transcription factor 1 (Pit1 gene (also known as POU1F1 critically affecting mammalian growth and development. The goal of this study was to detect novel nucleotide variations and haplotypes structure of the ATBF1 gene, as well as to test their associations with growth-related traits in goats. Herein, a total of seven novel single nucleotide polymorphisms (SNPs (SNP 1-7 within this gene were found in two well-known Chinese native goat breeds. Haplotypes structure analysis demonstrated that there were four haplotypes in Hainan black goat while seventeen haplotypes in Xinong Saanen dairy goat, and both breeds only shared one haplotype (hap1. Association testing revealed that the SNP2, SNP5, SNP6, and SNP7 loci were also found to significantly associate with growth-related traits in goats, respectively. Moreover, one diplotype in Xinong Saanen dairy goats significantly linked to growth related traits. These preliminary findings not only would extend the spectrum of genetic variations of the goat ATBF1 gene, but also would contribute to implementing marker-assisted selection in genetics and breeding in goats.

  11. Learning gene regulatory networks from gene expression data using weighted consensus

    KAUST Repository

    Fujii, Chisato; Kuwahara, Hiroyuki; Yu, Ge; Guo, Lili; Gao, Xin

    2016-01-01

    An accurate determination of the network structure of gene regulatory systems from high-throughput gene expression data is an essential yet challenging step in studying how the expression of endogenous genes is controlled through a complex interaction of gene products and DNA. While numerous methods have been proposed to infer the structure of gene regulatory networks, none of them seem to work consistently over different data sets with high accuracy. A recent study to compare gene network inference methods showed that an average-ranking-based consensus method consistently performs well under various settings. Here, we propose a linear programming-based consensus method for the inference of gene regulatory networks. Unlike the average-ranking-based one, which treats the contribution of each individual method equally, our new consensus method assigns a weight to each method based on its credibility. As a case study, we applied the proposed consensus method on synthetic and real microarray data sets, and compared its performance to that of the average-ranking-based consensus and individual inference methods. Our results show that our weighted consensus method achieves superior performance over the unweighted one, suggesting that assigning weights to different individual methods rather than giving them equal weights improves the accuracy. © 2016 Elsevier B.V.

  12. Learning gene regulatory networks from gene expression data using weighted consensus

    KAUST Repository

    Fujii, Chisato

    2016-08-25

    An accurate determination of the network structure of gene regulatory systems from high-throughput gene expression data is an essential yet challenging step in studying how the expression of endogenous genes is controlled through a complex interaction of gene products and DNA. While numerous methods have been proposed to infer the structure of gene regulatory networks, none of them seem to work consistently over different data sets with high accuracy. A recent study to compare gene network inference methods showed that an average-ranking-based consensus method consistently performs well under various settings. Here, we propose a linear programming-based consensus method for the inference of gene regulatory networks. Unlike the average-ranking-based one, which treats the contribution of each individual method equally, our new consensus method assigns a weight to each method based on its credibility. As a case study, we applied the proposed consensus method on synthetic and real microarray data sets, and compared its performance to that of the average-ranking-based consensus and individual inference methods. Our results show that our weighted consensus method achieves superior performance over the unweighted one, suggesting that assigning weights to different individual methods rather than giving them equal weights improves the accuracy. © 2016 Elsevier B.V.

  13. LINE FUSION GENES: a database of LINE expression in human genes

    Directory of Open Access Journals (Sweden)

    Park Hong-Seog

    2006-06-01

    Full Text Available Abstract Background Long Interspersed Nuclear Elements (LINEs are the most abundant retrotransposons in humans. About 79% of human genes are estimated to contain at least one segment of LINE per transcription unit. Recent studies have shown that LINE elements can affect protein sequences, splicing patterns and expression of human genes. Description We have developed a database, LINE FUSION GENES, for elucidating LINE expression throughout the human gene database. We searched the 28,171 genes listed in the NCBI database for LINE elements and analyzed their structures and expression patterns. The results show that the mRNA sequences of 1,329 genes were affected by LINE expression. The LINE expression types were classified on the basis of LINEs in the 5' UTR, exon or 3' UTR sequences of the mRNAs. Our database provides further information, such as the tissue distribution and chromosomal location of the genes, and the domain structure that is changed by LINE integration. We have linked all the accession numbers to the NCBI data bank to provide mRNA sequences for subsequent users. Conclusion We believe that our work will interest genome scientists and might help them to gain insight into the implications of LINE expression for human evolution and disease. Availability http://www.primate.or.kr/line

  14. Using the gene ontology to scan multilevel gene sets for associations in genome wide association studies.

    Science.gov (United States)

    Schaid, Daniel J; Sinnwell, Jason P; Jenkins, Gregory D; McDonnell, Shannon K; Ingle, James N; Kubo, Michiaki; Goss, Paul E; Costantino, Joseph P; Wickerham, D Lawrence; Weinshilboum, Richard M

    2012-01-01

    Gene-set analyses have been widely used in gene expression studies, and some of the developed methods have been extended to genome wide association studies (GWAS). Yet, complications due to linkage disequilibrium (LD) among single nucleotide polymorphisms (SNPs), and variable numbers of SNPs per gene and genes per gene-set, have plagued current approaches, often leading to ad hoc "fixes." To overcome some of the current limitations, we developed a general approach to scan GWAS SNP data for both gene-level and gene-set analyses, building on score statistics for generalized linear models, and taking advantage of the directed acyclic graph structure of the gene ontology when creating gene-sets. However, other types of gene-set structures can be used, such as the popular Kyoto Encyclopedia of Genes and Genomes (KEGG). Our approach combines SNPs into genes, and genes into gene-sets, but assures that positive and negative effects of genes on a trait do not cancel. To control for multiple testing of many gene-sets, we use an efficient computational strategy that accounts for LD and provides accurate step-down adjusted P-values for each gene-set. Application of our methods to two different GWAS provide guidance on the potential strengths and weaknesses of our proposed gene-set analyses. © 2011 Wiley Periodicals, Inc.

  15. Mechanisms of Action and Cell Death Associated with Clostridium perfringens Toxins

    Directory of Open Access Journals (Sweden)

    Mauricio A. Navarro

    2018-05-01

    Full Text Available Clostridium perfringens uses its large arsenal of protein toxins to produce histotoxic, neurologic and intestinal infections in humans and animals. The major toxins involved in diseases are alpha (CPA, beta (CPB, epsilon (ETX, iota (ITX, enterotoxin (CPE, and necrotic B-like (NetB toxins. CPA is the main virulence factor involved in gas gangrene in humans, whereas its role in animal diseases is limited and controversial. CPB is responsible for necrotizing enteritis and enterotoxemia, mostly in neonatal individuals of many animal species, including humans. ETX is the main toxin involved in enterotoxemia of sheep and goats. ITX has been implicated in cases of enteritis in rabbits and other animal species; however, its specific role in causing disease has not been proved. CPE is responsible for human food-poisoning and non-foodborne C. perfringens-mediated diarrhea. NetB is the cause of necrotic enteritis in chickens. In most cases, host–toxin interaction starts on the plasma membrane of target cells via specific receptors, resulting in the activation of intracellular pathways with a variety of effects, commonly including cell death. In general, the molecular mechanisms of cell death associated with C. perfringens toxins involve features of apoptosis, necrosis and/or necroptosis.

  16. Methanogenesis and methane genes

    International Nuclear Information System (INIS)

    Reeve, J.N.; Shref, B.A.

    1991-01-01

    An overview of the pathways leading to methane biosynthesis is presented. The steps investigated to date by gene cloning and DNA sequencing procedures are identified and discussed. The primary structures of component C of methyl coenzyme M reductase encoded by mcr operons in different methanogens are compared. Experiments to detect the primary structure of the genes encoding F420 reducing hydrogenase (frhABG) and methyl hydrogen reducing hydrogenase (mvhDGA) in methanobacterium thermoautotrophicum strain H are compared with each other and with eubacterial hydrogenase encoding genes. A biotechnological use for hydrogenases from hypermorphillic archaebacteria is suggested. (author)

  17. Structure-function analysis of RBP-J-interacting and tubulin-associated (RITA) reveals regions critical for repression of Notch target genes.

    Science.gov (United States)

    Tabaja, Nassif; Yuan, Zhenyu; Oswald, Franz; Kovall, Rhett A

    2017-06-23

    The Notch pathway is a cell-to-cell signaling mechanism that is essential for tissue development and maintenance, and aberrant Notch signaling has been implicated in various cancers, congenital defects, and cardiovascular diseases. Notch signaling activates the expression of target genes, which are regulated by the transcription factor CSL (CBF1/RBP-J, Su(H), Lag-1). CSL interacts with both transcriptional corepressor and coactivator proteins, functioning as both a repressor and activator, respectively. Although Notch activation complexes are relatively well understood at the structural level, less is known about how CSL interacts with corepressors. Recently, a new RBP-J (mammalian CSL ortholog)-interacting protein termed RITA has been identified and shown to export RBP-J out of the nucleus, thereby leading to the down-regulation of Notch target gene expression. However, the molecular details of RBP-J/RITA interactions are unclear. Here, using a combination of biochemical/cellular, structural, and biophysical techniques, we demonstrate that endogenous RBP-J and RITA proteins interact in cells, map the binding regions necessary for RBP-J·RITA complex formation, and determine the X-ray structure of the RBP-J·RITA complex bound to DNA. To validate the structure and glean more insights into function, we tested structure-based RBP-J and RITA mutants with biochemical/cellular assays and isothermal titration calorimetry. Whereas our structural and biophysical studies demonstrate that RITA binds RBP-J similarly to the RAM (RBP-J-associated molecule) domain of Notch, our biochemical and cellular assays suggest that RITA interacts with additional regions in RBP-J. Taken together, these results provide molecular insights into the mechanism of RITA-mediated regulation of Notch signaling, contributing to our understanding of how CSL functions as a transcriptional repressor of Notch target genes. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  18. Primary structure of the tms and prs genes of Bacillus subtilis

    DEFF Research Database (Denmark)

    Nilsson, Dan; Hove-Jensen, Bjarne; Arnvig, Kirsten

    1989-01-01

    The nucleotide sequence was determined of a 3211 nucleotide pair EcoRI-PvuII DNA fragment containing the tms and prs genes as well as a part of the ctc gene of Bacillus subtilis. The prs gene encodes phosphoribosylpyrophosphate (PRPP) synthetase, whereas the functioning of the tms and ctc gene...... products remains to be established. The prs gene contains an open reading frame of 317 codons resulting in a subunit Mr of 34828. An open reading frame comprising the tms gene contained 456 codons resulting in a putative translation product with an Mr of 49,554. Comparison of the deduced B. subtilis PRPP...

  19. Structure and expression of the Xenopus retinoblastoma gene.

    Science.gov (United States)

    Destrée, O H; Lam, K T; Peterson-Maduro, L J; Eizema, K; Diller, L; Gryka, M A; Frebourg, T; Shibuya, E; Friend, S H

    1992-09-01

    We have cloned a Xenopus homology (XRb1) of the human retinoblastoma susceptibility gene. DNA sequence analysis shows that the XRb1 gene product is highly conserved in many regions. The leucine repeat motif and many of the potential cdc2 phosphorylation sites, as well as potential sites for other kinases, are retained. The region of the protein homologous to the SV40 T antigen binding site and the basic region directly C-terminal to the E1A binding site are all conserved. XRb1 gene expression at the RNA level was studied by Northern blot analysis. Transcripts of 4.2 and 10-kb are present as maternal RNA stores in the oocyte. While the 4.2-kb product is stable until at least the mid-blastula stage, the 10-kb transcript is selectively degraded. Between stages 11 and 13 the 10-kb transcript reappears and also a minor product of approximately 11 kb becomes apparent. Both the 4.2- and the 10-kb transcripts remain present until later stages of development and are also present in all adult tissues examined, although at differing levels. Antibodies raised against human p105Rb which recognize the protein product of the XRb1 gene, pXRb1, detect the Xenopus 99-kDa protein prior to the mid-blastula stage, but at lower levels than at later stages in development.

  20. Relationship between mRNA secondary structure and sequence variability in Chloroplast genes: possible life history implications.

    Science.gov (United States)

    Krishnan, Neeraja M; Seligmann, Hervé; Rao, Basuthkar J

    2008-01-28

    Synonymous sites are freer to vary because of redundancy in genetic code. Messenger RNA secondary structure restricts this freedom, as revealed by previous findings in mitochondrial genes that mutations at third codon position nucleotides in helices are more selected against than those in loops. This motivated us to explore the constraints imposed by mRNA secondary structure on evolutionary variability at all codon positions in general, in chloroplast systems. We found that the evolutionary variability and intrinsic secondary structure stability of these sequences share an inverse relationship. Simulations of most likely single nucleotide evolution in Psilotum nudum and Nephroselmis olivacea mRNAs, indicate that helix-forming propensities of mutated mRNAs are greater than those of the natural mRNAs for short sequences and vice-versa for long sequences. Moreover, helix-forming propensity estimated by the percentage of total mRNA in helices increases gradually with mRNA length, saturating beyond 1000 nucleotides. Protection levels of functionally important sites vary across plants and proteins: r-strategists minimize mutation costs in large genes; K-strategists do the opposite. Mrna length presumably predisposes shorter mRNAs to evolve under different constraints than longer mRNAs. The positive correlation between secondary structure protection and functional importance of sites suggests that some sites might be conserved due to packing-protection constraints at the nucleic acid level in addition to protein level constraints. Consequently, nucleic acid secondary structure a priori biases mutations. The converse (exposure of conserved sites) apparently occurs in a smaller number of cases, indicating a different evolutionary adaptive strategy in these plants. The differences between the protection levels of functionally important sites for r- and K-strategists reflect their respective molecular adaptive strategies. These converge with increasing domestication levels of

  1. Phocid seal leptin: tertiary structure and hydrophobic receptor binding site preservation during distinct leptin gene evolution.

    Directory of Open Access Journals (Sweden)

    John A Hammond

    Full Text Available The cytokine hormone leptin is a key signalling molecule in many pathways that control physiological functions. Although leptin demonstrates structural conservation in mammals, there is evidence of positive selection in primates, lagomorphs and chiropterans. We previously reported that the leptin genes of the grey and harbour seals (phocids have significantly diverged from other mammals. Therefore we further investigated the diversification of leptin in phocids, other marine mammals and terrestrial taxa by sequencing the leptin genes of representative species. Phylogenetic reconstruction revealed that leptin diversification was pronounced within the phocid seals with a high dN/dS ratio of 2.8, indicating positive selection. We found significant evidence of positive selection along the branch leading to the phocids, within the phocid clade, but not over the dataset as a whole. Structural predictions indicate that the individual residues under selection are away from the leptin receptor (LEPR binding site. Predictions of the surface electrostatic potential indicate that phocid seal leptin is notably different to other mammalian leptins, including the otariids. Cloning the grey seal leptin binding domain of LEPR confirmed that this was structurally conserved. These data, viewed in toto, support a hypothesis that phocid leptin divergence is unlikely to have arisen by random mutation. Based upon these phylogenetic and structural assessments, and considering the comparative physiology and varying life histories among species, we postulate that the unique phocid diving behaviour has produced this selection pressure. The Phocidae includes some of the deepest diving species, yet have the least modified lung structure to cope with pressure and volume changes experienced at depth. Therefore, greater surfactant production is required to facilitate rapid lung re-inflation upon surfacing, while maintaining patent airways. We suggest that this additional

  2. Structural defect linked to nonrandom mutations in the matrix gene of Biden strain subacute sclerosing panencephalitis virus defined by cDNA cloning and expression of chimeric genes

    International Nuclear Information System (INIS)

    Ayata, M.; Hirano, A.; Wong, T.C.

    1989-01-01

    Biken strain, a nonproductive measles viruslike agent isolated from a subacute sclerosing panencephalitis (SSPE) patient, contains a posttranscriptional defect affecting matrix (M) protein. A putative M protein was translated in vitro with RNA from Biken strain-infected cells. A similar protein was detected in vivo by an antiserum against a peptide synthesized from the cloned M gene of Edmonston strain measles virus. By using a novel method, full-length cDNAs of the Biken M gene were selectively cloned. The cloned Biken M gene contained an open reading frame which encoded 8 extra carboxy-terminal amino acid residues and 20 amino acid substitutions predicted to affect both the hydrophobicity and secondary structure of the gene product. The cloned gene was expressed in vitro and in vivo into a 37,500 M r protein electrophoretically and antigenically distinct from the M protein of Edmonston strain but identical to the M protein in Biken strain-infected cells. Chimeric M proteins synthesized in vitro and in vivo showed that the mutations in the carboxy-proximal region altered the local antigenicity and those in the amino region affected the overall protein conformation. The protein expressed from the Biken M gene was unstable in vivo. Instability was attributed to multiple mutations. These results offer insights into the basis of the defect in Biken strain and pose intriguing questions about the evolutionary origins of SSPE viruses in general

  3. The relationship between CA repeat polymorphism of the IGF-1 gene and the structure of motor skills in young athletes.

    Science.gov (United States)

    Karpowicz, Krzysztof; Krych, Katarzyna; Karpowicz, Małgorzata; Nowak, Witold; Gronek, Piotr

    2018-01-01

    The map of candidate genes that can potentially affect physical fitness becomes larger every year, and they are associated with such aspects as respiratory and cardiovascular stability; body build and composition - especially muscle mass and strength; carbohydrate and lipid metabolism; response to training; and exercise intolerance.The aim of this study was to analyze the relationship between the CA repeat polymorphism of the P1 promoter of the IGF1 gene and the structure of motor skills in the two groups of Polish young athletes in 2007-2009. In this study, 350 young sportsmen representing different sports disciplines were examined (age = 15.5 ± 0.5 years), by genotyping the IGF1 gene and determining the structure of motor skills using the International Physical Fitness Test (IPFT) battery. The multiple stepwise regression was used to determine the impact of the investigated motor skills on the indicator of the overall physical fitness, measured by the total score of the International Physical Fitness Test (IPFT). The analysis showed some regularity related to the character of the IGF1 gene polymorphism. It can be concluded that the two groups of young boys athletes practicing various sports disciplines (kinds of physical exercise) displayed similar associations between CA repeat polymorphism of the P1 promoter of the IGF1 gene and the level of motor effects. Our results suggest that this polymorphism may be a genetic marker of the physical performance phenotype. We demonstrated that CA repeat polymorphism of the P1 promoter of the IGF1 gene was associated with strength predispositions in the homozygous and non-carriers groups. In the group who were heterozygous it was speed-strength aptitudes.

  4. Structural organization of the genes for rat von Ebner's gland proteins 1 and 2 reveals their close relationship to lipocalins.

    Science.gov (United States)

    Kock, K; Ahlers, C; Schmale, H

    1994-05-01

    The rat von Ebner's gland protein 1 (VEGP 1) is a secretory protein, which is abundantly expressed in the small acinar von Ebner's salivary glands of the tongue. Based on the primary structure of this protein we have previously suggested that it is a member of the lipocalin superfamily of lipophilic-ligand carrier proteins. Although the physiological role of VEGP 1 is not clear, it might be involved in sensory or protective functions in the taste epithelium. Here, we report the purification of VEGP 1 and of a closely related secretory polypeptide, VEGP 2, the isolation of a cDNA clone encoding VEGP 2, and the isolation and structural characterization of the genes for both proteins. Protein purification by gel-filtration and anion-exchange chromatography using Mono Q revealed the presence of two different immunoreactive VEGP species. N-terminal sequence determination of peptide fragments isolated after protease Asp-N digestion allowed the identification of a new VEGP, named VEGP 2, in addition to the previously characterized VEGP 1. The complete VEGP 2 sequence was deduced from a cDNA clone isolated from a von Ebner's gland cDNA library. The VEGP 2 cDNA encodes a protein of 177 amino acids and is 94% identical to VEGP 1. DNA sequence analysis of the rat VEGP 1 and 2 genes isolated from rat genomic libraries revealed that both span about 4.5 kb and contain seven exons. The VEGP 1 and 2 genes are non-allelic distinct genes in the rat genome and probably arose by gene duplication. The high degree of nucleotide sequence identity in introns A-C (94-100%) points to a recent gene conversion event that included the 5' part of the genes. The genomic organization of the rat VEGP genes closely resembles that found in other lipocalins such as beta-lactoglobulin, mouse urinary proteins (MUPs) and prostaglandin D synthase, and therefore provides clear evidence that VEGPs belong to this superfamily of proteins.

  5. Development of gene diagnosis for diabetes and cholecystis based on gene analysis of CCK-A receptor

    International Nuclear Information System (INIS)

    Kono, Akira

    1998-01-01

    The gene structures of CCK, A type receptor in human, the rat and the mouse were investigated aiming to clarify that the aberration of the gene is involved in the incidences of diabetes and cholecystis. In this fiscal year, 1997, the normal structure of the gene and the accurate base sequence were analyzed using DNA fragments bound to 32 P-labelled cDNA of human CCKAR originated from the gene library of leucocyte. This gene contained about 2.2 x 10 5 base pairs and the base sequence was completely determined and registered to Japan DNA data bank (D85606). In addition, the genome structures and base sequences of mouse and rat CCKAR were analyzed and registered (D 85605 and D 50608, respectively). The differences in the base sequence of CCKAR among the species were found in the promotor region and the intron regions, suggesting that there might be differences in splicing among species. (M.N.)

  6. Large-scale trends in the evolution of gene structures within 11 animal genomes.

    Directory of Open Access Journals (Sweden)

    Mark Yandell

    2006-03-01

    Full Text Available We have used the annotations of six animal genomes (Homo sapiens, Mus musculus, Ciona intestinalis, Drosophila melanogaster, Anopheles gambiae, and Caenorhabditis elegans together with the sequences of five unannotated Drosophila genomes to survey changes in protein sequence and gene structure over a variety of timescales--from the less than 5 million years since the divergence of D. simulans and D. melanogaster to the more than 500 million years that have elapsed since the Cambrian explosion. To do so, we have developed a new open-source software library called CGL (for "Comparative Genomics Library". Our results demonstrate that change in intron-exon structure is gradual, clock-like, and largely independent of coding-sequence evolution. This means that genome annotations can be used in new ways to inform, corroborate, and test conclusions drawn from comparative genomics analyses that are based upon protein and nucleotide sequence similarities.

  7. Complex organisation and structure of the ghrelin antisense strand gene GHRLOS, a candidate non-coding RNA gene

    Directory of Open Access Journals (Sweden)

    Herington Adrian C

    2008-10-01

    Full Text Available Abstract Background The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS, which spans the promoter and untranslated regions of the ghrelin gene (GHRL. Here we further characterise GHRLOS. Results We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2. Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis, as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. Conclusion GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA genes, including 5' capping, polyadenylation, extensive splicing and short open reading

  8. Ofd1, a human disease gene, regulates the length and distal structure of centrioles.

    Science.gov (United States)

    Singla, Veena; Romaguera-Ros, Miriam; Garcia-Verdugo, Jose Manuel; Reiter, Jeremy F

    2010-03-16

    Centrosomes and their component centrioles represent the principal microtubule organizing centers of animal cells. Here, we show that the gene underlying orofaciodigital syndrome 1, Ofd1, is a component of the distal centriole that controls centriole length. In the absence of Ofd1, distal regions of centrioles, but not procentrioles, elongate abnormally. These long centrioles are structurally similar to normal centrioles but contain destabilized microtubules with abnormal posttranslational modifications. Ofd1 is also important for centriole distal appendage formation and centriolar recruitment of the intraflagellar transport protein Ift88. To model OFD1 syndrome in embryonic stem cells, we replaced the Ofd1 gene with missense alleles from human OFD1 patients. Distinct disease-associated mutations cause different degrees of excessive or decreased centriole elongation, all of which are associated with diminished ciliogenesis. Our results indicate that Ofd1 acts at the distal centriole to build distal appendages, recruit Ift88, and stabilize centriolar microtubules at a defined length. Copyright 2010 Elsevier Inc. All rights reserved.

  9. Structural and quantitative characterisation of canine RAGE gene transcripts and evaluation of canine HMG genes and proteins for the establishment of therapeutic strategies

    OpenAIRE

    Sterenczak, Katharina

    2011-01-01

    Cancer is the leading cause of death in economic strong countries and a large number of in vivo and in vitro models of human cancer were established until today. Thereby the dog has attracted scientific interest as neoplasias seen in dogs share many characteristics with their human counterparts. The aim of this thesis was the analysis of the molecular structure and/or expression pattern of cancer associated genes and proteins in canine neoplasias including the receptor RAGE and members of the...

  10. Genetic Structure and Gene Flows within Horses: A Genealogical Study at the French Population Scale

    OpenAIRE

    Pirault, Pauline; Danvy, Sophy; Verrier, Etienne; Leroy, Gr?goire

    2013-01-01

    Since horse breeds constitute populations submitted to variable and multiple outcrossing events, we analyzed the genetic structure and gene flows considering horses raised in France. We used genealogical data, with a reference population of 547,620 horses born in France between 2002 and 2011, grouped according to 55 breed origins. On average, individuals had 6.3 equivalent generations known. Considering different population levels, fixation index decreased from an overall species FIT of 1.37%...

  11. High frequency of rare copy number variants affecting functionally related genes in patients with structural brain malformations

    DEFF Research Database (Denmark)

    Kariminejad, Roxana; Lind-Thomsen, Allan; Tümer, Zeynep

    2011-01-01

    ) to investigate copy number variants (CNVs) in a cohort of 169 patients with various structural brain malformations including lissencephaly, polymicrogyria, focal cortical dysplasia, and corpus callosum agenesis. The majority of the patients had intellectual disabilities (ID) and suffered from symptomatic...... that genes involved in "axonal transport," "cation transmembrane transporter activity," and the "c-Jun N-terminal kinase (JNK) cascade" play a significant role in the etiology of brain malformations. This is to the best of our knowledge the first systematic study of CNVs in patients with structural brain...

  12. Telomere structure and maintenance gene variants and risk of five cancer types

    Science.gov (United States)

    Karami, Sara; Han, Younghun; Pande, Mala; Cheng, Iona; Rudd, James; Pierce, Brandon L.; Nutter, Ellen L.; Schumacher, Fredrick R.; Kote-Jarai, Zsofia; Lindstrom, Sara; Witte, John S.; Fang, Shenying; Han, Jiali; Kraft, Peter; Hunter, David; Song, Fengju; Hung, Rayjean J.; McKay, James; Gruber, Stephen B.; Chanock, Stephen J.; Risch, Angela; Shen, Hongbing; Haiman, Christopher A.; Boardman, Lisa; Ulrich, Cornelia M.; Casey, Graham; Peters, Ulrike; Al Olama, Ali Amin; Berchuck, Andrew; Berndt, Sonja I.; Bezieau, Stephane; Brennan, Paul; Brenner, Hermann; Brinton, Louise; Caporaso, Neil; Chan, Andrew T.; Chang-Claude, Jenny; Christiani, David C.; Cunningham, Julie M.; Easton, Douglas; Eeles, Rosalind A.; Eisen, Timothy; Gala, Manish; Gallinger, Steven J.; Gayther, Simon A.; Goode, Ellen L.; Grönberg, Henrik; Henderson, Brian E.; Houlston, Richard; Joshi, Amit D.; Küry, Sébastien; Landi, Mari T.; Le Marchand, Loic; Muir, Kenneth; Newcomb, Polly A.; Permuth-Wey, Jenny; Pharoah, Paul; Phelan, Catherine; Potter, John D.; Ramus, Susan J.; Risch, Harvey; Schildkraut, Joellen; Slattery, Martha L.; Song, Honglin; Wentzensen, Nicolas; White, Emily; Wiklund, Fredrik; Zanke, Brent W.; Sellers, Thomas A.; Zheng, Wei; Chatterjee, Nilanjan; Amos, Christopher I.; Doherty, Jennifer A.

    2016-01-01

    Telomeres cap chromosome ends, protecting them from degradation, double-strand breaks, and end-to-end fusions. Telomeres are maintained by telomerase, a reverse transcriptase encoded by TERT, and an RNA template encoded by TERC. Loci in the TERT and adjoining CLPTM1L region are associated with risk of multiple cancers. We therefore investigated associations between variants in 22 telomere structure and maintenance gene regions and colorectal, breast, prostate, ovarian, and lung cancer risk. We performed subset-based meta-analyses of 204,993 directly-measured and imputed SNPs among 61,851 cancer cases and 74,457 controls of European descent. Independent associations for SNP minor alleles were identified using sequential conditional analysis (with gene-level P-value cutoffs ≤3.08×10−5). Of the thirteen independent SNPs observed to be associated with cancer risk, novel findings were observed for seven loci. Across the TERT-CLPTML1 region, rs12655062 was associated positively with prostate cancer, and inversely with colorectal and ovarian cancers, and rs115960372 was associated positively with prostate cancer. Across the TERC region, rs75316749 was positively associated with colorectal, breast, ovarian, and lung cancers. Across the DCLRE1B region, rs974404 and rs12144215 were inversely associated with prostate and lung cancers, and colorectal, breast, and ovarian cancers, respectively. Near POT1, rs116895242 was inversely associated with colorectal, ovarian, and lung cancers, and RTEL1 rs34978822 was inversely associated with prostate and lung cancers. The complex association patterns in telomere-related genes across cancer types may provide insight into mechanisms through which telomere dysfunction in different tissues influences cancer risk. PMID:27459707

  13. Telomere structure and maintenance gene variants and risk of five cancer types.

    Science.gov (United States)

    Karami, Sara; Han, Younghun; Pande, Mala; Cheng, Iona; Rudd, James; Pierce, Brandon L; Nutter, Ellen L; Schumacher, Fredrick R; Kote-Jarai, Zsofia; Lindstrom, Sara; Witte, John S; Fang, Shenying; Han, Jiali; Kraft, Peter; Hunter, David J; Song, Fengju; Hung, Rayjean J; McKay, James; Gruber, Stephen B; Chanock, Stephen J; Risch, Angela; Shen, Hongbing; Haiman, Christopher A; Boardman, Lisa; Ulrich, Cornelia M; Casey, Graham; Peters, Ulrike; Amin Al Olama, Ali; Berchuck, Andrew; Berndt, Sonja I; Bezieau, Stephane; Brennan, Paul; Brenner, Hermann; Brinton, Louise; Caporaso, Neil; Chan, Andrew T; Chang-Claude, Jenny; Christiani, David C; Cunningham, Julie M; Easton, Douglas; Eeles, Rosalind A; Eisen, Timothy; Gala, Manish; Gallinger, Steven J; Gayther, Simon A; Goode, Ellen L; Grönberg, Henrik; Henderson, Brian E; Houlston, Richard; Joshi, Amit D; Küry, Sébastien; Landi, Mari T; Le Marchand, Loic; Muir, Kenneth; Newcomb, Polly A; Permuth-Wey, Jenny; Pharoah, Paul; Phelan, Catherine; Potter, John D; Ramus, Susan J; Risch, Harvey; Schildkraut, Joellen; Slattery, Martha L; Song, Honglin; Wentzensen, Nicolas; White, Emily; Wiklund, Fredrik; Zanke, Brent W; Sellers, Thomas A; Zheng, Wei; Chatterjee, Nilanjan; Amos, Christopher I; Doherty, Jennifer A

    2016-12-15

    Telomeres cap chromosome ends, protecting them from degradation, double-strand breaks, and end-to-end fusions. Telomeres are maintained by telomerase, a reverse transcriptase encoded by TERT, and an RNA template encoded by TERC. Loci in the TERT and adjoining CLPTM1L region are associated with risk of multiple cancers. We therefore investigated associations between variants in 22 telomere structure and maintenance gene regions and colorectal, breast, prostate, ovarian, and lung cancer risk. We performed subset-based meta-analyses of 204,993 directly-measured and imputed SNPs among 61,851 cancer cases and 74,457 controls of European descent. Independent associations for SNP minor alleles were identified using sequential conditional analysis (with gene-level p value cutoffs ≤3.08 × 10 -5 ). Of the thirteen independent SNPs observed to be associated with cancer risk, novel findings were observed for seven loci. Across the DCLRE1B region, rs974494 and rs12144215 were inversely associated with prostate and lung cancers, and colorectal, breast, and prostate cancers, respectively. Across the TERC region, rs75316749 was positively associated with colorectal, breast, ovarian, and lung cancers. Across the DCLRE1B region, rs974404 and rs12144215 were inversely associated with prostate and lung cancers, and colorectal, breast, and prostate cancers, respectively. Near POT1, rs116895242 was inversely associated with colorectal, ovarian, and lung cancers, and RTEL1 rs34978822 was inversely associated with prostate and lung cancers. The complex association patterns in telomere-related genes across cancer types may provide insight into mechanisms through which telomere dysfunction in different tissues influences cancer risk. © 2016 UICC.

  14. Genetic diversity and population structure of Lantana camara in India indicates multiple introductions and gene flow.

    Science.gov (United States)

    Ray, A; Quader, S

    2014-05-01

    Lantana camara is a highly invasive plant, which has spread over 60 countries and island groups of Asia, Africa and Australia. In India, it was introduced in the early nineteenth century, since when it has expanded and gradually established itself in almost every available ecosystem. We investigated the genetic diversity and population structure of this plant in India in order to understand its introduction, subsequent range expansion and gene flow. A total of 179 individuals were sequenced at three chloroplast loci and 218 individuals were genotyped for six nuclear microsatellites. Both chloroplasts (nine haplotypes) and microsatellites (83 alleles) showed high genetic diversity. Besides, each type of marker confirmed the presence of private polymorphism. We uncovered low to medium population structure in both markers, and found a faint signal of isolation by distance with microsatellites. Bayesian clustering analyses revealed multiple divergent genetic clusters. Taken together, these findings (i.e. high genetic diversity with private alleles and multiple genetic clusters) suggest that Lantana was introduced multiple times and gradually underwent spatial expansion with recurrent gene flow. © 2013 German Botanical Society and The Royal Botanical Society of the Netherlands.

  15. Toward a suitable structural analysis of gene delivery carrier based on polycationic carbohydrates by electron transfer dissociation tandem mass spectrometry

    International Nuclear Information System (INIS)

    Przybylski, Cédric; Benito, Juan M.; Bonnet, Véronique; Mellet, Carmen Ortiz; García Fernández, José M.

    2016-01-01

    Polycationic carbohydrates represent an attractive class of biomolecules for several applications and particularly as non viral gene delivery vectors. In this case, the establishment of structure-biological activity relationship requires sensitive and accurate characterization tools to both control and achieve fine structural deciphering. Electrospray-tandem mass spectrometry (ESI-MS/MS) appears as a suitable approach to address these questions. In the study herein, we have investigated the usefulness of electron transfer dissociation (ETD) to get structural data about five polycationic carbohydrates demonstrated as promising gene delivery agents. A particular attention was paid to determine the influence of charge states as well as both fluoranthene reaction time and supplementary activation (SA) on production of charge reduced species, fragmentation yield, varying from 2 to 62%, as well as to obtain the most higher both diversity and intensity of fragments, according to charge states and targeted compounds. ETD fragmentation appeared to be mainly directed toward pending group rather than carbohydrate cyclic scaffold leading to a partial sequencing for building blocks when amino groups are close to carbohydrate core, but allowing to complete structural deciphering of some of them, such as those including dithioureidocysteaminyl group which was not possible with CID only. Such findings clearly highlight the potential to help the rational choice of the suitable analytical conditions, according to the nature of the gene delivery molecules exhibiting polycationic features. Moreover, our ETD-MS/MS approach open the way to a fine sequencing/identification of grafted groups carried on various sets of oligo-/polysaccharides in various fields such as glycobiology or nanomaterials, even with unknown or questionable extraction, synthesis or modification steps. - Highlights: • The first ETD-MS/MS characterization of polycationic carbohydrate based non-viral gene delivery

  16. Toward a suitable structural analysis of gene delivery carrier based on polycationic carbohydrates by electron transfer dissociation tandem mass spectrometry

    Energy Technology Data Exchange (ETDEWEB)

    Przybylski, Cédric, E-mail: cedric.przybylski@upmc.fr [Université d’Evry-Val-d’Essonne, Laboratoire Analyse et Modélisation pour la Biologie et l’Environnement, CNRS UMR 8587, Bâtiment Maupertuis, Bld F. Mitterrand, F-91025 Evry (France); Benito, Juan M. [Instituto de Investigaciones Químicas (IIQ), CSIC−Universidad de Sevilla, Américo Vespucio 49, Isla de la Cartuja, E-41092 Sevilla (Spain); Bonnet, Véronique [Université de Picardie Jules Verne, Laboratoire de Glycochimie, des Antimicrobiens et des Agroressources, CNRS UMR 7378, 80039 Amiens (France); Mellet, Carmen Ortiz [Departamento de Química Orgánica, Facultad de Química, Universidad de Sevilla, E-41012 Sevilla (Spain); García Fernández, José M. [Instituto de Investigaciones Químicas (IIQ), CSIC−Universidad de Sevilla, Américo Vespucio 49, Isla de la Cartuja, E-41092 Sevilla (Spain)

    2016-12-15

    Polycationic carbohydrates represent an attractive class of biomolecules for several applications and particularly as non viral gene delivery vectors. In this case, the establishment of structure-biological activity relationship requires sensitive and accurate characterization tools to both control and achieve fine structural deciphering. Electrospray-tandem mass spectrometry (ESI-MS/MS) appears as a suitable approach to address these questions. In the study herein, we have investigated the usefulness of electron transfer dissociation (ETD) to get structural data about five polycationic carbohydrates demonstrated as promising gene delivery agents. A particular attention was paid to determine the influence of charge states as well as both fluoranthene reaction time and supplementary activation (SA) on production of charge reduced species, fragmentation yield, varying from 2 to 62%, as well as to obtain the most higher both diversity and intensity of fragments, according to charge states and targeted compounds. ETD fragmentation appeared to be mainly directed toward pending group rather than carbohydrate cyclic scaffold leading to a partial sequencing for building blocks when amino groups are close to carbohydrate core, but allowing to complete structural deciphering of some of them, such as those including dithioureidocysteaminyl group which was not possible with CID only. Such findings clearly highlight the potential to help the rational choice of the suitable analytical conditions, according to the nature of the gene delivery molecules exhibiting polycationic features. Moreover, our ETD-MS/MS approach open the way to a fine sequencing/identification of grafted groups carried on various sets of oligo-/polysaccharides in various fields such as glycobiology or nanomaterials, even with unknown or questionable extraction, synthesis or modification steps. - Highlights: • The first ETD-MS/MS characterization of polycationic carbohydrate based non-viral gene delivery

  17. Structure and function of the human metallothionein gene family: Final technical report

    International Nuclear Information System (INIS)

    Karin, M.

    1986-01-01

    The full nucleotide sequence of two additional human metallothionein (hMT) genes has been determined. These genes, hMT-I/sub B/ and hMT-I/sub F/, are located within the MT-I gene cluster we have described originally. The hMT-I/sub F/ gene is the first hMT-I gene whose amino acid sequence is in complete agreement with the published sequence of the human MT-I proteins. Therefore it is likely to be an active gene encoding a functional protein. However, since we have just completed the sequence analysis, we have not characterized this gene further yet. The hMT-I/sub B/ gene is closely linked to the hMT-I/sub A/ gene, and two pseudogenes, hMT-I/sub C/ and hMT-I/sub D/ separate the two. From its nucleotide sequence hMT-I/sub B/ seems to be an active gene, encoding a functional protein even though it differs in four positions from the published sequence of human MT-I proteins. This gene is expressed in a human hepatoma cell line, HepG2, and its expression is stimulated by Cd ++ . Using gene fusions to the viral thymidine-kinase gene we find that hMT-I/sub B/, like the hMT-I/sub A/ and hMT-II/sub A/ genes, contains a heavy metal responsive promoterregulatory element within its 5' flanking region. We analyzed the level of hMT-I/sub B/ mRNA in a variety of human cell lines by the S1 nuclease technique, and compared it to the expression of the hMT-II/sub A/ gene. While the hMT-II/sub A/ gene was expressed in all of the cell lines analyzed, the hMT-I/sub B/ gene was expressed in liver and kidney derived cell lines cells. This suggest that the expression of the hMT-I/sub B/ gene is controlled in a tissue specific manner. 13 refs

  18. Functional and structural analysis of the DNA sequence conferring glucocorticoid inducibility to the mouse mammary tumor virus gene

    International Nuclear Information System (INIS)

    Skroch, P.

    1987-05-01

    In the first part of my thesis I show that the DNA element conferring glucocorticoid inducibility to the Mouse Mammary Tumor Virus (HRE) has enhancer properties. It activates a heterologous promoter - that of the β-globin gene, independently of distance, position and orientation. These properties however have to be regarded in relation to the remaining regulatory elements of the activated gene as the recombinants between HRE and the TK gene have demonstrated. In the second part of my thesis I investigated the biological significance of certain sequence motifs of the HRE, which are remarkable by their interaction with transacting factors or sequence homologies with other regulatory DNA elements. I could confirm the generally postulated modular structure of enhancers for the HRE and bring the relevance of the single subdomains for the function of the element into relationship. (orig.) [de

  19. Disconnect between alcohol-induced alterations in chromatin structure and gene transcription in a mouse embryonic stem cell model of exposure.

    Science.gov (United States)

    Veazey, Kylee J; Wang, Haiqing; Bedi, Yudhishtar S; Skiles, William M; Chang, Richard Cheng-An; Golding, Michael C

    2017-05-01

    Alterations to chromatin structure induced by environmental insults have become an attractive explanation for the persistence of exposure effects into subsequent life stages. However, a growing body of work examining the epigenetic impact that alcohol and other drugs of abuse exert consistently notes a disconnection between induced changes in chromatin structure and patterns of gene transcription. Thus, an important question is whether perturbations in the 'histone code' induced by prenatal exposures to alcohol implicitly subvert gene expression, or whether the hierarchy of cellular signaling networks driving development is such that they retain control over the transcriptional program. To address this question, we examined the impact of ethanol exposure in mouse embryonic stem cells cultured under 2i conditions, where the transcriptional program is rigidly enforced through the use of small molecule inhibitors. We find that ethanol-induced changes in post-translational histone modifications are dose-dependent, unique to the chromatin modification under investigation, and that the extent and direction of the change differ between the period of exposure and the recovery phase. Similar to in vivo models, we find post-translational modifications affecting histone 3 lysine 9 are the most profoundly impacted, with the signature of exposure persisting long after alcohol has been removed. These changes in chromatin structure associate with dose-dependent alterations in the levels of transcripts encoding Dnmt1, Uhrf1, Tet1, Tet2, Tet3, and Polycomb complex members Eed and Ezh2. However, in this model, ethanol-induced changes to the chromatin template do not consistently associate with changes in gene transcription, impede the process of differentiation, or affect the acquisition of monoallelic patterns of expression for the imprinted gene Igf2R. These findings question the inferred universal relevance of epigenetic changes induced by drugs of abuse and suggest that changes

  20. Characterization of Bombyx mori nucleopolyhedrovirus orf68 gene that encodes a novel structural protein of budded virus.

    Science.gov (United States)

    Iwanaga, Masashi; Kurihara, Masaaki; Kobayashi, Masahiko; Kang, WonKyung

    2002-05-25

    All lepidopteran baculovirus genomes sequenced to date encode a homolog of the Bombyx mori nucleopolyhedrovirus (BmNPV) orf68 gene, suggesting that it performs an important role in the virus life cycle. In this article we describe the characterization of BmNPV orf68 gene. Northern and Western analyses demonstrated that orf68 gene was expressed as a late gene and encoded a structural protein of budded virus (BV). Immunohistochemical analysis by confocal microscopy showed that ORF68 protein was localized mainly in the nucleus of infected cells. To examine the function of orf68 gene, we constructed orf68 deletion mutant (BmD68) and characterized it in BmN cells and larvae of B. mori. BV production was delayed in BmD68-infected cells. The larval bioassays also demonstrated that deletion of orf68 did not reduce the infectivity, but mutant virus took 70 h longer to kill the host than wild-type BmNPV. In addition, dot-blot analysis showed viral DNA accumulated more slowly in mutant infected cells. Further examination suggested that BmD68 was less efficient in entry and budding from cells, although it seemed to possess normal attachment ability. These results suggest that ORF68 is a BV-associated protein involved in secondary infection from cell-to-cell. (c) 2002 Elsevier Science (USA).

  1. Novel Structural and Functional Motifs in cellulose synthase (CesA Genes of Bread Wheat (Triticum aestivum, L..

    Directory of Open Access Journals (Sweden)

    Simerjeet Kaur

    Full Text Available Cellulose is the primary determinant of mechanical strength in plant tissues. Late-season lodging is inversely related to the amount of cellulose in a unit length of the stem. Wheat is the most widely grown of all the crops globally, yet information on its CesA gene family is limited. We have identified 22 CesA genes from bread wheat, which include homoeologs from each of the three genomes, and named them as TaCesAXA, TaCesAXB or TaCesAXD, where X denotes the gene number and the last suffix stands for the respective genome. Sequence analyses of the CESA proteins from wheat and their orthologs from barley, maize, rice, and several dicot species (Arabidopsis, beet, cotton, poplar, potato, rose gum and soybean revealed motifs unique to monocots (Poales or dicots. Novel structural motifs CQIC and SVICEXWFA were identified, which distinguished the CESAs involved in the formation of primary and secondary cell wall (PCW and SCW in all the species. We also identified several new motifs specific to monocots or dicots. The conserved motifs identified in this study possibly play functional roles specific to PCW or SCW formation. The new insights from this study advance our knowledge about the structure, function and evolution of the CesA family in plants in general and wheat in particular. This information will be useful in improving culm strength to reduce lodging or alter wall composition to improve biofuel production.

  2. Identification of nitrogen-fixing genes and gene clusters from metagenomic library of acid mine drainage.

    Directory of Open Access Journals (Sweden)

    Zhimin Dai

    Full Text Available Biological nitrogen fixation is an essential function of acid mine drainage (AMD microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community.

  3. Identification of nitrogen-fixing genes and gene clusters from metagenomic library of acid mine drainage.

    Science.gov (United States)

    Dai, Zhimin; Guo, Xue; Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

    2014-01-01

    Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community.

  4. Identification of Nitrogen-Fixing Genes and Gene Clusters from Metagenomic Library of Acid Mine Drainage

    Science.gov (United States)

    Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

    2014-01-01

    Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community. PMID:24498417

  5. An empirical comparison of popular structure learning algorithms with a view to gene network inference

    Czech Academy of Sciences Publication Activity Database

    Djordjilović, V.; Chiogna, M.; Vomlel, Jiří

    2017-01-01

    Roč. 88, č. 1 (2017), s. 602-613 ISSN 0888-613X R&D Projects: GA ČR(CZ) GA16-12010S Institutional support: RVO:67985556 Keywords : Bayesian networks * Structure learning * Reverse engineering * Gene networks Subject RIV: JD - Computer Applications, Robotics OBOR OECD: Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8) Impact factor: 2.845, year: 2016 http://library.utia.cas.cz/separaty/2017/MTR/vomlel-0477168.pdf

  6. Aspects of gene structure and functional regulation of the isozymes of Na,K-ATPase

    DEFF Research Database (Denmark)

    Jorgensen, P.L.

    2001-01-01

    genomes, the genes of four alpha-subunit and at least three beta-subunit isoforms of Na,K-ATPase are identified and two gamma-subunits are expressed in kidney. The isoforms combine in a number of Na,K-ATPase isozymes that are expressed in a tissue and cell specific manner. Models of the molecular...... mechanism of regulation of these isozymes have become more reliable due to progress in understanding the three-dimensional protein structure and conformational transitions mediating transfer of energy from the P-domain to intramembrane Na+ and K+ binding sites....

  7. Translation of the flavivirus kunjin NS3 gene in cis but not its RNA sequence or secondary structure is essential for efficient RNA packaging.

    Science.gov (United States)

    Pijlman, Gorben P; Kondratieva, Natasha; Khromykh, Alexander A

    2006-11-01

    Our previous studies using trans-complementation analysis of Kunjin virus (KUN) full-length cDNA clones harboring in-frame deletions in the NS3 gene demonstrated the inability of these defective complemented RNAs to be packaged into virus particles (W. J. Liu, P. L. Sedlak, N. Kondratieva, and A. A. Khromykh, J. Virol. 76:10766-10775). In this study we aimed to establish whether this requirement for NS3 in RNA packaging is determined by the secondary RNA structure of the NS3 gene or by the essential role of the translated NS3 gene product. Multiple silent mutations of three computer-predicted stable RNA structures in the NS3 coding region of KUN replicon RNA aimed at disrupting RNA secondary structure without affecting amino acid sequence did not affect RNA replication and packaging into virus-like particles in the packaging cell line, thus demonstrating that the predicted conserved RNA structures in the NS3 gene do not play a role in RNA replication and/or packaging. In contrast, double frameshift mutations in the NS3 coding region of full-length KUN RNA, producing scrambled NS3 protein but retaining secondary RNA structure, resulted in the loss of ability of these defective RNAs to be packaged into virus particles in complementation experiments in KUN replicon-expressing cells. Furthermore, the more robust complementation-packaging system based on established stable cell lines producing large amounts of complemented replicating NS3-deficient replicon RNAs and infection with KUN virus to provide structural proteins also failed to detect any secreted virus-like particles containing packaged NS3-deficient replicon RNAs. These results have now firmly established the requirement of KUN NS3 protein translated in cis for genome packaging into virus particles.

  8. Heterogeneic dynamics of the structures of multiple gene clusters in two pathogenetically different lines originating from the same phytoplasma.

    Science.gov (United States)

    Arashida, Ryo; Kakizawa, Shigeyuki; Hoshi, Ayaka; Ishii, Yoshiko; Jung, Hee-Young; Kagiwada, Satoshi; Yamaji, Yasuyuki; Oshima, Kenro; Namba, Shigetou

    2008-04-01

    Phytoplasmas are phloem-limited plant pathogens that are transmitted by insect vectors and are associated with diseases in hundreds of plant species. Despite their small sizes, phytoplasma genomes have repeat-rich sequences, which are due to several genes that are encoded as multiple copies. These multiple genes exist in a gene cluster, the potential mobile unit (PMU). PMUs are present at several distinct regions in the phytoplasma genome. The multicopy genes encoded by PMUs (herein named mobile unit genes [MUGs]) and similar genes elsewhere in the genome (herein named fundamental genes [FUGs]) are likely to have the same function based on their annotations. In this manuscript we show evidence that MUGs and FUGs do not cluster together within the same clade. Each MUG is in a cluster with a short branch length, suggesting that MUGs are recently diverged paralogs, whereas the origin of FUGs is different from that of MUGs. We also compared the genome structures around the lplA gene in two derivative lines of the 'Candidatus Phytoplasma asteris' OY strain, the severe-symptom line W (OY-W) and the mild-symptom line M (OY-M). The gene organizations of the nucleotide sequences upstream of the lplA genes of OY-W and OY-M were dramatically different. The tra5 insertion sequence, an element of PMUs, was found only in this region in OY-W. These results suggest that transposition of entire PMUs and PMU sections has occurred frequently in the OY phytoplasma genome. The difference in the pathogenicities of OY-W and OY-M might be caused by the duplication and transposition of PMUs, followed by genome rearrangement.

  9. Pseudomonas community structure and antagonistic potential in the rhizosphere : insights gained by combining phylogenetic and functional gene-based analyses

    NARCIS (Netherlands)

    Costa, Rodrigo; Gomes, Newton C. M.; Kroegerrecklenfort, Ellen; Opelt, Katja; Berg, Gabriele; Smalla, Kornelia

    The Pseudomonas community structure and antagonistic potential in the rhizospheres of strawberry and oilseed rape (host plants of the fungal phytopathogen Verticillium dahliae) were assessed. The use of a new PCR-DGGE system, designed to target Pseudomonas-specific gacA gene fragments in

  10. Genetic characterization of the non-structural protein-3 gene of bluetongue virus serotype-2 isolate from India

    Directory of Open Access Journals (Sweden)

    Raghavendra Sumanth Pudupakam

    2017-03-01

    Full Text Available Aim: Sequence analysis and phylogenetic studies based on non-structural protein-3 (NS3 gene are important in understanding the evolution and epidemiology of bluetongue virus (BTV. This study was aimed at characterizing the NS3 gene sequence of Indian BTV serotype-2 (BTV2 to elucidate its genetic relationship to global BTV isolates. Materials and Methods: The NS3 gene of BTV2 was amplified from infected BHK-21 cell cultures, cloned and subjected to sequence analysis. The generated NS3 gene sequence was compared with the corresponding sequences of different BTV serotypes across the world, and a phylogenetic relationship was established. Results: The NS3 gene of BTV2 showed moderate levels of variability in comparison to different BTV serotypes, with nucleotide sequence identities ranging from 81% to 98%. The region showed high sequence homology of 93-99% at amino acid level with various BTV serotypes. The PPXY/PTAP late domain motifs, glycosylation sites, hydrophobic domains, and the amino acid residues critical for virus-host interactions were conserved in NS3 protein. Phylogenetic analysis revealed that BTV isolates segregate into four topotypes and that the Indian BTV2 in subclade IA is closely related to Asian and Australian origin strains. Conclusion: Analysis of the NS3 gene indicated that Indian BTV2 isolate is closely related to strains from Asia and Australia, suggesting a common origin of infection. Although the pattern of evolution of BTV2 isolate is different from other global isolates, the deduced amino acid sequence of NS3 protein demonstrated high molecular stability.

  11. Genetic characterization of the non-structural protein-3 gene of bluetongue virus serotype-2 isolate from India.

    Science.gov (United States)

    Pudupakam, Raghavendra Sumanth; Raghunath, Shobana; Pudupakam, Meghanath; Daggupati, Sreenivasulu

    2017-03-01

    Sequence analysis and phylogenetic studies based on non-structural protein-3 (NS3) gene are important in understanding the evolution and epidemiology of bluetongue virus (BTV). This study was aimed at characterizing the NS3 gene sequence of Indian BTV serotype-2 (BTV2) to elucidate its genetic relationship to global BTV isolates. The NS3 gene of BTV2 was amplified from infected BHK-21 cell cultures, cloned and subjected to sequence analysis. The generated NS3 gene sequence was compared with the corresponding sequences of different BTV serotypes across the world, and a phylogenetic relationship was established. The NS3 gene of BTV2 showed moderate levels of variability in comparison to different BTV serotypes, with nucleotide sequence identities ranging from 81% to 98%. The region showed high sequence homology of 93-99% at amino acid level with various BTV serotypes. The PPXY/PTAP late domain motifs, glycosylation sites, hydrophobic domains, and the amino acid residues critical for virus-host interactions were conserved in NS3 protein. Phylogenetic analysis revealed that BTV isolates segregate into four topotypes and that the Indian BTV2 in subclade IA is closely related to Asian and Australian origin strains. Analysis of the NS3 gene indicated that Indian BTV2 isolate is closely related to strains from Asia and Australia, suggesting a common origin of infection. Although the pattern of evolution of BTV2 isolate is different from other global isolates, the deduced amino acid sequence of NS3 protein demonstrated high molecular stability.

  12. Structure of gene and pseudogenes of human apoferritin H

    Energy Technology Data Exchange (ETDEWEB)

    Costanzo, F; Colombo, M; Staempfli, S; Santoro, C; Marone, M; Frank, K; Delius, H; Cortese, R

    1986-01-24

    Ferritin is composed of two subunits, H and L. cDNA's coding for these proteins from human liver, lymphocytes and from the monocyte-like cell line U937 have been cloned and sequenced. Southern blot analysis on total human DNA reveals that there are many DNA segments hybridizing to the apoferritin H and L cDNA probes. In view of the tissue heterogeneity of ferritin molecules, it appeared possible that apoferritin molecules could be coded by a family of genes differentially expressed in various tissues. In this paper, the authors describe the cloning and sequencing of the gene coding for human apoferritin H. This gene has three introns; the exon sequence is identical to that of cDNAs isolated from human liver, lymphocytes, HeLa cells and endothelial cells. In addition they show that at least 15 intronless pseudogenes exist, with features suggesting that there were originated by reverse transcription and insertion. On the basis of these results they conclude that only one gene is responsible for the synthesis of the majority of apoferritin H mRNA in various tissues examined, and that probably all the other DNA segments hybridizing with apoferritin cDNA are pseudogenes.

  13. A Deconvolution Protocol for ChIP-Seq Reveals Analogous Enhancer Structures on the Mouse and Human Ribosomal RNA Genes

    Directory of Open Access Journals (Sweden)

    Jean-Clement Mars

    2018-01-01

    Full Text Available The combination of Chromatin Immunoprecipitation and Massively Parallel Sequencing, or ChIP-Seq, has greatly advanced our genome-wide understanding of chromatin and enhancer structures. However, its resolution at any given genetic locus is limited by several factors. In applying ChIP-Seq to the study of the ribosomal RNA genes, we found that a major limitation to resolution was imposed by the underlying variability in sequence coverage that very often dominates the protein–DNA interaction profiles. Here, we describe a simple numerical deconvolution approach that, in large part, corrects for this variability, and significantly improves both the resolution and quantitation of protein–DNA interaction maps deduced from ChIP-Seq data. This approach has allowed us to determine the in vivo organization of the RNA polymerase I preinitiation complexes that form at the promoters and enhancers of the mouse (Mus musculus and human (Homo sapiens ribosomal RNA genes, and to reveal a phased binding of the HMG-box factor UBF across the rDNA. The data identify and map a “Spacer Promoter” and associated stalled polymerase in the intergenic spacer of the human ribosomal RNA genes, and reveal a very similar enhancer structure to that found in rodents and lower vertebrates.

  14. A rapid pathway toward a superb gene delivery system: programming structural and functional diversity into a supramolecular nanoparticle library.

    Science.gov (United States)

    Wang, Hao; Liu, Kan; Chen, Kuan-Ju; Lu, Yujie; Wang, Shutao; Lin, Wei-Yu; Guo, Feng; Kamei, Ken-ichiro; Chen, Yi-Chun; Ohashi, Minori; Wang, Mingwei; Garcia, Mitch André; Zhao, Xing-Zhong; Shen, Clifton K-F; Tseng, Hsian-Rong

    2010-10-26

    Nanoparticles are regarded as promising transfection reagents for effective and safe delivery of nucleic acids into a specific type of cells or tissues providing an alternative manipulation/therapy strategy to viral gene delivery. However, the current process of searching novel delivery materials is limited due to conventional low-throughput and time-consuming multistep synthetic approaches. Additionally, conventional approaches are frequently accompanied with unpredictability and continual optimization refinements, impeding flexible generation of material diversity creating a major obstacle to achieving high transfection performance. Here we have demonstrated a rapid developmental pathway toward highly efficient gene delivery systems by leveraging the powers of a supramolecular synthetic approach and a custom-designed digital microreactor. Using the digital microreactor, broad structural/functional diversity can be programmed into a library of DNA-encapsulated supramolecular nanoparticles (DNA⊂SNPs) by systematically altering the mixing ratios of molecular building blocks and a DNA plasmid. In vitro transfection studies with DNA⊂SNPs library identified the DNA⊂SNPs with the highest gene transfection efficiency, which can be attributed to cooperative effects of structures and surface chemistry of DNA⊂SNPs. We envision such a rapid developmental pathway can be adopted for generating nanoparticle-based vectors for delivery of a variety of loads.

  15. Population structure of the malaria vector Anopheles sinensis (Diptera: Culicidae in China: two gene pools inferred by microsatellites.

    Directory of Open Access Journals (Sweden)

    Yajun Ma

    Full Text Available BACKGROUND: Anopheles sinensis is a competent malaria vector in China. An understanding of vector population structure is important to the vector-based malaria control programs. However, there is no adequate data of A. sinensis population genetics available yet. METHODOLOGY/PRINCIPAL FINDINGS: This study used 5 microsatellite loci to estimate population genetic diversity, genetic differentiation and demographic history of A. sinensis from 14 representative localities in China. All 5 microsatellite loci were highly polymorphic across populations, with high allelic richness and heterozygosity. Hardy-Weinberg disequilibrium was found in 12 populations associated with heterozygote deficits, which was likely caused by the presence of null allele and the Wahlund effect. Bayesian clustering analysis revealed two gene pools, grouping samples into two population clusters; one includes six and the other includes eight populations. Out of 14 samples, six samples were mixed with individuals from both gene pools, indicating the coexistence of two genetic units in the areas sampled. The overall differentiation between two genetic pools was moderate (F(ST = 0.156. Pairwise differentiation between populations were lower within clusters (F(ST = 0.008-0.028 in cluster I and F(ST = 0.004-0.048 in cluster II than between clusters (F(ST = 0.120-0.201. A reduced gene flow (Nm = 1-1.7 was detected between clusters. No evidence of isolation by distance was detected among populations neither within nor between the two clusters. There are differences in effective population size (Ne = 14.3-infinite across sampled populations. CONCLUSIONS/SIGNIFICANCE: Two genetic pools with moderate genetic differentiation were identified in the A. sinensis populations in China. The population divergence was not correlated with geographic distance or barrier in the range. Variable effective population size and other demographic effects of historical population

  16. COGNATE: comparative gene annotation characterizer.

    Science.gov (United States)

    Wilbrandt, Jeanne; Misof, Bernhard; Niehuis, Oliver

    2017-07-17

    The comparison of gene and genome structures across species has the potential to reveal major trends of genome evolution. However, such a comparative approach is currently hampered by a lack of standardization (e.g., Elliott TA, Gregory TR, Philos Trans Royal Soc B: Biol Sci 370:20140331, 2015). For example, testing the hypothesis that the total amount of coding sequences is a reliable measure of potential proteome diversity (Wang M, Kurland CG, Caetano-Anollés G, PNAS 108:11954, 2011) requires the application of standardized definitions of coding sequence and genes to create both comparable and comprehensive data sets and corresponding summary statistics. However, such standard definitions either do not exist or are not consistently applied. These circumstances call for a standard at the descriptive level using a minimum of parameters as well as an undeviating use of standardized terms, and for software that infers the required data under these strict definitions. The acquisition of a comprehensive, descriptive, and standardized set of parameters and summary statistics for genome publications and further analyses can thus greatly benefit from the availability of an easy to use standard tool. We developed a new open-source command-line tool, COGNATE (Comparative Gene Annotation Characterizer), which uses a given genome assembly and its annotation of protein-coding genes for a detailed description of the respective gene and genome structure parameters. Additionally, we revised the standard definitions of gene and genome structures and provide the definitions used by COGNATE as a working draft suggestion for further reference. Complete parameter lists and summary statistics are inferred using this set of definitions to allow down-stream analyses and to provide an overview of the genome and gene repertoire characteristics. COGNATE is written in Perl and freely available at the ZFMK homepage ( https://www.zfmk.de/en/COGNATE ) and on github ( https

  17. Differential structural status of the RNA counterpart of an undecamer quasi-palindromic DNA sequence present in LCR of human β-globin gene cluster.

    Science.gov (United States)

    Kaushik, Mahima; Kukreti, Shrikant

    2015-01-01

    Our previous work on structural polymorphism shown at a single nucleotide polymorphism (SNP) (A → G) site located on HS4 region of locus control region (LCR) of β-globin gene has established a hairpin → duplex equilibrium corresponding to A → B like DNA transition (Kaushik M, Kukreti, R., Grover, D., Brahmachari, S.K. and Kukreti S. Nucleic Acids Res. 2003; Kaushik M, Kukreti S. Nucleic Acids Res. 2006). The G-allele of A → G SNP has been shown to be significantly associated with the occurrence of β-thalassemia. Considering the significance of this 11-nt long quasi-palindromic sequence [5'-TGGGG(G/A)CCCCA; HP(G/A)11] of β-globin gene LCR, we further explored the differential behavior of the same DNA sequence with its RNA counterpart, using various biophysical and biochemical techniques. In contrast to its DNA counterpart exhibiting a A → B structural transition and an equilibrium between duplex and hairpin forms, the studied RNA oligonucleotide sequence [5'-UGGGG(G/A)CCCCA; RHP(G/A)11] existed only in duplex form (A-conformation) and did not form hairpin. The single residue difference from A to G led to the unusual thermal stability of the RNA structure formed by the studied sequence. Since, naturally occurring mutations and various SNP sites may stabilize or destabilize the local DNA/RNA secondary structures, these structural transitions may affect the gene expression by a change in the protein-DNA recognition patterns.

  18. Waterborne fluoride exposure changed the structure and the expressions of steroidogenic-related genes in gonads of adult zebrafish (Danio rerio).

    Science.gov (United States)

    Li, MeiYan; Cao, Jinling; Chen, Jianjie; Song, Jie; Zhou, Bingrui; Feng, Cuiping; Wang, Jundong

    2016-02-01

    Excessive fluoride in natural water ecosystem has been demonstrated to have adverse effects on reproductive system in humans and mammals, while the most vulnerable aquatic organisms were ignored. In this study, the effects of waterborne fluoride on growth performance, sex steroid hormone, histological structure, and the transcriptional profiles of sex steroid related genes were examined in both female and male zebrafish exposed to different concentrations of 0.79, 18.60, 36.83 mg L(-1) of fluoride for 30 and 60 d to investigate the effects of fluoride on reproductive system and the underlying toxic mechanisms caused by fluoride. The results showed that the body weight was remarkably decreased, the structure of ovary and testis were serious injured, and the T and E2 levels were significantly reduced in male zebrafish. The transcriptional profiles of steroidogenic related genes displayed phenomenal alterations, the expressions of pgr and cyp19a1a were significantly up-regulated, while the transcriptional levels of er, ar and hsd3β were decreased both in the ovary and testis, and hsd17β8 were down-regulated just in males. Taken together, these results demonstrated that fluoride could significantly inhibit the growth of zebrafish, and notably affect the reproductive system in both sex zebrafish by impairing the structure of ovary and testis, altering steroid hormone levels and steroidogenic genes expression related to the synthesis of sex hormones in zebrafish. Copyright © 2015 Elsevier Ltd. All rights reserved.

  19. Assessment of genetic diversity, population structure, and gene flow of tigers (Panthera tigris tigris) across Nepal's Terai Arc Landscape.

    Science.gov (United States)

    Thapa, Kanchan; Manandhar, Sulochana; Bista, Manisha; Shakya, Jivan; Sah, Govind; Dhakal, Maheshwar; Sharma, Netra; Llewellyn, Bronwyn; Wultsch, Claudia; Waits, Lisette P; Kelly, Marcella J; Hero, Jean-Marc; Hughes, Jane; Karmacharya, Dibesh

    2018-01-01

    With fewer than 200 tigers (Panthera tigris tigris) left in Nepal, that are generally confined to five protected areas across the Terai Arc Landscape, genetic studies are needed to provide crucial information on diversity and connectivity for devising an effective country-wide tiger conservation strategy. As part of the Nepal Tiger Genome Project, we studied landscape change, genetic variation, population structure, and gene flow of tigers across the Terai Arc Landscape by conducting Nepal's first comprehensive and systematic scat-based, non-invasive genetic survey. Of the 770 scat samples collected opportunistically from five protected areas and six presumed corridors, 412 were tiger (57%). Out of ten microsatellite loci, we retain eight markers that were used in identifying 78 individual tigers. We used this dataset to examine population structure, genetic variation, contemporary gene flow, and potential population bottlenecks of tigers in Nepal. We detected three genetic clusters consistent with three demographic sub-populations and found moderate levels of genetic variation (He = 0.61, AR = 3.51) and genetic differentiation (FST = 0.14) across the landscape. We detected 3-7 migrants, confirming the potential for dispersal-mediated gene flow across the landscape. We found evidence of a bottleneck signature likely caused by large-scale land-use change documented in the last two centuries in the Terai forest. Securing tiger habitat including functional forest corridors is essential to enhance gene flow across the landscape and ensure long-term tiger survival. This requires cooperation among multiple stakeholders and careful conservation planning to prevent detrimental effects of anthropogenic activities on tigers.

  20. SITEX 2.0: Projections of protein functional sites on eukaryotic genes. Extension with orthologous genes.

    Science.gov (United States)

    Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A

    2017-04-01

    Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .

  1. Macronuclear genome structure of the ciliate Nyctotherus ovalis: Single-gene chromosomes and tiny introns

    Directory of Open Access Journals (Sweden)

    Landweber Laura F

    2008-12-01

    Full Text Available Abstract Background Nyctotherus ovalis is a single-celled eukaryote that has hydrogen-producing mitochondria and lives in the hindgut of cockroaches. Like all members of the ciliate taxon, it has two types of nuclei, a micronucleus and a macronucleus. N. ovalis generates its macronuclear chromosomes by forming polytene chromosomes that subsequently develop into macronuclear chromosomes by DNA elimination and rearrangement. Results We examined the structure of these gene-sized macronuclear chromosomes in N. ovalis. We determined the telomeres, subtelomeric regions, UTRs, coding regions and introns by sequencing a large set of macronuclear DNA sequences (4,242 and cDNAs (5,484 and comparing them with each other. The telomeres consist of repeats CCC(AAAACCCCn, similar to those in spirotrichous ciliates such as Euplotes, Sterkiella (Oxytricha and Stylonychia. Per sequenced chromosome we found evidence for either a single protein-coding gene, a single tRNA, or the complete ribosomal RNAs cluster. Hence the chromosomes appear to encode single transcripts. In the short subtelomeric regions we identified a few overrepresented motifs that could be involved in gene regulation, but there is no consensus polyadenylation site. The introns are short (21–29 nucleotides, and a significant fraction (1/3 of the tiny introns is conserved in the distantly related ciliate Paramecium tetraurelia. As has been observed in P. tetraurelia, the N. ovalis introns tend to contain in-frame stop codons or have a length that is not dividable by three. This pattern causes premature termination of mRNA translation in the event of intron retention, and potentially degradation of unspliced mRNAs by the nonsense-mediated mRNA decay pathway. Conclusion The combination of short leaders, tiny introns and single genes leads to very minimal macronuclear chromosomes. The smallest we identified contained only 150 nucleotides.

  2. Function and structure in social brain regions can link oxytocin-receptor genes with autistic social behavior.

    Science.gov (United States)

    Yamasue, Hidenori

    2013-02-01

    Difficulties in appropriate social and communicative behaviors are the most prevalent and core symptoms of autism spectrum disorders (ASDs). Although recent intensive research has focused on the neurobiological background of these difficulties, many aspects of them were not yet elucidated. Recent studies have employed multimodal magnetic resonance imaging (MRI) indices as intermediate phenotypes of this behavioral phenotype to link candidate genes with the autistic social difficulty. As MRI indices, functional MRI (fMRI), structural MRI, and MR-spectroscopy have been examined in subjects with autism spectrum disorders. As candidate genes, this mini-review has much interest in oxytocin-receptor genes (OXTR), since recent studies have repeatedly reported their associations with normal variations in social cognition and behavior as well as with their extremes, autistic social dysfunction. Through previous increasing studies, medial prefrontal cortex, hypothalamus and amygdala have repeatedly been revealed as neural correlates of autistic social behavior by MRI multimodalities and their relationship to OXTR. For further development of this research area, this mini-review integrates recent accumulating evidence about human behavioral and neural correlates of OXTR. Copyright © 2012 The Japanese Society of Child Neurology. Published by Elsevier B.V. All rights reserved.

  3. Population structure and virulence gene profiles of Streptococcus agalactiae collected from different hosts worldwide.

    Science.gov (United States)

    Morach, Marina; Stephan, Roger; Schmitt, Sarah; Ewers, Christa; Zschöck, Michael; Reyes-Velez, Julian; Gilli, Urs; Del Pilar Crespo-Ortiz, María; Crumlish, Margaret; Gunturu, Revathi; Daubenberger, Claudia A; Ip, Margaret; Regli, Walter; Johler, Sophia

    2018-03-01

    Streptococcus agalactiae is a leading cause of morbidity and mortality among neonates and causes severe infections in pregnant women and nonpregnant predisposed adults, in addition to various animal species worldwide. Still, information on the population structure of S. agalactiae and the geographical distribution of different clones is limited. Further data are urgently needed to identify particularly successful clones and obtain insights into possible routes of transmission within one host species and across species borders. We aimed to determine the population structure and virulence gene profiles of S. agalactiae strains from a diverse set of sources and geographical origins. To this end, 373 S. agalactiae isolates obtained from humans and animals from five different continents were typed by DNA microarray profiling. A total of 242 different S. agalactiae strains were identified and further analyzed. Particularly successful clonal lineages, hybridization patterns, and strains were identified that were spread across different continents and/or were present in more than one host species. In particular, several strains were detected in both humans and cattle, and several canine strains were also detected in samples from human, bovine, and porcine hosts. The findings of our study suggest that although S. agalactiae is well adapted to various hosts including humans, cattle, dogs, rodents, and fish, interspecies transmission is possible and occurs between humans and cows, dogs, and rabbits. The virulence and resistance gene profiles presented enable new insights into interspecies transmission and make a crucial contribution to the identification of suitable targets for therapeutic agents and vaccines.

  4. Personality in chimpanzees (Pan troglodytes: exploring the hierarchical structure and associations with the vasopressin V1A receptor gene.

    Directory of Open Access Journals (Sweden)

    Robert D Latzman

    Full Text Available One of the major contributions of recent personality psychology is the finding that traits are related to each other in an organized hierarchy. To date, however, researchers have yet to investigate this hierarchy in nonhuman primates. Such investigations are critical in confirming the cross-species nature of trait personality helping to illuminate personality as neurobiologically-based and evolutionarily-derived dimensions of primate disposition. Investigations of potential genetic polymorphisms associated with hierarchical models of personality among nonhuman primates represent a critical first step. The current study examined the hierarchical structure of chimpanzee personality as well as sex-specific associations with a polymorphism in the promoter region of the vasopressin V1a receptor gene (AVPR1A, a gene associated with dispositional traits, among 174 chimpanzees. Results confirmed a hierarchical structure of personality across species and, despite differences in early rearing experiences, suggest a sexually dimorphic role of AVPR1A polymorphisms on hierarchical personality profiles at a higher-order level.

  5. Personality in Chimpanzees (Pan troglodytes): Exploring the Hierarchical Structure and Associations with the Vasopressin V1A Receptor Gene

    Science.gov (United States)

    Latzman, Robert D.; Hopkins, William D.; Keebaugh, Alaine C.; Young, Larry J.

    2014-01-01

    One of the major contributions of recent personality psychology is the finding that traits are related to each other in an organized hierarchy. To date, however, researchers have yet to investigate this hierarchy in nonhuman primates. Such investigations are critical in confirming the cross-species nature of trait personality helping to illuminate personality as neurobiologically-based and evolutionarily-derived dimensions of primate disposition. Investigations of potential genetic polymorphisms associated with hierarchical models of personality among nonhuman primates represent a critical first step. The current study examined the hierarchical structure of chimpanzee personality as well as sex-specific associations with a polymorphism in the promoter region of the vasopressin V1a receptor gene (AVPR1A), a gene associated with dispositional traits, among 174 chimpanzees. Results confirmed a hierarchical structure of personality across species and, despite differences in early rearing experiences, suggest a sexually dimorphic role of AVPR1A polymorphisms on hierarchical personality profiles at a higher-order level. PMID:24752497

  6. Discovery and replication of gene influences on brain structure using LASSO regression

    Directory of Open Access Journals (Sweden)

    Omid eKohannim

    2012-08-01

    Full Text Available We implemented LASSO (least absolute shrinkage and selection operator regression to evaluate gene effects in genome-wide association studies (GWAS of brain images, using an MRI-derived temporal lobe volume measure from 729 subjects scanned as part of the Alzheimer’s Disease Neuroimaging Initiative (ADNI. Sparse groups of SNPs in individual genes were selected by LASSO, which identifies efficient sets of variants influencing the data. These SNPs were considered jointly when assessing their association with neuroimaging measures. We discovered 22 genes that passed genome-wide significance for influencing temporal lobe volume. This was a substantially greater number of significant genes compared to those found with standard, univariate GWAS. These top genes are all expressed in the brain and include genes previously related to brain function or neuropsychiatric disorders such as MACROD2, SORCS2, GRIN2B, MAGI2, NPAS3, CLSTN2, GABRG3, NRXN3, PRKAG2, GAS7, RBFOX1, ADARB2, CHD4 and CDH13. The top genes we identified with this method also displayed significant and widespread post-hoc effects on voxelwise, tensor-based morphometry (TBM maps of the temporal lobes. The most significantly associated gene was an autism susceptibility gene known as MACROD2. We were able to successfully replicate the effect of the MACROD2 gene in an independent cohort of 564 young, Australian healthy adult twins and siblings scanned with MRI (mean age: 23.8±2.2 SD years. In exploratory analyses, three selected SNPs in the MACROD2 gene were also significantly associated with performance intelligence quotient (PIQ. Our approach powerfully complements univariate techniques in detecting influences of genes on the living brain.

  7. Planting increases the abundance and structure complexity of soil core functional genes relevant to carbon and nitrogen cycling.

    Science.gov (United States)

    Wang, Feng; Liang, Yuting; Jiang, Yuji; Yang, Yunfeng; Xue, Kai; Xiong, Jinbo; Zhou, Jizhong; Sun, Bo

    2015-09-23

    Plants have an important impact on soil microbial communities and their functions. However, how plants determine the microbial composition and network interactions is still poorly understood. During a four-year field experiment, we investigated the functional gene composition of three types of soils (Phaeozem, Cambisols and Acrisol) under maize planting and bare fallow regimes located in cold temperate, warm temperate and subtropical regions, respectively. The core genes were identified using high-throughput functional gene microarray (GeoChip 3.0), and functional molecular ecological networks (fMENs) were subsequently developed with the random matrix theory (RMT)-based conceptual framework. Our results demonstrated that planting significantly (P soils and 83.5% of microbial alpha-diversity can be explained by the plant factor. Moreover, planting had significant impacts on the microbial community structure and the network interactions of the microbial communities. The calculated network complexity was higher under maize planting than under bare fallow regimes. The increase of the functional genes led to an increase in both soil respiration and nitrification potential with maize planting, indicating that changes in the soil microbial communities and network interactions influenced ecological functioning.

  8. Conserved intron positions in FGFR genes reflect the modular structure of FGFR and reveal stepwise addition of domains to an already complex ancestral FGFR.

    Science.gov (United States)

    Rebscher, Nicole; Deichmann, Christina; Sudhop, Stefanie; Fritzenwanker, Jens Holger; Green, Stephen; Hassel, Monika

    2009-10-01

    We have analyzed the evolution of fibroblast growth factor receptor (FGFR) tyrosine kinase genes throughout a wide range of animal phyla. No evidence for an FGFR gene was found in Porifera, but we tentatively identified an FGFR gene in the placozoan Trichoplax adhaerens. The gene encodes a protein with three immunoglobulin-like domains, a single-pass transmembrane, and a split tyrosine kinase domain. By superimposing intron positions of 20 FGFR genes from Placozoa, Cnidaria, Protostomia, and Deuterostomia over the respective protein domain structure, we identified ten ancestral introns and three conserved intron groups. Our analysis shows (1) that the position of ancestral introns correlates to the modular structure of FGFRs, (2) that the acidic domain very likely evolved in the last common ancestor of triploblasts, (3) that splicing of IgIII was enabled by a triploblast-specific insertion, and (4) that IgI is subject to substantial loss or duplication particularly in quickly evolving genomes. Moreover, intron positions in the catalytic domain of FGFRs map to the borders of protein subdomains highly conserved in other serine/threonine kinases. Nevertheless, these introns were introduced in metazoan receptor tyrosine kinases exclusively. Our data support the view that protein evolution dating back to the Cambrian explosion took place in such a short time window that only subtle changes in the domain structure are detectable in extant representatives of animal phyla. We propose that the first multidomain FGFR originated in the last common ancestor of Placozoa, Cnidaria, and Bilateria. Additional domains were introduced mainly in the ancestor of triploblasts and in the Ecdysozoa.

  9. Limited gene dispersal and spatial genetic structure as stabilizing factors in an ant-plant mutualism.

    Science.gov (United States)

    Malé, P-J G; Leroy, C; Humblot, P; Dejean, A; Quilichini, A; Orivel, J

    2016-12-01

    Comparative studies of the population genetics of closely associated species are necessary to properly understand the evolution of these relationships because gene flow between populations affects the partners' evolutionary potential at the local scale. As a consequence (at least for antagonistic interactions), asymmetries in the strength of the genetic structures of the partner populations can result in one partner having a co-evolutionary advantage. Here, we assess the population genetic structure of partners engaged in a species-specific and obligatory mutualism: the Neotropical ant-plant, Hirtella physophora, and its ant associate, Allomerus decemarticulatus. Although the ant cannot complete its life cycle elsewhere than on H. physophora and the plant cannot live for long without the protection provided by A. decemarticulatus, these species also have antagonistic interactions: the ants have been shown to benefit from castrating their host plant and the plant is able to retaliate against too virulent ant colonies. We found similar short dispersal distances for both partners, resulting in the local transmission of the association and, thus, inbred populations in which too virulent castrating ants face the risk of local extinction due to the absence of H. physophora offspring. On the other hand, we show that the plant populations probably experienced greater gene flow than did the ant populations, thus enhancing the evolutionary potential of the plants. We conclude that such levels of spatial structure in the partners' populations can increase the stability of the mutualistic relationship. Indeed, the local transmission of the association enables partial alignments of the partners' interests, and population connectivity allows the plant retaliation mechanisms to be locally adapted to the castration behaviour of their symbionts. © 2016 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2016 European Society For Evolutionary Biology.

  10. Genetic structure of Bemisia tabaci Med populations from home-range countries, inferred by nuclear and cytoplasmic markers: impact on the distribution of the insecticide resistance genes.

    Science.gov (United States)

    Gauthier, Nathalie; Clouet, Cécile; Perrakis, Andreas; Kapantaidaki, Despoina; Peterschmitt, Michel; Tsagkarakou, Anastasia

    2014-10-01

    Insecticide resistance management in Bemisia tabaci is one of the main issues facing agricultural production today. An extensive survey was undertaken in five Mediterranean countries to examine the resistance status of Med B. tabaci species in its range of geographic origin and the relationship between population genetic structure and the distribution of resistance genes. The investigation combined molecular diagnostic tests, sequence and microsatellite polymorphism studies and monitoring of endosymbionts. High frequencies of pyrethroid (L925I and T929V, VGSC gene) and organophosphate (F331W, ace1 gene) resistance mutations were found in France, Spain and Greece, but not in Morocco or Tunisia. Sequence analyses of the COI gene delineated two closely related mitochondrial groups (Q1 and Q2), which were found either sympatrically (Spain) or separately (France). Only Q1 was observed in Greece, Morocco and Tunisia. Bayesian analyses based on microsatellite loci revealed three geographically delineated genetic groups (France, Spain, Morocco/Greece/Tunisia) and high levels of genetic differentiation even between neighbouring samples. Evidence was also found for hybridisation and asymmetrical gene flow between Q1 and Q2. Med B. tabaci is more diverse and structured than reported so far. On a large geographic scale, resistance is affected by population genetic structure, whereas on a local scale, agricultural practices appear to play a major role. © 2014 Society of Chemical Industry.

  11. A Regulatory Network Analysis of Orphan Genes in Arabidopsis Thaliana

    Science.gov (United States)

    Singh, Pramesh; Chen, Tianlong; Arendsee, Zebulun; Wurtele, Eve S.; Bassler, Kevin E.

    Orphan genes, which are genes unique to each particular species, have recently drawn significant attention for their potential usefulness for organismal robustness. Their origin and regulatory interaction patterns remain largely undiscovered. Recently, methods that use the context likelihood of relatedness to infer a network followed by modularity maximizing community detection algorithms on the inferred network to find the functional structure of regulatory networks were shown to be effective. We apply improved versions of these methods to gene expression data from Arabidopsis thaliana, identify groups (clusters) of interacting genes with related patterns of expression and analyze the structure within those groups. Focusing on clusters that contain orphan genes, we compare the identified clusters to gene ontology (GO) terms, regulons, and pathway designations and analyze their hierarchical structure. We predict new regulatory interactions and unravel the structure of the regulatory interaction patterns of orphan genes. Work supported by the NSF through Grants DMR-1507371 and IOS-1546858.

  12. UniGene Tabulator: a full parser for the UniGene format.

    Science.gov (United States)

    Lenzi, Luca; Frabetti, Flavia; Facchin, Federica; Casadei, Raffaella; Vitale, Lorenza; Canaider, Silvia; Carinci, Paolo; Zannotti, Maria; Strippoli, Pierluigi

    2006-10-15

    UniGene Tabulator 1.0 provides a solution for full parsing of UniGene flat file format; it implements a structured graphical representation of each data field present in UniGene following import into a common database managing system usable in a personal computer. This database includes related tables for sequence, protein similarity, sequence-tagged site (STS) and transcript map interval (TXMAP) data, plus a summary table where each record represents a UniGene cluster. UniGene Tabulator enables full local management of UniGene data, allowing parsing, querying, indexing, retrieving, exporting and analysis of UniGene data in a relational database form, usable on Macintosh (OS X 10.3.9 or later) and Windows (2000, with service pack 4, XP, with service pack 2 or later) operating systems-based computers. The current release, including both the FileMaker runtime applications, is freely available at http://apollo11.isto.unibo.it/software/

  13. Structure and expression of sulfatase and sulfatase modifying factor genes in the diamondback moth, Plutella xylostella.

    Science.gov (United States)

    Ma, Xiao-Li; He, Wei-Yi; Chen, Wei; Xu, Xue-Jiao; Qi, Wei-Ping; Zou, Ming-Min; You, Yan-Chun; Baxter, Simon W; Wang, Ping; You, Min-Sheng

    2017-06-01

    The diamondback moth, Plutella xylostella (L.), uses sulfatases (SULF) to counteract the glucosinolate-myrosinase defensive system that cruciferous plants have evolved to deter insect feeding. Sulfatase activity is regulated by post-translational modification of a cysteine residue by sulfatase modifying factor 1 (SUMF1). We identified 12 SULF genes (PxylSulfs) and two SUMF1 genes (PxylSumf1s) in the P. xylostella genome. Phylogenetic analysis of SULFs and SUMFs from P. xylostella, Bombyx mori, Manduca sexta, Heliconius melpomene, Danaus plexippus, Drosophila melanogaster, Tetranychus urticae and Homo sapiens showed that the SULFs were clustered into five groups, and the SUMFs could be divided into two groups. Profiling of the expression of PxylSulfs and PxylSumfs by RNA-seq and by quantitative real-time polymerase chain reaction showed that two glucosinolate sulfatase genes (GSS), PxylSulf2 and PxylSulf3, were primarily expressed in the midgut of 3rd- and 4th-instar larvae. Moreover, expression of sulfatases PxylSulf2, PxylSulf3 and PxylSulf4 were correlated with expression of the sulfatases modifying factor PxylSumf1a. The findings from this study provide new insights into the structure and expression of SUMF1 and PxylSulf genes that are considered to be key factors for the evolutionary success of P. xylostella as a specialist herbivore of cruciferous plants. © 2017 Institute of Zoology, Chinese Academy of Sciences.

  14. Revised genomic structure of the human ghrelin gene and identification of novel exons, alternative splice variants and natural antisense transcripts

    Directory of Open Access Journals (Sweden)

    Herington Adrian C

    2007-08-01

    Full Text Available Abstract Background Ghrelin is a multifunctional peptide hormone expressed in a range of normal tissues and pathologies. It has been reported that the human ghrelin gene consists of five exons which span 5 kb of genomic DNA on chromosome 3 and includes a 20 bp non-coding first exon (20 bp exon 0. The availability of bioinformatic tools enabling comparative analysis and the finalisation of the human genome prompted us to re-examine the genomic structure of the ghrelin locus. Results We have demonstrated the presence of an additional novel exon (exon -1 and 5' extensions to exon 0 and 1 using comparative in silico analysis and have demonstrated their existence experimentally using RT-PCR and 5' RACE. A revised exon-intron structure demonstrates that the human ghrelin gene spans 7.2 kb and consists of six rather than five exons. Several ghrelin gene-derived splice forms were detected in a range of human tissues and cell lines. We have demonstrated ghrelin gene-derived mRNA transcripts that do not code for ghrelin, but instead may encode the C-terminal region of full-length preproghrelin (C-ghrelin, which contains the coding region for obestatin and a transcript encoding obestatin-only. Splice variants that differed in their 5' untranslated regions were also found, suggesting a role of these regions in the post-transcriptional regulation of preproghrelin translation. Finally, several natural antisense transcripts, termed ghrelinOS (ghrelin opposite strand transcripts, were demonstrated via orientation-specific RT-PCR, 5' RACE and in silico analysis of ESTs and cloned amplicons. Conclusion The sense and antisense alternative transcripts demonstrated in this study may function as non-coding regulatory RNA, or code for novel protein isoforms. This is the first demonstration of putative obestatin and C-ghrelin specific transcripts and these findings suggest that these ghrelin gene-derived peptides may also be produced independently of preproghrelin

  15. Population Structure and Adaptive Divergence in a High Gene Flow Marine Fish: The Small Yellow Croaker (Larimichthys polyactis.

    Directory of Open Access Journals (Sweden)

    Bing-Jian Liu

    Full Text Available The spatial distribution of genetic diversity has been long considered as a key component of policy development for management and conservation of marine fishes. However, unraveling the population genetic structure of migratory fish species is challenging due to high potential for gene flow. Despite the shallow population differentiation revealed by putatively neutral loci, the higher genetic differentiation with panels of putatively adaptive loci could provide greater resolution for stock identification. Here, patterns of population differentiation of small yellow croaker (Larimichthys polyactis were investigated by genotyping 15 highly polymorphic microsatellites in 337 individuals of 15 geographic populations collected from both spawning and overwintering grounds. Outlier analyses indicated that the locus Lpol03 might be under directional selection, which showed a strong homology with Grid2 gene encoding the glutamate receptor δ2 protein (GluRδ2. Based on Lpol03, two distinct clusters were identified by both STRUCTURE and PCoA analyses, suggesting that there were two overwintering aggregations of L. polyactis. A novel migration pattern was suggested for L. polyactis, which was inconsistent with results of previous studies based on historical fishing yield statistics. These results provided new perspectives on the population genetic structure and migratory routes of L. polyactis, which could have significant implications for sustainable management and utilization of this important fishery resource.

  16. Usher Syndrome Type III: Revised Genomic Structure of the USH3 Gene and Identification of Novel Mutations

    Science.gov (United States)

    Fields, Randall R.; Zhou, Guimei; Huang, Dali; Davis, Jack R.; Möller, Claes; Jacobson, Samuel G.; Kimberling, William J.; Sumegi, Janos

    2002-01-01

    Usher syndrome type III is an autosomal recessive disorder characterized by progressive sensorineural hearing loss, vestibular dysfunction, and retinitis pigmentosa. The disease gene was localized to 3q25 and recently was identified by positional cloning. In the present study, we have revised the structure of the USH3 gene, including a new translation start site, 5′ untranslated region, and a transcript encoding a 232–amino acid protein. The mature form of the protein is predicted to contain three transmembrane domains and 204 residues. We have found four new disease-causing mutations, including one that appears to be relatively common in the Ashkenazi Jewish population. We have also identified mouse (chromosome 3) and rat (chromosome 2) orthologues, as well as two human paralogues on chromosomes 4 and 10. PMID:12145752

  17. Sites of instability in the human TCF3 (E2A) gene adopt G-quadruplex DNA structures in vitro

    Science.gov (United States)

    Williams, Jonathan D.; Fleetwood, Sara; Berroyer, Alexandra; Kim, Nayun; Larson, Erik D.

    2015-01-01

    The formation of highly stable four-stranded DNA, called G-quadruplex (G4), promotes site-specific genome instability. G4 DNA structures fold from repetitive guanine sequences, and increasing experimental evidence connects G4 sequence motifs with specific gene rearrangements. The human transcription factor 3 (TCF3) gene (also termed E2A) is subject to genetic instability associated with severe disease, most notably a common translocation event t(1;19) associated with acute lymphoblastic leukemia. The sites of instability in TCF3 are not randomly distributed, but focused to certain sequences. We asked if G4 DNA formation could explain why TCF3 is prone to recombination and mutagenesis. Here we demonstrate that sequences surrounding the major t(1;19) break site and a region associated with copy number variations both contain G4 sequence motifs. The motifs identified readily adopt G4 DNA structures that are stable enough to interfere with DNA synthesis in physiological salt conditions in vitro. When introduced into the yeast genome, TCF3 G4 motifs promoted gross chromosomal rearrangements in a transcription-dependent manner. Our results provide a molecular rationale for the site-specific instability of human TCF3, suggesting that G4 DNA structures contribute to oncogenic DNA breaks and recombination. PMID:26029241

  18. Chitosan in Non-Viral Gene Delivery: Role of Structure, Characterization Methods, and Insights in Cancer and Rare Diseases Therapies

    Directory of Open Access Journals (Sweden)

    Beatriz Santos-Carballal

    2018-04-01

    Full Text Available Non-viral gene delivery vectors have lagged far behind viral ones in the current pipeline of clinical trials of gene therapy nanomedicines. Even when non-viral nanovectors pose less safety risks than do viruses, their efficacy is much lower. Since the early studies to deliver pDNA, chitosan has been regarded as a highly attractive biopolymer to deliver nucleic acids intracellularly and induce a transgenic response resulting in either upregulation of protein expression (for pDNA, mRNA or its downregulation (for siRNA or microRNA. This is explained as the consequence of a multi-step process involving condensation of nucleic acids, protection against degradation, stabilization in physiological conditions, cellular internalization, release from the endolysosome (“proton sponge” effect, unpacking and enabling the trafficking of pDNA to the nucleus or the siRNA to the RNA interference silencing complex (RISC. Given the multiple steps and complexity involved in the gene transfection process, there is a dearth of understanding of the role of chitosan’s structural features (Mw and degree of acetylation, DA% on each step that dictates the net transfection efficiency and its kinetics. The use of fully characterized chitosan samples along with the utilization of complementary biophysical and biological techniques is key to bridging this gap of knowledge and identifying the optimal chitosans for delivering a specific gene. Other aspects such as cell type and administration route are also at play. At the same time, the role of chitosan structural features on the morphology, size and surface composition of synthetic virus-like particles has barely been addressed. The ongoing revolution brought about by the recent discovery of CRISPR-Cas9 technology will undoubtedly be a game changer in this field in the short term. In the field of rare diseases, gene therapy is perhaps where the greatest potential lies and we anticipate that chitosans will be key players

  19. The drug target genes show higher evolutionary conservation than non-target genes.

    Science.gov (United States)

    Lv, Wenhua; Xu, Yongdeng; Guo, Yiying; Yu, Ziqi; Feng, Guanglong; Liu, Panpan; Luan, Meiwei; Zhu, Hongjie; Liu, Guiyou; Zhang, Mingming; Lv, Hongchao; Duan, Lian; Shang, Zhenwei; Li, Jin; Jiang, Yongshuai; Zhang, Ruijie

    2016-01-26

    Although evidence indicates that drug target genes share some common evolutionary features, there have been few studies analyzing evolutionary features of drug targets from an overall level. Therefore, we conducted an analysis which aimed to investigate the evolutionary characteristics of drug target genes. We compared the evolutionary conservation between human drug target genes and non-target genes by combining both the evolutionary features and network topological properties in human protein-protein interaction network. The evolution rate, conservation score and the percentage of orthologous genes of 21 species were included in our study. Meanwhile, four topological features including the average shortest path length, betweenness centrality, clustering coefficient and degree were considered for comparison analysis. Then we got four results as following: compared with non-drug target genes, 1) drug target genes had lower evolutionary rates; 2) drug target genes had higher conservation scores; 3) drug target genes had higher percentages of orthologous genes and 4) drug target genes had a tighter network structure including higher degrees, betweenness centrality, clustering coefficients and lower average shortest path lengths. These results demonstrate that drug target genes are more evolutionarily conserved than non-drug target genes. We hope that our study will provide valuable information for other researchers who are interested in evolutionary conservation of drug targets.

  20. Genetic structure, mating system, and long-distance gene flow in heart of palm (Euterpe edulis Mart.).

    Science.gov (United States)

    Gaiotto, F A; Grattapaglia, D; Vencovsky, R

    2003-01-01

    We report a detailed analysis of the population genetic structure, mating system, and gene flow of heart of palm (Euterpe edulis Mart.-Arecaceae) in central Brazil. This palm is considered a keystone species because it supplies fruits for birds and rodents all year and is intensively harvested for culinary purposes. Two populations of this palm tree were examined, using 18 microsatellite loci. The species displays a predominantly outcrossed mating system (tm = 0.94), with a probability of full sibship greater than 70% within open-pollinated families. The following estimates of interpopulation genetic variation were calculated and found significant: FIT = 0.17, FIS = 0.12, FST = 0.06, and RST = 0.07. This low but significant level of interpopulation genetic variation indicates high levels of gene flow. Two adult trees were identified as likely seed parents (P > 99.9%) of juveniles located at a distance of 22 km. Gene flow over such distances has not been reported before for tropical tree species. The establishment and management of in situ genetic reserves or ex situ conservation and breeding populations for E. edulis should contemplate the collection of several hundreds open-pollinated maternal families from relatively few distant populations to maximize the genetic sampling of a larger number of pollen parents.

  1. Structure and regulated expression of bovine prolactin and bovine growth hormone genes

    International Nuclear Information System (INIS)

    Rottman, F.; Camper, S.; Goodwin, E.; Hampson, R.; Lyons, R.

    1986-01-01

    This paper presents a description of several studies which utilize the transfection of cloned chimeric genes in an attempt to analyze the regulatory signals found in the bPRL and bGH genes. Examination of 5' flanking region of PRL genes reveals a high degree of sequence homology between the bovine, human, and rat species. In order to assess the existence of possible regulatory sequences in a more direct manner, the authors transfected homologous and heterologous cells with chimeric gene constructs containing possible regulatory sequences derived from both the bPRL and bGH genes. An analysis is presented of the polyadenylation signal contained in the bGH 3' flanking sequence

  2. Chromatin loops, gene positioning, and gene expression

    NARCIS (Netherlands)

    Holwerda, S.; de Laat, W.

    2012-01-01

    Technological developments and intense research over the last years have led to a better understanding of the 3D structure of the genome and its influence on genome function inside the cell nucleus. We will summarize topological studies performed on four model gene loci: the alpha- and beta-globin

  3. Imaging the impact of genes on Parkinson's disease

    DEFF Research Database (Denmark)

    van der Vegt, J P M; van Nuenen, B F L; Bloem, B R

    2009-01-01

    by the discovery of mutations in single genes that can cause autosomal dominant (alpha-synuclein (SNCA)) and leucine rich repeat kinase 2 (LRRK2) gene) or recessive (Parkin, PTEN-induced putative kinase 1 (PINK1), DJ-1, and ATP13A2 gene) forms of PD. Here, we review how structural and functional neuroimaging...... of individuals carrying a mutation in one of the PD genes has offered a unique avenue of research into the pathogenesis of PD. In symptomatic mutation carriers (i.e. those with overt disease), brain mapping can help to link the molecular pathogenesis of PD more directly with functional and structural changes...... monogenic forms of PD, common polymorphisms in genes that influence mono-aminergic signaling or synaptic plasticity may have modifying effects on distinct aspects of PD. We also discuss how functional and structural neuroimaging can be used to better characterize these genotype-phenotype correlations....

  4. Spatial genetic structure and asymmetrical gene flow within the Pacific walrus

    Science.gov (United States)

    Sonsthagen, Sarah A.; Jay, Chadwick V.; Fischbach, Anthony S.; Sage, George K.; Talbot, Sandra L.

    2012-01-01

    Pacific walruses (Odobenus rosmarus divergens) occupying shelf waters of Pacific Arctic seas migrate during spring and summer from 3 breeding areas in the Bering Sea to form sexually segregated nonbreeding aggregations. We assessed genetic relationships among 2 putative breeding populations and 6 nonbreeding aggregations. Analyses of mitochondrial DNA (mtDNA) control region sequence data suggest that males are distinct among breeding populations (ΦST=0.051), and between the eastern Chukchi and other nonbreeding aggregations (ΦST=0.336–0.449). Nonbreeding female aggregations were genetically distinct across marker types (microsatellite FST=0.019; mtDNA ΦST=0.313), as was eastern Chukchi and all other nonbreeding aggregations (microsatellite FST=0.019–0.035; mtDNA ΦST=0.386–0.389). Gene flow estimates are asymmetrical from St. Lawrence Island into the southeastern Bering breeding population for both sexes. Partitioning of haplotype frequencies among breeding populations suggests that individuals exhibit some degree of philopatry, although weak. High levels of genetic differentiation among eastern Chukchi and all other nonbreeding aggregations, but considerably lower genetic differentiation between breeding populations, suggest that at least 1 genetically distinct breeding population remained unsampled. Limited genetic structure at microsatellite loci between assayed breeding areas can emerge from several processes, including male-mediated gene flow, or population admixture following a decrease in census size (i.e., due to commercial harvest during 1880–1950s) and subsequent recovery. Nevertheless, high levels of genetic diversity in the Pacific walrus, which withstood prolonged decreases in census numbers with little impact on neutral genetic diversity, may reflect resiliency in the face of past environmental challenges.

  5. Signalign: An Ontology of DNA as Signal for Comparative Gene Structure Prediction Using Information-Coding-and-Processing Techniques.

    Science.gov (United States)

    Yu, Ning; Guo, Xuan; Gu, Feng; Pan, Yi

    2016-03-01

    Conventional character-analysis-based techniques in genome analysis manifest three main shortcomings-inefficiency, inflexibility, and incompatibility. In our previous research, a general framework, called DNA As X was proposed for character-analysis-free techniques to overcome these shortcomings, where X is the intermediates, such as digit, code, signal, vector, tree, graph network, and so on. In this paper, we further implement an ontology of DNA As Signal, by designing a tool named Signalign for comparative gene structure analysis, in which DNA sequences are converted into signal series, processed by modified method of dynamic time warping and measured by signal-to-noise ratio (SNR). The ontology of DNA As Signal integrates the principles and concepts of other disciplines including information coding theory and signal processing into sequence analysis and processing. Comparing with conventional character-analysis-based methods, Signalign can not only have the equivalent or superior performance, but also enrich the tools and the knowledge library of computational biology by extending the domain from character/string to diverse areas. The evaluation results validate the success of the character-analysis-free technique for improved performances in comparative gene structure prediction.

  6. Influence of secondary water supply systems on microbial community structure and opportunistic pathogen gene markers.

    Science.gov (United States)

    Li, Huan; Li, Shang; Tang, Wei; Yang, Yang; Zhao, Jianfu; Xia, Siqing; Zhang, Weixian; Wang, Hong

    2018-06-01

    Secondary water supply systems (SWSSs) refer to the in-building infrastructures (e.g., water storage tanks) used to supply water pressure beyond the main distribution systems. The purpose of this study was to investigate the influence of SWSSs on microbial community structure and the occurrence of opportunistic pathogens, the latter of which are an emerging public health concern. Higher numbers of bacterial 16S rRNA genes, Legionella and mycobacterial gene markers were found in public building taps served by SWSSs relative to the mains, regardless of the flushing practice (P water retention time, warm temperature and loss of disinfectant residuals promoted microbial growth and colonization of potential pathogens in SWSSs. Varied levels of microbial community shifts were found in different types of SWSSs during water transportation from the distribution main to taps, highlighting the critical role of SWSSs in shaping the drinking water microbiota. Overall, the results provided insight to factors that might aid in controlling pathogen proliferation in real-world water systems using SWSSs. Copyright © 2018 Elsevier Ltd. All rights reserved.

  7. Phosphagen kinase in Schistosoma japonicum: characterization of its enzymatic properties and determination of its gene structure.

    Science.gov (United States)

    Tokuhiro, Shinji; Uda, Kouji; Yano, Hiroko; Nagataki, Mitsuru; Jarilla, Blanca R; Suzuki, Tomohiko; Agatsuma, Takeshi

    2013-04-01

    Phosphagen kinases (PKs) play a major role in the regulation of energy metabolism in animals. Creatine kinase (CK) is the sole PK in vertebrates, whereas several PKs are present in invertebrates. Here, we report the enzymatic properties and gene structure of PK in the trematode Schistosoma japonicum (Sj). SjPK has a unique contiguous dimeric structure comprising domain 1 (D1) and domain 2 (D2). The three states of the recombinant SjPK (D1, D2, and D1D2) show a specific activity for the substrate taurocyamine. The comparison of the two domains of SjPK revealed that D1 had a high turnover rate (kcat=52.91) and D2 exhibited a high affinity for taurocyamine (Km(Tauro) =0.53±0.06). The full-length protein exhibited higher affinity for taurocyamine (Km(Tauro) =0.47±0.03) than the truncated domains (D1=1.30±0.10, D2=0.53±0.06). D1D2 also exhibited higher catalytic efficiency (kcat/Km(Tauro) =82.98) than D1 (40.70) and D2 (29.04). These results demonstrated that both domains of SjTKD1D2 interacted efficiently and remained functional. The three-dimensional structure of SjPKD1 was constructed by the homology modeling based on the transition state analog complex state of Limulus AK. This protein model of SjPKD1 suggests that the overall structure is almost conserve between SjPKD1 and Limulus AK except for the flexible loops, that is, particularly guanidino-specificity (GS) region, which is associated with the recognition of the corresponding guanidino substrate. The constructed NJ tree and the comparison of exon/intron organization suggest that SjTK has evolved from an arginine kinase (AK) gene. SjTK has potential as a novel antihelminthic drug target as it is absent in mammals and its strong activity may imply a significant role for this protein in the energy metabolism of the parasite. Copyright © 2013 Elsevier B.V. All rights reserved.

  8. Nested PCR Biases in Interpreting Microbial Community Structure in 16S rRNA Gene Sequence Datasets.

    Science.gov (United States)

    Yu, Guoqin; Fadrosh, Doug; Goedert, James J; Ravel, Jacques; Goldstein, Alisa M

    2015-01-01

    Sequencing of the PCR-amplified 16S rRNA gene has become a common approach to microbial community investigations in the fields of human health and environmental sciences. This approach, however, is difficult when the amount of DNA is too low to be amplified by standard PCR. Nested PCR can be employed as it can amplify samples with DNA concentration several-fold lower than standard PCR. However, potential biases with nested PCRs that could affect measurement of community structure have received little attention. In this study, we used 17 DNAs extracted from vaginal swabs and 12 DNAs extracted from stool samples to study the influence of nested PCR amplification of the 16S rRNA gene on the estimation of microbial community structure using Illumina MiSeq sequencing. Nested and standard PCR methods were compared on alpha- and beta-diversity metrics and relative abundances of bacterial genera. The effects of number of cycles in the first round of PCR (10 vs. 20) and microbial diversity (relatively low in vagina vs. high in stool) were also investigated. Vaginal swab samples showed no significant difference in alpha diversity or community structure between nested PCR and standard PCR (one round of 40 cycles). Stool samples showed significant differences in alpha diversity (except Shannon's index) and relative abundance of 13 genera between nested PCR with 20 cycles in the first round and standard PCR (Pnested PCR with 10 cycles in the first round and standard PCR. Operational taxonomic units (OTUs) that had low relative abundance (sum of relative abundance 27% of total OTUs in stool). Nested PCR introduced bias in estimated diversity and community structure. The bias was more significant for communities with relatively higher diversity and when more cycles were applied in the first round of PCR. We conclude that nested PCR could be used when standard PCR does not work. However, rare taxa detected by nested PCR should be validated by other technologies.

  9. Influence of mutations in some structural genes of heat-shock proteins on radiation resistance of Escherichia coli

    International Nuclear Information System (INIS)

    Verbenko, V.N.; Kuznetsova, L.V.; Bikineeva, E.G.; Kalinin, V.L.

    1992-01-01

    Lethal effects of γ-irradiation were studied in Escherichia coli strains with normal repair genotype and in radiation-resistant Gam r strains, both carrying additional mutations in the structural genes dnaK, grpE, groES or groEL. The null mutation ΔdnaK52::Cm r enhanced radiation sensitivity of wild-type cells and abolished the effect of heat induced rediation-resistance (ETIRR) and elevated radiation resistance of the Gam r strains

  10. The research and application of TPO's gene

    International Nuclear Information System (INIS)

    Xing Yan

    2002-01-01

    Thyro-peroxidase (TPO) is a glycosylated protein bound to the apical plasma membrane of thyrocytes. It is the key enzyme in the synthesis of thyroid hormones. Its gene structure and transcriptional regulation have been deeply studied. The author reviews the development of TPO's gene structure, function, transcriptional regulation, the relationship between TPO with thyroid diseases and radioactive iodide therapy

  11. Convergent evolution of gene networks by single-gene duplications in higher eukaryotes.

    Science.gov (United States)

    Amoutzias, Gregory D; Robertson, David L; Oliver, Stephen G; Bornberg-Bauer, Erich

    2004-03-01

    By combining phylogenetic, proteomic and structural information, we have elucidated the evolutionary driving forces for the gene-regulatory interaction networks of basic helix-loop-helix transcription factors. We infer that recurrent events of single-gene duplication and domain rearrangement repeatedly gave rise to distinct networks with almost identical hub-based topologies, and multiple activators and repressors. We thus provide the first empirical evidence for scale-free protein networks emerging through single-gene duplications, the dominant importance of molecular modularity in the bottom-up construction of complex biological entities, and the convergent evolution of networks.

  12. MicroRNA-target binding structures mimic microRNA duplex structures in humans.

    Directory of Open Access Journals (Sweden)

    Xi Chen

    Full Text Available Traditionally, researchers match a microRNA guide strand to mRNA sequences using sequence comparisons to predict its potential target genes. However, many of the predictions can be false positives due to limitations in sequence comparison alone. In this work, we consider the association of two related RNA structures that share a common guide strand: the microRNA duplex and the microRNA-target binding structure. We have analyzed thousands of such structure pairs and found many of them share high structural similarity. Therefore, we conclude that when predicting microRNA target genes, considering just the microRNA guide strand matches to gene sequences may not be sufficient--the microRNA duplex structure formed by the guide strand and its companion passenger strand must also be considered. We have developed software to translate RNA binding structure into encoded representations, and we have also created novel automatic comparison methods utilizing such encoded representations to determine RNA structure similarity. Our software and methods can be utilized in the other RNA secondary structure comparisons as well.

  13. Extracellular Matrix, Nuclear and Chromatin Structure and GeneExpression in Normal Tissues and Malignant Tumors: A Work inProgress

    Energy Technology Data Exchange (ETDEWEB)

    Spencer, Virginia A.; Xu, Ren; Bissell, Mina J.

    2006-08-01

    Almost three decades ago, we presented a model where theextracellular matrix (ECM) was postulated to influence gene expressionand tissue-specificity through the action of ECM receptors and thecytoskeleton. This hypothesis implied that ECM molecules could signal tothe nucleus and that the unit of function in higher organisms was not thecell alone, but the cell plus its microenvironment. We now know that ECMinvokes changes in tissue and organ architecture and that tissue, cell,nuclear, and chromatin structure are changed profoundly as a result ofand during malignant progression. Whereas some evidence has beengenerated for a link between ECM-induced alterations in tissuearchitecture and changes in both nuclear and chromatin organization, themanner by which these changes actively induce or repress gene expressionin normal and malignant cells is a topic in need of further attention.Here, we will discuss some key findings that may provide insights intomechanisms through which ECM could influence gene transcription and howtumor cells acquire the ability to overcome these levels ofcontrol.

  14. Plant-based food and feed protein structure changes induced by gene-transformation, heating and bio-ethanol processing: a synchrotron-based molecular structure and nutrition research program.

    Science.gov (United States)

    Yu, Peiqiang

    2010-11-01

    Unlike traditional "wet" analytical methods which during processing for analysis often result in destruction or alteration of the intrinsic protein structures, advanced synchrotron radiation-based Fourier transform infrared microspectroscopy has been developed as a rapid and nondestructive and bioanalytical technique. This cutting-edge synchrotron-based bioanalytical technology, taking advantages of synchrotron light brightness (million times brighter than sun), is capable of exploring the molecular chemistry or structure of a biological tissue without destruction inherent structures at ultra-spatial resolutions. In this article, a novel approach is introduced to show the potential of the advanced synchrotron-based analytical technology, which can be used to study plant-based food or feed protein molecular structure in relation to nutrient utilization and availability. Recent progress was reported on using synchrotron-based bioanalytical technique synchrotron radiation-based Fourier transform infrared microspectroscopy and diffused reflectance infrared Fourier transform spectroscopy to detect the effects of gene-transformation (Application 1), autoclaving (Application 2), and bio-ethanol processing (Application 3) on plant-based food and feed protein structure changes on a molecular basis. The synchrotron-based technology provides a new approach for plant-based protein structure research at ultra-spatial resolutions at cellular and molecular levels.

  15. Parthenocarpic potential in Capsicum annuum L. is enhanced by carpelloid structures and controlled by a single recessive gene

    Directory of Open Access Journals (Sweden)

    Xue Lin B

    2011-10-01

    Full Text Available Abstract Background Parthenocarpy is a desirable trait in Capsicum annuum production because it improves fruit quality and results in a more regular fruit set. Previously, we identified several C. annuum genotypes that already show a certain level of parthenocarpy, and the seedless fruits obtained from these genotypes often contain carpel-like structures. In the Arabidopsis bel1 mutant ovule integuments are transformed into carpels, and we therefore carefully studied ovule development in C. annuum and correlated aberrant ovule development and carpelloid transformation with parthenocarpic fruit set. Results We identified several additional C. annuum genotypes with a certain level of parthenocarpy, and confirmed a positive correlation between parthenocarpic potential and the development of carpelloid structures. Investigations into the source of these carpel-like structures showed that while the majority of the ovules in C. annuum gynoecia are unitegmic and anatropous, several abnormal ovules were observed, abundant at the top and base of the placenta, with altered integument growth. Abnormal ovule primordia arose from the placenta and most likely transformed into carpelloid structures in analogy to the Arabidopsis bel1 mutant. When pollination was present fruit weight was positively correlated with seed number, but in the absence of seeds, fruit weight proportionally increased with the carpelloid mass and number. Capsicum genotypes with high parthenocarpic potential always showed stronger carpelloid development. The parthenocarpic potential appeared to be controlled by a single recessive gene, but no variation in coding sequence was observed in a candidate gene CaARF8. Conclusions Our results suggest that in the absence of fertilization most C. annuum genotypes, have parthenocarpic potential and carpelloid growth, which can substitute developing seeds in promoting fruit development.

  16. Parthenocarpic potential in Capsicum annuum L. is enhanced by carpelloid structures and controlled by a single recessive gene

    Science.gov (United States)

    2011-01-01

    Background Parthenocarpy is a desirable trait in Capsicum annuum production because it improves fruit quality and results in a more regular fruit set. Previously, we identified several C. annuum genotypes that already show a certain level of parthenocarpy, and the seedless fruits obtained from these genotypes often contain carpel-like structures. In the Arabidopsis bel1 mutant ovule integuments are transformed into carpels, and we therefore carefully studied ovule development in C. annuum and correlated aberrant ovule development and carpelloid transformation with parthenocarpic fruit set. Results We identified several additional C. annuum genotypes with a certain level of parthenocarpy, and confirmed a positive correlation between parthenocarpic potential and the development of carpelloid structures. Investigations into the source of these carpel-like structures showed that while the majority of the ovules in C. annuum gynoecia are unitegmic and anatropous, several abnormal ovules were observed, abundant at the top and base of the placenta, with altered integument growth. Abnormal ovule primordia arose from the placenta and most likely transformed into carpelloid structures in analogy to the Arabidopsis bel1 mutant. When pollination was present fruit weight was positively correlated with seed number, but in the absence of seeds, fruit weight proportionally increased with the carpelloid mass and number. Capsicum genotypes with high parthenocarpic potential always showed stronger carpelloid development. The parthenocarpic potential appeared to be controlled by a single recessive gene, but no variation in coding sequence was observed in a candidate gene CaARF8. Conclusions Our results suggest that in the absence of fertilization most C. annuum genotypes, have parthenocarpic potential and carpelloid growth, which can substitute developing seeds in promoting fruit development. PMID:22018057

  17. Pathways to age of onset of heroin use: a structural model approach exploring the relationship of the COMT gene, impulsivity and childhood trauma.

    Science.gov (United States)

    Li, Ting; Du, Jiang; Yu, Shunying; Jiang, Haifeng; Fu, Yingmei; Wang, Dongxiang; Sun, Haiming; Chen, Hanhui; Zhao, Min

    2012-01-01

    The interaction of the association of dopamine genes, impulsivity and childhood trauma with substance abuse remains unclear. To clarify the impacts and the interactions of the Catechol -O-methyltransferase (COMT) gene, impulsivity and childhood trauma on the age of onset of heroin use among heroin dependent patients in China. 202 male and 248 female inpatients who meet DSM-IV criteria of heroin dependence were enrolled. Impulsivity and childhood trauma were measured using BIS-11 (Barratt Impulsiveness Scale-11) and ETISR-SF (Early Trauma Inventory Self Report-Short Form). The single nucleotide polymorphism (SNP) rs737866 on the COMT gene-which has previously been associated with heroin abuse, was genotyped using a DNA sequence detection system. Structural equations model was used to assess the interaction paths between these factors and the age of onset of heroin use. Chi-square test indicated the individuals with TT allele have earlier age of onset of heroin use than those with CT or CC allele. In the correlation analysis, the severity of childhood trauma was positively correlated to impulsive score, but both of them were negatively related to the age of onset of heroin use. In structure equation model, both the COMT gene and childhood trauma had impacts on the age of onset of heroin use directly or via impulsive personality. Our findings indicated that the COMT gene, impulsive personality traits and childhood trauma experience were interacted to impact the age of onset of heroin use, which play a critical role in the development of heroin dependence. The impact of environmental factor was greater than the COMT gene in the development of heroin dependence.

  18. Cell-bound lipases from Burkholderia sp. ZYB002: gene sequence analysis, expression, enzymatic characterization, and 3D structural model.

    Science.gov (United States)

    Shu, Zhengyu; Lin, Hong; Shi, Shaolei; Mu, Xiangduo; Liu, Yanru; Huang, Jianzhong

    2016-05-03

    The whole-cell lipase from Burkholderia cepacia has been used as a biocatalyst in organic synthesis. However, there is no report in the literature on the component or the gene sequence of the cell-bound lipase from this species. Qualitative analysis of the cell-bound lipase would help to illuminate the regulation mechanism of gene expression and further improve the yield of the cell-bound lipase by gene engineering. Three predictive cell-bound lipases, lipA, lipC21 and lipC24, from Burkholderia sp. ZYB002 were cloned and expressed in E. coli. Both LipA and LipC24 displayed the lipase activity. LipC24 was a novel mesophilic enzyme and displayed preference for medium-chain-length acyl groups (C10-C14). The 3D structural model of LipC24 revealed the open Y-type active site. LipA displayed 96 % amino acid sequence identity with the known extracellular lipase. lipA-inactivation and lipC24-inactivation decreased the total cell-bound lipase activity of Burkholderia sp. ZYB002 by 42 % and 14 %, respectively. The cell-bound lipase activity from Burkholderia sp. ZYB002 originated from a multi-enzyme mixture with LipA as the main component. LipC24 was a novel lipase and displayed different enzymatic characteristics and structural model with LipA. Besides LipA and LipC24, other type of the cell-bound lipases (or esterases) should exist.

  19. [Mechanisms of endogenous drug resistance acquisition by spontaneous chromosomal gene mutation].

    Science.gov (United States)

    Fukuda, H; Hiramatsu, K

    1997-05-01

    Endogenous resistance in bacteria is caused by a change or loss of function and generally genetically recessive. However, this type of resistance acquisition are now prevalent in clinical setting. Chromosomal genes that afford endogenous resistance are the genes correlated with the target of the drug, the drug inactivating enzymes, and permeability of the molecules including the antibacterial agents. Endogenous alteration of the drug target are mediated by the spontaneous mutation of their structural gene. This mutation provides much lower affinity of the drugs for the target. Gene expression of the inactivating enzymes, such as class C beta-lactamase, is generally regulated by regulatory genes. Spontaneous mutations in the regulatory genes cause constitutive enzyme production and provides the resistant to the agent which is usually stable for such enzymes. Spontaneous mutation in the structural gene gives the enzyme extra-spectrum substrate specificity, like ESBL (Extra-Spectrum-beta-Lactamase). Expression of structural genes encoding the permeability systems are also regulated by some regulatory genes. The spontaneous mutation of the regulatory genes reduce an amount of porin protein. This mutation causes much lower influx of the drug in the cell. Spontaneous mutation in promoter region of the structural gene of efflux protein was observed. This mutation raised the gene transcription and overproduced efflux protein. This protein progresses the drug efflux from the cell.

  20. Cone structure in patients with usher syndrome type III and mutations in the Clarin 1 gene.

    Science.gov (United States)

    Ratnam, Kavitha; Västinsalo, Hanna; Roorda, Austin; Sankila, Eeva-Marja K; Duncan, Jacque L

    2013-01-01

    To study macular structure and function in patients with Usher syndrome type III (USH3) caused by mutations in the Clarin 1 gene (CLRN1). High-resolution macular images were obtained by adaptive optics scanning laser ophthalmoscopy and spectral domain optical coherence tomography in 3 patients with USH3 and were compared with those of age-similar control subjects. Vision function measures included best-corrected visual acuity, kinetic and static perimetry, and full-field electroretinography. Coding regions of the CLRN1 gene were sequenced. CLRN1 mutations were present in all the patients; a 20-year-old man showed compound heterozygous mutations (p.N48K and p.S188X), and 2 unrelated women aged 25 and 32 years had homozygous mutations (p.N48K). Best-corrected visual acuity ranged from 20/16 to 20/40, with scotomas beginning at 3° eccentricity. The inner segment-outer segment junction or the inner segment ellipsoid band was disrupted within 1° to 4° of the fovea, and the foveal inner and outer segment layers were significantly thinner than normal. Cones near the fovea in patients 1 and 2 showed normal spacing, and the preserved region ended abruptly. Retinal pigment epithelial cells were visible in patient 3 where cones were lost. Cones were observed centrally but not in regions with scotomas, and retinal pigment epithelial cells were visible in regions without cones in patients with CLRN1 mutations. High-resolution measures of retinal structure demonstrate patterns of cone loss associated with CLRN1 mutations. These findings provide insight into the effect of CLRN1 mutations on macular cone structure, which has implications for the development of treatments for USH3. clinicaltrials.gov Identifier: NCT00254605.

  1. Cone Structure in Patients With Usher Syndrome Type III and Mutations in the Clarin 1 Gene

    Science.gov (United States)

    Ratnam, Kavitha; Västinsalo, Hanna; Roorda, Austin; Sankila, Eeva-Marja K.; Duncan, Jacque L.

    2015-01-01

    Objective To study macular structure and function in patients with Usher syndrome type III (USH3) caused by mutations in the Clarin 1 gene (CLRN1). Methods High-resolution macular images were obtained by adaptive optics scanning laser ophthalmoscopy and spectral domain optical coherence tomography in 3 patients with USH3 and were compared with those of age-similar control subjects. Vision function measures included best-corrected visual acuity, kinetic and static perimetry, and full-field electroretinography. Coding regions of the CLRN1 gene were sequenced. Results CLRN1 mutations were present in all the patients; a 20-year-old man showed compound heterozygous mutations (p.N48K and p.S188X), and 2 unrelated women aged 25 and 32 years had homozygous mutations (p.N48K). Best-corrected visual acuity ranged from 20/16 to 20/40, with scotomas beginning at 3° eccentricity. The inner segment-outer segment junction or the inner segment ellipsoid band was disrupted within 1° to 4° of the fovea, and the foveal inner and outer segment layers were significantly thinner than normal. Cones near the fovea in patients 1 and 2 showed normal spacing, and the preserved region ended abruptly. Retinal pigment epithelial cells were visible in patient 3 where cones were lost. Conclusions Cones were observed centrally but not in regions with scotomas, and retinal pigment epithelial cells were visible in regions without cones in patients with CLRN1 mutations. High-resolution measures of retinal structure demonstrate patterns of cone loss associated with CLRN1 mutations. Clinical Relevance These findings provide insight into the effect of CLRN1 mutations on macular cone structure, which has implications for the development of treatments for USH3. Trial Registration clinicaltrials.gov Identifier: NCT00254605 PMID:22964989

  2. Gene design, cloning and protein-expression methods for high-value targets at the Seattle Structural Genomics Center for Infectious Disease

    International Nuclear Information System (INIS)

    Raymond, Amy; Haffner, Taryn; Ng, Nathan; Lorimer, Don; Staker, Bart; Stewart, Lance

    2011-01-01

    An overview of one salvage strategy for high-value SSGCID targets is given. Any structural genomics endeavor, particularly ambitious ones such as the NIAID-funded Seattle Structural Genomics Center for Infectious Disease (SSGCID) and Center for Structural Genomics of Infectious Disease (CSGID), face technical challenges at all points of the production pipeline. One salvage strategy employed by SSGCID is combined gene engineering and structure-guided construct design to overcome challenges at the levels of protein expression and protein crystallization. Multiple constructs of each target are cloned in parallel using Polymerase Incomplete Primer Extension cloning and small-scale expressions of these are rapidly analyzed by capillary electrophoresis. Using the methods reported here, which have proven particularly useful for high-value targets, otherwise intractable targets can be resolved

  3. Glucokinase gene mutations: structural and genotype-phenotype analyses in MODY children from South Italy.

    Directory of Open Access Journals (Sweden)

    Nadia Tinto

    Full Text Available BACKGROUND: Maturity onset diabetes of the young type 2 (or GCK MODY is a genetic form of diabetes mellitus provoked by mutations in the glucokinase gene (GCK. METHODOLOGY/PRINCIPAL FINDINGS: We screened the GCK gene by direct sequencing in 30 patients from South Italy with suspected MODY. The mutation-induced structural alterations in the protein were analyzed by molecular modeling. The patients' biochemical, clinical and anamnestic data were obtained. Mutations were detected in 16/30 patients (53%; 9 of the 12 mutations identified were novel (p.Glu70Asp, p.Phe123Leu, p.Asp132Asn, p.His137Asp, p.Gly162Asp, p.Thr168Ala, p.Arg392Ser, p.Glu290X, p.Gln106_Met107delinsLeu and are in regions involved in structural rearrangements required for catalysis. The prevalence of mutation sites was higher in the small domain (7/12: approximately 59% than in the large (4/12: 33% domain or in the connection (1/12: 8% region of the protein. Mild diabetic phenotypes were detected in almost all patients [mean (SD OGTT = 7.8 mMol/L (1.8] and mean triglyceride levels were lower in mutated than in unmutated GCK patients (p = 0.04. CONCLUSIONS: The prevalence of GCK MODY is high in southern Italy, and the GCK small domain is a hot spot for MODY mutations. Both the severity of the GCK mutation and the genetic background seem to play a relevant role in the GCK MODY phenotype. Indeed, a partial genotype-phenotype correlation was identified in related patients (3 pairs of siblings but not in two unrelated children bearing the same mutation. Thus, the molecular approach allows the physician to confirm the diagnosis and to predict severity of the mutation.

  4. Gene function prediction based on Gene Ontology Hierarchy Preserving Hashing.

    Science.gov (United States)

    Zhao, Yingwen; Fu, Guangyuan; Wang, Jun; Guo, Maozu; Yu, Guoxian

    2018-02-23

    Gene Ontology (GO) uses structured vocabularies (or terms) to describe the molecular functions, biological roles, and cellular locations of gene products in a hierarchical ontology. GO annotations associate genes with GO terms and indicate the given gene products carrying out the biological functions described by the relevant terms. However, predicting correct GO annotations for genes from a massive set of GO terms as defined by GO is a difficult challenge. To combat with this challenge, we introduce a Gene Ontology Hierarchy Preserving Hashing (HPHash) based semantic method for gene function prediction. HPHash firstly measures the taxonomic similarity between GO terms. It then uses a hierarchy preserving hashing technique to keep the hierarchical order between GO terms, and to optimize a series of hashing functions to encode massive GO terms via compact binary codes. After that, HPHash utilizes these hashing functions to project the gene-term association matrix into a low-dimensional one and performs semantic similarity based gene function prediction in the low-dimensional space. Experimental results on three model species (Homo sapiens, Mus musculus and Rattus norvegicus) for interspecies gene function prediction show that HPHash performs better than other related approaches and it is robust to the number of hash functions. In addition, we also take HPHash as a plugin for BLAST based gene function prediction. From the experimental results, HPHash again significantly improves the prediction performance. The codes of HPHash are available at: http://mlda.swu.edu.cn/codes.php?name=HPHash. Copyright © 2018 Elsevier Inc. All rights reserved.

  5. The Drosophila melanogaster methuselah gene: a novel gene with ancient functions.

    Directory of Open Access Journals (Sweden)

    Ana Rita Araújo

    Full Text Available The Drosophila melanogaster G protein-coupled receptor gene, methuselah (mth, has been described as a novel gene that is less than 10 million years old. Nevertheless, it shows a highly specific expression pattern in embryos, larvae, and adults, and has been implicated in larval development, stress resistance, and in the setting of adult lifespan, among others. Although mth belongs to a gene subfamily with 16 members in D. melanogaster, there is no evidence for functional redundancy in this subfamily. Therefore, it is surprising that a novel gene influences so many traits. Here, we explore the alternative hypothesis that mth is an old gene. Under this hypothesis, in species distantly related to D. melanogaster, there should be a gene with features similar to those of mth. By performing detailed phylogenetic, synteny, protein structure, and gene expression analyses we show that the D. virilis GJ12490 gene is the orthologous of mth in species distantly related to D. melanogaster. We also show that, in D. americana (a species of the virilis group of Drosophila, a common amino acid polymorphism at the GJ12490 orthologous gene is significantly associated with developmental time, size, and lifespan differences. Our results imply that GJ12490 orthologous genes are candidates for developmental time and lifespan differences in Drosophila in general.

  6. Analysis of phylogeny and codon usage bias and relationship of GC content, amino acid composition with expression of the structural nif genes.

    Science.gov (United States)

    Mondal, Sunil Kanti; Kundu, Sudip; Das, Rabindranath; Roy, Sujit

    2016-08-01

    Bacteria and archaea have evolved with the ability to fix atmospheric dinitrogen in the form of ammonia, catalyzed by the nitrogenase enzyme complex which comprises three structural genes nifK, nifD and nifH. The nifK and nifD encodes for the beta and alpha subunits, respectively, of component 1, while nifH encodes for component 2 of nitrogenase. Phylogeny based on nifDHK have indicated that Cyanobacteria is closer to Proteobacteria alpha and gamma but not supported by the tree based on 16SrRNA. The evolutionary ancestor for the different trees was also different. The GC1 and GC2% analysis showed more consistency than GC3% which appeared to below for Firmicutes, Cyanobacteria and Euarchaeota while highest in Proteobacteria beta and clearly showed the proportional effect on the codon usage with a few exceptions. Few genes from Firmicutes, Euryarchaeota, Proteobacteria alpha and delta were found under mutational pressure. These nif genes with low and high GC3% from different classes of organisms showed similar expected number of codons. Distribution of the genes and codons, based on codon usage demonstrated opposite pattern for different orientation of mirror plane when compared with each other. Overall our results provide a comprehensive analysis on the evolutionary relationship of the three structural nif genes, nifK, nifD and nifH, respectively, in the context of codon usage bias, GC content relationship and amino acid composition of the encoded proteins and exploration of crucial statistical method for the analysis of positive data with non-constant variance to identify the shape factors of codon adaptation index.

  7. Inferred vs realized patterns of gene flow: an analysis of population structure in the Andros Island Rock Iguana.

    Science.gov (United States)

    Colosimo, Giuliano; Knapp, Charles R; Wallace, Lisa E; Welch, Mark E

    2014-01-01

    Ecological data, the primary source of information on patterns and rates of migration, can be integrated with genetic data to more accurately describe the realized connectivity between geographically isolated demes. In this paper we implement this approach and discuss its implications for managing populations of the endangered Andros Island Rock Iguana, Cyclura cychlura cychlura. This iguana is endemic to Andros, a highly fragmented landmass of large islands and smaller cays. Field observations suggest that geographically isolated demes were panmictic due to high, inferred rates of gene flow. We expand on these observations using 16 polymorphic microsatellites to investigate the genetic structure and rates of gene flow from 188 Andros Iguanas collected across 23 island sites. Bayesian clustering of specimens assigned individuals to three distinct genotypic clusters. An analysis of molecular variance (AMOVA) indicates that allele frequency differences are responsible for a significant portion of the genetic variance across the three defined clusters (Fst =  0.117, p<0.01). These clusters are associated with larger islands and satellite cays isolated by broad water channels with strong currents. These findings imply that broad water channels present greater obstacles to gene flow than was inferred from field observation alone. Additionally, rates of gene flow were indirectly estimated using BAYESASS 3.0. The proportion of individuals originating from within each identified cluster varied from 94.5 to 98.7%, providing further support for local isolation. Our assessment reveals a major disparity between inferred and realized gene flow. We discuss our results in a conservation perspective for species inhabiting highly fragmented landscapes.

  8. Inferred vs Realized Patterns of Gene Flow: An Analysis of Population Structure in the Andros Island Rock Iguana

    Science.gov (United States)

    Colosimo, Giuliano; Knapp, Charles R.; Wallace, Lisa E.; Welch, Mark E.

    2014-01-01

    Ecological data, the primary source of information on patterns and rates of migration, can be integrated with genetic data to more accurately describe the realized connectivity between geographically isolated demes. In this paper we implement this approach and discuss its implications for managing populations of the endangered Andros Island Rock Iguana, Cyclura cychlura cychlura. This iguana is endemic to Andros, a highly fragmented landmass of large islands and smaller cays. Field observations suggest that geographically isolated demes were panmictic due to high, inferred rates of gene flow. We expand on these observations using 16 polymorphic microsatellites to investigate the genetic structure and rates of gene flow from 188 Andros Iguanas collected across 23 island sites. Bayesian clustering of specimens assigned individuals to three distinct genotypic clusters. An analysis of molecular variance (AMOVA) indicates that allele frequency differences are responsible for a significant portion of the genetic variance across the three defined clusters (Fst =  0.117, p0.01). These clusters are associated with larger islands and satellite cays isolated by broad water channels with strong currents. These findings imply that broad water channels present greater obstacles to gene flow than was inferred from field observation alone. Additionally, rates of gene flow were indirectly estimated using BAYESASS 3.0. The proportion of individuals originating from within each identified cluster varied from 94.5 to 98.7%, providing further support for local isolation. Our assessment reveals a major disparity between inferred and realized gene flow. We discuss our results in a conservation perspective for species inhabiting highly fragmented landscapes. PMID:25229344

  9. Inferred vs realized patterns of gene flow: an analysis of population structure in the Andros Island Rock Iguana.

    Directory of Open Access Journals (Sweden)

    Giuliano Colosimo

    Full Text Available Ecological data, the primary source of information on patterns and rates of migration, can be integrated with genetic data to more accurately describe the realized connectivity between geographically isolated demes. In this paper we implement this approach and discuss its implications for managing populations of the endangered Andros Island Rock Iguana, Cyclura cychlura cychlura. This iguana is endemic to Andros, a highly fragmented landmass of large islands and smaller cays. Field observations suggest that geographically isolated demes were panmictic due to high, inferred rates of gene flow. We expand on these observations using 16 polymorphic microsatellites to investigate the genetic structure and rates of gene flow from 188 Andros Iguanas collected across 23 island sites. Bayesian clustering of specimens assigned individuals to three distinct genotypic clusters. An analysis of molecular variance (AMOVA indicates that allele frequency differences are responsible for a significant portion of the genetic variance across the three defined clusters (Fst =  0.117, p<<0.01. These clusters are associated with larger islands and satellite cays isolated by broad water channels with strong currents. These findings imply that broad water channels present greater obstacles to gene flow than was inferred from field observation alone. Additionally, rates of gene flow were indirectly estimated using BAYESASS 3.0. The proportion of individuals originating from within each identified cluster varied from 94.5 to 98.7%, providing further support for local isolation. Our assessment reveals a major disparity between inferred and realized gene flow. We discuss our results in a conservation perspective for species inhabiting highly fragmented landscapes.

  10. The WRKY Transcription Factor Genes in Lotus japonicus.

    Science.gov (United States)

    Song, Hui; Wang, Pengfei; Nan, Zhibiao; Wang, Xingjun

    2014-01-01

    WRKY transcription factor genes play critical roles in plant growth and development, as well as stress responses. WRKY genes have been examined in various higher plants, but they have not been characterized in Lotus japonicus. The recent release of the L. japonicus whole genome sequence provides an opportunity for a genome wide analysis of WRKY genes in this species. In this study, we identified 61 WRKY genes in the L. japonicus genome. Based on the WRKY protein structure, L. japonicus WRKY (LjWRKY) genes can be classified into three groups (I-III). Investigations of gene copy number and gene clusters indicate that only one gene duplication event occurred on chromosome 4 and no clustered genes were detected on chromosomes 3 or 6. Researchers previously believed that group II and III WRKY domains were derived from the C-terminal WRKY domain of group I. Our results suggest that some WRKY genes in group II originated from the N-terminal domain of group I WRKY genes. Additional evidence to support this hypothesis was obtained by Medicago truncatula WRKY (MtWRKY) protein motif analysis. We found that LjWRKY and MtWRKY group III genes are under purifying selection, suggesting that WRKY genes will become increasingly structured and functionally conserved.

  11. Molecular identification of aiiA homologous gene from endophytic Enterobacter species and in silico analysis of putative tertiary structure of AHL-lactonase.

    Science.gov (United States)

    Rajesh, P S; Rai, V Ravishankar

    2014-01-03

    The aiiA homologous gene known to encode AHL- lactonase enzyme which hydrolyze the N-acylhomoserine lactone (AHL) quorum sensing signaling molecules produced by Gram negative bacteria. In this study, the degradation of AHL molecules was determined by cell-free lysate of endophytic Enterobacter species. The percentage of quorum quenching was confirmed and quantified by HPLC method (pEnterobacter asburiae VT65, Enterobacter aerogenes VT66 and Enterobacter ludwigii VT70 strains. Sequence alignment analysis revealed the presence of two zinc binding sites, "HXHXDH" motif as well as tyrosine residue at the position 194. Based on known template available at Swiss-Model, putative tertiary structure of AHL-lactonase was constructed. The result showed that novel endophytic strains of Enterobacter genera encode the novel aiiA homologous gene and its structural importance for future study. Copyright © 2013 Elsevier Inc. All rights reserved.

  12. Visual gene developer: a fully programmable bioinformatics software for synthetic gene optimization

    Directory of Open Access Journals (Sweden)

    McDonald Karen

    2011-08-01

    Full Text Available Abstract Background Direct gene synthesis is becoming more popular owing to decreases in gene synthesis pricing. Compared with using natural genes, gene synthesis provides a good opportunity to optimize gene sequence for specific applications. In order to facilitate gene optimization, we have developed a stand-alone software called Visual Gene Developer. Results The software not only provides general functions for gene analysis and optimization along with an interactive user-friendly interface, but also includes unique features such as programming capability, dedicated mRNA secondary structure prediction, artificial neural network modeling, network & multi-threaded computing, and user-accessible programming modules. The software allows a user to analyze and optimize a sequence using main menu functions or specialized module windows. Alternatively, gene optimization can be initiated by designing a gene construct and configuring an optimization strategy. A user can choose several predefined or user-defined algorithms to design a complicated strategy. The software provides expandable functionality as platform software supporting module development using popular script languages such as VBScript and JScript in the software programming environment. Conclusion Visual Gene Developer is useful for both researchers who want to quickly analyze and optimize genes, and those who are interested in developing and testing new algorithms in bioinformatics. The software is available for free download at http://www.visualgenedeveloper.net.

  13. Genetic structure of Octopus vulgaris (Cephalopoda, Octopodidae) in the central Mediterranean Sea inferred from the mitochondrial COIII gene.

    Science.gov (United States)

    Fadhlaoui-Zid, Karima; Knittweis, Leyla; Aurelle, Didier; Nafkha, Chaala; Ezzeddine, Soufia; Fiorentino, Fabio; Ghmati, Hisham; Ceriola, Luca; Jarboui, Othman; Maltagliati, Ferruccio

    2012-01-01

    The polymorphism of the mitochondrial gene cytochrome oxidase III was studied in the Mediterranean octopus, Octopus vulgaris Cuvier, 1797. A total of 202 specimens from seven sampling sites were analysed with the aim of elucidating patterns of genetic structure in the central Mediterranean Sea and to give an insight into the phylogeny of the Octopus genus. Phylogenetic analyses showed that individuals from the central Mediterranean belong to the O. vulgaris species whose limits should nevertheless be clarified. Concerning genetic structure, two high-frequency haplotypes were present in all locations. The overall genetic divergence (Φ(ST)=0.05, P<0.05) indicated a significant genetic structuring in the study area and an AMOVA highlighted a significant break between western and eastern Mediterranean basins (Φ(CT)=0.094, P<0.05). Possible explanations for the observed patterns of genetic structuring are discussed with reference to their relevance for fisheries management. Copyright © 2012. Published by Elsevier SAS.

  14. Zebrafish Expression Ontology of Gene Sets (ZEOGS): A Tool to Analyze Enrichment of Zebrafish Anatomical Terms in Large Gene Sets

    Science.gov (United States)

    Marsico, Annalisa

    2013-01-01

    Abstract The zebrafish (Danio rerio) is an established model organism for developmental and biomedical research. It is frequently used for high-throughput functional genomics experiments, such as genome-wide gene expression measurements, to systematically analyze molecular mechanisms. However, the use of whole embryos or larvae in such experiments leads to a loss of the spatial information. To address this problem, we have developed a tool called Zebrafish Expression Ontology of Gene Sets (ZEOGS) to assess the enrichment of anatomical terms in large gene sets. ZEOGS uses gene expression pattern data from several sources: first, in situ hybridization experiments from the Zebrafish Model Organism Database (ZFIN); second, it uses the Zebrafish Anatomical Ontology, a controlled vocabulary that describes connected anatomical structures; and third, the available connections between expression patterns and anatomical terms contained in ZFIN. Upon input of a gene set, ZEOGS determines which anatomical structures are overrepresented in the input gene set. ZEOGS allows one for the first time to look at groups of genes and to describe them in terms of shared anatomical structures. To establish ZEOGS, we first tested it on random gene selections and on two public microarray datasets with known tissue-specific gene expression changes. These tests showed that ZEOGS could reliably identify the tissues affected, whereas only very few enriched terms to none were found in the random gene sets. Next we applied ZEOGS to microarray datasets of 24 and 72 h postfertilization zebrafish embryos treated with beclomethasone, a potent glucocorticoid. This analysis resulted in the identification of several anatomical terms related to glucocorticoid-responsive tissues, some of which were stage-specific. Our studies highlight the ability of ZEOGS to extract spatial information from datasets derived from whole embryos, indicating that ZEOGS could be a useful tool to automatically analyze gene

  15. Zebrafish Expression Ontology of Gene Sets (ZEOGS): a tool to analyze enrichment of zebrafish anatomical terms in large gene sets.

    Science.gov (United States)

    Prykhozhij, Sergey V; Marsico, Annalisa; Meijsing, Sebastiaan H

    2013-09-01

    The zebrafish (Danio rerio) is an established model organism for developmental and biomedical research. It is frequently used for high-throughput functional genomics experiments, such as genome-wide gene expression measurements, to systematically analyze molecular mechanisms. However, the use of whole embryos or larvae in such experiments leads to a loss of the spatial information. To address this problem, we have developed a tool called Zebrafish Expression Ontology of Gene Sets (ZEOGS) to assess the enrichment of anatomical terms in large gene sets. ZEOGS uses gene expression pattern data from several sources: first, in situ hybridization experiments from the Zebrafish Model Organism Database (ZFIN); second, it uses the Zebrafish Anatomical Ontology, a controlled vocabulary that describes connected anatomical structures; and third, the available connections between expression patterns and anatomical terms contained in ZFIN. Upon input of a gene set, ZEOGS determines which anatomical structures are overrepresented in the input gene set. ZEOGS allows one for the first time to look at groups of genes and to describe them in terms of shared anatomical structures. To establish ZEOGS, we first tested it on random gene selections and on two public microarray datasets with known tissue-specific gene expression changes. These tests showed that ZEOGS could reliably identify the tissues affected, whereas only very few enriched terms to none were found in the random gene sets. Next we applied ZEOGS to microarray datasets of 24 and 72 h postfertilization zebrafish embryos treated with beclomethasone, a potent glucocorticoid. This analysis resulted in the identification of several anatomical terms related to glucocorticoid-responsive tissues, some of which were stage-specific. Our studies highlight the ability of ZEOGS to extract spatial information from datasets derived from whole embryos, indicating that ZEOGS could be a useful tool to automatically analyze gene expression

  16. Polyamidoamine-Decorated Nanodiamonds as a Hybrid Gene Delivery Vector and siRNA Structural Characterization at the Charged Interfaces.

    Science.gov (United States)

    Lim, Dae Gon; Rajasekaran, Nirmal; Lee, Dukhee; Kim, Nam Ah; Jung, Hun Soon; Hong, Sungyoul; Shin, Young Kee; Kang, Eunah; Jeong, Seong Hoon

    2017-09-20

    Nanodiamonds have been discovered as a new exogenous material source in biomedical applications. As a new potent form of nanodiamond (ND), polyamidoamine-decorated nanodiamonds (PAMAM-NDs) were prepared for E7 or E6 oncoprotein-suppressing siRNA gene delivery for high risk human papillomavirus-induced cervical cancer, such as types 16 and 18. It is critical to understand the physicochemical properties of siRNA complexes immobilized on cationic solid ND surfaces in the aspect of biomolecular structural and conformational changes, as the new inert carbon material can be extended into the application of a gene delivery vector. A spectral study of siRNA/PAMAM-ND complexes using differential scanning calorimetry and circular dichroism spectroscopy proved that the hydrogen bonding and electrostatic interactions between siRNA and PAMAM-NDs decreased endothermic heat capacity. Moreover, siRNA/PAMAM-ND complexes showed low cell cytotoxicity and significant suppressing effects for forward target E6 and E7 oncogenic genes, proving functional and therapeutic efficacy. The cellular uptake of siRNA/PAMAM-ND complexes at 8 h was visualized by macropinocytes and direct endosomal escape of the siRNA/PAMAM-ND complexes. It is presumed that PAMAM-NDs provided a buffering cushion to adjust the pH and hard mechanical stress to escape endosomes. siRNA/PAMAM-ND complexes provide a potential organic/inorganic hybrid material source for gene delivery carriers.

  17. Reranking candidate gene models with cross-species comparison for improved gene prediction

    Directory of Open Access Journals (Sweden)

    Pereira Fernando CN

    2008-10-01

    Full Text Available Abstract Background Most gene finders score candidate gene models with state-based methods, typically HMMs, by combining local properties (coding potential, splice donor and acceptor patterns, etc. Competing models with similar state-based scores may be distinguishable with additional information. In particular, functional and comparative genomics datasets may help to select among competing models of comparable probability by exploiting features likely to be associated with the correct gene models, such as conserved exon/intron structure or protein sequence features. Results We have investigated the utility of a simple post-processing step for selecting among a set of alternative gene models, using global scoring rules to rerank competing models for more accurate prediction. For each gene locus, we first generate the K best candidate gene models using the gene finder Evigan, and then rerank these models using comparisons with putative orthologous genes from closely-related species. Candidate gene models with lower scores in the original gene finder may be selected if they exhibit strong similarity to probable orthologs in coding sequence, splice site location, or signal peptide occurrence. Experiments on Drosophila melanogaster demonstrate that reranking based on cross-species comparison outperforms the best gene models identified by Evigan alone, and also outperforms the comparative gene finders GeneWise and Augustus+. Conclusion Reranking gene models with cross-species comparison improves gene prediction accuracy. This straightforward method can be readily adapted to incorporate additional lines of evidence, as it requires only a ranked source of candidate gene models.

  18. Automatic generation of gene finders for eukaryotic species

    DEFF Research Database (Denmark)

    Terkelsen, Kasper Munch; Krogh, A.

    2006-01-01

    and quality of reliable gene annotation grows. Results We present a procedure, Agene, that automatically generates a species-specific gene predictor from a set of reliable mRNA sequences and a genome. We apply a Hidden Markov model (HMM) that implements explicit length distribution modelling for all gene......Background The number of sequenced eukaryotic genomes is rapidly increasing. This means that over time it will be hard to keep supplying customised gene finders for each genome. This calls for procedures to automatically generate species-specific gene finders and to re-train them as the quantity...... structure blocks using acyclic discrete phase type distributions. The state structure of the each HMM is generated dynamically from an array of sub-models to include only gene features represented in the training set. Conclusion Acyclic discrete phase type distributions are well suited to model sequence...

  19. Genome-wide analysis of regions similar to promoters of histone genes

    KAUST Repository

    Chowdhary, Rajesh

    2010-05-28

    Background: The purpose of this study is to: i) develop a computational model of promoters of human histone-encoding genes (shortly histone genes), an important class of genes that participate in various critical cellular processes, ii) use the model so developed to identify regions across the human genome that have similar structure as promoters of histone genes; such regions could represent potential genomic regulatory regions, e.g. promoters, of genes that may be coregulated with histone genes, and iii/ identify in this way genes that have high likelihood of being coregulated with the histone genes.Results: We successfully developed a histone promoter model using a comprehensive collection of histone genes. Based on leave-one-out cross-validation test, the model produced good prediction accuracy (94.1% sensitivity, 92.6% specificity, and 92.8% positive predictive value). We used this model to predict across the genome a number of genes that shared similar promoter structures with the histone gene promoters. We thus hypothesize that these predicted genes could be coregulated with histone genes. This hypothesis matches well with the available gene expression, gene ontology, and pathways data. Jointly with promoters of the above-mentioned genes, we found a large number of intergenic regions with similar structure as histone promoters.Conclusions: This study represents one of the most comprehensive computational analyses conducted thus far on a genome-wide scale of promoters of human histone genes. Our analysis suggests a number of other human genes that share a high similarity of promoter structure with the histone genes and thus are highly likely to be coregulated, and consequently coexpressed, with the histone genes. We also found that there are a large number of intergenic regions across the genome with their structures similar to promoters of histone genes. These regions may be promoters of yet unidentified genes, or may represent remote control regions that

  20. Analysis of the structural genes encoding M-factor in the fission yeast Schizosaccharomyces pombe: identification of a third gene, mfm3

    DEFF Research Database (Denmark)

    Kjaerulff, S; Davey, William John; Nielsen, O

    1994-01-01

    We previously identified two genes, mfm1 and mfm2, with the potential to encode the M-factor mating pheromone of the fission yeast Schizosaccharomyces pombe (J. Davey, EMBO J. 11:951-960, 1992), but further analysis revealed that a mutant strain lacking both genes still produced active M-factor. ......We previously identified two genes, mfm1 and mfm2, with the potential to encode the M-factor mating pheromone of the fission yeast Schizosaccharomyces pombe (J. Davey, EMBO J. 11:951-960, 1992), but further analysis revealed that a mutant strain lacking both genes still produced active M...... that is not rescued by addition of exogenous M-factor. A mutational analysis reveals that all three mfm genes contribute to the production of M-factor. Their transcription is limited to M cells and requires the mat1-Mc and ste11 gene products. Each gene is induced when the cells are starved of nitrogen and further...

  1. Gene-Culture Coevolution in a Social Cetacean: Integrating Acoustic and Genetic Data to Understand Population Structure in the Short-Finned Pilot Whale (Globicephala macrorhynchus)

    Science.gov (United States)

    Van Cise, Amy

    The evolutionary ecology of a species is driven by a combination of random events, ecological and environmental mechanisms, and social behavior. Gene-culture coevolutionary theory attempts to understand the evolutionary trajectory of a species by examining the interactions between these potential drivers. Further, our choice of data type will affect the patterns we observe, therefore by integrating several types of data we achieve a holistic understanding of the various aspects of evolutionary ecology within a species. In order to understand population structure in short-finned pilot whales, I use a combination of genetic and acoustic data to examine structure on evolutionary (genetic) and cultural (acoustic) timescales. I first examine structure among geographic populations in the Pacific Ocean. Using genetic sequences from the mitochondrial control region, I show that two genetically and morphologically distinct types of short-finned pilot whale, described off the coast of Japan, have non-overlapping distributions throughout their range in the Pacific Ocean. Analysis of the acoustic features of their social calls indicates that they are acoustically differentiated, possibly due to limited communication between the two types. This evidence supports the hypothesis that the two types may be separate species or subspecies. Next, I examine structure among island communities and social groups within the Hawaiian Island population of short-finned pilot whales. Using a combination of mitochondrial and nuclear DNA, I showed that the hierarchical social structure in Hawaiian pilot whales is driven by genetic relatedness; individuals remain in groups with their immediate family members, and preferentially associate with relatives. Similarly, social structure affects genetic differentiation, likely by restricting access to mates. Acoustic differentiation among social groups indicates that social structure may also restrict the flow of cultural information, such as vocal

  2. In vitro assembly of a prohead-like structure of the Rhodobacter capsulatus gene transfer agent

    International Nuclear Information System (INIS)

    Spano, Anthony J.; Chen, Frank S.; Goodman, Benjamin E.; Sabat, Agnes E.; Simon, Martha N.; Wall, Joseph S.; Correia, John J.; McIvor, Wilson; Newcomb, William W.; Brown, Jay C.; Schnur, Joel M.; Lebedev, Nikolai

    2007-01-01

    The gene transfer agent (GTA) is a phage-like particle capable of exchanging double-stranded DNA fragments between cells of the photosynthetic bacterium Rhodobacter capsulatus. Here we show that the major capsid protein of GTA, expressed in E. coli, can be assembled into prohead-like structures in the presence of calcium ions in vitro. Transmission electron microscopy (TEM) of uranyl acetate staining material and thin sections of glutaraldehyde-fixed material demonstrates that these associates have spherical structures with diameters in the range of 27-35 nm. The analysis of scanning TEM images revealed particles of mass ∼ 4.3 MDa, representing 101 ± 11 copies of the monomeric subunit. The establishment of this simple and rapid method to form prohead-like particles permits the GTA system to be used for genome manipulation within the photosynthetic bacterium, for specific targeted drug delivery, and for the construction of biologically based distributed autonomous sensors for environmental monitoring

  3. Short interspersed nuclear elements (SINEs) are abundant in Solanaceae and have a family-specific impact on gene structure and genome organization.

    Science.gov (United States)

    Seibt, Kathrin M; Wenke, Torsten; Muders, Katja; Truberg, Bernd; Schmidt, Thomas

    2016-05-01

    Short interspersed nuclear elements (SINEs) are highly abundant non-autonomous retrotransposons that are widespread in plants. They are short in size, non-coding, show high sequence diversity, and are therefore mostly not or not correctly annotated in plant genome sequences. Hence, comparative studies on genomic SINE populations are rare. To explore the structural organization and impact of SINEs, we comparatively investigated the genome sequences of the Solanaceae species potato (Solanum tuberosum), tomato (Solanum lycopersicum), wild tomato (Solanum pennellii), and two pepper cultivars (Capsicum annuum). Based on 8.5 Gbp sequence data, we annotated 82 983 SINE copies belonging to 10 families and subfamilies on a base pair level. Solanaceae SINEs are dispersed over all chromosomes with enrichments in distal regions. Depending on the genome assemblies and gene predictions, 30% of all SINE copies are associated with genes, particularly frequent in introns and untranslated regions (UTRs). The close association with genes is family specific. More than 10% of all genes annotated in the Solanaceae species investigated contain at least one SINE insertion, and we found genes harbouring up to 16 SINE copies. We demonstrate the involvement of SINEs in gene and genome evolution including the donation of splice sites, start and stop codons and exons to genes, enlargement of introns and UTRs, generation of tandem-like duplications and transduction of adjacent sequence regions. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  4. Convergent evolution of gene networks by single-gene duplications in higher eukaryotes

    OpenAIRE

    Amoutzias, Gregory D; Robertson, David L; Oliver, Stephen G; Bornberg-Bauer, Erich

    2004-01-01

    By combining phylogenetic, proteomic and structural information, we have elucidated the evolutionary driving forces for the gene-regulatory interaction networks of basic helix–loop–helix transcription factors. We infer that recurrent events of single-gene duplication and domain rearrangement repeatedly gave rise to distinct networks with almost identical hub-based topologies, and multiple activators and repressors. We thus provide the first empirical evidence for scale-free protein networks e...

  5. Analysis of the grape MYB R2R3 subfamily reveals expanded wine quality-related clades and conserved gene structure organization across Vitis and Arabidopsis genomes

    Science.gov (United States)

    Matus, José Tomás; Aquea, Felipe; Arce-Johnson, Patricio

    2008-01-01

    Background The MYB superfamily constitutes the most abundant group of transcription factors described in plants. Members control processes such as epidermal cell differentiation, stomatal aperture, flavonoid synthesis, cold and drought tolerance and pathogen resistance. No genome-wide characterization of this family has been conducted in a woody species such as grapevine. In addition, previous analysis of the recently released grape genome sequence suggested expansion events of several gene families involved in wine quality. Results We describe and classify 108 members of the grape R2R3 MYB gene subfamily in terms of their genomic gene structures and similarity to their putative Arabidopsis thaliana orthologues. Seven gene models were derived and analyzed in terms of gene expression and their DNA binding domain structures. Despite low overall sequence homology in the C-terminus of all proteins, even in those with similar functions across Arabidopsis and Vitis, highly conserved motif sequences and exon lengths were found. The grape epidermal cell fate clade is expanded when compared with the Arabidopsis and rice MYB subfamilies. Two anthocyanin MYBA related clusters were identified in chromosomes 2 and 14, one of which includes the previously described grape colour locus. Tannin related loci were also detected with eight candidate homologues in chromosomes 4, 9 and 11. Conclusion This genome wide transcription factor analysis in Vitis suggests that clade-specific grape R2R3 MYB genes are expanded while other MYB genes could be well conserved compared to Arabidopsis. MYB gene abundance, homology and orientation within particular loci also suggests that expanded MYB clades conferring quality attributes of grapes and wines, such as colour and astringency, could possess redundant, overlapping and cooperative functions. PMID:18647406

  6. Gene expression profiles in skeletal muscle after gene electrotransfer

    DEFF Research Database (Denmark)

    Hojman, Pernille; Zibert, John R; Gissel, Hanne

    2007-01-01

    BACKGROUND: Gene transfer by electroporation (DNA electrotransfer) to muscle results in high level long term transgenic expression, showing great promise for treatment of e.g. protein deficiency syndromes. However little is known about the effects of DNA electrotransfer on muscle fibres. We have...... caused down-regulation of structural proteins e.g. sarcospan and catalytic enzymes. Injection of DNA induced down-regulation of intracellular transport proteins e.g. sentrin. The effects on muscle fibres were transient as the expression profiles 3 weeks after treatment were closely related......) followed by a long low voltage pulse (LV, 100 V/cm, 400 ms); a pulse combination optimised for efficient and safe gene transfer. Muscles were transfected with green fluorescent protein (GFP) and excised at 4 hours, 48 hours or 3 weeks after treatment. RESULTS: Differentially expressed genes were...

  7. Population genetic structure in Sabatieria (Nematoda) reveals intermediary gene flow and admixture between distant cold seeps from the Mediterranean Sea.

    Science.gov (United States)

    De Groote, Annelies; Hauquier, Freija; Vanreusel, Ann; Derycke, Sofie

    2017-07-01

    There is a general lack of information on the dispersal and genetic structuring for populations of small-sized deep-water taxa, including free-living nematodes which inhabit and dominate the seafloor sediments. This is also true for unique and scattered deep-sea habitats such as cold seeps. Given the limited dispersal capacity of marine nematodes, genetic differentiation between such geographically isolated habitat patches is expected to be high. Against this background, we examined genetic variation in both mitochondrial (COI) and nuclear (18S and 28S ribosomal) DNA markers of 333 individuals of the genus Sabatieria, abundantly present in reduced cold-seep sediments. Samples originated from four Eastern Mediterranean cold seeps, separated by hundreds of kilometers, and one seep in the Southeast Atlantic. Individuals from the Mediterranean and Atlantic were divided into two separate but closely-related species clades. Within the Eastern Mediterranean, all specimens belonged to a single species, but with a strong population genetic structure (Φ ST  = 0.149). The haplotype network of COI contained 19 haplotypes with the most abundant haplotype (52% of the specimens) shared between all four seeps. The number of private haplotypes was high (15), but the number of mutations between haplotypes was low (1-8). These results indicate intermediary gene flow among the Mediterranean Sabatieria populations with no evidence of long-term barriers to gene flow. The presence of shared haplotypes and multiple admixture events indicate that Sabatieria populations from disjunct cold seeps are not completely isolated, with gene flow most likely facilitated through water current transportation of individuals and/or eggs. Genetic structure and molecular diversity indices are comparable to those of epiphytic shallow-water marine nematodes, while no evidence of sympatric cryptic species was found for the cold-seep Sabatieria.

  8. The four hexamerin genes in the honey bee: structure, molecular evolution and function deduced from expression patterns in queens, workers and drones.

    Science.gov (United States)

    Martins, Juliana R; Nunes, Francis M F; Cristino, Alexandre S; Simões, Zilá L P; Bitondi, Márcia M G

    2010-03-26

    Hexamerins are hemocyanin-derived proteins that have lost the ability to bind copper ions and transport oxygen; instead, they became storage proteins. The current study aimed to broaden our knowledge on the hexamerin genes found in the honey bee genome by exploring their structural characteristics, expression profiles, evolution, and functions in the life cycle of workers, drones and queens. The hexamerin genes of the honey bee (hex 70a, hex 70b, hex 70c and hex 110) diverge considerably in structure, so that the overall amino acid identity shared among their deduced protein subunits varies from 30 to 42%. Bioinformatics search for motifs in the respective upstream control regions (UCRs) revealed six overrepresented motifs including a potential binding site for Ultraspiracle (Usp), a target of juvenile hormone (JH). The expression of these genes was induced by topical application of JH on worker larvae. The four genes are highly transcribed by the larval fat body, although with significant differences in transcript levels, but only hex 110 and hex 70a are re-induced in the adult fat body in a caste- and sex-specific fashion, workers showing the highest expression. Transcripts for hex 110, hex 70a and hex70b were detected in developing ovaries and testes, and hex 110 was highly transcribed in the ovaries of egg-laying queens. A phylogenetic analysis revealed that HEX 110 is located at the most basal position among the holometabola hexamerins, and like HEX 70a and HEX 70c, it shares potential orthology relationship with hexamerins from other hymenopteran species. Striking differences were found in the structure and developmental expression of the four hexamerin genes in the honey bee. The presence of a potential binding site for Usp in the respective 5' UCRs, and the results of experiments on JH level manipulation in vivo support the hypothesis of regulation by JH. Transcript levels and patterns in the fat body and gonads suggest that, in addition to their primary

  9. Combinatorial explosion in model gene networks

    Science.gov (United States)

    Edwards, R.; Glass, L.

    2000-09-01

    The explosive growth in knowledge of the genome of humans and other organisms leaves open the question of how the functioning of genes in interacting networks is coordinated for orderly activity. One approach to this problem is to study mathematical properties of abstract network models that capture the logical structures of gene networks. The principal issue is to understand how particular patterns of activity can result from particular network structures, and what types of behavior are possible. We study idealized models in which the logical structure of the network is explicitly represented by Boolean functions that can be represented by directed graphs on n-cubes, but which are continuous in time and described by differential equations, rather than being updated synchronously via a discrete clock. The equations are piecewise linear, which allows significant analysis and facilitates rapid integration along trajectories. We first give a combinatorial solution to the question of how many distinct logical structures exist for n-dimensional networks, showing that the number increases very rapidly with n. We then outline analytic methods that can be used to establish the existence, stability and periods of periodic orbits corresponding to particular cycles on the n-cube. We use these methods to confirm the existence of limit cycles discovered in a sample of a million randomly generated structures of networks of 4 genes. Even with only 4 genes, at least several hundred different patterns of stable periodic behavior are possible, many of them surprisingly complex. We discuss ways of further classifying these periodic behaviors, showing that small mutations (reversal of one or a few edges on the n-cube) need not destroy the stability of a limit cycle. Although these networks are very simple as models of gene networks, their mathematical transparency reveals relationships between structure and behavior, they suggest that the possibilities for orderly dynamics in such

  10. Multi-target parallel processing approach for gene-to-structure determination of the influenza polymerase PB2 subunit.

    Science.gov (United States)

    Armour, Brianna L; Barnes, Steve R; Moen, Spencer O; Smith, Eric; Raymond, Amy C; Fairman, James W; Stewart, Lance J; Staker, Bart L; Begley, Darren W; Edwards, Thomas E; Lorimer, Donald D

    2013-06-28

    Pandemic outbreaks of highly virulent influenza strains can cause widespread morbidity and mortality in human populations worldwide. In the United States alone, an average of 41,400 deaths and 1.86 million hospitalizations are caused by influenza virus infection each year (1). Point mutations in the polymerase basic protein 2 subunit (PB2) have been linked to the adaptation of the viral infection in humans (2). Findings from such studies have revealed the biological significance of PB2 as a virulence factor, thus highlighting its potential as an antiviral drug target. The structural genomics program put forth by the National Institute of Allergy and Infectious Disease (NIAID) provides funding to Emerald Bio and three other Pacific Northwest institutions that together make up the Seattle Structural Genomics Center for Infectious Disease (SSGCID). The SSGCID is dedicated to providing the scientific community with three-dimensional protein structures of NIAID category A-C pathogens. Making such structural information available to the scientific community serves to accelerate structure-based drug design. Structure-based drug design plays an important role in drug development. Pursuing multiple targets in parallel greatly increases the chance of success for new lead discovery by targeting a pathway or an entire protein family. Emerald Bio has developed a high-throughput, multi-target parallel processing pipeline (MTPP) for gene-to-structure determination to support the consortium. Here we describe the protocols used to determine the structure of the PB2 subunit from four different influenza A strains.

  11. Structural defects and variations in the HIV-1 nef gene from rapid, slow and non-progressor children.

    Science.gov (United States)

    Casartelli, Nicoletta; Di Matteo, Gigliola; Argentini, Claudio; Cancrini, Caterina; Bernardi, Stefania; Castelli, Guido; Scarlatti, Gabriella; Plebani, Anna; Rossi, Paolo; Doria, Margherita

    2003-06-13

    Evaluation of sequence evolution as well as structural defects and mutations of the human immunodeficiency virus-type 1 (HIV-1) nef gene in relation to disease progression in infected children. We examined a large number of nef alleles sequentially derived from perinatally HIV-1-infected children with different rates of disease progression: six non-progressors (NPs), four rapid progressors (RPs), and three slow progressors (SPs). Nef alleles (182 total) were isolated from patients' peripheral blood mononuclear cells (PBMCs), sequenced and analysed for their evolutionary pattern, frequency of mutations and occurrence of amino acid variations associated with different stages of disease. The evolution rate of the nef gene apparently correlated with CD4+ decline in all progression groups. Evidence for rapid viral turnover and positive selection for changes were found only in two SPs and two RPs respectively. In NPs, a higher proportion of disrupted sequences and mutations at various functional motifs were observed. Furthermore, NP-derived Nef proteins were often changed at residues localized in the folded core domain at cytotoxic T lymphocytes (CTL) epitopes (E(105), K(106), E(110), Y(132), K(164), and R(200)), while other residues outside the core domain are more often changed in RPs (A(43)) and SPs (N(173) and Y(214)). Our results suggest a link between nef gene functions and the progression rate in HIV-1-infected children. Moreover, non-progressor-associated variations in the core domain of Nef, together with the genetic analysis, suggest that nef gene evolution is shaped by an effective immune system in these patients.

  12. Functions, structure, and read-through alternative splicing of feline APOBEC3 genes

    Science.gov (United States)

    Münk, Carsten; Beck, Thomas; Zielonka, Jörg; Hotz-Wagenblatt, Agnes; Chareza, Sarah; Battenberg, Marion; Thielebein, Jens; Cichutek, Klaus; Bravo, Ignacio G; O'Brien, Stephen J; Lochelt, Martin; Yuhki, Naoya

    2008-01-01

    Background Over the past years a variety of host restriction genes have been identified in human and mammals that modulate retrovirus infectivity, replication, assembly, and/or cross-species transmission. Among these host-encoded restriction factors, the APOBEC3 (A3; apolipoprotein B mRNA-editing catalytic polypeptide 3) proteins are potent inhibitors of retroviruses and retrotransposons. While primates encode seven of these genes (A3A to A3H), rodents carry only a single A3 gene. Results Here we identified and characterized several A3 genes in the genome of domestic cat (Felis catus) by analyzing the genomic A3 locus. The cat genome presents one A3H gene and three very similar A3C genes (a-c), probably generated after two consecutive gene duplications. In addition to these four one-domain A3 proteins, a fifth A3, designated A3CH, is expressed by read-through alternative splicing. Specific feline A3 proteins selectively inactivated only defined genera of feline retroviruses: Bet-deficient feline foamy virus was mainly inactivated by feA3Ca, feA3Cb, and feA3Cc, while feA3H and feA3CH were only weakly active. The infectivity of Vif-deficient feline immunodeficiency virus and feline leukemia virus was reduced only by feA3H and feA3CH, but not by any of the feA3Cs. Within Felidae, A3C sequences show significant adaptive selection, but unexpectedly, the A3H sequences present more sites that are under purifying selection. Conclusion Our data support a complex evolutionary history of expansion, divergence, selection and individual extinction of antiviral A3 genes that parallels the early evolution of Placentalia, becoming more intricate in taxa in which the arms race between host and retroviruses is harsher. PMID:18315870

  13. Effects of the nanotopographic surface structure of commercially pure titanium following anodization–hydrothermal treatment on gene expression and adhesion in gingival epithelial cells

    International Nuclear Information System (INIS)

    Takebe, J.; Miyata, K.; Miura, S.; Ito, S.

    2014-01-01

    The long-term stability and maintenance of endosseous implants with anodized–hydrothermally treated commercially pure titanium surfaces and a nanotopographic structure (SA-treated c.p.Ti) depend on the barrier function provided by the interface between the transmucosal portion of the implant surface and the peri-implant epithelium. This study investigated the effects of extracellular and intracellular gene expression in adherent gingival epithelial cells cultured for 1–7 days on SA-treated c.p.Ti implant surfaces compared to anodic oxide (AO) c.p.Ti and c.p.Ti disks. Scanning electron microscopy (SEM) showed filopodium-like extensions bound closely to the nanotopographic structure of SA-treated c.p.Ti at day 7 of culture. Gene expressions of focal adhesion kinase, integrin-α6β4, and laminin-5 (α3, β3, γ2) were significantly higher on SA-treated c.p.Ti than on c.p.Ti or AO c.p.Ti after 7 days (P < 0.05). Our results confirmed that gingival epithelial cells adhere to SA-treated c.p.Ti as the transmucosal portion of an implant, and that this interaction markedly improves expression of focal adhesion molecules and enhances the epithelial cell phenotype. The cellular gene expression responses driving extracellular and intracellular molecular interactions thus play an important role in maintenance at the interface between SA-treated c.p.Ti implant surfaces and the gingival epithelial cells. - Highlights: • SA-treated Ti provides a nanotopographic structure for clinical oral implants. • This could regulate integrin-mediated epithelial cell adhesion and gene expression. • FAK mRNA was significantly higher on SA-treated Ti. • Integrin-α6β4 and laminin-5 mRNA were significantly higher on SA-treated Ti. • Extracellular/intracellular molecular interactions play a key role on SA-treated Ti

  14. Comparisons of Copy Number, Genomic Structure, and Conserved Motifs for α-Amylase Genes from Barley, Rice, and Wheat

    Directory of Open Access Journals (Sweden)

    Qisen Zhang

    2017-10-01

    Full Text Available Barley is an important crop for the production of malt and beer. However, crops such as rice and wheat are rarely used for malting. α-amylase is the key enzyme that degrades starch during malting. In this study, we compared the genomic properties, gene copies, and conserved promoter motifs of α-amylase genes in barley, rice, and wheat. In all three crops, α-amylase consists of four subfamilies designated amy1, amy2, amy3, and amy4. In wheat and barley, members of amy1 and amy2 genes are localized on chromosomes 6 and 7, respectively. In rice, members of amy1 genes are found on chromosomes 1 and 2, and amy2 genes on chromosome 6. The barley genome has six amy1 members and three amy2 members. The wheat B genome contains four amy1 members and three amy2 members, while the rice genome has three amy1 members and one amy2 member. The B genome has mostly amy1 and amy2 members among the three wheat genomes. Amy1 promoters from all three crop genomes contain a GA-responsive complex consisting of a GA-responsive element (CAATAAA, pyrimidine box (CCTTTT and TATCCAT/C box. This study has shown that amy1 and amy2 from both wheat and barley have similar genomic properties, including exon/intron structures and GA-responsive elements on promoters, but these differ in rice. Like barley, wheat should have sufficient amy activity to degrade starch completely during malting. Other factors, such as high protein with haze issues and the lack of husk causing Lauting difficulty, may limit the use of wheat for brewing.

  15. alpha-Globin genes: thalassemic and structural alterations in a Brazilian population

    Directory of Open Access Journals (Sweden)

    M.R.S.C. Wenning

    2000-09-01

    Full Text Available Seven unrelated patients with hemoglobin (Hb H disease and 27 individuals with alpha-chain structural alterations were studied to identify the alpha-globin gene mutations present in the population of Southeast Brazil. The -alpha3.7, --MED and -(alpha20.5 deletions were investigated by PCR, whereas non-deletional alpha-thalassemia (alphaHphalpha, alphaNcoIalpha, aaNcoI, alphaIcalpha and alphaTSaudialpha was screened with restriction enzymes and by nested PCR. Structural alterations were identified by direct DNA sequencing. Of the seven patients with Hb H disease, all of Italian descent, two had the -(alpha20.5/-alpha3.7 genotype, one had the --MED/-alpha3.7 genotype, one had the --MED/alphaHphalpha genotype and three showed interaction of the -alpha3.7 deletion with an unusual, unidentified form of non-deletional alpha-thalassemia [-alpha3.7/(aaT]. Among the 27 patients with structural alterations, 15 (of Italian descent had Hb Hasharon (alpha47Asp->His associated with the -alpha3.7 deletion, 4 (of Italian descent were heterozygous for Hb J-Rovigo (alpha53Ala->Asp, 4 (3 Blacks and 1 Caucasian were heterozygous for Hb Stanleyville-II (alpha78Asn->Lys associated with the alpha+-thalassemia, 1 (Black was heterozygous for Hb G-Pest (alpha74Asp->Asn, 1 (Caucasian was heterozygous for Hb Kurosaki (alpha7Lys->Glu, 1 (Caucasian was heterozygous for Hb Westmead (alpha122His->Gln, and 1 (Caucasian was the carrier of a novel silent variant (Hb Campinas, alpha26Ala->Val. Most of the mutations found reflected the Mediterranean and African origins of the population. Hbs G-Pest and Kurosaki, very rare, and Hb Westmead, common in southern China, were initially described in individuals of ethnic origin differing from those of the carriers reported in the present study and are the first cases to be reported in the Brazilian population.

  16. Activity-regulated genes as mediators of neural circuit plasticity.

    Science.gov (United States)

    Leslie, Jennifer H; Nedivi, Elly

    2011-08-01

    Modifications of neuronal circuits allow the brain to adapt and change with experience. This plasticity manifests during development and throughout life, and can be remarkably long lasting. Evidence has linked activity-regulated gene expression to the long-term structural and electrophysiological adaptations that take place during developmental critical periods, learning and memory, and alterations to sensory map representations in the adult. In all these cases, the cellular response to neuronal activity integrates multiple tightly coordinated mechanisms to precisely orchestrate long-lasting, functional and structural changes in brain circuits. Experience-dependent plasticity is triggered when neuronal excitation activates cellular signaling pathways from the synapse to the nucleus that initiate new programs of gene expression. The protein products of activity-regulated genes then work via a diverse array of cellular mechanisms to modify neuronal functional properties. Synaptic strengthening or weakening can reweight existing circuit connections, while structural changes including synapse addition and elimination create new connections. Posttranscriptional regulatory mechanisms, often also dependent on activity, further modulate activity-regulated gene transcript and protein function. Thus, activity-regulated genes implement varied forms of structural and functional plasticity to fine-tune brain circuit wiring. Copyright © 2011 Elsevier Ltd. All rights reserved.

  17. Alu Elements as Novel Regulators of Gene Expression in Type 1 Diabetes Susceptibility Genes?

    Science.gov (United States)

    Kaur, Simranjeet; Pociot, Flemming

    2015-07-13

    Despite numerous studies implicating Alu repeat elements in various diseases, there is sparse information available with respect to the potential functional and biological roles of the repeat elements in Type 1 diabetes (T1D). Therefore, we performed a genome-wide sequence analysis of T1D candidate genes to identify embedded Alu elements within these genes. We observed significant enrichment of Alu elements within the T1D genes (p-value genes harboring Alus revealed significant enrichment for immune-mediated processes (p-value genes harboring inverted Alus (IRAlus) within their 3' untranslated regions (UTRs) that are known to regulate the expression of host mRNAs by generating double stranded RNA duplexes. Our in silico analysis predicted the formation of duplex structures by IRAlus within the 3'UTRs of T1D genes. We propose that IRAlus might be involved in regulating the expression levels of the host T1D genes.

  18. Genome-Wide Identification of the Alba Gene Family in Plants and Stress-Responsive Expression of the Rice Alba Genes.

    Science.gov (United States)

    Verma, Jitendra Kumar; Wardhan, Vijay; Singh, Deepali; Chakraborty, Subhra; Chakraborty, Niranjan

    2018-03-28

    Architectural proteins play key roles in genome construction and regulate the expression of many genes, albeit the modulation of genome plasticity by these proteins is largely unknown. A critical screening of the architectural proteins in five crop species, viz., Oryza sativa , Zea mays , Sorghum bicolor , Cicer arietinum , and Vitis vinifera , and in the model plant Arabidopsis thaliana along with evolutionary relevant species such as Chlamydomonas reinhardtii , Physcomitrella patens , and Amborella trichopoda , revealed 9, 20, 10, 7, 7, 6, 1, 4, and 4 Alba (acetylation lowers binding affinity) genes, respectively. A phylogenetic analysis of the genes and of their counterparts in other plant species indicated evolutionary conservation and diversification. In each group, the structural components of the genes and motifs showed significant conservation. The chromosomal location of the Alba genes of rice ( OsAlba ), showed an unequal distribution on 8 of its 12 chromosomes. The expression profiles of the OsAlba genes indicated a distinct tissue-specific expression in the seedling, vegetative, and reproductive stages. The quantitative real-time PCR (qRT-PCR) analysis of the OsAlba genes confirmed their stress-inducible expression under multivariate environmental conditions and phytohormone treatments. The evaluation of the regulatory elements in 68 Alba genes from the 9 species studied led to the identification of conserved motifs and overlapping microRNA (miRNA) target sites, suggesting the conservation of their function in related proteins and a divergence in their biological roles across species. The 3D structure and the prediction of putative ligands and their binding sites for OsAlba proteins offered a key insight into the structure-function relationship. These results provide a comprehensive overview of the subtle genetic diversification of the OsAlba genes, which will help in elucidating their functional role in plants.

  19. Detecting population structure in a high gene-flow species, Atlantic herring (Clupea harengus): direct, simultaneous evaluation of neutral vs putatively selected loci

    DEFF Research Database (Denmark)

    André, C.; Larsson, L. C.; Laikre, L.

    2010-01-01

    In many marine fish species, genetic population structure is typically weak because populations are large, evolutionarily young and have a high potential for gene flow. We tested whether genetic markers influenced by natural selection are more efficient than the presumed neutral genetic markers t...

  20. Homology-dependent Gene Silencing in Paramecium

    Science.gov (United States)

    Ruiz, Françoise; Vayssié, Laurence; Klotz, Catherine; Sperling, Linda; Madeddu, Luisa

    1998-01-01

    Microinjection at high copy number of plasmids containing only the coding region of a gene into the Paramecium somatic macronucleus led to a marked reduction in the expression of the corresponding endogenous gene(s). The silencing effect, which is stably maintained throughout vegetative growth, has been observed for all Paramecium genes examined so far: a single-copy gene (ND7), as well as members of multigene families (centrin genes and trichocyst matrix protein genes) in which all closely related paralogous genes appeared to be affected. This phenomenon may be related to posttranscriptional gene silencing in transgenic plants and quelling in Neurospora and allows the efficient creation of specific mutant phenotypes thus providing a potentially powerful tool to study gene function in Paramecium. For the two multigene families that encode proteins that coassemble to build up complex subcellular structures the analysis presented herein provides the first experimental evidence that the members of these gene families are not functionally redundant. PMID:9529389

  1. Examining the process of de novo gene birth: an educational primer on "integration of new genes into cellular networks, and their structural maturation".

    Science.gov (United States)

    Frietze, Seth; Leatherman, Judith

    2014-03-01

    New genes that arise from modification of the noncoding portion of a genome rather than being duplicated from parent genes are called de novo genes. These genes, identified by their brief evolution and lack of parent genes, provide an opportunity to study the timeframe in which emerging genes integrate into cellular networks, and how the characteristics of these genes change as they mature into bona fide genes. An article by G. Abrusán provides an opportunity to introduce students to fundamental concepts in evolutionary and comparative genetics and to provide a technical background by which to discuss systems biology approaches when studying the evolutionary process of gene birth. Basic background needed to understand the Abrusán study and details on comparative genomic concepts tailored for a classroom discussion are provided, including discussion questions and a supplemental exercise on navigating a genome database.

  2. Global gene expression in Escherichia coli biofilms

    DEFF Research Database (Denmark)

    Schembri, Mark; Kjærgaard, K.; Klemm, Per

    2003-01-01

    It is now apparent that microorganisms undergo significant changes during the transition from planktonic to biofilm growth. These changes result in phenotypic adaptations that allow the formation of highly organized and structured sessile communities, which possess enhanced resistance to antimicr......It is now apparent that microorganisms undergo significant changes during the transition from planktonic to biofilm growth. These changes result in phenotypic adaptations that allow the formation of highly organized and structured sessile communities, which possess enhanced resistance...... the transition to biofilm growth, and these included genes expressed under oxygen-limiting conditions, genes encoding (putative) transport proteins, putative oxidoreductases and genes associated with enhanced heavy metal resistance. Of particular interest was the observation that many of the genes altered...... in expression have no current defined function. These genes, as well as those induced by stresses relevant to biofilm growth such as oxygen and nutrient limitation, may be important factors that trigger enhanced resistance mechanisms of sessile communities to antibiotics and hydrodynamic shear forces....

  3. Analysis of the grape MYB R2R3 subfamily reveals expanded wine quality-related clades and conserved gene structure organization across Vitis and Arabidopsis genomes

    Directory of Open Access Journals (Sweden)

    Arce-Johnson Patricio

    2008-07-01

    Full Text Available Abstract Background The MYB superfamily constitutes the most abundant group of transcription factors described in plants. Members control processes such as epidermal cell differentiation, stomatal aperture, flavonoid synthesis, cold and drought tolerance and pathogen resistance. No genome-wide characterization of this family has been conducted in a woody species such as grapevine. In addition, previous analysis of the recently released grape genome sequence suggested expansion events of several gene families involved in wine quality. Results We describe and classify 108 members of the grape R2R3 MYB gene subfamily in terms of their genomic gene structures and similarity to their putative Arabidopsis thaliana orthologues. Seven gene models were derived and analyzed in terms of gene expression and their DNA binding domain structures. Despite low overall sequence homology in the C-terminus of all proteins, even in those with similar functions across Arabidopsis and Vitis, highly conserved motif sequences and exon lengths were found. The grape epidermal cell fate clade is expanded when compared with the Arabidopsis and rice MYB subfamilies. Two anthocyanin MYBA related clusters were identified in chromosomes 2 and 14, one of which includes the previously described grape colour locus. Tannin related loci were also detected with eight candidate homologues in chromosomes 4, 9 and 11. Conclusion This genome wide transcription factor analysis in Vitis suggests that clade-specific grape R2R3 MYB genes are expanded while other MYB genes could be well conserved compared to Arabidopsis. MYB gene abundance, homology and orientation within particular loci also suggests that expanded MYB clades conferring quality attributes of grapes and wines, such as colour and astringency, could possess redundant, overlapping and cooperative functions.

  4. Functional Analysis of an ATP-Binding Cassette Transporter Gene in Botrytis cinerea by Gene Disruption

    OpenAIRE

    Masami, NAKAJIMA; Junko, SUZUKI; Takehiko, HOSAKA; Tadaaki, HIBI; Katsumi, AKUTSU; School of Agriculture, Ibaraki University; School of Agriculture, Ibaraki University; School of Agriculture, Ibaraki University; Department of Agriculture and Environmental Biology, The University of Tokyo; School of Agriculture, Ibaraki University

    2001-01-01

    The BMR1 gene encoding an ABC transporter was cloned from Botrytis cinerea. To examine the function of BMR1 in B.cinerea, we isolated BMR1-deficient mutants after gene disruption. Disruption vector pBcDF4 was constructed by replacing the BMR1-coding region with a hygromycin B phosphotransferase gene(hph)cassette. The BMR1 disruptants had an increased sensitivity to polyoxin and iprobenfos. Polyoxin and iprobenfos, structurally unrelated compounds, may therefore be substrates of BMR1.

  5. In Silico Screening and Molecular Dynamics Simulation of Disease-Associated nsSNP in TYRP1 Gene and Its Structural Consequences in OCA3

    Directory of Open Access Journals (Sweden)

    Balu Kamaraj

    2013-01-01

    Full Text Available Oculocutaneous albinism type III (OCA3, caused by mutations of TYRP1 gene, is an autosomal recessive disorder characterized by reduced biosynthesis of melanin pigment in the hair, skin, and eyes. The TYRP1 gene encodes a protein called tyrosinase-related protein-1 (Tyrp1. Tyrp1 is involved in maintaining the stability of tyrosinase protein and modulating its catalytic activity in eumelanin synthesis. Tyrp1 is also involved in maintenance of melanosome structure and affects melanocyte proliferation and cell death. In this work we implemented computational analysis to filter the most probable mutation that might be associated with OCA3. We found R326H and R356Q as most deleterious and disease associated by using PolyPhen 2.0, SIFT, PANTHER, I-mutant 3.0, PhD-SNP, SNP&GO, Pmut, and Mutpred tools. To understand the atomic arrangement in 3D space, the native and mutant (R326H and R356Q structures were modelled. Finally the structural analyses of native and mutant Tyrp1 proteins were investigated using molecular dynamics simulation (MDS approach. MDS results showed more flexibility in native Tyrp1 structure. Due to mutation in Tyrp1 protein, it became more rigid and might disturb the structural conformation and catalytic function of the structure and might also play a significant role in inducing OCA3. The results obtained from this study would facilitate wet-lab researches to develop a potent drug therapies against OCA3.

  6. Evolutionary dynamics of human autoimmune disease genes and malfunctioned immunological genes

    Directory of Open Access Journals (Sweden)

    Podder Soumita

    2012-01-01

    Full Text Available Abstract Background One of the main issues of molecular evolution is to divulge the principles in dictating the evolutionary rate differences among various gene classes. Immunological genes have received considerable attention in evolutionary biology as candidates for local adaptation and for studying functionally important polymorphisms. The normal structure and function of immunological genes will be distorted when they experience mutations leading to immunological dysfunctions. Results Here, we examined the fundamental differences between the genes which on mutation give rise to autoimmune or other immune system related diseases and the immunological genes that do not cause any disease phenotypes. Although the disease genes examined are analogous to non-disease genes in product, expression, function, and pathway affiliation, a statistically significant decrease in evolutionary rate has been found in autoimmune disease genes relative to all other immune related diseases and non-disease genes. Possible ways of accumulation of mutation in the three steps of the central dogma (DNA-mRNA-Protein have been studied to trace the mutational effects predisposed to disease consequence and acquiring higher selection pressure. Principal Component Analysis and Multivariate Regression Analysis have established the predominant role of single nucleotide polymorphisms in guiding the evolutionary rate of immunological disease and non-disease genes followed by m-RNA abundance, paralogs number, fraction of phosphorylation residue, alternatively spliced exon, protein residue burial and protein disorder. Conclusions Our study provides an empirical insight into the etiology of autoimmune disease genes and other immunological diseases. The immediate utility of our study is to help in disease gene identification and may also help in medicinal improvement of immune related disease.

  7. Systematic review, structural analysis, and new theoretical perspectives on the role of serotonin and associated genes in the etiology of psychopathy and sociopathy

    NARCIS (Netherlands)

    Yildirim, B.O.; Derksen, J.J.L.

    2013-01-01

    Since its theoretical inception, psychopathy has been considered by philosophers, clinicians, theorists, and empirical researchers to be substantially and critically explained by genetic factors. In this systematic review and structural analysis, new hypotheses will be introduced regarding gene–gene

  8. Extensive gene content variation in the Brachypodium distachyon pan-genome correlates with population structure.

    Science.gov (United States)

    Gordon, Sean P; Contreras-Moreira, Bruno; Woods, Daniel P; Des Marais, David L; Burgess, Diane; Shu, Shengqiang; Stritt, Christoph; Roulin, Anne C; Schackwitz, Wendy; Tyler, Ludmila; Martin, Joel; Lipzen, Anna; Dochy, Niklas; Phillips, Jeremy; Barry, Kerrie; Geuten, Koen; Budak, Hikmet; Juenger, Thomas E; Amasino, Richard; Caicedo, Ana L; Goodstein, David; Davidson, Patrick; Mur, Luis A J; Figueroa, Melania; Freeling, Michael; Catalan, Pilar; Vogel, John P

    2017-12-19

    While prokaryotic pan-genomes have been shown to contain many more genes than any individual organism, the prevalence and functional significance of differentially present genes in eukaryotes remains poorly understood. Whole-genome de novo assembly and annotation of 54 lines of the grass Brachypodium distachyon yield a pan-genome containing nearly twice the number of genes found in any individual genome. Genes present in all lines are enriched for essential biological functions, while genes present in only some lines are enriched for conditionally beneficial functions (e.g., defense and development), display faster evolutionary rates, lie closer to transposable elements and are less likely to be syntenic with orthologous genes in other grasses. Our data suggest that differentially present genes contribute substantially to phenotypic variation within a eukaryote species, these genes have a major influence in population genetics, and transposable elements play a key role in pan-genome evolution.

  9. Crystal structure of Aquifex aeolicus gene product Aq1627: a putative phosphoglucosamine mutase reveals a unique C-terminal end-to-end disulfide linkage.

    Science.gov (United States)

    Sridharan, Upasana; Kuramitsu, Seiki; Yokoyama, Shigeyuki; Kumarevel, Thirumananseri; Ponnuraj, Karthe

    2017-06-27

    The Aq1627 gene from Aquifex aeolicus, a hyperthermophilic bacterium has been cloned and overexpressed in Escherichia coli. The protein was purified to homogeneity and its X-ray crystal structure was determined to 1.3 Å resolution using multiple wavelength anomalous dispersion phasing. The structural and sequence analysis of Aq1627 is suggestive of a putative phosphoglucosamine mutase. The structural features of Aq1627 further indicate that it could belong to a new subclass of the phosphoglucosamine mutase family. Aq1627 structure contains a unique C-terminal end-to-end disulfide bond, which links two monomers and this structural information can be used in protein engineering to make proteins more stable in different applications.

  10. Alterations of pancreatic islet structure, metabolism and gene expression in diet-induced obese C57BL/6J mice.

    Directory of Open Access Journals (Sweden)

    Regan Roat

    Full Text Available The reduction of functional β cell mass is a key feature of type 2 diabetes. Here, we studied metabolic functions and islet gene expression profiles of C57BL/6J mice with naturally occurring nicotinamide nucleotide transhydrogenase (NNT deletion mutation, a widely used model of diet-induced obesity and diabetes. On high fat diet (HF, the mice developed obesity and hyperinsulinemia, while blood glucose levels were only mildly elevated indicating a substantial capacity to compensate for insulin resistance. The basal serum insulin levels were elevated in HF mice, but insulin secretion in response to glucose load was significantly blunted. Hyperinsulinemia in HF fed mice was associated with an increase in islet mass and size along with higher BrdU incorporation to β cells. The temporal profiles of glucose-stimulated insulin secretion (GSIS of isolated islets were comparable in HF and normal chow fed mice. Islets isolated from HF fed mice had elevated basal oxygen consumption per islet but failed to increase oxygen consumption further in response to glucose or carbonyl cyanide-4-trifluoromethoxyphenylhydrazone (FCCP. To obtain an unbiased assessment of metabolic pathways in islets, we performed microarray analysis comparing gene expression in islets from HF to normal chow-fed mice. A few genes, for example, those genes involved in the protection against oxidative stress (hypoxia upregulated protein 1 and Pgc1α were up-regulated in HF islets. In contrast, several genes in extracellular matrix and other pathways were suppressed in HF islets. These results indicate that islets from C57BL/6J mice with NNT deletion mutation develop structural, metabolic and gene expression features consistent with compensation and decompensation in response to HF diet.

  11. Gene structure of the pregnancy-associated glycoprotein-like (PAG-L) in the Eurasian beaver (Castor fiber L.).

    Science.gov (United States)

    Lipka, Aleksandra; Majewska, Marta; Panasiewicz, Grzegorz; Bieniek-Kobuszewska, Martyna; Szafranska, Bozena

    2017-09-01

    The pregnancy-associated glycoprotein-like family (PAG-L) is a large group of chorionic products, expressed in the pre-placental trophoblast and later in the post-implantational chorionic epithelium, and are involved in proper placenta development and embryo-maternal interaction in eutherians. This study describes identification of the PAG-L family in the genome of the Eurasian beaver (Castor fiber L.), named CfPAG-L. We identified 7657 bp of the CfPAG-L gDNA sequence (Acc. No. KX377932), encompassing nine exons (1-9) and eight introns (A-H). The length of the CfPAG-L exons (59-200 bp) was equivalently similar to the only known counterparts of bPAG1, bPAG2, and pPAG2. The length of the CfPAG-L introns ranged 288-1937 bp and was completely different from previously known PAG introns. The exonic CfPAG-L regions revealed 50.3-72.9% homology with equivalent segments of bPAG1 and pPAG2 structure. The intronic CfPAG-L regions alignments revealed a lack of homology. Within the entire CfPAG-L gene, 31 potential single nucleotide variants (SNV: 7 transversions and 24 transitions) were predicted. The identified exonic polymorphic loci did not affect the amino acid sequence of the CfPAG-L polypeptide precursor. This is the first report describing the CfPAG-L gene sequence, structural organization, and SNVs in the Eurasian beaver, one of the largest rodents.

  12. Molecular evolution of pentatricopeptide repeat genes reveals truncation in species lacking an editing target and structural domains under distinct selective pressures

    Directory of Open Access Journals (Sweden)

    Hayes Michael L

    2012-05-01

    Full Text Available Abstract Background Pentatricopeptide repeat (PPR proteins are required for numerous RNA processing events in plant organelles including C-to-U editing, splicing, stabilization, and cleavage. Fifteen PPR proteins are known to be required for RNA editing at 21 sites in Arabidopsis chloroplasts, and belong to the PLS class of PPR proteins. In this study, we investigate the co-evolution of four PPR genes (CRR4, CRR21, CLB19, and OTP82 and their six editing targets in Brassicaceae species. PPR genes are composed of approximately 10 to 20 tandem repeats and each repeat has two α-helical regions, helix A and helix B, that are separated by short coil regions. Each repeat and structural feature was examined to determine the selective pressures on these regions. Results All of the PPR genes examined are under strong negative selection. Multiple independent losses of editing site targets are observed for both CRR21 and OTP82. In several species lacking the known editing target for CRR21, PPR genes are truncated near the 17th PPR repeat. The coding sequences of the truncated CRR21 genes are maintained under strong negative selection; however, the 3’ UTR sequences beyond the truncation site have substantially diverged. Phylogenetic analyses of four PPR genes show that sequences corresponding to helix A are high compared to helix B sequences. Differential evolutionary selection of helix A versus helix B is observed in both plant and mammalian PPR genes. Conclusion PPR genes and their cognate editing sites are mutually constrained in evolution. Editing sites are frequently lost by replacement of an edited C with a genomic T. After the loss of an editing site, the PPR genes are observed with three outcomes: first, few changes are detected in some cases; second, the PPR gene is present as a pseudogene; and third, the PPR gene is present but truncated in the C-terminal region. The retention of truncated forms of CRR21 that are maintained under strong negative

  13. Molecular evolution of pentatricopeptide repeat genes reveals truncation in species lacking an editing target and structural domains under distinct selective pressures.

    Science.gov (United States)

    Hayes, Michael L; Giang, Karolyn; Mulligan, R Michael

    2012-05-14

    Pentatricopeptide repeat (PPR) proteins are required for numerous RNA processing events in plant organelles including C-to-U editing, splicing, stabilization, and cleavage. Fifteen PPR proteins are known to be required for RNA editing at 21 sites in Arabidopsis chloroplasts, and belong to the PLS class of PPR proteins. In this study, we investigate the co-evolution of four PPR genes (CRR4, CRR21, CLB19, and OTP82) and their six editing targets in Brassicaceae species. PPR genes are composed of approximately 10 to 20 tandem repeats and each repeat has two α-helical regions, helix A and helix B, that are separated by short coil regions. Each repeat and structural feature was examined to determine the selective pressures on these regions. All of the PPR genes examined are under strong negative selection. Multiple independent losses of editing site targets are observed for both CRR21 and OTP82. In several species lacking the known editing target for CRR21, PPR genes are truncated near the 17th PPR repeat. The coding sequences of the truncated CRR21 genes are maintained under strong negative selection; however, the 3' UTR sequences beyond the truncation site have substantially diverged. Phylogenetic analyses of four PPR genes show that sequences corresponding to helix A are high compared to helix B sequences. Differential evolutionary selection of helix A versus helix B is observed in both plant and mammalian PPR genes. PPR genes and their cognate editing sites are mutually constrained in evolution. Editing sites are frequently lost by replacement of an edited C with a genomic T. After the loss of an editing site, the PPR genes are observed with three outcomes: first, few changes are detected in some cases; second, the PPR gene is present as a pseudogene; and third, the PPR gene is present but truncated in the C-terminal region. The retention of truncated forms of CRR21 that are maintained under strong negative selection even in the absence of an editing site target

  14. The duplicated genes database: identification and functional annotation of co-localised duplicated genes across genomes.

    Directory of Open Access Journals (Sweden)

    Marion Ouedraogo

    Full Text Available BACKGROUND: There has been a surge in studies linking genome structure and gene expression, with special focus on duplicated genes. Although initially duplicated from the same sequence, duplicated genes can diverge strongly over evolution and take on different functions or regulated expression. However, information on the function and expression of duplicated genes remains sparse. Identifying groups of duplicated genes in different genomes and characterizing their expression and function would therefore be of great interest to the research community. The 'Duplicated Genes Database' (DGD was developed for this purpose. METHODOLOGY: Nine species were included in the DGD. For each species, BLAST analyses were conducted on peptide sequences corresponding to the genes mapped on a same chromosome. Groups of duplicated genes were defined based on these pairwise BLAST comparisons and the genomic location of the genes. For each group, Pearson correlations between gene expression data and semantic similarities between functional GO annotations were also computed when the relevant information was available. CONCLUSIONS: The Duplicated Gene Database provides a list of co-localised and duplicated genes for several species with the available gene co-expression level and semantic similarity value of functional annotation. Adding these data to the groups of duplicated genes provides biological information that can prove useful to gene expression analyses. The Duplicated Gene Database can be freely accessed through the DGD website at http://dgd.genouest.org.

  15. Bioinformatics analysis of the predicted polyprenol reductase genes in higher plants

    Science.gov (United States)

    Basyuni, M.; Wati, R.

    2018-03-01

    The present study evaluates the bioinformatics methods to analyze twenty-four predicted polyprenol reductase genes from higher plants on GenBank as well as predicted the structure, composition, similarity, subcellular localization, and phylogenetic. The physicochemical properties of plant polyprenol showed diversity among the observed genes. The percentage of the secondary structure of plant polyprenol genes followed the ratio order of α helix > random coil > extended chain structure. The values of chloroplast but not signal peptide were too low, indicated that few chloroplast transit peptide in plant polyprenol reductase genes. The possibility of the potential transit peptide showed variation among the plant polyprenol reductase, suggested the importance of understanding the variety of peptide components of plant polyprenol genes. To clarify this finding, a phylogenetic tree was drawn. The phylogenetic tree shows several branches in the tree, suggested that plant polyprenol reductase genes grouped into divergent clusters in the tree.

  16. Genetic and epigenetic alteration among three homoeologous genes of a class E MADS box gene in hexaploid wheat.

    Science.gov (United States)

    Shitsukawa, Naoki; Tahira, Chikako; Kassai, Ken-Ichiro; Hirabayashi, Chizuru; Shimizu, Tomoaki; Takumi, Shigeo; Mochida, Keiichi; Kawaura, Kanako; Ogihara, Yasunari; Murai, Koji

    2007-06-01

    Bread wheat (Triticum aestivum) is a hexaploid species with A, B, and D ancestral genomes. Most bread wheat genes are present in the genome as triplicated homoeologous genes (homoeologs) derived from the ancestral species. Here, we report that both genetic and epigenetic alterations have occurred in the homoeologs of a wheat class E MADS box gene. Two class E genes are identified in wheat, wheat SEPALLATA (WSEP) and wheat LEAFY HULL STERILE1 (WLHS1), which are homologs of Os MADS45 and Os MADS1 in rice (Oryza sativa), respectively. The three wheat homoeologs of WSEP showed similar genomic structures and expression profiles. By contrast, the three homoeologs of WLHS1 showed genetic and epigenetic alterations. The A genome WLHS1 homoeolog (WLHS1-A) had a structural alteration that contained a large novel sequence in place of the K domain sequence. A yeast two-hybrid analysis and a transgenic experiment indicated that the WLHS1-A protein had no apparent function. The B and D genome homoeologs, WLHS1-B and WLHS1-D, respectively, had an intact MADS box gene structure, but WLHS1-B was predominantly silenced by cytosine methylation. Consequently, of the three WLHS1 homoeologs, only WLHS1-D functions in hexaploid wheat. This is a situation where three homoeologs are differentially regulated by genetic and epigenetic mechanisms.

  17. Pollen-mediated gene flow and fine-scale spatial genetic structure in Olea europaea subsp. europaea var. sylvestris.

    Science.gov (United States)

    Beghè, D; Piotti, A; Satovic, Z; de la Rosa, R; Belaj, A

    2017-03-01

    Wild olive ( Olea europaea subsp. europaea var. sylvestris ) is important from an economic and ecological point of view. The effects of anthropogenic activities may lead to the genetic erosion of its genetic patrimony, which has high value for breeding programmes. In particular, the consequences of the introgression from cultivated stands are strongly dependent on the extent of gene flow and therefore this work aims at quantitatively describing contemporary gene flow patterns in wild olive natural populations. The studied wild population is located in an undisturbed forest, in southern Spain, considered one of the few extant hotspots of true oleaster diversity. A total of 225 potential father trees and seeds issued from five mother trees were genotyped by eight microsatellite markers. Levels of contemporary pollen flow, in terms of both pollen immigration rates and within-population dynamics, were measured through paternity analyses. Moreover, the extent of fine-scale spatial genetic structure (SGS) was studied to assess the relative importance of seed and pollen dispersal in shaping the spatial distribution of genetic variation. The results showed that the population under study is characterized by a high genetic diversity, a relatively high pollen immigration rate (0·57), an average within-population pollen dispersal of about 107 m and weak but significant SGS up to 40 m. The population is a mosaic of several intermingled genetic clusters that is likely to be generated by spatially restricted seed dispersal. Moreover, wild oleasters were found to be self-incompatible and preferential mating between some genotypes was revealed. Knowledge of the within-population genetic structure and gene flow dynamics will lead to identifying possible strategies aimed at limiting the effect of anthropogenic activities and improving breeding programmes for the conservation of olive tree forest genetic resources. © The Author 2016. Published by Oxford University Press on behalf

  18. Search for missing schizophrenia genes will require a new ...

    Indian Academy of Sciences (India)

    2013-08-06

    Aug 6, 2013 ... causal gene(s)?. The successful search for disease genes is based on a ..... 2010 Mobile interspersed repeats are major structural variants in ... Petronis A., Paterson A. D. and Kennedy J. L. 1999 Schizophrenia: an epigenetic ...

  19. Assessment of orthologous splicing isoforms in human and mouse orthologous genes

    Directory of Open Access Journals (Sweden)

    Horner David S

    2010-10-01

    Full Text Available Abstract Background Recent discoveries have highlighted the fact that alternative splicing and alternative transcripts are the rule, rather than the exception, in metazoan genes. Since multiple transcript and protein variants expressed by the same gene are, by definition, structurally distinct and need not to be functionally equivalent, the concept of gene orthology should be extended to the transcript level in order to describe evolutionary relationships between structurally similar transcript variants. In other words, the identification of true orthology relationships between gene products now should progress beyond primary sequence and "splicing orthology", consisting in ancestrally shared exon-intron structures, is required to define orthologous isoforms at transcript level. Results As a starting step in this direction, in this work we performed a large scale human- mouse gene comparison with a twofold goal: first, to assess if and to which extent traditional gene annotations such as RefSeq capture genuine splicing orthology; second, to provide a more detailed annotation and quantification of true human-mouse orthologous transcripts defined as transcripts of orthologous genes exhibiting the same splicing patterns. Conclusions We observed an identical exon/intron structure for 32% of human and mouse orthologous genes. This figure increases to 87% using less stringent criteria for gene structure similarity, thus implying that for about 13% of the human RefSeq annotated genes (and about 25% of the corresponding transcripts we could not identify any mouse transcript showing sufficient similarity to be confidently assigned as a splicing ortholog. Our data suggest that current gene and transcript data may still be rather incomplete - with several splicing variants still unknown. The observation that alternative splicing produces large numbers of alternative transcripts and proteins, some of them conserved across species and others truly species

  20. Consequences of Marfan mutations to expression of fibrillin gene and to the structure of microfibrils

    Energy Technology Data Exchange (ETDEWEB)

    Peltonen, L.; Karttunen, L.; Rantamaeki, T. [NPHI, Helsinki (Finland)] [and others

    1994-09-01

    Marfan syndrome (MFS) is a dominantly inherited connective tissue disorder which is caused by mutations in the fibrillin-1 gene (FBN1). Over 40 family-specific FBN1 mutations have been identified. We have characterized 18 different heterozygous mutations including amino acid substitutions, premature stop, and splicing defects leading to deletions or one insertion, and one compound heterozygote with two differently mutated FBN1 alleles inherited from his affected parents. To unravel the consequences of FBN1 mutations to the transcription of FBN1 gene, we have measured the steady state levels of mRNA transcribed from the normal and mutated alleles. The missense mutations do not affect the transcription of the allele while the nonsense mutation leads to lower steady state amount of mutated allele. For the dissection of molecular pathogenesis of FBN1 mutations we have performed rotary shadowing of the microfibrils produced by the cell cultures from MFS patients. The cells from the neonatal patients with established mutations produced only disorganized fibrillin aggregates but no clearly defined microfibrils could be detected, suggesting a major role of this gene region coding for exons 24-26 in stabilization and organization of the bead structure of microfibrils. From the cells of a rare compound heterozygote case carrying two different mutations, no detectable microfibrils could be detected whereas the cells of his parents with heterozygous mutations were able to form identifiable but disorganized microfibrils. In the cells of an MFS case caused by a premature stop removing the C-terminus of fibrillin, the microfibril assembly takes place but the appropriate packing of the microfibrils is disturbed suggesting that C-terminae are actually located within the interbead domain of the microfibrils.

  1. Related structures of neutral capsular polysaccharides of Acinetobacter baumannii isolates that carry related capsule gene clusters KL43, KL47, and KL88.

    Science.gov (United States)

    Shashkov, Alexander S; Kenyon, Johanna J; Arbatsky, Nikolay P; Shneider, Mikhail M; Popova, Anastasiya V; Miroshnikov, Konstantin A; Hall, Ruth M; Knirel, Yuriy A

    2016-11-29

    Capsular polysaccharides were recovered from four Acinetobacter baumannii isolates, and the following related structures of oligosaccharide repeating units were established by sugar analyses along with 1D and 2D 1 H and 13 C NMR spectroscopy: NIPH 60 and LUH5544 (K43) NIPH 601 (K47) The K locus for capsule biosynthesis in the genome sequences available for NIPH 60 and LUH5544, designated KL43, was found to be related to gene clusters KL47 in NIPH 601 and KL88 in LUH5548. The three clusters share most gene content differing in only a small portion that includes an additional glycosyltransferase genes in KL47 and KL88, as well as genes encoding distinct Wzy polymerases that were found to form the same α-d-GlcpNAc-(1 → 6)-α-d-GlcpNAc linkage in K43 and K47. Copyright © 2016 Elsevier Ltd. All rights reserved.

  2. Integrative characterization of germ cell-specific genes from mouse spermatocyte UniGene library

    Directory of Open Access Journals (Sweden)

    Eddy Edward M

    2007-07-01

    Full Text Available Abstract Background The primary regulator of spermatogenesis, a highly ordered and tightly regulated developmental process, is an intrinsic genetic program involving male germ cell-specific genes. Results We analyzed the mouse spermatocyte UniGene library containing 2155 gene-oriented transcript clusters. We predict that 11% of these genes are testis-specific and systematically identified 24 authentic genes specifically and abundantly expressed in the testis via in silico and in vitro approaches. Northern blot analysis disclosed various transcript characteristics, such as expression level, size and the presence of isoform. Expression analysis revealed developmentally regulated and stage-specific expression patterns in all of the genes. We further analyzed the genes at the protein and cellular levels. Transfection assays performed using GC-2 cells provided information on the cellular characteristics of the gene products. In addition, antibodies were generated against proteins encoded by some of the genes to facilitate their identification and characterization in spermatogenic cells and sperm. Our data suggest that a number of the gene products are implicated in transcriptional regulation, nuclear integrity, sperm structure and motility, and fertilization. In particular, we found for the first time that Mm.333010, predicted to contain a trypsin-like serine protease domain, is a sperm acrosomal protein. Conclusion We identify 24 authentic genes with spermatogenic cell-specific expression, and provide comprehensive information about the genes. Our findings establish a new basis for future investigation into molecular mechanisms underlying male reproduction.

  3. Genes and inheritance.

    Science.gov (United States)

    Middelton, L A; Peters, K F

    2001-10-01

    The information gained from the Human Genome Project and related genetic research will undoubtedly create significant changes in healthcare practice. It is becoming increasingly clear that nurses in all areas of clinical practice will require a fundamental understanding of basic genetics. This article provides the oncology nurse with an overview of basic genetic concepts, including inheritance patterns of single gene conditions, pedigree construction, chromosome aberrations, and the multifactorial basis underlying the common diseases of adulthood. Normal gene structure and function are introduced and the biochemistry of genetic errors is described.

  4. Isolation and identification of differentially expressed genes between ...

    African Journals Online (AJOL)

    Plants have evolved sophisticated molecular defense mechanisms in order to survive disease conditions. So far, a number of pathogen resistance (R) genes have been reported in plants. These R genes are thought to be involved in activating the signals that lead to disease resistance. The structural specificity of R genes ...

  5. Medusa structure of the gene regulatory network: dominance of transcription factors in cancer subtype classification.

    Science.gov (United States)

    Guo, Yuchun; Feng, Ying; Trivedi, Niraj S; Huang, Sui

    2011-05-01

    Gene expression profiles consisting of ten thousands of transcripts are used for clustering of tissue, such as tumors, into subtypes, often without considering the underlying reason that the distinct patterns of expression arise because of constraints in the realization of gene expression profiles imposed by the gene regulatory network. The topology of this network has been suggested to consist of a regulatory core of genes represented most prominently by transcription factors (TFs) and microRNAs, that influence the expression of other genes, and of a periphery of 'enslaved' effector genes that are regulated but not regulating. This 'medusa' architecture implies that the core genes are much stronger determinants of the realized gene expression profiles. To test this hypothesis, we examined the clustering of gene expression profiles into known tumor types to quantitatively demonstrate that TFs, and even more pronounced, microRNAs, are much stronger discriminators of tumor type specific gene expression patterns than a same number of randomly selected or metabolic genes. These findings lend support to the hypothesis of a medusa architecture and of the canalizing nature of regulation by microRNAs. They also reveal the degree of freedom for the expression of peripheral genes that are less stringently associated with a tissue type specific global gene expression profile.

  6. Genetic structure of Quechua-speakers of the Central Andes and geographic patterns of gene frequencies in South Amerindian populations.

    Science.gov (United States)

    Luiselli, D; Simoni, L; Tarazona-Santos, E; Pastor, S; Pettener, D

    2000-09-01

    A sample of 141 Quechua-speaking individuals of the population of Tayacaja, in the Peruvian Central Andes, was typed for the following 16 genetic systems: ABO, Rh, MNSs, P, Duffy, AcP1, EsD, GLOI, PGM1, AK, 6-PGD, Hp, Gc, Pi, C3, and Bf. The genetic structure of the population was analyzed in relation to the allele frequencies available for other South Amerindian populations, using a combination of multivariate and multivariable techniques. Spatial autocorrelation analysis was performed independently for 13 alleles to identify patterns of gene flow in South America as a whole and in more specific geographic regions. We found a longitudinal cline for the AcP1*a and EsD*1 alleles which we interpreted as the result of an ancient longitudinal expansion of a putative ancestral population of modern Amerindians. Monmonnier's algorithm, used to identify areas of sharp genetic discontinuity, suggested a clear east-west differentiation of native South American populations, which was confirmed by analysis of the distribution of genetic distances. We suggest that this pattern of genetic structures is the consequence of the independent peopling of western and eastern South America or to low levels of gene flow between these regions, related to different environmental and demographic histories. Copyright 2000 Wiley-Liss, Inc.

  7. The structure of gene product 6 of bacteriophage T4, the hinge-pin of the baseplate.

    Science.gov (United States)

    Aksyuk, Anastasia A; Leiman, Petr G; Shneider, Mikhail M; Mesyanzhinov, Vadim V; Rossmann, Michael G

    2009-06-10

    The baseplate of bacteriophage T4 is a multicomponent protein complex, which controls phage attachment to the host. It assembles from six wedges and a central hub. During infection the baseplate undergoes a large conformational change from a dome-shaped to a flat, star-shaped structure. We report the crystal structure of the C-terminal half of gene product (gp) 6 and investigate its motion with respect to the other proteins during the baseplate rearrangement. Six gp6 dimers interdigitate, forming a ring that maintains the integrity of the baseplate in both conformations. One baseplate wedge contains an N-terminal dimer of gp6, whereas neighboring wedges are tied together through the C-terminal dimer of gp6. The dimeric interactions are preserved throughout the rearrangement of the baseplate. However, the hinge angle between the N- and C-terminal parts of gp6 changes by approximately 15 degrees , accounting for a 10 A radial increase in the diameter of the gp6 ring.

  8. Sponge non-metastatic Group I Nme gene/protein - structure and function is conserved from sponges to humans

    Science.gov (United States)

    2011-01-01

    Background Nucleoside diphosphate kinases NDPK are evolutionarily conserved enzymes present in Bacteria, Archaea and Eukarya, with human Nme1 the most studied representative of the family and the first identified metastasis suppressor. Sponges (Porifera) are simple metazoans without tissues, closest to the common ancestor of all animals. They changed little during evolution and probably provide the best insight into the metazoan ancestor's genomic features. Recent studies show that sponges have a wide repertoire of genes many of which are involved in diseases in more complex metazoans. The original function of those genes and the way it has evolved in the animal lineage is largely unknown. Here we report new results on the metastasis suppressor gene/protein homolog from the marine sponge Suberites domuncula, NmeGp1Sd. The purpose of this study was to investigate the properties of the sponge Group I Nme gene and protein, and compare it to its human homolog in order to elucidate the evolution of the structure and function of Nme. Results We found that sponge genes coding for Group I Nme protein are intron-rich. Furthermore, we discovered that the sponge NmeGp1Sd protein has a similar level of kinase activity as its human homolog Nme1, does not cleave negatively supercoiled DNA and shows nonspecific DNA-binding activity. The sponge NmeGp1Sd forms a hexamer, like human Nme1, and all other eukaryotic Nme proteins. NmeGp1Sd interacts with human Nme1 in human cells and exhibits the same subcellular localization. Stable clones expressing sponge NmeGp1Sd inhibited the migratory potential of CAL 27 cells, as already reported for human Nme1, which suggests that Nme's function in migratory processes was engaged long before the composition of true tissues. Conclusions This study suggests that the ancestor of all animals possessed a NmeGp1 protein with properties and functions similar to evolutionarily recent versions of the protein, even before the appearance of true tissues

  9. Process and genes for expression and overexpression of active [FeFe] hydrogenases

    Science.gov (United States)

    Seibert, Michael; King, Paul W; Ghirardi, Maria Lucia; Posewitz, Matthew C; Smolinski, Sharon L

    2014-09-16

    A process for expression of active [FeFe]-hydrogenase in a host organism that does not contain either the structural gene(s) for [FeFe]-hydrogenases and/or homologues for the maturation genes HydE, HydF and HyG, comprising: cloning the structural hydrogenase gene(s) and/or the maturation genes HydE, HydF and HydG from an organisms that contains these genes into expression plasmids; transferring the plasmids into an organism that lacks a native [FeFe]-hydrogenase or that has a disrupted [FeFe]-hydrogenase and culturing it aerobically; and inducing anaerobiosis to provide [FeFe] hydrogenase biosynthesis and H?2#191 production.

  10. Physical Factors Correlate to Microbial Community Structure and Nitrogen Cycling Gene Abundance in a Nitrate Fed Eutrophic Lagoon.

    Science.gov (United States)

    Highton, Matthew P; Roosa, Stéphanie; Crawshaw, Josie; Schallenberg, Marc; Morales, Sergio E

    2016-01-01

    Nitrogenous run-off from farmed pastures contributes to the eutrophication of Lake Ellesmere, a large shallow lagoon/lake on the east coast of New Zealand. Tributaries periodically deliver high loads of nitrate to the lake which likely affect microbial communities therein. We hypothesized that a nutrient gradient would form from the potential sources (tributaries) creating a disturbance resulting in changes in microbial community structure. To test this we first determined the existence of such a gradient but found only a weak nitrogen (TN) and phosphorous gradient (DRP). Changes in microbial communities were determined by measuring functional potential (quantification of nitrogen cycling genes via nifH , nirS , nosZI , and nosZII using qPCR), potential activity (via denitrification enzyme activity), as well as using changes in total community (via 16S rRNA gene amplicon sequencing). Our results demonstrated that changes in microbial communities at a phylogenetic (relative abundance) and functional level (proportion of the microbial community carrying nifH and nosZI genes) were most strongly associated with physical gradients (e.g., lake depth, sediment grain size, sediment porosity) and not nutrient concentrations. Low nitrate influx at the time of sampling is proposed as a factor contributing to the observed patterns.

  11. Physical factors correlate to microbial community structure and nitrogen cycling gene abundance in a nitrate fed eutrophic lagoon

    Directory of Open Access Journals (Sweden)

    Matthew Paul Highton

    2016-10-01

    Full Text Available Nitrogenous run-off from farmed pastures contributes to the eutrophication of Lake Ellesmere, a large shallow lagoon/lake on the east coast of New Zealand. Tributaries periodically deliver high loads of nitrate to the lake which likely affect microbial communities therein. We hypothesized that a nutrient gradient would form from the potential sources (tributaries creating a disturbance resulting in changes in microbial community structure. To test this we first determined the existence of such a gradient but found only a weak nitrogen (TN and phosphorous gradient (DRP. Changes in microbial communities were determined by measuring functional potential (quantification of nitrogen cycling genes via nifH, nirS, nosZI and nosZII using qPCR, potential activity (via denitrification enzyme activity, as well as using changes in total community (via 16S rRNA gene amplicon sequencing. Our results demonstrated that changes in microbial communities at a phylogenetic (relative abundance and functional level (proportion of the microbial community carrying nifH and nosZI genes were most strongly associated with physical gradients (e.g. lake depth, sediment grain size, sediment porosity and not nutrient concentrations. Low nitrate influx at the time of sampling is proposed as a factor contributing to the observed patterns.

  12. Genome-Wide Analyses of the NAC Transcription Factor Gene Family in Pepper (Capsicum annuum L.: Chromosome Location, Phylogeny, Structure, Expression Patterns, Cis-Elements in the Promoter, and Interaction Network

    Directory of Open Access Journals (Sweden)

    Weiping Diao

    2018-03-01

    Full Text Available The NAM, ATAF1/2, and CUC2 (NAC transcription factors form a large plant-specific gene family, which is involved in the regulation of tissue development in response to biotic and abiotic stress. To date, there have been no comprehensive studies investigating chromosomal location, gene structure, gene phylogeny, conserved motifs, or gene expression of NAC in pepper (Capsicum annuum L.. The recent release of the complete genome sequence of pepper allowed us to perform a genome-wide investigation of Capsicum annuum L. NAC (CaNAC proteins. In the present study, a comprehensive analysis of the CaNAC gene family in pepper was performed, and a total of 104 CaNAC genes were identified. Genome mapping analysis revealed that CaNAC genes were enriched on four chromosomes (chromosomes 1, 2, 3, and 6. In addition, phylogenetic analysis of the NAC domains from pepper, potato, Arabidopsis, and rice showed that CaNAC genes could be clustered into three groups (I, II, and III. Group III, which contained 24 CaNAC genes, was exclusive to the Solanaceae plant family. Gene structure and protein motif analyses showed that these genes were relatively conserved within each subgroup. The number of introns in CaNAC genes varied from 0 to 8, with 83 (78.9% of CaNAC genes containing two or less introns. Promoter analysis confirmed that CaNAC genes are involved in pepper growth, development, and biotic or abiotic stress responses. Further, the expression of 22 selected CaNAC genes in response to seven different biotic and abiotic stresses [salt, heat shock, drought, Phytophthora capsici, abscisic acid, salicylic acid (SA, and methyl jasmonate (MeJA] was evaluated by quantitative RT-PCR to determine their stress-related expression patterns. Several putative stress-responsive CaNAC genes, including CaNAC72 and CaNAC27, which are orthologs of the known stress-responsive Arabidopsis gene ANAC055 and potato gene StNAC30, respectively, were highly regulated by treatment with

  13. The complete chloroplast genome sequence of an endemic monotypic genus Hagenia (Rosaceae: structural comparative analysis, gene content and microsatellite detection

    Directory of Open Access Journals (Sweden)

    Andrew W. Gichira

    2017-01-01

    Full Text Available Hagenia is an endangered monotypic genus endemic to the topical mountains of Africa. The only species, Hagenia abyssinica (Bruce J.F. Gmel, is an important medicinal plant producing bioactive compounds that have been traditionally used by African communities as a remedy for gastrointestinal ailments in both humans and animals. Complete chloroplast genomes have been applied in resolving phylogenetic relationships within plant families. We employed high-throughput sequencing technologies to determine the complete chloroplast genome sequence of H. abyssinica. The genome is a circular molecule of 154,961 base pairs (bp, with a pair of Inverted Repeats (IR 25,971 bp each, separated by two single copies; a large (LSC, 84,320 bp and a small single copy (SSC, 18,696. H. abyssinica’s chloroplast genome has a 37.1% GC content and encodes 112 unique genes, 78 of which code for proteins, 30 are tRNA genes and four are rRNA genes. A comparative analysis with twenty other species, sequenced to-date from the family Rosaceae, revealed similarities in structural organization, gene content and arrangement. The observed size differences are attributed to the contraction/expansion of the inverted repeats. The translational initiation factor gene (infA which had been previously reported in other chloroplast genomes was conspicuously missing in H. abyssinica. A total of 172 microsatellites and 49 large repeat sequences were detected in the chloroplast genome. A Maximum Likelihood analyses of 71 protein-coding genes placed Hagenia in Rosoideae. The availability of a complete chloroplast genome, the first in the Sanguisorbeae tribe, is beneficial for further molecular studies on taxonomic and phylogenomic resolution within the Rosaceae family.

  14. The complete chloroplast genome sequence of an endemic monotypic genus Hagenia (Rosaceae): structural comparative analysis, gene content and microsatellite detection.

    Science.gov (United States)

    Gichira, Andrew W; Li, Zhizhong; Saina, Josphat K; Long, Zhicheng; Hu, Guangwan; Gituru, Robert W; Wang, Qingfeng; Chen, Jinming

    2017-01-01

    Hagenia is an endangered monotypic genus endemic to the topical mountains of Africa. The only species, Hagenia abyssinica (Bruce) J.F. Gmel, is an important medicinal plant producing bioactive compounds that have been traditionally used by African communities as a remedy for gastrointestinal ailments in both humans and animals. Complete chloroplast genomes have been applied in resolving phylogenetic relationships within plant families. We employed high-throughput sequencing technologies to determine the complete chloroplast genome sequence of H. abyssinica. The genome is a circular molecule of 154,961 base pairs (bp), with a pair of Inverted Repeats (IR) 25,971 bp each, separated by two single copies; a large (LSC, 84,320 bp) and a small single copy (SSC, 18,696). H. abyssinica 's chloroplast genome has a 37.1% GC content and encodes 112 unique genes, 78 of which code for proteins, 30 are tRNA genes and four are rRNA genes. A comparative analysis with twenty other species, sequenced to-date from the family Rosaceae, revealed similarities in structural organization, gene content and arrangement. The observed size differences are attributed to the contraction/expansion of the inverted repeats. The translational initiation factor gene ( infA ) which had been previously reported in other chloroplast genomes was conspicuously missing in H. abyssinica . A total of 172 microsatellites and 49 large repeat sequences were detected in the chloroplast genome. A Maximum Likelihood analyses of 71 protein-coding genes placed Hagenia in Rosoideae. The availability of a complete chloroplast genome, the first in the Sanguisorbeae tribe, is beneficial for further molecular studies on taxonomic and phylogenomic resolution within the Rosaceae family.

  15. The four hexamerin genes in the honey bee: structure, molecular evolution and function deduced from expression patterns in queens, workers and drones

    Directory of Open Access Journals (Sweden)

    Martins Juliana R

    2010-03-01

    Full Text Available Abstract Background Hexamerins are hemocyanin-derived proteins that have lost the ability to bind copper ions and transport oxygen; instead, they became storage proteins. The current study aimed to broaden our knowledge on the hexamerin genes found in the honey bee genome by exploring their structural characteristics, expression profiles, evolution, and functions in the life cycle of workers, drones and queens. Results The hexamerin genes of the honey bee (hex 70a, hex 70b, hex 70c and hex 110 diverge considerably in structure, so that the overall amino acid identity shared among their deduced protein subunits varies from 30 to 42%. Bioinformatics search for motifs in the respective upstream control regions (UCRs revealed six overrepresented motifs including a potential binding site for Ultraspiracle (Usp, a target of juvenile hormone (JH. The expression of these genes was induced by topical application of JH on worker larvae. The four genes are highly transcribed by the larval fat body, although with significant differences in transcript levels, but only hex 110 and hex 70a are re-induced in the adult fat body in a caste- and sex-specific fashion, workers showing the highest expression. Transcripts for hex 110, hex 70a and hex70b were detected in developing ovaries and testes, and hex 110 was highly transcribed in the ovaries of egg-laying queens. A phylogenetic analysis revealed that HEX 110 is located at the most basal position among the holometabola hexamerins, and like HEX 70a and HEX 70c, it shares potential orthology relationship with hexamerins from other hymenopteran species. Conclusions Striking differences were found in the structure and developmental expression of the four hexamerin genes in the honey bee. The presence of a potential binding site for Usp in the respective 5' UCRs, and the results of experiments on JH level manipulation in vivo support the hypothesis of regulation by JH. Transcript levels and patterns in the fat body

  16. The four hexamerin genes in the honey bee: structure, molecular evolution and function deduced from expression patterns in queens, workers and drones

    Science.gov (United States)

    2010-01-01

    Background Hexamerins are hemocyanin-derived proteins that have lost the ability to bind copper ions and transport oxygen; instead, they became storage proteins. The current study aimed to broaden our knowledge on the hexamerin genes found in the honey bee genome by exploring their structural characteristics, expression profiles, evolution, and functions in the life cycle of workers, drones and queens. Results The hexamerin genes of the honey bee (hex 70a, hex 70b, hex 70c and hex 110) diverge considerably in structure, so that the overall amino acid identity shared among their deduced protein subunits varies from 30 to 42%. Bioinformatics search for motifs in the respective upstream control regions (UCRs) revealed six overrepresented motifs including a potential binding site for Ultraspiracle (Usp), a target of juvenile hormone (JH). The expression of these genes was induced by topical application of JH on worker larvae. The four genes are highly transcribed by the larval fat body, although with significant differences in transcript levels, but only hex 110 and hex 70a are re-induced in the adult fat body in a caste- and sex-specific fashion, workers showing the highest expression. Transcripts for hex 110, hex 70a and hex70b were detected in developing ovaries and testes, and hex 110 was highly transcribed in the ovaries of egg-laying queens. A phylogenetic analysis revealed that HEX 110 is located at the most basal position among the holometabola hexamerins, and like HEX 70a and HEX 70c, it shares potential orthology relationship with hexamerins from other hymenopteran species. Conclusions Striking differences were found in the structure and developmental expression of the four hexamerin genes in the honey bee. The presence of a potential binding site for Usp in the respective 5' UCRs, and the results of experiments on JH level manipulation in vivo support the hypothesis of regulation by JH. Transcript levels and patterns in the fat body and gonads suggest that

  17. Death and resurrection of the human IRGM gene.

    Directory of Open Access Journals (Sweden)

    Cemalettin Bekpen

    2009-03-01

    Full Text Available Immunity-related GTPases (IRG play an important role in defense against intracellular pathogens. One member of this gene family in humans, IRGM, has been recently implicated as a risk factor for Crohn's disease. We analyzed the detailed structure of this gene family among primates and showed that most of the IRG gene cluster was deleted early in primate evolution, after the divergence of the anthropoids from prosimians ( about 50 million years ago. Comparative sequence analysis of New World and Old World monkey species shows that the single-copy IRGM gene became pseudogenized as a result of an Alu retrotransposition event in the anthropoid common ancestor that disrupted the open reading frame (ORF. We find that the ORF was reestablished as a part of a polymorphic stop codon in the common ancestor of humans and great apes. Expression analysis suggests that this change occurred in conjunction with the insertion of an endogenous retrovirus, which altered the transcription initiation, splicing, and expression profile of IRGM. These data argue that the gene became pseudogenized and was then resurrected through a series of complex structural events and suggest remarkable functional plasticity where alleles experience diverse evolutionary pressures over time. Such dynamism in structure and evolution may be critical for a gene family locked in an arms race with an ever-changing repertoire of intracellular parasites.

  18. FGF: A web tool for Fishing Gene Family in a whole genome database

    DEFF Research Database (Denmark)

    Zheng, Hongkun; Shi, Junjie; Fang, Xiaodong

    2007-01-01

    to efficiently search for and identify gene families. The FGF output displays the results as visual phylogenetic trees including information on gene structure, chromosome position, duplication fate and selective pressure. It is particularly useful to identify pseudogenes and detect changes in gene structure. FGF...

  19. Association of nad7a Gene with Cytoplasmic Male Sterility in Pigeonpea

    Directory of Open Access Journals (Sweden)

    Pallavi Sinha

    2015-07-01

    Full Text Available Cytoplasmic male sterility (CMS has been exploited in the commercial pigeonpea [ (L. Millsp.] hybrid breeding system; however, the molecular mechanism behind this system is unknown. To understand the underlying molecular mechanism involved in A CMS system derived from (Haines Maesen, 34 mitochondrial genes were analyzed for expression profiling and structural variation analysis between CMS line (ICRISAT Pigeonpea A line, ICPA 2039 and its cognate maintainer (ICPB 2039. Expression profiling of 34 mitochondrial genes revealed nine genes with significant fold differential gene expression at ≤ 0.01, including one gene, , with 1366-fold higher expression in CMS line as compared with the maintainer. Structural variation analysis of these mitochondrial genes identified length variation between ICPA 2039 and ICPB 2039 for (subunit of gene. Sanger sequencing of and genes in the CMS and the maintainer lines identified two single nucleotide polymorphisms (SNPs in upstream region of and a deletion of 10 bp in in the CMS line. Protein structure evaluation showed conformational changes in predicted protein structures for between ICPA 2039 and ICPB 2039 lines. All above analyses indicate association of gene with the CMS for A cytoplasm in pigeonpea. Additionally, one polymerase chain reaction (PCR based Indel marker ( has been developed and validated for testing genetic purity of A derived CMS lines to strengthen the commercial hybrid breeding program in pigeonpea.

  20. Gene structure and mutations of glutaryl-coenzyme A dehydrogenase: impaired association of enzyme subunits that is due to an A421V substitution causes glutaric acidemia type I in the Amish.

    OpenAIRE

    Biery, B. J.; Stein, D. E.; Morton, D. H.; Goodman, S. I.

    1996-01-01

    The structure of the human glutaryl coenzyme A dehydrogenase (GCD) gene was determined to contain 11 exons and to span approximately 7 kb. Fibroblast DNA from 64 unrelated glutaric acidemia type I (GA1) patients was screened for mutations by PCR amplification and analysis of SSCP. Fragments with altered electrophoretic mobility were subcloned and sequenced to detect mutations that caused GA1. This report describes the structure of the GCD gene, as well as point mutations and polymorphisms fou...

  1. New Gene Evolution: Little Did We Know

    Science.gov (United States)

    Long, Manyuan; VanKuren, Nicholas W.; Chen, Sidi; Vibranovski, Maria D.

    2014-01-01

    Genes are perpetually added to and deleted from genomes during evolution. Thus, it is important to understand how new genes are formed and evolve as critical components of the genetic systems determining the biological diversity of life. Two decades of effort have shed light on the process of new gene origination, and have contributed to an emerging comprehensive picture of how new genes are added to genomes, ranging from the mechanisms that generate new gene structures to the presence of new genes in different organisms to the rates and patterns of new gene origination and the roles of new genes in phenotypic evolution. We review each of these aspects of new gene evolution, summarizing the main evidence for the origination and importance of new genes in evolution. We highlight findings showing that new genes rapidly change existing genetic systems that govern various molecular, cellular and phenotypic functions. PMID:24050177

  2. Aux/IAA Gene Family in Plants: Molecular Structure, Regulation, and Function

    Directory of Open Access Journals (Sweden)

    Jie Luo

    2018-01-01

    Full Text Available Auxin plays a crucial role in the diverse cellular and developmental responses of plants across their lifespan. Plants can quickly sense and respond to changes in auxin levels, and these responses involve several major classes of auxin-responsive genes, including the Auxin/Indole-3-Acetic Acid (Aux/IAA family, the auxin response factor (ARF family, small auxin upregulated RNA (SAUR, and the auxin-responsive Gretchen Hagen3 (GH3 family. Aux/IAA proteins are short-lived nuclear proteins comprising several highly conserved domains that are encoded by the auxin early response gene family. These proteins have specific domains that interact with ARFs and inhibit the transcription of genes activated by ARFs. Molecular studies have revealed that Aux/IAA family members can form diverse dimers with ARFs to regulate genes in various ways. Functional analyses of Aux/IAA family members have indicated that they have various roles in plant development, such as root development, shoot growth, and fruit ripening. In this review, recently discovered details regarding the molecular characteristics, regulation, and protein–protein interactions of the Aux/IAA proteins are discussed. These details provide new insights into the molecular basis of the Aux/IAA protein functions in plant developmental processes.

  3. Efficient Reverse-Engineering of a Developmental Gene Regulatory Network

    Science.gov (United States)

    Cicin-Sain, Damjan; Ashyraliyev, Maksat; Jaeger, Johannes

    2012-01-01

    Understanding the complex regulatory networks underlying development and evolution of multi-cellular organisms is a major problem in biology. Computational models can be used as tools to extract the regulatory structure and dynamics of such networks from gene expression data. This approach is called reverse engineering. It has been successfully applied to many gene networks in various biological systems. However, to reconstitute the structure and non-linear dynamics of a developmental gene network in its spatial context remains a considerable challenge. Here, we address this challenge using a case study: the gap gene network involved in segment determination during early development of Drosophila melanogaster. A major problem for reverse-engineering pattern-forming networks is the significant amount of time and effort required to acquire and quantify spatial gene expression data. We have developed a simplified data processing pipeline that considerably increases the throughput of the method, but results in data of reduced accuracy compared to those previously used for gap gene network inference. We demonstrate that we can infer the correct network structure using our reduced data set, and investigate minimal data requirements for successful reverse engineering. Our results show that timing and position of expression domain boundaries are the crucial features for determining regulatory network structure from data, while it is less important to precisely measure expression levels. Based on this, we define minimal data requirements for gap gene network inference. Our results demonstrate the feasibility of reverse-engineering with much reduced experimental effort. This enables more widespread use of the method in different developmental contexts and organisms. Such systematic application of data-driven models to real-world networks has enormous potential. Only the quantitative investigation of a large number of developmental gene regulatory networks will allow us to

  4. Comparative structural and functional analysis of genes encoding pectin methylesterases in Phytophthora spp.

    Science.gov (United States)

    Mingora, Christina; Ewer, Jason; Ospina-Giraldo, Manuel

    2014-03-15

    We have scanned the Phytophthora infestans, P. ramorum, and P. sojae genomes for the presence of putative pectin methylesterase genes and conducted a sequence analysis of all gene models found. We also searched for potential regulatory motifs in the promoter region of the proposed P. infestans models, and investigated the gene expression levels throughout the course of P. infestans infection on potato plants, using in planta and detached leaf assays. We found that genes located on contiguous chromosomal regions contain similar motifs in the promoter region, indicating the possibility of a shared regulatory mechanism. Results of our investigations also suggest that, during the pathogenicity process, the expression levels of some of the analyzed genes vary considerably when compared to basal expression observed in in vitro cultures of non-sporulating mycelium. These results were observed both in planta and in detached leaf assays. Copyright © 2014 Elsevier B.V. All rights reserved.

  5. Differentially expressed genes in embryonic cardiac tissues of mice lacking Folr1 gene activity

    Directory of Open Access Journals (Sweden)

    Schwartz Robert J

    2007-11-01

    Full Text Available Abstract Background Heart anomalies are the most frequently observed among all human congenital defects. As with the situation for neural tube defects (NTDs, it has been demonstrated that women who use multivitamins containing folic acid peri-conceptionally have a reduced risk for delivering offspring with conotruncal heart defects 123. Cellular folate transport is mediated by a receptor or binding protein and by an anionic transporter protein system. Defective function of the Folr1 (also known as Folbp1; homologue of human FRα gene in mice results in inadequate transport, accumulation, or metabolism of folate during cardiovascular morphogenesis. Results We have observed cardiovascular abnormalities including outflow tract and aortic arch arterial defects in genetically compromised Folr1 knockout mice. In order to investigate the molecular mechanisms underlying the failure to complete development of outflow tract and aortic arch arteries in the Folr1 knockout mouse model, we examined tissue-specific gene expression difference between Folr1 nullizygous embryos and morphologically normal heterozygous embryos during early cardiac development (14-somite stage, heart tube looping (28-somite stage, and outflow track septation (38-somite stage. Microarray analysis was performed as a primary screening, followed by investigation using quantitative real-time PCR assays. Gene ontology analysis highlighted the following ontology groups: cell migration, cell motility and localization of cells, structural constituent of cytoskeleton, cell-cell adhesion, oxidoreductase, protein folding and mRNA processing. This study provided preliminary data and suggested potential candidate genes for further description and investigation. Conclusion The results suggested that Folr1 gene ablation and abnormal folate homeostasis altered gene expression in developing heart and conotruncal tissues. These changes affected normal cytoskeleton structures, cell migration and

  6. Ferritin gene organization: differences between plants and animals suggest possible kingdom-specific selective constraints.

    Science.gov (United States)

    Proudhon, D; Wei, J; Briat, J; Theil, E C

    1996-03-01

    Ferritin, a protein widespread in nature, concentrates iron approximately 10(11)-10(12)-fold above the solubility within a spherical shell of 24 subunits; it derives in plants and animals from a common ancestor (based on sequence) but displays a cytoplasmic location in animals compared to the plastid in contemporary plants. Ferritin gene regulation in plants and animals is altered by development, hormones, and excess iron; iron signals target DNA in plants but mRNA in animals. Evolution has thus conserved the two end points of ferritin gene expression, the physiological signals and the protein structure, while allowing some divergence of the genetic mechanisms. Comparison of ferritin gene organization in plants and animals, made possible by the cloning of a dicot (soybean) ferritin gene presented here and the recent cloning of two monocot (maize) ferritin genes, shows evolutionary divergence in ferritin gene organization between plants and animals but conservation among plants or among animals; divergence in the genetic mechanism for iron regulation is reflected by the absence in all three plant genes of the IRE, a highly conserved, noncoding sequence in vertebrate animal ferritin mRNA. In plant ferritin genes, the number of introns (n = 7) is higher than in animals (n = 3). Second, no intron positions are conserved when ferritin genes of plants and animals are compared, although all ferritin gene introns are in the coding region; within kingdoms, the intron positions in ferritin genes are conserved. Finally, secondary protein structure has no apparent relationship to intron/exon boundaries in plant ferritin genes, whereas in animal ferritin genes the correspondence is high. The structural differences in introns/exons among phylogenetically related ferritin coding sequences and the high conservation of the gene structure within plant or animal kingdoms of the gene structure within plant or animal kingdoms suggest that kingdom-specific functional constraints may

  7. Structural and functional organization of the HF.10 human zinc finger gene (ZNF35) located on chromosome 3p21-p22

    DEFF Research Database (Denmark)

    Lanfrancone, L; Pengue, G; Pandolfi, P P

    1992-01-01

    We report the structural and functional characterization of the HF.10 zinc finger gene (ZNF35) in normal human cells, as well as a processed pseudogene. The HF.10 gene spans about 13 kb and it is interrupted by three introns. All 11 zinc finger DNA-binding domains are contiguously encoded within...... and partial nucleotide sequencing of the HF.10 pseudogene indicated that it has arisen by retroposition of spliced HF.10 mRNA. In situ hybridization experiments revealed that both the functional locus and the pseudogene map to chromosome 3p21p22, a region that is frequently deleted in small cell lung...... and renal carcinomas. Hybridization of the HF.10 gene and the HF.10 pseudogene DNA probes to metaphases from a small cell lung carcinoma cell line with the 3p deletion revealed that both loci are part of the deleted chromosome region....

  8. [From gene cloning to expressional analysis--practice and experience from educational reform of experimental gene engineering].

    Science.gov (United States)

    Wu, Yan-Hua; Guo, Bin; Lou, Hui-Ling; Cui, Yu-Liang; Gu, Hui-Juan; Qiao, Shou-Yi

    2012-02-01

    Experimental gene engineering is a laboratory course focusing on the molecular structure, expression pattern and biological function of genes. Providing our students with a solid knowledge base and correct ways to conduct research is very important for high-quality education of genetic engineering. Inspired by recent progresses in this field, we improved the experimental gene engineering course by adding more updated knowledge and technologies and emphasizing on the combination of teaching and research, with the aim of offering our students a good start in their scientific careers.

  9. Gene Structures, Evolution, Classification and Expression Profiles of the Aquaporin Gene Family in Castor Bean (Ricinus communis L..

    Directory of Open Access Journals (Sweden)

    Zhi Zou

    Full Text Available Aquaporins (AQPs are a class of integral membrane proteins that facilitate the passive transport of water and other small solutes across biological membranes. Castor bean (Ricinus communis L., Euphobiaceae, an important non-edible oilseed crop, is widely cultivated for industrial, medicinal and cosmetic purposes. Its recently available genome provides an opportunity to analyze specific gene families. In this study, a total of 37 full-length AQP genes were identified from the castor bean genome, which were assigned to five subfamilies, including 10 plasma membrane intrinsic proteins (PIPs, 9 tonoplast intrinsic proteins (TIPs, 8 NOD26-like intrinsic proteins (NIPs, 6 X intrinsic proteins (XIPs and 4 small basic intrinsic proteins (SIPs on the basis of sequence similarities. Functional prediction based on the analysis of the aromatic/arginine (ar/R selectivity filter, Froger's positions and specificity-determining positions (SDPs showed a remarkable difference in substrate specificity among subfamilies. Homology analysis supported the expression of all 37 RcAQP genes in at least one of examined tissues, e.g., root, leaf, flower, seed and endosperm. Furthermore, global expression profiles with deep transcriptome sequencing data revealed diverse expression patterns among various tissues. The current study presents the first genome-wide analysis of the AQP gene family in castor bean. Results obtained from this study provide valuable information for future functional analysis and utilization.

  10. An extended Kalman filtering approach to modeling nonlinear dynamic gene regulatory networks via short gene expression time series.

    Science.gov (United States)

    Wang, Zidong; Liu, Xiaohui; Liu, Yurong; Liang, Jinling; Vinciotti, Veronica

    2009-01-01

    In this paper, the extended Kalman filter (EKF) algorithm is applied to model the gene regulatory network from gene time series data. The gene regulatory network is considered as a nonlinear dynamic stochastic model that consists of the gene measurement equation and the gene regulation equation. After specifying the model structure, we apply the EKF algorithm for identifying both the model parameters and the actual value of gene expression levels. It is shown that the EKF algorithm is an online estimation algorithm that can identify a large number of parameters (including parameters of nonlinear functions) through iterative procedure by using a small number of observations. Four real-world gene expression data sets are employed to demonstrate the effectiveness of the EKF algorithm, and the obtained models are evaluated from the viewpoint of bioinformatics.

  11. From the genome to the phenome and back: linking genes with human brain function and structure using genetically informed neuroimaging

    DEFF Research Database (Denmark)

    Siebner, H R; Callicott, J H; Sommer, T

    2009-01-01

    In recent years, an array of brain mapping techniques has been successfully employed to link individual differences in circuit function or structure in the living human brain with individual variations in the human genome. Several proof-of-principle studies provided converging evidence that brain...... imaging can establish important links between genes and behaviour. The overarching goal is to use genetically informed brain imaging to pinpoint neurobiological mechanisms that contribute to behavioural intermediate phenotypes or disease states. This special issue on "Linking Genes to Brain Function...... in Health and Disease" provides an overview over how the "imaging genetics" approach is currently applied in the various fields of systems neuroscience to reveal the genetic underpinnings of complex behaviours and brain diseases. While the rapidly emerging field of imaging genetics holds great promise...

  12. iHAP – integrated haplotype analysis pipeline for characterizing the haplotype structure of genes

    Directory of Open Access Journals (Sweden)

    Lim Yun Ping

    2006-12-01

    Full Text Available Abstract Background The advent of genotype data from large-scale efforts that catalog the genetic variants of different populations have given rise to new avenues for multifactorial disease association studies. Recent work shows that genotype data from the International HapMap Project have a high degree of transferability to the wider population. This implies that the design of genotyping studies on local populations may be facilitated through inferences drawn from information contained in HapMap populations. Results To facilitate analysis of HapMap data for characterizing the haplotype structure of genes or any chromosomal regions, we have developed an integrated web-based resource, iHAP. In addition to incorporating genotype and haplotype data from the International HapMap Project and gene information from the UCSC Genome Browser Database, iHAP also provides capabilities for inferring haplotype blocks and selecting tag SNPs that are representative of haplotype patterns. These include block partitioning algorithms, block definitions, tag SNP definitions, as well as SNPs to be "force included" as tags. Based on the parameters defined at the input stage, iHAP performs on-the-fly analysis and displays the result graphically as a webpage. To facilitate analysis, intermediate and final result files can be downloaded. Conclusion The iHAP resource, available at http://ihap.bii.a-star.edu.sg, provides a convenient yet flexible approach for the user community to analyze HapMap data and identify candidate targets for genotyping studies.

  13. Gene study within the 5' flanking regions of growth hormone gene of ...

    African Journals Online (AJOL)

    user

    2011-01-17

    Jan 17, 2011 ... Expression of more than one gene for GH has been reported, indicating ..... hormone levels of palsmáticos IGF-1 and carcass traits in beef cattle. Dissertation ... Structure-function relation of somatotropin with reference to ...

  14. Gene finding with a hidden Markov model of genome structure and evolution

    DEFF Research Database (Denmark)

    Pedersen, Jakob Skou; Hein, Jotun

    2003-01-01

    the model are linear in alignment length and genome number. The model is applied to the problem of gene finding. The benefit of modelling sequence evolution is demonstrated both in a range of simulations and on a set of orthologous human/mouse gene pairs. AVAILABILITY: Free availability over the Internet...

  15. Secondary structure and feature of mitochondrial tRNA genes of the Ussurian tube-nosed bat Murina ussuriensis (Chiroptera: Vespertilionidae

    Directory of Open Access Journals (Sweden)

    Kwang Bae Yoon

    2015-09-01

    Full Text Available The complete mitogenome (NC_021119 of the Ussurian tube-nosed bat Murina ussuriensis (Chiroptera: Vespertilionidae was annotated and characterized in our recent publication (http://www.ncbi.nlm.nih.gov/nuccore/NC_021119. Here we provide additional information on methods in detail for obtaining the complete sequence of M. ussuriensis mitogenome. In addition, we describe characteristics of 22 tRNA genes and secondary structure and feature of 22 tRNAs of M. ussuriensis mitogenome.

  16. Organization and evolution of the rat tyrosine hydroxylase gene

    International Nuclear Information System (INIS)

    Brown, E.R.; Coker, G.T. III; O'Malley, K.L.

    1987-01-01

    This report describes the organization of the rat tyrosine hydroxylase (TH) gene and compares its structure with the human phenylalanine hydroxylase gene. Both genes are single copy and contain 13 exons separated by 12 introns. Remarkably, the positions of 10 out 12 intron/exon boundaries are identical for the two genes. These results support the idea that these hydroxylases genes are members of a gene family which has a common evolutionary origin. The authors predict that this ancestral gene would have encoded exons similar to those of TH prior to evolutionary drift to other members of this gene family

  17. Gene structure of CYP3A4, an adult-specific form of cytochrome P450 in human livers, and its transcriptional control.

    Science.gov (United States)

    Hashimoto, H; Toide, K; Kitamura, R; Fujita, M; Tagawa, S; Itoh, S; Kamataki, T

    1993-12-01

    CYP3 A4 is the adult-specific form of cytochrome P450 in human livers [Komori, M., Nishio, K., Kitada, M., Shiramatsu, K., Muroya, K., Soma, M., Nagashima, K. & Kamataki, T. (1990) Biochemistry 29, 4430-4433]. The sequences of three genomic clones for CYP3A4 were analyzed for all exons, exon-intron junctions and the 5'-flanking region from the major transcription site to nucleotide position -1105, and compared with those of the CYP3A7 gene, a fetal-specific form of cytochrome P450 in humans. The results showed that the identity of 5'-flanking sequences between CYP3A4 and CYP3A7 genes was 91%, and that each 5'-flanking region had characteristic sequences termed as NFSE (P450NF-specific element) and HFLaSE (P450HFLa specific element), respectively. A basic transcription element (BTE) also lay in the 5'-flanking region of the CYP3A4 gene as seen in many CYP genes [Yanagida, A., Sogawa, K., Yasumoto, K. & Fujii-Kuriyama, Y. (1990) Mol. Cell. Biol. 10, 1470-1475]. The BTE binding factor (BTEB) was present in both adult and fetal human livers. To examine the transcriptional activity of the CYP3A4 gene, DNA fragments in the 5'-flanking region of the gene were inserted in front of the simian virus 40 promoter and the chloramphenicol acetyltransferase structural gene, and the constructs were transfected in HepG2 cells. The analysis of the chloramphenicol acetyltransferase activity indicated that (a) specific element(s) which could bind with a factor(s) in livers was present in the 5'-flanking region of the CYP3A4 gene to show the transcriptional activity.

  18. EcoGene 3.0.

    Science.gov (United States)

    Zhou, Jindan; Rudd, Kenneth E

    2013-01-01

    EcoGene (http://ecogene.org) is a database and website devoted to continuously improving the structural and functional annotation of Escherichia coli K-12, one of the most well understood model organisms, represented by the MG1655(Seq) genome sequence and annotations. Major improvements to EcoGene in the past decade include (i) graphic presentations of genome map features; (ii) ability to design Boolean queries and Venn diagrams from EcoArray, EcoTopics or user-provided GeneSets; (iii) the genome-wide clone and deletion primer design tool, PrimerPairs; (iv) sequence searches using a customized EcoBLAST; (v) a Cross Reference table of synonymous gene and protein identifiers; (vi) proteome-wide indexing with GO terms; (vii) EcoTools access to >2000 complete bacterial genomes in EcoGene-RefSeq; (viii) establishment of a MySql relational database; and (ix) use of web content management systems. The biomedical literature is surveyed daily to provide citation and gene function updates. As of September 2012, the review of 37 397 abstracts and articles led to creation of 98 425 PubMed-Gene links and 5415 PubMed-Topic links. Annotation updates to Genbank U00096 are transmitted from EcoGene to NCBI. Experimental verifications include confirmation of a CTG start codon, pseudogene restoration and quality assurance of the Keio strain collection.

  19. EcoGene 3.0

    Science.gov (United States)

    Zhou, Jindan; Rudd, Kenneth E.

    2013-01-01

    EcoGene (http://ecogene.org) is a database and website devoted to continuously improving the structural and functional annotation of Escherichia coli K-12, one of the most well understood model organisms, represented by the MG1655(Seq) genome sequence and annotations. Major improvements to EcoGene in the past decade include (i) graphic presentations of genome map features; (ii) ability to design Boolean queries and Venn diagrams from EcoArray, EcoTopics or user-provided GeneSets; (iii) the genome-wide clone and deletion primer design tool, PrimerPairs; (iv) sequence searches using a customized EcoBLAST; (v) a Cross Reference table of synonymous gene and protein identifiers; (vi) proteome-wide indexing with GO terms; (vii) EcoTools access to >2000 complete bacterial genomes in EcoGene-RefSeq; (viii) establishment of a MySql relational database; and (ix) use of web content management systems. The biomedical literature is surveyed daily to provide citation and gene function updates. As of September 2012, the review of 37 397 abstracts and articles led to creation of 98 425 PubMed-Gene links and 5415 PubMed-Topic links. Annotation updates to Genbank U00096 are transmitted from EcoGene to NCBI. Experimental verifications include confirmation of a CTG start codon, pseudogene restoration and quality assurance of the Keio strain collection. PMID:23197660

  20. Genes with stable DNA methylation levels show higher evolutionary conservation than genes with fluctuant DNA methylation levels.

    Science.gov (United States)

    Zhang, Ruijie; Lv, Wenhua; Luan, Meiwei; Zheng, Jiajia; Shi, Miao; Zhu, Hongjie; Li, Jin; Lv, Hongchao; Zhang, Mingming; Shang, Zhenwei; Duan, Lian; Jiang, Yongshuai

    2015-11-24

    Different human genes often exhibit different degrees of stability in their DNA methylation levels between tissues, samples or cell types. This may be related to the evolution of human genome. Thus, we compared the evolutionary conservation between two types of genes: genes with stable DNA methylation levels (SM genes) and genes with fluctuant DNA methylation levels (FM genes). For long-term evolutionary characteristics between species, we compared the percentage of the orthologous genes, evolutionary rate dn/ds and protein sequence identity. We found that the SM genes had greater percentages of the orthologous genes, lower dn/ds, and higher protein sequence identities in all the 21 species. These results indicated that the SM genes were more evolutionarily conserved than the FM genes. For short-term evolutionary characteristics among human populations, we compared the single nucleotide polymorphism (SNP) density, and the linkage disequilibrium (LD) degree in HapMap populations and 1000 genomes project populations. We observed that the SM genes had lower SNP densities, and higher degrees of LD in all the 11 HapMap populations and 13 1000 genomes project populations. These results mean that the SM genes had more stable chromosome genetic structures, and were more conserved than the FM genes.

  1. GeneBreak: detection of recurrent DNA copy number aberration-associated chromosomal breakpoints within genes [version 2; referees: 2 approved

    Directory of Open Access Journals (Sweden)

    Evert van den Broek

    2017-07-01

    Full Text Available Development of cancer is driven by somatic alterations, including numerical and structural chromosomal aberrations. Currently, several computational methods are available and are widely applied to detect numerical copy number aberrations (CNAs of chromosomal segments in tumor genomes. However, there is lack of computational methods that systematically detect structural chromosomal aberrations by virtue of the genomic location of CNA-associated chromosomal breaks and identify genes that appear non-randomly affected by chromosomal breakpoints across (large series of tumor samples. ‘GeneBreak’ is developed to systematically identify genes recurrently affected by the genomic location of chromosomal CNA-associated breaks by a genome-wide approach, which can be applied to DNA copy number data obtained by array-Comparative Genomic Hybridization (CGH or by (low-pass whole genome sequencing (WGS. First, ‘GeneBreak’ collects the genomic locations of chromosomal CNA-associated breaks that were previously pinpointed by the segmentation algorithm that was applied to obtain CNA profiles. Next, a tailored annotation approach for breakpoint-to-gene mapping is implemented. Finally, dedicated cohort-based statistics is incorporated with correction for covariates that influence the probability to be a breakpoint gene. In addition, multiple testing correction is integrated to reveal recurrent breakpoint events. This easy-to-use algorithm, ‘GeneBreak’, is implemented in R (www.cran.r-project.org and is available from Bioconductor (www.bioconductor.org/packages/release/bioc/html/GeneBreak.html.

  2. Origin of sphinx, a young chimeric RNA gene in Drosophila melanogaster

    Science.gov (United States)

    Wang, Wen; Brunet, Frédéric G.; Nevo, Eviatar; Long, Manyuan

    2002-01-01

    Non-protein-coding RNA genes play an important role in various biological processes. How new RNA genes originated and whether this process is controlled by similar evolutionary mechanisms for the origin of protein-coding genes remains unclear. A young chimeric RNA gene that we term sphinx (spx) provides the first insight into the early stage of evolution of RNA genes. spx originated as an insertion of a retroposed sequence of the ATP synthase chain F gene at the cytological region 60DB since the divergence of Drosophila melanogaster from its sibling species 2–3 million years ago. This retrosequence, which is located at 102F on the fourth chromosome, recruited a nearby exon and intron, thereby evolving a chimeric gene structure. This molecular process suggests that the mechanism of exon shuffling, which can generate protein-coding genes, also plays a role in the origin of RNA genes. The subsequent evolutionary process of spx has been associated with a high nucleotide substitution rate, possibly driven by a continuous positive Darwinian selection for a novel function, as is shown in its sex- and development-specific alternative splicing. To test whether spx has adapted to different environments, we investigated its population genetic structure in the unique “Evolution Canyon” in Israel, revealing a similar haplotype structure in spx, and thus similar evolutionary forces operating on spx between environments. PMID:11904380

  3. CAR gene cluster and transcript levels of carotenogenic genes in Rhodotorula mucilaginosa.

    Science.gov (United States)

    Landolfo, Sara; Ianiri, Giuseppe; Camiolo, Salvatore; Porceddu, Andrea; Mulas, Giuliana; Chessa, Rossella; Zara, Giacomo; Mannazzu, Ilaria

    2018-01-01

    A molecular approach was applied to the study of the carotenoid biosynthetic pathway of Rhodotorula mucilaginosa. At first, functional annotation of the genome of R. mucilaginosa C2.5t1 was carried out and gene ontology categories were assigned to 4033 predicted proteins. Then, a set of genes involved in different steps of carotenogenesis was identified and those coding for phytoene desaturase, phytoene synthase/lycopene cyclase and carotenoid dioxygenase (CAR genes) proved to be clustered within a region of ~10 kb. Quantitative PCR of the genes involved in carotenoid biosynthesis showed that genes coding for 3-hydroxy-3-methylglutharyl-CoA reductase and mevalonate kinase are induced during exponential phase while no clear trend of induction was observed for phytoene synthase/lycopene cyclase and phytoene dehydrogenase encoding genes. Thus, in R. mucilaginosa the induction of genes involved in the early steps of carotenoid biosynthesis is transient and accompanies the onset of carotenoid production, while that of CAR genes does not correlate with the amount of carotenoids produced. The transcript levels of genes coding for carotenoid dioxygenase, superoxide dismutase and catalase A increased during the accumulation of carotenoids, thus suggesting the activation of a mechanism aimed at the protection of cell structures from oxidative stress during carotenoid biosynthesis. The data presented herein, besides being suitable for the elucidation of the mechanisms that underlie carotenoid biosynthesis, will contribute to boosting the biotechnological potential of this yeast by improving the outcome of further research efforts aimed at also exploring other features of interest.

  4. Organization of Genes Required for the Oxidation of Methanol to Formaldehyde in Three Type II Methylotrophs

    Science.gov (United States)

    Bastien, C.; Machlin, S.; Zhang, Y.; Donaldson, K.; Hanson, R. S.

    1989-01-01

    Restriction maps of genes required for the synthesis of active methanol dehydrogenase in Methylobacterium organophilum XX and Methylobacterium sp. strain AM1 have been completed and compared. In these two species of pink-pigmented, type II methylotrophs, 15 genes were identified that were required for the expression of methanol dehydrogenase activity. None of these genes were required for the synthesis of the prosthetic group of methanol dehydrogenase, pyrroloquinoline quinone. The structural gene required for the synthesis of cytochrome cL, an electron acceptor uniquely required for methanol dehydrogenase, and the genes encoding small basic peptides that copurified with methanol dehydrogenases were closely linked to the methanol dehydrogenase structural genes. A cloned 22-kilobase DNA insert from Methylsporovibrio methanica 81Z, an obligate type II methanotroph, complemented mutants that contained lesions in four genes closely linked to the methanol dehydrogenase structural genes. The methanol dehydrogenase and cytochrome cL structural genes were found to be transcribed independently in M. organophilum XX. Only two of the genes required for methanol dehydrogenase synthesis in this bacterium were found to be cotranscribed. PMID:16348074

  5. ALGEBRAIC STRUCTURES AND SYNTHESIS PROTEIN: A DESCRIPTION OF GENE CAPN10

    Directory of Open Access Journals (Sweden)

    Obidio Rubio

    2016-06-01

    Full Text Available This article mainly informative, some ways of algebraic modeling of human genome sequences is presented, with special emphasis on describing mutations in the genes, which by modifying protein synthesis involve genetic diseases such as Diabetes Mellitus. Mutations as endomorphisms on an R-module, which consists of a direct sum of groups of sequences 2q37.3 gene, on the rings Z64 and Z125, where el haplotype compound for the polymorphisms SNP43, SNP19 and SNP63 occur.

  6. The BDGP gene disruption project: Single transposon insertions associated with 40 percent of Drosophila genes

    Energy Technology Data Exchange (ETDEWEB)

    Bellen, Hugo J.; Levis, Robert W.; Liao, Guochun; He, Yuchun; Carlson, Joseph W.; Tsang, Garson; Evans-Holm, Martha; Hiesinger, P. Robin; Schulze, Karen L.; Rubin, Gerald M.; Hoskins, Roger A.; Spradling, Allan C.

    2004-01-13

    The Berkeley Drosophila Genome Project (BDGP) strives to disrupt each Drosophila gene by the insertion of a single transposable element. As part of this effort, transposons in more than 30,000 fly strains were localized and analyzed relative to predicted Drosophila gene structures. Approximately 6,300 lines that maximize genomic coverage were selected to be sent to the Bloomington Stock Center for public distribution, bringing the size of the BDGP gene disruption collection to 7,140 lines. It now includes individual lines predicted to disrupt 5,362 of the 13,666 currently annotated Drosophila genes (39 percent). Other lines contain an insertion at least 2 kb from others in the collection and likely mutate additional incompletely annotated or uncharacterized genes and chromosomal regulatory elements. The remaining strains contain insertions likely to disrupt alternative gene promoters or to allow gene mis-expression. The expanded BDGP gene disruption collection provides a public resource that will facilitate the application of Drosophila genetics to diverse biological problems. Finally, the project reveals new insight into how transposons interact with a eukaryotic genome and helps define optimal strategies for using insertional mutagenesis as a genomic tool.

  7. Physical and genetic map of the major nif gene cluster from Azotobacter vinelandii.

    OpenAIRE

    Jacobson, M R; Brigle, K E; Bennett, L T; Setterquist, R A; Wilson, M S; Cash, V L; Beynon, J; Newton, W E; Dean, D R

    1989-01-01

    Determination of a 28,793-base-pair DNA sequence of a region from the Azotobacter vinelandii genome that includes and flanks the nitrogenase structural gene region was completed. This information was used to revise the previously proposed organization of the major nif cluster. The major nif cluster from A. vinelandii encodes 15 nif-specific genes whose products bear significant structural identity to the corresponding nif-specific gene products from Klebsiella pneumoniae. These genes include ...

  8. Comparative genome analysis of PHB gene family reveals deep evolutionary origins and diverse gene function.

    Science.gov (United States)

    Di, Chao; Xu, Wenying; Su, Zhen; Yuan, Joshua S

    2010-10-07

    PHB (Prohibitin) gene family is involved in a variety of functions important for different biological processes. PHB genes are ubiquitously present in divergent species from prokaryotes to eukaryotes. Human PHB genes have been found to be associated with various diseases. Recent studies by our group and others have shown diverse function of PHB genes in plants for development, senescence, defence, and others. Despite the importance of the PHB gene family, no comprehensive gene family analysis has been carried to evaluate the relatedness of PHB genes across different species. In order to better guide the gene function analysis and understand the evolution of the PHB gene family, we therefore carried out the comparative genome analysis of the PHB genes across different kingdoms. The relatedness, motif distribution, and intron/exon distribution all indicated that PHB genes is a relatively conserved gene family. The PHB genes can be classified into 5 classes and each class have a very deep evolutionary origin. The PHB genes within the class maintained the same motif patterns during the evolution. With Arabidopsis as the model species, we found that PHB gene intron/exon structure and domains are also conserved during the evolution. Despite being a conserved gene family, various gene duplication events led to the expansion of the PHB genes. Both segmental and tandem gene duplication were involved in Arabidopsis PHB gene family expansion. However, segmental duplication is predominant in Arabidopsis. Moreover, most of the duplicated genes experienced neofunctionalization. The results highlighted that PHB genes might be involved in important functions so that the duplicated genes are under the evolutionary pressure to derive new function. PHB gene family is a conserved gene family and accounts for diverse but important biological functions based on the similar molecular mechanisms. The highly diverse biological function indicated that more research needs to be carried out

  9. Crystal Structure of Borrelia turicatae protein, BTA121, a differentially regulated  gene in the tick-mammalian transmission cycle of relapsing fever spirochetes

    Energy Technology Data Exchange (ETDEWEB)

    Luo, Zhipu; Kelleher, Alan J.; Darwiche, Rabih; Hudspeth, Elissa M.; Shittu, Oluwatosin K.; Krishnavajhala, Aparna; Schneiter, Roger; Lopez, Job E.; Asojo, Oluwatoyin A. (Baylor); (Fribourg); (NCI)

    2017-11-10

    Tick-borne relapsing fever (RF) borreliosis is a neglected disease that is often misdiagnosed. RF species circulating in the United States include Borrelia turicatae, which is transmitted by argasid ticks. Environmental adaptation by RF Borrelia is poorly understood, however our previous studies indicated differential regulation of B. turicatae genes localized on the 150 kb linear megaplasmid during the tick-mammalian transmission cycle, including bta121. This gene is up-regulated by B. turicatae in the tick versus the mammal, and the encoded protein (BTA121) is predicted to be surface localized. The structure of BTA121 was solved by single-wavelength anomalous dispersion (SAD) using selenomethionine-derivative protein. The topology of BTA121 is unique with four helical domains organized into two helical bundles. Due to the sequence similarity of several genes on the megaplasmid, BTA121 can serve as a model for their tertiary structures. BTA121 has large interconnected tunnels and cavities that can accommodate ligands, notably long parallel helices, which have a large hydrophobic central pocket. Preliminary in-vitro studies suggest that BTA121 binds lipids, notably palmitate with a similar order of binding affinity as tablysin-15, a known palmitate-binding protein. The reported data will guide mechanistic studies to determine the role of BTA121 in the tick-mammalian transmission cycle of B. turicatae.

  10. Genome-Wide Identification and Structural Analysis of bZIP Transcription Factor Genes in Brassica napus.

    Science.gov (United States)

    Zhou, Yan; Xu, Daixiang; Jia, Ledong; Huang, Xiaohu; Ma, Guoqiang; Wang, Shuxian; Zhu, Meichen; Zhang, Aoxiang; Guan, Mingwei; Lu, Kun; Xu, Xinfu; Wang, Rui; Li, Jiana; Qu, Cunmin

    2017-10-24

    The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed ( Brassica napus ). In this study, we identified 247 BnbZIP genes in the rapeseed genome, which we classified into 10 subfamilies based on phylogenetic analysis of their deduced protein sequences. The BnbZIP genes were grouped into functional clades with Arabidopsis genes with similar putative functions, indicating functional conservation. Genome mapping analysis revealed that the BnbZIPs are distributed unevenly across all 19 chromosomes, and that some of these genes arose through whole-genome duplication and dispersed duplication events. All expression profiles of 247 bZIP genes were extracted from RNA-sequencing data obtained from 17 different B . napus ZS11 tissues with 42 various developmental stages. These genes exhibited different expression patterns in various tissues, revealing that these genes are differentially regulated. Our results provide a valuable foundation for functional dissection of the different BnbZIP homologs in B . napus and its parental lines and for molecular breeding studies of bZIP genes in B . napus .

  11. In silico identification and analysis of phytoene synthase genes in plants.

    Science.gov (United States)

    Han, Y; Zheng, Q S; Wei, Y P; Chen, J; Liu, R; Wan, H J

    2015-08-14

    In this study, we examined phytoene synthetase (PSY), the first key limiting enzyme in the synthesis of carotenoids and catalyzing the formation of geranylgeranyl pyrophosphate in terpenoid biosynthesis. We used known amino acid sequences of the PSY gene in tomato plants to conduct a genome-wide search and identify putative candidates in 34 sequenced plants. A total of 101 homologous genes were identified. Phylogenetic analysis revealed that PSY evolved independently in algae as well as monocotyledonous and dicotyledonous plants. Our results showed that the amino acid structures exhibited 5 motifs (motifs 1 to 5) in algae and those in higher plants were highly conserved. The PSY gene structures showed that the number of intron in algae varied widely, while the number of introns in higher plants was 4 to 5. Identification of PSY genes in plants and the analysis of the gene structure may provide a theoretical basis for studying evolutionary relationships in future analyses.

  12. A Computational Approach From Gene to Structure Analysis of the Human ABCA4 Transporter Involved in Genetic Retinal Diseases.

    Science.gov (United States)

    Trezza, Alfonso; Bernini, Andrea; Langella, Andrea; Ascher, David B; Pires, Douglas E V; Sodi, Andrea; Passerini, Ilaria; Pelo, Elisabetta; Rizzo, Stanislao; Niccolai, Neri; Spiga, Ottavia

    2017-10-01

    The aim of this article is to report the investigation of the structural features of ABCA4, a protein associated with a genetic retinal disease. A new database collecting knowledge of ABCA4 structure may facilitate predictions about the possible functional consequences of gene mutations observed in clinical practice. In order to correlate structural and functional effects of the observed mutations, the structure of mouse P-glycoprotein was used as a template for homology modeling. The obtained structural information and genetic data are the basis of our relational database (ABCA4Database). Sequence variability among all ABCA4-deposited entries was calculated and reported as Shannon entropy score at the residue level. The three-dimensional model of ABCA4 structure was used to locate the spatial distribution of the observed variable regions. Our predictions from structural in silico tools were able to accurately link the functional effects of mutations to phenotype. The development of the ABCA4Database gathers all the available genetic and structural information, yielding a global view of the molecular basis of some retinal diseases. ABCA4 modeled structure provides a molecular basis on which to analyze protein sequence mutations related to genetic retinal disease in order to predict the risk of retinal disease across all possible ABCA4 mutations. Additionally, our ABCA4 predicted structure is a good starting point for the creation of a new data analysis model, appropriate for precision medicine, in order to develop a deeper knowledge network of the disease and to improve the management of patients.

  13. Effects of polycyclic aromatic hydrocarbons on microbial community structure and PAH ring hydroxylating dioxygenase gene abundance in soil.

    Science.gov (United States)

    Sawulski, Przemyslaw; Clipson, Nicholas; Doyle, Evelyn

    2014-11-01

    Development of successful bioremediation strategies for environments contaminated with recalcitrant pollutants requires in-depth knowledge of the microorganisms and microbial processes involved in degradation. The response of soil microbial communities to three polycyclic aromatic hydrocarbons, phenanthrene (3-ring), fluoranthene (4-ring) and benzo(a)pyrene (5-ring), was examined. Profiles of bacterial, archaeal and fungal communities were generated using molecular fingerprinting techniques (TRFLP, ARISA) and multivariate statistical tools were employed to interpret the effect of PAHs on community dynamics and composition. The extent and rate of PAH removal was directly related to the chemical structure, with the 5-ring PAH benzo(a)pyrene degraded more slowly than phenathrene or fluoranthene. Bacterial, archaeal and fungal communities were all significantly affected by PAH amendment, time and their interaction. Based on analysis of clone libraries, Actinobacteria appeared to dominate in fluoranthene amended soil, although they also represented a significant portion of the diversity in phenanthrene amended and unamended soils. In addition there appeared to be more γ-Proteobacteria and less Bacteroidetes in soil amended with either PAH compared to the control. The soil bacterial community clearly possessed the potential to degrade PAHs as evidenced by the abundance of PAH ring hydroxylating (PAH-RHDα) genes from both gram negative (GN) and gram positive (GP) bacteria in PAH-amended and control soils. Although the dioxygenase gene from GP bacteria was less abundant in soil than the gene associated with GN bacteria, significant (p PAH-RHDα gene were observed during phenanthrene and fluoranthene degradation, whereas there was no significant difference in the abundance of the GN PAH-RHDα gene during the course of the experiment. Few studies to-date have examined the effect of pollutants on more than one microbial community in soil. The current study provides

  14. Structural divergence of Plant TCTPs

    Directory of Open Access Journals (Sweden)

    Diego eGutiérrez-Galeano

    2014-07-01

    Full Text Available The Translationally Controlled Tumor Protein (TCTP is a highly conserved protein at the level of sequence, considered to play an essential role in the regulation of growth and development in eukaryotes. However, this function has been inferred from studies in a few model systems, such as mice and mammalian cell lines, Drosophila and Arabidopsis. Thus, the knowledge regarding this protein is far from complete. In the present study bioinformatic analysis showed the presence of one or more TCTP genes per genome in plants with highly conserved signatures and subtle variations at the level of primary structure but with more noticeable differences at the level of predicted three-dimensional structures. These structures show differences in the pocket region close to the center of the protein and in its flexible loop domain. In fact, all predictive TCTP structures can be divided into two groups: 1 AtTCTP1-like and 2 CmTCTP-like, based on the predicted structures of an Arabidopsis TCTP and a Cucurbita maxima TCTP; according to this classification we propose that their probable function in plants may be inferred in principle. Thus different TCTP genes in a single organism may have different functions; additionally, in those species harboring a single TCTP gene this could carry multiple functions. On the other hand, in silico analysis of AtTCTP1-like and CmTCTP-like promoters suggest that these share common motifs but with different abundance, which may underscore differences in their gene expression patterns. Finally, the absence of TCTP genes in most chlorophytes with the exception of Coccomyxa subellipsoidea, indicates that other proteins perform the roles played by TCTP or the pathways regulated by TCTP occur through alternative routes. These findings provide insight into the evolution of this gene family in plants.

  15. tRNA gene diversity in the three domains of life

    Directory of Open Access Journals (Sweden)

    Kosuke eFujishima

    2014-05-01

    Full Text Available Transfer RNA (tRNA is widely known for its key role in decoding mRNA into protein. Despite their necessity and relatively short nucleotide sequences, a large diversity of gene structures and RNA secondary structures of pre-tRNAs and mature tRNAs have recently been discovered in the three domains of life. Growing evidences of disrupted tRNA genes in the genomes of Archaea reveals unique gene structures such as, intron-containing tRNA, split tRNA, and permuted tRNA. Coding sequence for these tRNAs are either separated with introns, fragmented, or permuted at the genome level. Although evolutionary scenario behind the tRNA gene disruption is still unclear, diversity of tRNA structure seems to be co-evolved with their processing enzyme, so-called RNA splicing endonuclease. Metazoan mitochondrial tRNAs (mtRNAs are known for their unique lack of either one or two arms from the typical tRNA cloverleaf structure, while still maintaining functionality. Recently identified nematode-specific V-arm containing tRNAs (nev-tRNAs possess long variable arms that are specific to eukaryotic class II tRNASer and tRNALeu but also decode class I tRNA codons. Moreover, many tRNA-like sequences have been found in the genomes of different organisms and viruses. Thus this review is aimed to cover the latest knowledge on tRNA gene diversity and further recapitulate the evolutionary and biological aspects that caused such uniqueness.

  16. Partial characterization of nif genes from the bacterium Azospirillum amazonense

    Directory of Open Access Journals (Sweden)

    D.P. Potrich

    2001-09-01

    Full Text Available Azospirillum amazonense revealed genomic organization patterns of the nitrogen fixation genes similar to those of the distantly related species A. brasilense. Our work suggests that A. brasilense nifHDK, nifENX, fixABC operons and nifA and glnB genes may be structurally homologous to the counterpart genes of A. amazonense. This is the first analysis revealing homology between A. brasilense nif genes and the A. amazonense genome. Sequence analysis of PCR amplification products revealed similarities between the amino acid sequences of the highly conserved nifD and glnB genes of A. amazonense and related genes of A. brasilense and other bacteria. However, the A. amazonense non-coding regions (the upstream activator sequence region and the region between the nifH and nifD genes differed from related regions of A. brasilense even in nitrogenase structural genes which are highly conserved among diazotrophic bacteria. The feasibility of the 16S ribosomal RNA gene-based PCR system for specific detection of A. amazonense was shown. Our results indicate that the PCR primers for 16S rDNA defined in this article are highly specific to A. amazonense and can distinguish this species from A. brasilense.

  17. The WRKY Transcription Factor Genes in Lotus japonicus

    OpenAIRE

    Song, Hui; Wang, Pengfei; Nan, Zhibiao; Wang, Xingjun

    2014-01-01

    WRKY transcription factor genes play critical roles in plant growth and development, as well as stress responses. WRKY genes have been examined in various higher plants, but they have not been characterized in Lotus japonicus. The recent release of the L. japonicus whole genome sequence provides an opportunity for a genome wide analysis of WRKY genes in this species. In this study, we identified 61 WRKY genes in the L. japonicus genome. Based on the WRKY protein structure, L. japonicus WRKY (...

  18. The Crc global regulator inhibits the Pseudomonas putida pWW0 toluene/xylene assimilation pathway by repressing the translation of regulatory and structural genes.

    Science.gov (United States)

    Moreno, Renata; Fonseca, Pilar; Rojo, Fernando

    2010-08-06

    In Pseudomonas putida, the expression of the pWW0 plasmid genes for the toluene/xylene assimilation pathway (the TOL pathway) is subject to complex regulation in response to environmental and physiological signals. This includes strong inhibition via catabolite repression, elicited by the carbon sources that the cells prefer to hydrocarbons. The Crc protein, a global regulator that controls carbon flow in pseudomonads, has an important role in this inhibition. Crc is a translational repressor that regulates the TOL genes, but how it does this has remained unknown. This study reports that Crc binds to sites located at the translation initiation regions of the mRNAs coding for XylR and XylS, two specific transcription activators of the TOL genes. Unexpectedly, eight additional Crc binding sites were found overlapping the translation initiation sites of genes coding for several enzymes of the pathway, all encoded within two polycistronic mRNAs. Evidence is provided supporting the idea that these sites are functional. This implies that Crc can differentially modulate the expression of particular genes within polycistronic mRNAs. It is proposed that Crc controls TOL genes in two ways. First, Crc inhibits the translation of the XylR and XylS regulators, thereby reducing the transcription of all TOL pathway genes. Second, Crc inhibits the translation of specific structural genes of the pathway, acting mainly on proteins involved in the first steps of toluene assimilation. This ensures a rapid inhibitory response that reduces the expression of the toluene/xylene degradation proteins when preferred carbon sources become available.

  19. Microbial functional genes enriched in the Xiangjiang River sediments with heavy metal contamination.

    Science.gov (United States)

    Jie, Shiqi; Li, Mingming; Gan, Min; Zhu, Jianyu; Yin, Huaqun; Liu, Xueduan

    2016-08-08

    Xiangjiang River (Hunan, China) has been contaminated with heavy metal for several decades by surrounding factories. However, little is known about the influence of a gradient of heavy metal contamination on the diversity, structure of microbial functional gene in sediment. To deeply understand the impact of heavy metal contamination on microbial community, a comprehensive functional gene array (GeoChip 5.0) has been used to study the functional genes structure, composition, diversity and metabolic potential of microbial community from three heavy metal polluted sites of Xiangjiang River. A total of 25595 functional genes involved in different biogeochemical processes have been detected in three sites, and different diversities and structures of microbial functional genes were observed. The analysis of gene overlapping, unique genes, and various diversity indices indicated a significant correlation between the level of heavy metal contamination and the functional diversity. Plentiful resistant genes related to various metal were detected, such as copper, arsenic, chromium and mercury. The results indicated a significantly higher abundance of genes involved in metal resistance including sulfate reduction genes (dsr) in studied site with most serious heavy metal contamination, such as cueo, mer, metc, merb, tehb and terc gene. With regard to the relationship between the environmental variables and microbial functional structure, S, Cu, Cd, Hg and Cr were the dominating factor shaping the microbial distribution pattern in three sites. This study suggests that high level of heavy metal contamination resulted in higher functional diversity and the abundance of metal resistant genes. These variation therefore significantly contribute to the resistance, resilience and stability of the microbial community subjected to the gradient of heavy metals contaminant in Xiangjiang River.

  20. Methylation of miRNA genes and oncogenesis.

    Science.gov (United States)

    Loginov, V I; Rykov, S V; Fridman, M V; Braga, E A

    2015-02-01

    Interaction between microRNA (miRNA) and messenger RNA of target genes at the posttranscriptional level provides fine-tuned dynamic regulation of cell signaling pathways. Each miRNA can be involved in regulating hundreds of protein-coding genes, and, conversely, a number of different miRNAs usually target a structural gene. Epigenetic gene inactivation associated with methylation of promoter CpG-islands is common to both protein-coding genes and miRNA genes. Here, data on functions of miRNAs in development of tumor-cell phenotype are reviewed. Genomic organization of promoter CpG-islands of the miRNA genes located in inter- and intragenic areas is discussed. The literature and our own results on frequency of CpG-island methylation in miRNA genes from tumors are summarized, and data regarding a link between such modification and changed activity of miRNA genes and, consequently, protein-coding target genes are presented. Moreover, the impact of miRNA gene methylation on key oncogenetic processes as well as affected signaling pathways is discussed.

  1. Structural organization and classification of cytochrome P450 genes in flax (Linum usitatissimum L.).

    Science.gov (United States)

    Babu, Peram Ravindra; Rao, Khareedu Venkateswara; Reddy, Vudem Dashavantha

    2013-01-15

    Flax CYPome analysis resulted in the identification of 334 putative cytochrome P450 (CYP450) genes in the cultivated flax genome. Classification of flax CYP450 genes based on the sequence similarity with Arabidopsis orthologs and CYP450 nomenclature, revealed 10 clans representing 44 families and 98 subfamilies. CYP80, CYP83, CYP92, CYP702, CYP705, CYP708, CYP728, CYP729, CYP733 and CYP736 families are absent in the flax genome. The subfamily members exhibited conserved sequences, length of exons and phasing of introns. Similarity search of the genomic resources of wild flax species Linum bienne with CYP450 coding sequences of the cultivated flax, revealed the presence of 127 CYP450 gene orthologs, indicating amplification of novel CYP450 genes in the cultivated flax. Seven families CYP73, 74, 75, 76, 77, 84 and 709, coding for enzymes associated with phenylpropanoid/fatty acid metabolism, showed extensive gene amplification in the flax. About 59% of the flax CYP450 genes were present in the EST libraries. Copyright © 2012 Elsevier B.V. All rights reserved.

  2. Diversification of Root Hair Development Genes in Vascular Plants.

    Science.gov (United States)

    Huang, Ling; Shi, Xinhui; Wang, Wenjia; Ryu, Kook Hui; Schiefelbein, John

    2017-07-01

    The molecular genetic program for root hair development has been studied intensively in Arabidopsis ( Arabidopsis thaliana ). To understand the extent to which this program might operate in other plants, we conducted a large-scale comparative analysis of root hair development genes from diverse vascular plants, including eudicots, monocots, and a lycophyte. Combining phylogenetics and transcriptomics, we discovered conservation of a core set of root hair genes across all vascular plants, which may derive from an ancient program for unidirectional cell growth coopted for root hair development during vascular plant evolution. Interestingly, we also discovered preferential diversification in the structure and expression of root hair development genes, relative to other root hair- and root-expressed genes, among these species. These differences enabled the definition of sets of genes and gene functions that were acquired or lost in specific lineages during vascular plant evolution. In particular, we found substantial divergence in the structure and expression of genes used for root hair patterning, suggesting that the Arabidopsis transcriptional regulatory mechanism is not shared by other species. To our knowledge, this study provides the first comprehensive view of gene expression in a single plant cell type across multiple species. © 2017 American Society of Plant Biologists. All Rights Reserved.

  3. Determining Semantically Related Significant Genes.

    Science.gov (United States)

    Taha, Kamal

    2014-01-01

    GO relation embodies some aspects of existence dependency. If GO term xis existence-dependent on GO term y, the presence of y implies the presence of x. Therefore, the genes annotated with the function of the GO term y are usually functionally and semantically related to the genes annotated with the function of the GO term x. A large number of gene set enrichment analysis methods have been developed in recent years for analyzing gene sets enrichment. However, most of these methods overlook the structural dependencies between GO terms in GO graph by not considering the concept of existence dependency. We propose in this paper a biological search engine called RSGSearch that identifies enriched sets of genes annotated with different functions using the concept of existence dependency. We observe that GO term xcannot be existence-dependent on GO term y, if x- and y- have the same specificity (biological characteristics). After encoding into a numeric format the contributions of GO terms annotating target genes to the semantics of their lowest common ancestors (LCAs), RSGSearch uses microarray experiment to identify the most significant LCA that annotates the result genes. We evaluated RSGSearch experimentally and compared it with five gene set enrichment systems. Results showed marked improvement.

  4. Gene Fusion Markup Language: a prototype for exchanging gene fusion data.

    Science.gov (United States)

    Kalyana-Sundaram, Shanker; Shanmugam, Achiraman; Chinnaiyan, Arul M

    2012-10-16

    An avalanche of next generation sequencing (NGS) studies has generated an unprecedented amount of genomic structural variation data. These studies have also identified many novel gene fusion candidates with more detailed resolution than previously achieved. However, in the excitement and necessity of publishing the observations from this recently developed cutting-edge technology, no community standardization approach has arisen to organize and represent the data with the essential attributes in an interchangeable manner. As transcriptome studies have been widely used for gene fusion discoveries, the current non-standard mode of data representation could potentially impede data accessibility, critical analyses, and further discoveries in the near future. Here we propose a prototype, Gene Fusion Markup Language (GFML) as an initiative to provide a standard format for organizing and representing the significant features of gene fusion data. GFML will offer the advantage of representing the data in a machine-readable format to enable data exchange, automated analysis interpretation, and independent verification. As this database-independent exchange initiative evolves it will further facilitate the formation of related databases, repositories, and analysis tools. The GFML prototype is made available at http://code.google.com/p/gfml-prototype/. The Gene Fusion Markup Language (GFML) presented here could facilitate the development of a standard format for organizing, integrating and representing the significant features of gene fusion data in an inter-operable and query-able fashion that will enable biologically intuitive access to gene fusion findings and expedite functional characterization. A similar model is envisaged for other NGS data analyses.

  5. Invasion fitness for gene-culture co-evolution in family-structured populations and an application to cumulative culture under vertical transmission.

    Science.gov (United States)

    Mullon, Charles; Lehmann, Laurent

    2017-08-01

    Human evolution depends on the co-evolution between genetically determined behaviors and socially transmitted information. Although vertical transmission of cultural information from parent to offspring is common in hominins, its effects on cumulative cultural evolution are not fully understood. Here, we investigate gene-culture co-evolution in a family-structured population by studying the invasion fitness of a mutant allele that influences a deterministic level of cultural information (e.g., amount of knowledge or skill) to which diploid carriers of the mutant are exposed in subsequent generations. We show that the selection gradient on such a mutant, and the concomitant level of cultural information it generates, can be evaluated analytically under the assumption that the cultural dynamic has a single attractor point, thereby making gene-culture co-evolution in family-structured populations with multigenerational effects mathematically tractable. We apply our result to study how genetically determined phenotypes of individual and social learning co-evolve with the level of adaptive information they generate under vertical transmission. We find that vertical transmission increases adaptive information due to kin selection effects, but when information is transmitted as efficiently between family members as between unrelated individuals, this increase is moderate in diploids. By contrast, we show that the way resource allocation into learning trades off with allocation into reproduction (the "learning-reproduction trade-off") significantly influences levels of adaptive information. We also show that vertical transmission prevents evolutionary branching and may therefore play a qualitative role in gene-culture co-evolutionary dynamics. More generally, our analysis of selection suggests that vertical transmission can significantly increase levels of adaptive information under the biologically plausible condition that information transmission between relatives is

  6. Evolutionary conservation and network structure characterize genes of phenotypic relevance for mitosis in human.

    Directory of Open Access Journals (Sweden)

    Marek Ostaszewski

    Full Text Available The impact of gene silencing on cellular phenotypes is difficult to establish due to the complexity of interactions in the associated biological processes and pathways. A recent genome-wide RNA knock-down study both identified and phenotypically characterized a set of important genes for the cell cycle in HeLa cells. Here, we combine a molecular interaction network analysis, based on physical and functional protein interactions, in conjunction with evolutionary information, to elucidate the common biological and topological properties of these key genes. Our results show that these genes tend to be conserved with their corresponding protein interactions across several species and are key constituents of the evolutionary conserved molecular interaction network. Moreover, a group of bistable network motifs is found to be conserved within this network, which are likely to influence the network stability and therefore the robustness of cellular functioning. They form a cluster, which displays functional homogeneity and is significantly enriched in genes phenotypically relevant for mitosis. Additional results reveal a relationship between specific cellular processes and the phenotypic outcomes induced by gene silencing. This study introduces new ideas regarding the relationship between genotype and phenotype in the context of the cell cycle. We show that the analysis of molecular interaction networks can result in the identification of genes relevant to cellular processes, which is a promising avenue for future research.

  7. Identifying the genes of unconventional high temperature superconductors.

    Science.gov (United States)

    Hu, Jiangping

    We elucidate a recently emergent framework in unifying the two families of high temperature (high [Formula: see text]) superconductors, cuprates and iron-based superconductors. The unification suggests that the latter is simply the counterpart of the former to realize robust extended s-wave pairing symmetries in a square lattice. The unification identifies that the key ingredients (gene) of high [Formula: see text] superconductors is a quasi two dimensional electronic environment in which the d -orbitals of cations that participate in strong in-plane couplings to the p -orbitals of anions are isolated near Fermi energy. With this gene, the superexchange magnetic interactions mediated by anions could maximize their contributions to superconductivity. Creating the gene requires special arrangements between local electronic structures and crystal lattice structures. The speciality explains why high [Formula: see text] superconductors are so rare. An explicit prediction is made to realize high [Formula: see text] superconductivity in Co/Ni-based materials with a quasi two dimensional hexagonal lattice structure formed by trigonal bipyramidal complexes.

  8. Chromosome 15 structural abnormalities: effect on IGF1R gene expression and function

    Directory of Open Access Journals (Sweden)

    Rossella Cannarella

    2017-09-01

    Full Text Available Insulin-like growth factor 1 receptor (IGF1R, mapping on the 15q26.3 chromosome, is required for normal embryonic and postnatal growth. The aim of the present study was to evaluate the IGF1R gene expression and function in three unrelated patients with chromosome 15 structural abnormalities. We report two male patients with the smallest 15q26.3 chromosome duplication described so far, and a female patient with ring chromosome 15 syndrome. Patient one, with a 568 kb pure duplication, had overgrowth, developmental delay, mental and psychomotor retardation, obesity, cryptorchidism, borderline low testis volume, severe oligoasthenoteratozoospermia and gynecomastia. We found a 1.8-fold increase in the IGF1R mRNA and a 1.3-fold increase in the IGF1R protein expression (P < 0.05. Patient two, with a 650 kb impure duplication, showed overgrowth, developmental delay, mild mental retardation, precocious puberty, low testicular volume and severe oligoasthenoteratozoospermia. The IGF1R mRNA and protein expression was similar to that of the control. Patient three, with a 46,XX r(15 (p10q26.2 karyotype, displayed intrauterine growth retardation, developmental delay, mental and psychomotor retardation. We found a <0.5-fold decrease in the IGF1R mRNA expression and an undetectable IGF1R activity. After reviewing the previously 96 published cases of chromosome 15q duplication, we found that neurological disorders, congenital cardiac defects, typical facial traits and gonadal abnormalities are the prominent features in patients with chromosome 15q duplication. Interestingly, patients with 15q deletion syndrome display similar features. We speculate that both the increased and decreased IGF1R gene expression may play a role in the etiology of neurological and gonadal disorders.

  9. Role of genes in oro-dental diseases

    Directory of Open Access Journals (Sweden)

    Kavitha B

    2010-01-01

    Full Text Available In oral cavity, the spectrum of diseases due to genetic alterations ranges from developmental disturbances of teeth to the pre-cancerous and cancerous lesions. Of late, significant progress has been made in the molecular analysis of tumors. With molecular genetic testing emerging as diagnostic, prognostic, and therapeutic approach, a review of genetic alterations ranging from the development of oro-facial structures to the tumors in the head and neck region are addressed in this article. The functional regulatory aspect of genes in relation to oro-facial structures are discussed separately, i.e., in relation to tooth genesis, tooth agenesis (non-syndromic, syndromic, tooth structural alterations, syndromic oro-facial defects, bone diseases, skin diseases (genodermatoses, and malignant tumors. In this literature, various genes involved in the development of the oro-facial structures and tooth in particular are discussed. The genetic basis of disorders in the tooth development (agenesis, hypodontia, tooth structural defects like amelogenesis imperfecta (AI, dentinogenesis imperfecta (DI, and oro-facial structural alterations (various syndromes are explained.

  10. Enhanced gene ranking approaches using modified trace ratio algorithm for gene expression data

    Directory of Open Access Journals (Sweden)

    Shruti Mishra

    Full Text Available Microarray technology enables the understanding and investigation of gene expression levels by analyzing high dimensional datasets that contain few samples. Over time, microarray expression data have been collected for studying the underlying biological mechanisms of disease. One such application for understanding the mechanism is by constructing a gene regulatory network (GRN. One of the foremost key criteria for GRN discovery is gene selection. Choosing a generous set of genes for the structure of the network is highly desirable. For this role, two suitable methods were proposed for selection of appropriate genes. The first approach comprises a gene selection method called Information gain, where the dataset is reformed and fused with another distinct algorithm called Trace Ratio (TR. Our second method is the implementation of our projected modified TR algorithm, where the scoring base for finding weight matrices has been re-designed. Both the methods' efficiency was shown with different classifiers that include variants of the Artificial Neural Network classifier, such as Resilient Propagation, Quick Propagation, Back Propagation, Manhattan Propagation and Radial Basis Function Neural Network and also the Support Vector Machine (SVM classifier. In the study, it was confirmed that both of the proposed methods worked well and offered high accuracy with a lesser number of iterations as compared to the original Trace Ratio algorithm. Keywords: Gene regulatory network, Gene selection, Information gain, Trace ratio, Canonical correlation analysis, Classification

  11. Evolutionary origin of Rosaceae-specific active non-autonomous hAT elements and their contribution to gene regulation and genomic structural variation.

    Science.gov (United States)

    Wang, Lu; Peng, Qian; Zhao, Jianbo; Ren, Fei; Zhou, Hui; Wang, Wei; Liao, Liao; Owiti, Albert; Jiang, Quan; Han, Yuepeng

    2016-05-01

    Transposable elements account for approximately 30 % of the Prunus genome; however, their evolutionary origin and functionality remain largely unclear. In this study, we identified a hAT transposon family, termed Moshan, in Prunus. The Moshan elements consist of three types, aMoshan, tMoshan, and mMoshan. The aMoshan and tMoshan types contain intact or truncated transposase genes, respectively, while the mMoshan type is miniature inverted-repeat transposable element (MITE). The Moshan transposons are unique to Rosaceae, and the copy numbers of different Moshan types are significantly correlated. Sequence homology analysis reveals that the mMoshan MITEs are direct deletion derivatives of the tMoshan progenitors, and one kind of mMoshan containing a MuDR-derived fragment were amplified predominately in the peach genome. The mMoshan sequences contain cis-regulatory elements that can enhance gene expression up to 100-fold. The mMoshan MITEs can serve as potential sources of micro and long noncoding RNAs. Whole-genome re-sequencing analysis indicates that mMoshan elements are highly active, and an insertion into S-haplotype-specific F-box gene was reported to cause the breakdown of self-incompatibility in sour cherry. Taken together, all these results suggest that the mMoshan elements play important roles in regulating gene expression and driving genomic structural variation in Prunus.

  12. Glycosyltransferase Gene Expression Profiles Classify Cancer Types and Propose Prognostic Subtypes

    Science.gov (United States)

    Ashkani, Jahanshah; Naidoo, Kevin J.

    2016-05-01

    Aberrant glycosylation in tumours stem from altered glycosyltransferase (GT) gene expression but can the expression profiles of these signature genes be used to classify cancer types and lead to cancer subtype discovery? The differential structural changes to cellular glycan structures are predominantly regulated by the expression patterns of GT genes and are a hallmark of neoplastic cell metamorphoses. We found that the expression of 210 GT genes taken from 1893 cancer patient samples in The Cancer Genome Atlas (TCGA) microarray data are able to classify six cancers; breast, ovarian, glioblastoma, kidney, colon and lung. The GT gene expression profiles are used to develop cancer classifiers and propose subtypes. The subclassification of breast cancer solid tumour samples illustrates the discovery of subgroups from GT genes that match well against basal-like and HER2-enriched subtypes and correlates to clinical, mutation and survival data. This cancer type glycosyltransferase gene signature finding provides foundational evidence for the centrality of glycosylation in cancer.

  13. Recent Advancements in Gene Therapy for Hereditary Retinal Dystrophies

    Directory of Open Access Journals (Sweden)

    Ayşe Öner

    2017-12-01

    Full Text Available Hereditary retinal dystrophies (HRDs are degenerative diseases of the retina which have marked clinical and genetic heterogeneity. Common presentations among these disorders include night or colour blindness, tunnel vision, and subsequent progression to complete blindness. The known causative disease genes have a variety of developmental and functional roles, with mutations in more than 120 genes shown to be responsible for the phenotypes. In addition, mutations within the same gene have been shown to cause different disease phenotypes, even amongst affected individuals within the same family, highlighting further levels of complexity. The known disease genes encode proteins involved in retinal cellular structures, phototransduction, the visual cycle, and photoreceptor structure or gene regulation. Significant advancements have been made in understanding the genetic pathogenesis of ocular diseases, and gene replacement and gene silencing have been proposed as potentially efficacious therapies. Because of its favorable anatomical and immunological characteristics, the eye has been at the forefront of translational gene therapy. Recent improvements have been made in the safety and specificity of vector-based ocular gene transfer methods. Dozens of promising proofs of concept have been obtained in animal models of HRDs and some of them have been relayed to the clinic. The results from the first clinical trials for a congenital form of blindness have generated great interest and have demonstrated the safety and efficacy of intraocular administrations of viral vectors in humans. This review summarizes the clinical development of retinal gene therapy.

  14. Partial structure of the phylloxin gene from the giant monkey frog, Phyllomedusa bicolor: parallel cloning of precursor cDNA and genomic DNA from lyophilized skin secretion.

    Science.gov (United States)

    Chen, Tianbao; Gagliardo, Ron; Walker, Brian; Zhou, Mei; Shaw, Chris

    2005-12-01

    Phylloxin is a novel prototype antimicrobial peptide from the skin of Phyllomedusa bicolor. Here, we describe parallel identification and sequencing of phylloxin precursor transcript (mRNA) and partial gene structure (genomic DNA) from the same sample of lyophilized skin secretion using our recently-described cloning technique. The open-reading frame of the phylloxin precursor was identical in nucleotide sequence to that previously reported and alignment with the nucleotide sequence derived from genomic DNA indicated the presence of a 175 bp intron located in a near identical position to that found in the dermaseptins. The highly-conserved structural organization of skin secretion peptide genes in P. bicolor can thus be extended to include that encoding phylloxin (plx). These data further reinforce our assertion that application of the described methodology can provide robust genomic/transcriptomic/peptidomic data without the need for specimen sacrifice.

  15. Identification of Human HK Genes and Gene Expression Regulation Study in Cancer from Transcriptomics Data Analysis

    Science.gov (United States)

    Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun

    2013-01-01

    The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867

  16. Structure, expression profile and phylogenetic inference of chalcone isomerase-like genes from the narrow-leafed lupin (Lupinus angustifolius L. genome

    Directory of Open Access Journals (Sweden)

    Łucja ePrzysiecka

    2015-04-01

    Full Text Available Lupins, like other legumes, have a unique biosynthesis scheme of 5-deoxy-type flavonoids and isoflavonoids. A key enzyme in this pathway is chalcone isomerase (CHI, a member of CHI-fold protein family, encompassing subfamilies of CHI1, CHI2, CHI-like (CHIL, and fatty acid-binding (FAP proteins. Here, two Lupinus angustifolius (narrow-leafed lupin CHILs, LangCHIL1 and LangCHIL2, were identified and characterized using DNA fingerprinting, cytogenetic and linkage mapping, sequencing and expression profiling. Clones carrying CHIL sequences were assembled into two contigs. Full gene sequences were obtained from these contigs, and mapped in two L. angustifolius linkage groups by gene-specific markers. Bacterial artificial chromosome fluorescence in situ hybridization approach confirmed the localization of two LangCHIL genes in distinct chromosomes. The expression profiles of both LangCHIL isoforms were very similar. The highest level of transcription was in the roots of the third week of plant growth; thereafter, expression declined. The expression of both LangCHIL genes in leaves and stems was similar and low. Comparative mapping to reference legume genome sequences revealed strong syntenic links; however, LangCHIL2 contig had a much more conserved structure than LangCHIL1. LangCHIL2 is assumed to be an ancestor gene, whereas LangCHIL1 probably appeared as a result of duplication. As both copies are transcriptionally active, questions arise concerning their hypothetical functional divergence. Screening of the narrow-leafed lupin genome and transcriptome with CHI-fold protein sequences, followed by Bayesian inference of phylogeny and cross-genera synteny survey, identified representatives of all but one (CHI1 main subfamilies. They are as follows: two copies of CHI2, FAPa2 and CHIL, and single copies of FAPb and FAPa1. Duplicated genes are remnants of whole genome duplication which is assumed to have occurred after the divergence of Lupinus, Arachis

  17. Deriving Trading Rules Using Gene Expression Programming

    Directory of Open Access Journals (Sweden)

    Adrian VISOIU

    2011-01-01

    Full Text Available This paper presents how buy and sell trading rules are generated using gene expression programming with special setup. Market concepts are presented and market analysis is discussed with emphasis on technical analysis and quantitative methods. The use of genetic algorithms in deriving trading rules is presented. Gene expression programming is applied in a form where multiple types of operators and operands are used. This gives birth to multiple gene contexts and references between genes in order to keep the linear structure of the gene expression programming chromosome. The setup of multiple gene contexts is presented. The case study shows how to use the proposed gene setup to derive trading rules encoded by Boolean expressions, using a dataset with the reference exchange rates between the Euro and the Romanian leu. The conclusions highlight the positive results obtained in deriving useful trading rules.

  18. Kissing loops hide premature termination codons in pre-mRNAof selenoprotein genes and in genes containing programmedribosomal frameshifts

    DEFF Research Database (Denmark)

    Knudsen, Steen; Brunak, Søren

    1997-01-01

    A novel RNA secondary structure that places the selenocysteine codon UGA in one hairpin and a donor splice site in another, has been discovered in selenoprotein genes. The presence of the structure resolves the discrepancy that the selenocysteine triplet, UGA, should block splicing. Without a spe...

  19. Development of gene diagnosis for diabetes and cholecystitis based on gene analysis of CCK-A receptor

    International Nuclear Information System (INIS)

    Kono, Akira

    1999-01-01

    Base sequence analysis of CCKAR gene (a gene of A-type receptor for cholecystokinin) from OLETF rat, a model rat for insulin-independent diabetes was made based on the base sequence of wild CCKAR gene, which had been clarified in the previous year. From the pancreas of OLETF rat, DNA was extracted and transduced into λphage after fragmentation to construct the gene library of OLETF. Then, λphage DNA clone bound with labelled cDNA of CCKAR gene was analyzed and the gene structure was compared with that of the wild gene. It was demonstrated that CCKAR gene of OLETF had a deletion (6800 b.p.) ranging from the promoter region to the Exon 2, suggesting that CCKAR gene is not functional in OLETF rat. The whole sequence of this mutant gene was registered into Japan DNA Bank (D 50610). Then, F 2 offspring rats were obtained through crossing OLETF (female) and F344 (male) and the time course-changes in the blood glucose level after glucose loading were compared among them. The blood glucose level after glucose loading was significantly higher in the homo-mutant F 2 (CCKAR,-/-) as well as the parent OLETF rat than hetero-mutant F 2 (CCKARm-/+) or the wild rat (CCKAR,+/+). This suggests that CCKAR gene might be involved in the control of blood glucose level and an alteration of the expression level or the functions of CCKAR gene might affect the blood glucose level. (M.N.)

  20. Regulatory structures for gene therapy medicinal products in the European Union.

    Science.gov (United States)

    Klug, Bettina; Celis, Patrick; Carr, Melanie; Reinhardt, Jens

    2012-01-01

    Taking into account the complexity and technical specificity of advanced therapy medicinal products: (gene and cell therapy medicinal products and tissue engineered products), a dedicated European regulatory framework was needed. Regulation (EC) No. 1394/2007, the "ATMP Regulation" provides tailored regulatory principles for the evaluation and authorization of these innovative medicines. The majority of gene or cell therapy product development is carried out by academia, hospitals, and small- and medium-sized enterprises (SMEs). Thus, acknowledging the particular needs of these types of sponsors, the legislation also provides incentives for product development tailored to them. The European Medicines Agency (EMA) and, in particular, its Committee for Advanced Therapies (CAT) provide a variety of opportunities for early interaction with developers of ATMPs to enable them to have early regulatory and scientific input. An important tool to promote innovation and the development of new medicinal products by micro-, small-, and medium-sized enterprises is the EMA's SME initiative launched in December 2005 to offer financial and administrative assistance to smaller companies. The European legislation also foresees the involvement of stakeholders, such as patient organizations, in the development of new medicines. Considering that gene therapy medicinal products are developed in many cases for treatment of rare diseases often of monogenic origin, the involvement of patient organizations, which focus on rare diseases and genetic and congenital disorders, is fruitful. Two such organizations are represented in the CAT. Research networks play another important role in the development of gene therapy medicinal products. The European Commission is funding such networks through the EU Sixth Framework Program. Copyright © 2012 Elsevier Inc. All rights reserved.

  1. Reverse engineering model structures for soil and ecosystem respiration: the potential of gene expression programming

    Directory of Open Access Journals (Sweden)

    I. Ilie

    2017-09-01

    Full Text Available Accurate model representation of land–atmosphere carbon fluxes is essential for climate projections. However, the exact responses of carbon cycle processes to climatic drivers often remain uncertain. Presently, knowledge derived from experiments, complemented by a steadily evolving body of mechanistic theory, provides the main basis for developing such models. The strongly increasing availability of measurements may facilitate new ways of identifying suitable model structures using machine learning. Here, we explore the potential of gene expression programming (GEP to derive relevant model formulations based solely on the signals present in data by automatically applying various mathematical transformations to potential predictors and repeatedly evolving the resulting model structures. In contrast to most other machine learning regression techniques, the GEP approach generates readable models that allow for prediction and possibly for interpretation. Our study is based on two cases: artificially generated data and real observations. Simulations based on artificial data show that GEP is successful in identifying prescribed functions, with the prediction capacity of the models comparable to four state-of-the-art machine learning methods (random forests, support vector machines, artificial neural networks, and kernel ridge regressions. Based on real observations we explore the responses of the different components of terrestrial respiration at an oak forest in south-eastern England. We find that the GEP-retrieved models are often better in prediction than some established respiration models. Based on their structures, we find previously unconsidered exponential dependencies of respiration on seasonal ecosystem carbon assimilation and water dynamics. We noticed that the GEP models are only partly portable across respiration components, the identification of a general terrestrial respiration model possibly prevented by equifinality issues. Overall

  2. Reverse engineering model structures for soil and ecosystem respiration: the potential of gene expression programming

    Science.gov (United States)

    Ilie, Iulia; Dittrich, Peter; Carvalhais, Nuno; Jung, Martin; Heinemeyer, Andreas; Migliavacca, Mirco; Morison, James I. L.; Sippel, Sebastian; Subke, Jens-Arne; Wilkinson, Matthew; Mahecha, Miguel D.

    2017-09-01

    Accurate model representation of land-atmosphere carbon fluxes is essential for climate projections. However, the exact responses of carbon cycle processes to climatic drivers often remain uncertain. Presently, knowledge derived from experiments, complemented by a steadily evolving body of mechanistic theory, provides the main basis for developing such models. The strongly increasing availability of measurements may facilitate new ways of identifying suitable model structures using machine learning. Here, we explore the potential of gene expression programming (GEP) to derive relevant model formulations based solely on the signals present in data by automatically applying various mathematical transformations to potential predictors and repeatedly evolving the resulting model structures. In contrast to most other machine learning regression techniques, the GEP approach generates readable models that allow for prediction and possibly for interpretation. Our study is based on two cases: artificially generated data and real observations. Simulations based on artificial data show that GEP is successful in identifying prescribed functions, with the prediction capacity of the models comparable to four state-of-the-art machine learning methods (random forests, support vector machines, artificial neural networks, and kernel ridge regressions). Based on real observations we explore the responses of the different components of terrestrial respiration at an oak forest in south-eastern England. We find that the GEP-retrieved models are often better in prediction than some established respiration models. Based on their structures, we find previously unconsidered exponential dependencies of respiration on seasonal ecosystem carbon assimilation and water dynamics. We noticed that the GEP models are only partly portable across respiration components, the identification of a general terrestrial respiration model possibly prevented by equifinality issues. Overall, GEP is a promising

  3. Effects of the nanotopographic surface structure of commercially pure titanium following anodization-hydrothermal treatment on gene expression and adhesion in gingival epithelial cells.

    Science.gov (United States)

    Takebe, J; Miyata, K; Miura, S; Ito, S

    2014-09-01

    The long-term stability and maintenance of endosseous implants with anodized-hydrothermally treated commercially pure titanium surfaces and a nanotopographic structure (SA-treated c.p.Ti) depend on the barrier function provided by the interface between the transmucosal portion of the implant surface and the peri-implant epithelium. This study investigated the effects of extracellular and intracellular gene expression in adherent gingival epithelial cells cultured for 1-7 days on SA-treated c.p.Ti implant surfaces compared to anodic oxide (AO) c.p.Ti and c.p.Ti disks. Scanning electron microscopy (SEM) showed filopodium-like extensions bound closely to the nanotopographic structure of SA-treated c.p.Ti at day 7 of culture. Gene expressions of focal adhesion kinase, integrin-α6β4, and laminin-5 (α3, β3, γ2) were significantly higher on SA-treated c.p.Ti than on c.p.Ti or AO c.p.Ti after 7 days (Pcells adhere to SA-treated c.p.Ti as the transmucosal portion of an implant, and that this interaction markedly improves expression of focal adhesion molecules and enhances the epithelial cell phenotype. The cellular gene expression responses driving extracellular and intracellular molecular interactions thus play an important role in maintenance at the interface between SA-treated c.p.Ti implant surfaces and the gingival epithelial cells. Copyright © 2014 Elsevier B.V. All rights reserved.

  4. Canine candidate genes for dilated cardiomyopathy: annotation of and polymorphic markers for 14 genes.

    Science.gov (United States)

    Wiersma, Anje C; Leegwater, Peter Aj; van Oost, Bernard A; Ollier, William E; Dukes-McEwan, Joanna

    2007-10-19

    Dilated cardiomyopathy is a myocardial disease occurring in humans and domestic animals and is characterized by dilatation of the left ventricle, reduced systolic function and increased sphericity of the left ventricle. Dilated cardiomyopathy has been observed in several, mostly large and giant, dog breeds, such as the Dobermann and the Great Dane. A number of genes have been identified, which are associated with dilated cardiomyopathy in the human, mouse and hamster. These genes mainly encode structural proteins of the cardiac myocyte. We present the annotation of, and marker development for, 14 of these genes of the dog genome, i.e. alpha-cardiac actin, caveolin 1, cysteine-rich protein 3, desmin, lamin A/C, LIM-domain binding factor 3, myosin heavy polypeptide 7, phospholamban, sarcoglycan delta, titin cap, alpha-tropomyosin, troponin I, troponin T and vinculin. A total of 33 Single Nucleotide Polymorphisms were identified for these canine genes and 11 polymorphic microsatellite repeats were developed. The presented polymorphisms provide a tool to investigate the role of the corresponding genes in canine Dilated Cardiomyopathy by linkage analysis or association studies.

  5. Molecular evolution and diversification of snake toxin genes, revealed by analysis of intron sequences.

    Science.gov (United States)

    Fujimi, T J; Nakajyo, T; Nishimura, E; Ogura, E; Tsuchiya, T; Tamiya, T

    2003-08-14

    The genes encoding erabutoxin (short chain neurotoxin) isoforms (Ea, Eb, and Ec), LsIII (long chain neurotoxin) and a novel long chain neurotoxin pseudogene were cloned from a Laticauda semifasciata genomic library. Short and long chain neurotoxin genes were also cloned from the genome of Laticauda laticaudata, a closely related species of L. semifasciata, by PCR. A putative matrix attached region (MAR) sequence was found in the intron I of the LsIII gene. Comparative analysis of 11 structurally relevant snake toxin genes (three-finger-structure toxins) revealed the molecular evolution of these toxins. Three-finger-structure toxin genes diverged from a common ancestor through two types of evolutionary pathways (long and short types), early in the course of evolution. At a later stage of evolution in each gene, the accumulation of mutations in the exons, especially exon II, by accelerated evolution may have caused the increased diversification in their functions. It was also revealed that the putative MAR sequence found in the LsIII gene was integrated into the gene after the species-level divergence.

  6. Identification of rat genes by TWINSCAN gene prediction, RT-PCR, and direct sequencing

    DEFF Research Database (Denmark)

    Wu, Jia Qian; Shteynberg, David; Arumugam, Manimozhiyan

    2004-01-01

    an alternative approach: reverse transcription-polymerase chain reaction (RT-PCR) and direct sequencing based on dual-genome de novo predictions from TWINSCAN. We tested 444 TWINSCAN-predicted rat genes that showed significant homology to known human genes implicated in disease but that were partially...... in the single-intron experiment. Spliced sequences were amplified in 46 cases (34%). We conclude that this procedure for elucidating gene structures with native cDNA sequences is cost-effective and will become even more so as it is further optimized.......The publication of a draft sequence of a third mammalian genome--that of the rat--suggests a need to rethink genome annotation. New mammalian sequences will not receive the kind of labor-intensive annotation efforts that are currently being devoted to human. In this paper, we demonstrate...

  7. Empirical study of supervised gene screening

    Directory of Open Access Journals (Sweden)

    Ma Shuangge

    2006-12-01

    Full Text Available Abstract Background Microarray studies provide a way of linking variations of phenotypes with their genetic causations. Constructing predictive models using high dimensional microarray measurements usually consists of three steps: (1 unsupervised gene screening; (2 supervised gene screening; and (3 statistical model building. Supervised gene screening based on marginal gene ranking is commonly used to reduce the number of genes in the model building. Various simple statistics, such as t-statistic or signal to noise ratio, have been used to rank genes in the supervised screening. Despite of its extensive usage, statistical study of supervised gene screening remains scarce. Our study is partly motivated by the differences in gene discovery results caused by using different supervised gene screening methods. Results We investigate concordance and reproducibility of supervised gene screening based on eight commonly used marginal statistics. Concordance is assessed by the relative fractions of overlaps between top ranked genes screened using different marginal statistics. We propose a Bootstrap Reproducibility Index, which measures reproducibility of individual genes under the supervised screening. Empirical studies are based on four public microarray data. We consider the cases where the top 20%, 40% and 60% genes are screened. Conclusion From a gene discovery point of view, the effect of supervised gene screening based on different marginal statistics cannot be ignored. Empirical studies show that (1 genes passed different supervised screenings may be considerably different; (2 concordance may vary, depending on the underlying data structure and percentage of selected genes; (3 evaluated with the Bootstrap Reproducibility Index, genes passed supervised screenings are only moderately reproducible; and (4 concordance cannot be improved by supervised screening based on reproducibility.

  8. DISC1 gene and affective psychopathology: a combined structural and functional MRI study.

    Science.gov (United States)

    Opmeer, Esther M; van Tol, Marie-José; Kortekaas, Rudie; van der Wee, Nic J A; Woudstra, Saskia; van Buchem, Mark A; Penninx, Brenda W; Veltman, Dick J; Aleman, André

    2015-02-01

    The gene Disrupted-In-Schizophrenia-1 (DISC1) has been indicated as a determinant of psychopathology, including affective disorders, and shown to influence prefrontal cortex (PFC) and hippocampus functioning, regions of major interest for affective disorders. We aimed to investigate whether DISC1 differentially modulates brain function during executive and memory processing, and morphology in regions relevant for depression and anxiety disorders (affective disorders). 128 participants, with (n = 103) and without (controls; n = 25) affective disorders underwent genotyping for Ser704Cys (with Cys-allele considered as risk-allele) and structural and functional (f) Magnetic Resonance Imaging (MRI) during visuospatial planning and emotional episodic memory tasks. For both voxel-based morphometry and fMRI analyses, we investigated the effect of genotype in controls and explored genotypeXdiagnosis interactions. Results are reported at p < 0.05 FWE small volume corrected. In controls, Cys-carriers showed smaller bilateral (para)hippocampal volumes compared with Ser-homozygotes, and lower activation in the anterior cingulate cortex (ACC) and dorsolateral PFC during visuospatial planning. In anxiety patients, Cys-carriers showed larger (para)hippocampal volumes and more ACC activation during visuospatial planning. In depressive patients, no effect of genotype was observed and overall, no effect of genotype on episodic memory processing was detected. We demonstrated that Ser704Cys-genotype influences (para)hippocampal structure and functioning the dorsal PFC during executive planning, most prominently in unaffected controls. Results suggest that presence of psychopathology moderates Ser704Cys effects. Copyright © 2014 Elsevier Ltd. All rights reserved.

  9. Deep convolutional neural networks for annotating gene expression patterns in the mouse brain.

    Science.gov (United States)

    Zeng, Tao; Li, Rongjian; Mukkamala, Ravi; Ye, Jieping; Ji, Shuiwang

    2015-05-07

    Profiling gene expression in brain structures at various spatial and temporal scales is essential to understanding how genes regulate the development of brain structures. The Allen Developing Mouse Brain Atlas provides high-resolution 3-D in situ hybridization (ISH) gene expression patterns in multiple developing stages of the mouse brain. Currently, the ISH images are annotated with anatomical terms manually. In this paper, we propose a computational approach to annotate gene expression pattern images in the mouse brain at various structural levels over the course of development. We applied deep convolutional neural network that was trained on a large set of natural images to extract features from the ISH images of developing mouse brain. As a baseline representation, we applied invariant image feature descriptors to capture local statistics from ISH images and used the bag-of-words approach to build image-level representations. Both types of features from multiple ISH image sections of the entire brain were then combined to build 3-D, brain-wide gene expression representations. We employed regularized learning methods for discriminating gene expression patterns in different brain structures. Results show that our approach of using convolutional model as feature extractors achieved superior performance in annotating gene expression patterns at multiple levels of brain structures throughout four developing ages. Overall, we achieved average AUC of 0.894 ± 0.014, as compared with 0.820 ± 0.046 yielded by the bag-of-words approach. Deep convolutional neural network model trained on natural image sets and applied to gene expression pattern annotation tasks yielded superior performance, demonstrating its transfer learning property is applicable to such biological image sets.

  10. Classification and expression analyses of homeobox genes from ...

    Indian Academy of Sciences (India)

    We present here the first genome-wide classification and comparative genomic analysis of the 14 homeobox genes present in D. discoideum. Based on the structural alignment of the homeodomains, they can be broadly divided into TALE and non-TALE classes. When individual homeobox genes were compared with ...

  11. Gene profile analysis of osteoblast genes differentially regulated by histone deacetylase inhibitors

    Directory of Open Access Journals (Sweden)

    Lamblin Anne-Francoise

    2007-10-01

    Full Text Available Abstract Background Osteoblast differentiation requires the coordinated stepwise expression of multiple genes. Histone deacetylase inhibitors (HDIs accelerate the osteoblast differentiation process by blocking the activity of histone deacetylases (HDACs, which alter gene expression by modifying chromatin structure. We previously demonstrated that HDIs and HDAC3 shRNAs accelerate matrix mineralization and the expression of osteoblast maturation genes (e.g. alkaline phosphatase, osteocalcin. Identifying other genes that are differentially regulated by HDIs might identify new pathways that contribute to osteoblast differentiation. Results To identify other osteoblast genes that are altered early by HDIs, we incubated MC3T3-E1 preosteoblasts with HDIs (trichostatin A, MS-275, or valproic acid for 18 hours in osteogenic conditions. The promotion of osteoblast differentiation by HDIs in this experiment was confirmed by osteogenic assays. Gene expression profiles relative to vehicle-treated cells were assessed by microarray analysis with Affymetrix GeneChip 430 2.0 arrays. The regulation of several genes by HDIs in MC3T3-E1 cells and primary osteoblasts was verified by quantitative real-time PCR. Nine genes were differentially regulated by at least two-fold after exposure to each of the three HDIs and six were verified by PCR in osteoblasts. Four of the verified genes (solute carrier family 9 isoform 3 regulator 1 (Slc9a3r1, sorbitol dehydrogenase 1, a kinase anchor protein, and glutathione S-transferase alpha 4 were induced. Two genes (proteasome subunit, beta type 10 and adaptor-related protein complex AP-4 sigma 1 were suppressed. We also identified eight growth factors and growth factor receptor genes that are significantly altered by each of the HDIs, including Frizzled related proteins 1 and 4, which modulate the Wnt signaling pathway. Conclusion This study identifies osteoblast genes that are regulated early by HDIs and indicates pathways that

  12. Comparative GO: a web application for comparative gene ontology and gene ontology-based gene selection in bacteria.

    Directory of Open Access Journals (Sweden)

    Mario Fruzangohar

    Full Text Available The primary means of classifying new functions for genes and proteins relies on Gene Ontology (GO, which defines genes/proteins using a controlled vocabulary in terms of their Molecular Function, Biological Process and Cellular Component. The challenge is to present this information to researchers to compare and discover patterns in multiple datasets using visually comprehensible and user-friendly statistical reports. Importantly, while there are many GO resources available for eukaryotes, there are none suitable for simultaneous, graphical and statistical comparison between multiple datasets. In addition, none of them supports comprehensive resources for bacteria. By using Streptococcus pneumoniae as a model, we identified and collected GO resources including genes, proteins, taxonomy and GO relationships from NCBI, UniProt and GO organisations. Then, we designed database tables in PostgreSQL database server and developed a Java application to extract data from source files and loaded into database automatically. We developed a PHP web application based on Model-View-Control architecture, used a specific data structure as well as current and novel algorithms to estimate GO graphs parameters. We designed different navigation and visualization methods on the graphs and integrated these into graphical reports. This tool is particularly significant when comparing GO groups between multiple samples (including those of pathogenic bacteria from different sources simultaneously. Comparing GO protein distribution among up- or down-regulated genes from different samples can improve understanding of biological pathways, and mechanism(s of infection. It can also aid in the discovery of genes associated with specific function(s for investigation as a novel vaccine or therapeutic targets.http://turing.ersa.edu.au/BacteriaGO.

  13. Mosaic origins of a complex chimeric mitochondrial gene in Silene vulgaris.

    Directory of Open Access Journals (Sweden)

    Helena Storchova

    Full Text Available Chimeric genes are significant sources of evolutionary innovation that are normally created when portions of two or more protein coding regions fuse to form a new open reading frame. In plant mitochondria astonishingly high numbers of different novel chimeric genes have been reported, where they are generated through processes of rearrangement and recombination. Nonetheless, because most studies do not find or report nucleotide variation within the same chimeric gene, evolution after the origination of these chimeric genes remains unstudied. Here we identify two alleles of a complex chimera in Silene vulgaris that are divergent in nucleotide sequence, genomic position relative to other mitochondrial genes, and expression patterns. Structural patterns suggest a history partially influenced by gene conversion between the chimeric gene and functional copies of subunit 1 of the mitochondrial ATP synthase gene (atp1. We identified small repeat structures within the chimeras that are likely recombination sites allowing generation of the chimera. These results establish the potential for chimeric gene divergence in different plant mitochondrial lineages within the same species. This result contrasts with the absence of diversity within mitochondrial chimeras found in crop species.

  14. Leishmania naiffi and Leishmania guyanensis reference genomes highlight genome structure and gene evolution in the Viannia subgenus.

    Science.gov (United States)

    Coughlan, Simone; Taylor, Ali Shirley; Feane, Eoghan; Sanders, Mandy; Schonian, Gabriele; Cotton, James A; Downing, Tim

    2018-04-01

    The unicellular protozoan parasite Leishmania causes the neglected tropical disease leishmaniasis, affecting 12 million people in 98 countries. In South America, where the Viannia subgenus predominates, so far only L. ( Viannia ) braziliensis and L. ( V. ) panamensis have been sequenced, assembled and annotated as reference genomes. Addressing this deficit in molecular information can inform species typing, epidemiological monitoring and clinical treatment. Here, L. ( V. ) naiffi and L. ( V. ) guyanensis genomic DNA was sequenced to assemble these two genomes as draft references from short sequence reads. The methods used were tested using short sequence reads for L. braziliensis M2904 against its published reference as a comparison. This assembly and annotation pipeline identified 70 additional genes not annotated on the original M2904 reference. Phylogenetic and evolutionary comparisons of L. guyanensis and L. naiffi with 10 other Viannia genomes revealed four traits common to all Viannia : aneuploidy, 22 orthologous groups of genes absent in other Leishmania subgenera, elevated TATE transposon copies and a high NADH-dependent fumarate reductase gene copy number. Within the Viannia , there were limited structural changes in genome architecture specific to individual species: a 45 Kb amplification on chromosome 34 was present in all bar L. lainsoni , L. naiffi had a higher copy number of the virulence factor leishmanolysin, and laboratory isolate L. shawi M8408 had a possible minichromosome derived from the 3' end of chromosome 34 . This combination of genome assembly, phylogenetics and comparative analysis across an extended panel of diverse Viannia has uncovered new insights into the origin and evolution of this subgenus and can help improve diagnostics for leishmaniasis surveillance.

  15. Of mice and men: divergence of gene expression patterns in kidney.

    Directory of Open Access Journals (Sweden)

    Lydie Cheval

    Full Text Available Since the development of methods for homologous gene recombination, mouse models have played a central role in research in renal pathophysiology. However, many published and unpublished results show that mice with genetic changes mimicking human pathogenic mutations do not display the human phenotype. These functional differences may stem from differences in gene expression between mouse and human kidneys. However, large scale comparison of gene expression networks revealed conservation of gene expression among a large panel of human and mouse tissues including kidneys. Because renal functions result from the spatial integration of elementary processes originating in the glomerulus and the successive segments constituting the nephron, we hypothesized that differences in gene expression profiles along the human and mouse nephron might account for different behaviors. Analysis of SAGE libraries generated from the glomerulus and seven anatomically defined nephron segments from human and mouse kidneys allowed us to identify 4644 pairs of gene orthologs expressed in either one or both species. Quantitative analysis shows that many transcripts are present at different levels in the two species. It also shows poor conservation of gene expression profiles, with less than 10% of the 4644 gene orthologs displaying a higher conservation of expression profiles than the neutral expectation (p<0.05. Accordingly, hierarchical clustering reveals a higher degree of conservation of gene expression patterns between functionally unrelated kidney structures within a given species than between cognate structures from the two species. Similar findings were obtained for sub-groups of genes with either kidney-specific or housekeeping functions. Conservation of gene expression at the scale of the whole organ and divergence at the level of its constituting sub-structures likely account for the fact that although kidneys assume the same global function in the two species

  16. Cis-regulatory timers for developmental gene expression.

    Directory of Open Access Journals (Sweden)

    Lionel Christiaen

    2013-10-01

    Full Text Available How does a fertilized egg decode its own genome to eventually develop into a mature animal? Each developing cell must activate a battery of genes in a timely manner and according to the function it will ultimately perform, but how? During development of the notochord--a structure akin to the vertebrate spine--in a simple marine invertebrate, an essential protein called Brachyury binds to specific sites in its target genes. A study just published in PLOS Biology reports that if the target gene contains multiple Brachyury-binding sites it will be activated early in development but if it contains only one site it will be activated later. Genes that contain no binding site can still be activated by Brachyury, but only indirectly by an earlier Brachyury-dependent gene product, so later than the directly activated genes. Thus, this study shows how several genes can interpret the presence of a single factor differently to become active at distinct times in development.

  17. Origins of gene, genetic code, protein and life

    Indian Academy of Sciences (India)

    Unknown

    have concluded that newly-born genes are products of nonstop frames (NSF) ... research to determine tertiary structures of proteins such ... the present earth, is favourable for new genes to arise, if ..... NGG) in the universal genetic code table, cannot satisfy ..... which has been proposed to explain the development of life on.

  18. Identification and detection of a novel human endogenous retrovirus-related gene, and structural characterization of its related elements

    Directory of Open Access Journals (Sweden)

    Qiaoyi Liang

    2009-01-01

    Full Text Available Up-regulation of human endogenous retroviruses (HERVs is associated with many diseases, including cancer. In this study, an H family HERV (HERV-H-related gene was identified and characterized. Its spliced transcript lacks protein-coding capacity and may belong to the emerging class of noncoding RNAs (ncRNAs. The 1.3-kb RNA consisting of four exons is transcribed from an Alu element upstream of a 5.0-kb structurally incomplete HERV-H element. RT-PCR and quantitative RT-PCR results indicated that expression of this HERV-related transcript was negatively associated with colon, stomach, and kidney cancers. Its expression was induced upon treatment with DNA methylation and histone deacetylation inhibitors. A BLAT search using long terminal repeats (LTRs identified 50 other LTR homogenous HERV-H elements. Further analysis of these elements revealed that all are structurally incomplete and only five exert transcriptional activity. The results presented here recommend further investigation into a potentially functional HERV-H-related ncRNA.

  19. Hox gene regulation in the central nervous system of Drosophila

    Directory of Open Access Journals (Sweden)

    Maheshwar eGummalla

    2014-04-01

    Full Text Available Hox genes specify the structures that form along the anteroposterior (AP axis of bilateria. Within the genome, they often form clusters where, remarkably enough, their position within the clusters reflects the relative positions of the structures they specify along the AP axis. This correspondence between genomic organization and gene expression pattern has been conserved through evolution and provides a unique opportunity to study how chromosomal context affects gene regulation. In Drosophila, a general rule, often called posterior dominance, states that Hox genes specifying more posterior structures repress the expression of more anterior Hox genes. This rule explains the apparent spatial complementarity of Hox gene expression patterns in Drosophila. Here we review a noticeable exception to this rule where the more-posteriorly expressed Abd-B hox gene fails to repress the more-anterior abd-A gene in cells of the central nervous system (CNS. While Abd-B is required to repress ectopic expression of abd-A in the posterior epidermis, abd-A repression in the posterior CNS is accomplished by a different mechanism that involves a large 92kb long non-coding RNA (lncRNA encoded by the intergenic region separating abd-A and Abd-B (the iab8ncRNA. Dissection of this lncRNA revealed that abd-A is repressed by the lncRNA using two redundant mechanisms. The 1st mechanism is mediated by a microRNA (mir-iab-8 encoded by intronic sequence within the large iab8-ncRNA. Meanwhile, the second mechanism seems to involve transcriptional interference by the long iab-8 ncRNA on the abd-A promoter. Recent work demonstrating CNS-specific regulation of genes by ncRNAs in Drosophila, seem to highlight a potential role for the iab-8-ncRNA in the evolution of the Drosophila hox complexes

  20. Multi-label literature classification based on the Gene Ontology graph

    Directory of Open Access Journals (Sweden)

    Lu Xinghua

    2008-12-01

    Full Text Available Abstract Background The Gene Ontology is a controlled vocabulary for representing knowledge related to genes and proteins in a computable form. The current effort of manually annotating proteins with the Gene Ontology is outpaced by the rate of accumulation of biomedical knowledge in literature, which urges the development of text mining approaches to facilitate the process by automatically extracting the Gene Ontology annotation from literature. The task is usually cast as a text classification problem, and contemporary methods are confronted with unbalanced training data and the difficulties associated with multi-label classification. Results In this research, we investigated the methods of enhancing automatic multi-label classification of biomedical literature by utilizing the structure of the Gene Ontology graph. We have studied three graph-based multi-label classification algorithms, including a novel stochastic algorithm and two top-down hierarchical classification methods for multi-label literature classification. We systematically evaluated and compared these graph-based classification algorithms to a conventional flat multi-label algorithm. The results indicate that, through utilizing the information from the structure of the Gene Ontology graph, the graph-based multi-label classification methods can significantly improve predictions of the Gene Ontology terms implied by the analyzed text. Furthermore, the graph-based multi-label classifiers are capable of suggesting Gene Ontology annotations (to curators that are closely related to the true annotations even if they fail to predict the true ones directly. A software package implementing the studied algorithms is available for the research community. Conclusion Through utilizing the information from the structure of the Gene Ontology graph, the graph-based multi-label classification methods have better potential than the conventional flat multi-label classification approach to facilitate

  1. Phylogenetics and evolution of Trx SET genes in fully sequenced land plants.

    Science.gov (United States)

    Zhu, Xinyu; Chen, Caoyi; Wang, Baohua

    2012-04-01

    Plant Trx SET proteins are involved in H3K4 methylation and play a key role in plant floral development. Genes encoding Trx SET proteins constitute a multigene family in which the copy number varies among plant species and functional divergence appears to have occurred repeatedly. To investigate the evolutionary history of the Trx SET gene family, we made a comprehensive evolutionary analysis on this gene family from 13 major representatives of green plants. A novel clustering (here named as cpTrx clade), which included the III-1, III-2, and III-4 orthologous groups, previously resolved was identified. Our analysis showed that plant Trx proteins possessed a variety of domain organizations and gene structures among paralogs. Additional domains such as PHD, PWWP, and FYR were early integrated into primordial SET-PostSET domain organization of cpTrx clade. We suggested that the PostSET domain was lost in some members of III-4 orthologous group during the evolution of land plants. At least four classes of gene structures had been formed at the early evolutionary stage of land plants. Three intronless orphan Trx SET genes from the Physcomitrella patens (moss) were identified, and supposedly, their parental genes have been eliminated from the genome. The structural differences among evolutionary groups of plant Trx SET genes with different functions were described, contributing to the design of further experimental studies.

  2. A dual origin of the Xist gene from a protein-coding gene and a set of transposable elements.

    Directory of Open Access Journals (Sweden)

    Eugeny A Elisaphenko

    2008-06-01

    Full Text Available X-chromosome inactivation, which occurs in female eutherian mammals is controlled by a complex X-linked locus termed the X-inactivation center (XIC. Previously it was proposed that genes of the XIC evolved, at least in part, as a result of pseudogenization of protein-coding genes. In this study we show that the key XIC gene Xist, which displays fragmentary homology to a protein-coding gene Lnx3, emerged de novo in early eutherians by integration of mobile elements which gave rise to simple tandem repeats. The Xist gene promoter region and four out of ten exons found in eutherians retain homology to exons of the Lnx3 gene. The remaining six Xist exons including those with simple tandem repeats detectable in their structure have similarity to different transposable elements. Integration of mobile elements into Xist accompanies the overall evolution of the gene and presumably continues in contemporary eutherian species. Additionally we showed that the combination of remnants of protein-coding sequences and mobile elements is not unique to the Xist gene and is found in other XIC genes producing non-coding nuclear RNA.

  3. The R2R3-MYB-like regulatory factor EOBI, acting downstream of EOBII, regulates scent production by activating ODO1 and structural scent-related genes in petunia.

    Science.gov (United States)

    Spitzer-Rimon, Ben; Farhi, Moran; Albo, Boaz; Cna'ani, Alon; Ben Zvi, Michal Moyal; Masci, Tania; Edelbaum, Orit; Yu, Yixun; Shklarman, Elena; Ovadis, Marianna; Vainstein, Alexander

    2012-12-01

    Flower scent is a highly dynamic trait, under developmental, spatial, and diurnal regulation. The mechanism governing scent production is only beginning to be unraveled. In petunia (Petunia hybrida), EMISSION OF BENZENOIDS II (EOBII) controls transcription of both the shikimate pathway-regulating MYB factor ODORANT1 (ODO1) and phenylpropanoid scent-related structural genes. A promoter-activation screen identified an R2R3-MYB-like regulatory factor of phenylpropanoid volatile biosynthesis acting downstream of EOBII, designated EOBI. EOBI silencing led to downregulation of ODO1 and numerous structural scent-related genes from both the shikimate and phenylpropanoid pathways. The ability of EOBI to directly activate ODO1, as revealed by electrophoretic mobility shift assay and yeast one-hybrid analysis, place EOBI upstream of ODO1 in regulating substrate availability for volatile biosynthesis. Interestingly, ODO1-silenced transgenic petunia flowers accumulated higher EOBI transcript levels than controls, suggesting a complex feedback loop between these regulatory factors. The accumulation pattern of EOBI transcript relative to EOBII and ODO1, and the effect of up/downregulation of EOBII on transcript levels of EOBI and ODO1, further support these factors' hierarchical relationships. The dependence of scent production on EOBI expression and its direct interaction with both regulatory and structural genes provide evidence for EOBI's wide-ranging involvement in the production of floral volatiles.

  4. Non-virulence of a recombinant shrimp nidovirus is associated with its non structural gene sequence and not a large structural gene deletion

    International Nuclear Information System (INIS)

    Gangnonngiw, Warachin; Anantasomboon, Gun; Sang-oum, Wiwat; Sriurairatana, Siriporn; Sritunyalucksana, Kallaya; Flegel, Timothy W.

    2009-01-01

    RT-PCR using a commercial kit for yellow head virus (YHV) detection in growth-retarded shrimp yielded an unusual 777 bp amplicon instead of expected amplicons of 277 bp for YHV type-1 (YHV-1) or 406 bp for YHV type-2 (YHV-2). Cloning and sequencing (GenBank (EU170438)) revealed approximately 80% identity to non-structural (NS) ORF1b sequences of both YHV-1 (GenBank (AA083987)) and YHV-2 (GenBank (AF227196)), indicating an atypical YHV type (A-YHV) phylogenetically equidistant from both types. An RT-PCR test specifically designed for A-YHV revealed that it was uncommon and that its occurrence in shrimp culture ponds did not correlate with growth retardation or mortality. By immunohistochemistry with YHV-specific monoclonal antibodies, the A-YHV gave positive reactions for envelope protein gp64 and capsid protein p20, but not for envelope protein gp116, even though gp116 and gp64 originate from a polyprotein of ORF3. Lack of gp116 immunoreactivity correlated with a large ORF3 deletion (GenBank (EU123854)) in the region of the protein targeted by an MAb against gp116. Transmission electron microscopy of A-YHV-infected shrimp revealed only unenveloped pre-virions. During manuscript revision, information received revealed that typing of YHV isolates based on sequences of ORF1b and ORF3 had yielded several geographical types, including one virulent type (YHV-1b) with an ORF3 deletion sequence that matched the sequence of A-YHV. Using these sequences and an additional A-YHV sequence ( (EU853170)) from the ORF1b typing region, A-YHV potentially represents a recombinant between type 1b and type 5. SDS-PAGE and Western blot analysis revealed that type 1b produced a gp116 deletion protein that did not bind with the MAb or polyclonal Ab to normal gp116. Overall, the information suggested that lack of A-YHV virulence was associated with the NS gene sequence linked to ORF1b rather than the deletion in ORF3

  5. Mutation of the mouse Syce1 gene disrupts synapsis and suggests a link between synaptonemal complex structural components and DNA repair.

    Directory of Open Access Journals (Sweden)

    Ewelina Bolcun-Filas

    2009-02-01

    Full Text Available In mammals, the synaptonemal complex is a structure required to complete crossover recombination. Although suggested by cytological work, in vivo links between the structural proteins of the synaptonemal complex and the proteins of the recombination process have not previously been made. The central element of the synaptonemal complex is traversed by DNA at sites of recombination and presents a logical place to look for interactions between these components. There are four known central element proteins, three of which have previously been mutated. Here, we complete the set by creating a null mutation in the Syce1 gene in mouse. The resulting disruption of synapsis in these animals has allowed us to demonstrate a biochemical interaction between the structural protein SYCE2 and the repair protein RAD51. In normal meiosis, this interaction may be responsible for promoting homologous synapsis from sites of recombination.

  6. CCDB: a curated database of genes involved in cervix cancer.

    Science.gov (United States)

    Agarwal, Subhash M; Raghav, Dhwani; Singh, Harinder; Raghava, G P S

    2011-01-01

    The Cervical Cancer gene DataBase (CCDB, http://crdd.osdd.net/raghava/ccdb) is a manually curated catalog of experimentally validated genes that are thought, or are known to be involved in the different stages of cervical carcinogenesis. In spite of the large women population that is presently affected from this malignancy still at present, no database exists that catalogs information on genes associated with cervical cancer. Therefore, we have compiled 537 genes in CCDB that are linked with cervical cancer causation processes such as methylation, gene amplification, mutation, polymorphism and change in expression level, as evident from published literature. Each record contains details related to gene like architecture (exon-intron structure), location, function, sequences (mRNA/CDS/protein), ontology, interacting partners, homology to other eukaryotic genomes, structure and links to other public databases, thus augmenting CCDB with external data. Also, manually curated literature references have been provided to support the inclusion of the gene in the database and establish its association with cervix cancer. In addition, CCDB provides information on microRNA altered in cervical cancer as well as search facility for querying, several browse options and an online tool for sequence similarity search, thereby providing researchers with easy access to the latest information on genes involved in cervix cancer.

  7. Aldehyde Dehydrogenase Gene Superfamily in Populus: Organization and Expression Divergence between Paralogous Gene Pairs.

    Science.gov (United States)

    Tian, Feng-Xia; Zang, Jian-Lei; Wang, Tan; Xie, Yu-Li; Zhang, Jin; Hu, Jian-Jun

    2015-01-01

    Aldehyde dehydrogenases (ALDHs) constitute a superfamily of NAD(P)+-dependent enzymes that catalyze the irreversible oxidation of a wide range of reactive aldehydes to their corresponding nontoxic carboxylic acids. ALDHs have been studied in many organisms from bacteria to mammals; however, no systematic analyses incorporating genome organization, gene structure, expression profiles, and cis-acting elements have been conducted in the model tree species Populus trichocarpa thus far. In this study, a comprehensive analysis of the Populus ALDH gene superfamily was performed. A total of 26 Populus ALDH genes were found to be distributed across 12 chromosomes. Genomic organization analysis indicated that purifying selection may have played a pivotal role in the retention and maintenance of PtALDH gene families. The exon-intron organizations of PtALDHs were highly conserved within the same family, suggesting that the members of the same family also may have conserved functionalities. Microarray data and qRT-PCR analysis indicated that most PtALDHs had distinct tissue-specific expression patterns. The specificity of cis-acting elements in the promoter regions of the PtALDHs and the divergence of expression patterns between nine paralogous PtALDH gene pairs suggested that gene duplications may have freed the duplicate genes from the functional constraints. The expression levels of some ALDHs were up- or down-regulated by various abiotic stresses, implying that the products of these genes may be involved in the adaptation of Populus to abiotic stresses. Overall, the data obtained from our investigation contribute to a better understanding of the complexity of the Populus ALDH gene superfamily and provide insights into the function and evolution of ALDH gene families in vascular plants.

  8. Aldehyde Dehydrogenase Gene Superfamily in Populus: Organization and Expression Divergence between Paralogous Gene Pairs.

    Directory of Open Access Journals (Sweden)

    Feng-Xia Tian

    Full Text Available Aldehyde dehydrogenases (ALDHs constitute a superfamily of NAD(P+-dependent enzymes that catalyze the irreversible oxidation of a wide range of reactive aldehydes to their corresponding nontoxic carboxylic acids. ALDHs have been studied in many organisms from bacteria to mammals; however, no systematic analyses incorporating genome organization, gene structure, expression profiles, and cis-acting elements have been conducted in the model tree species Populus trichocarpa thus far. In this study, a comprehensive analysis of the Populus ALDH gene superfamily was performed. A total of 26 Populus ALDH genes were found to be distributed across 12 chromosomes. Genomic organization analysis indicated that purifying selection may have played a pivotal role in the retention and maintenance of PtALDH gene families. The exon-intron organizations of PtALDHs were highly conserved within the same family, suggesting that the members of the same family also may have conserved functionalities. Microarray data and qRT-PCR analysis indicated that most PtALDHs had distinct tissue-specific expression patterns. The specificity of cis-acting elements in the promoter regions of the PtALDHs and the divergence of expression patterns between nine paralogous PtALDH gene pairs suggested that gene duplications may have freed the duplicate genes from the functional constraints. The expression levels of some ALDHs were up- or down-regulated by various abiotic stresses, implying that the products of these genes may be involved in the adaptation of Populus to abiotic stresses. Overall, the data obtained from our investigation contribute to a better understanding of the complexity of the Populus ALDH gene superfamily and provide insights into the function and evolution of ALDH gene families in vascular plants.

  9. Long SAGE analysis of genes differentially expressed in the midgut ...

    African Journals Online (AJOL)

    USER

    identification of genes related to sexual disparity in silk protein production efficiency. ... Ysh, a yellow cocoon color sex-limited strain of the silkworm B. mori, ...... alternative splicing of human genes. ... Structure, function and evolution of.

  10. Partial genomic structure, mutation analysis and mapping of the porcine inhibitor of DNA binding genes ID1, ID2, ID3 and ID4

    Czech Academy of Sciences Publication Activity Database

    Stratil, Antonín; Horák, Pavel; Filkuková, Jitka; Van Poucke, M.; Bartenschlager, H.; Peelman, L. J.; Geldermann, H.

    2010-01-01

    Roč. 41, - (2010), s. 558-559 ISSN 0268-9146 R&D Projects: GA ČR(CZ) GA523/06/1302; GA ČR GA523/09/0844 Institutional research plan: CEZ:AV0Z50450515 Keywords : genomic structure * muscle-specific genes * porcine Subject RIV: GI - Animal Husbandry ; Breeding Impact factor: 2.203, year: 2010

  11. Evolution and Expression Patterns of CYC/TB1 Genes in Anacyclus: Phylogenetic Insights for Floral Symmetry Genes in Asteraceae

    Science.gov (United States)

    Bello, María A.; Cubas, Pilar; Álvarez, Inés; Sanjuanbenito, Guillermo; Fuertes-Aguilar, Javier

    2017-01-01

    Homologs of the CYC/TB1 gene family have been independently recruited many times across the eudicots to control aspects of floral symmetry The family Asteraceae exhibits the largest known diversification in this gene paralog family accompanied by a parallel morphological floral richness in its specialized head-like inflorescence. In Asteraceae, whether or not CYC/TB1 gene floral symmetry function is preserved along organismic and gene lineages is unknown. In this study, we used phylogenetic, structural and expression analyses focused on the highly derived genus Anacyclus (tribe Anthemidae) to address this question. Phylogenetic reconstruction recovered eight main gene lineages present in Asteraceae: two from CYC1, four from CYC2 and two from CYC3-like genes. The species phylogeny was recovered in most of the gene lineages, allowing the delimitation of orthologous sets of CYC/TB1 genes in Asteraceae. Quantitative real-time PCR analysis indicated that in Anacyclus three of the four isolated CYC2 genes are more highly expressed in ray flowers. The expression of the four AcCYC2 genes overlaps in several organs including the ligule of ray flowers, as well as in anthers and ovules throughout development. PMID:28487706

  12. Visualizing conserved gene location across microbe genomes

    Science.gov (United States)

    Shaw, Chris D.

    2009-01-01

    This paper introduces an analysis-based zoomable visualization technique for displaying the location of genes across many related species of microbes. The purpose of this visualizatiuon is to enable a biologist to examine the layout of genes in the organism of interest with respect to the gene organization of related organisms. During the genomic annotation process, the ability to observe gene organization in common with previously annotated genomes can help a biologist better confirm the structure and function of newly analyzed microbe DNA sequences. We have developed a visualization and analysis tool that enables the biologist to observe and examine gene organization among genomes, in the context of the primary sequence of interest. This paper describes the visualization and analysis steps, and presents a case study using a number of Rickettsia genomes.

  13. Cloning and characterization of a Candida albicans maltase gene involved in sucrose utilization.

    Science.gov (United States)

    Geber, A; Williamson, P R; Rex, J H; Sweeney, E C; Bennett, J E

    1992-01-01

    In order to isolate the structural gene involved in sucrose utilization, we screened a sucrose-induced Candida albicans cDNA library for clones expressing alpha-glucosidase activity. The C. albicans maltase structural gene (CAMAL2) was isolated. No other clones expressing alpha-glucosidase activity. were detected. A genomic CAMAL2 clone was obtained by screening a size-selected genomic library with the cDNA clone. DNA sequence analysis reveals that CAMAL2 encodes a 570-amino-acid protein which shares 50% identity with the maltase structural gene (MAL62) of Saccharomyces carlsbergensis. The substrate specificity of the recombinant protein purified from Escherichia coli identifies the enzyme as a maltase. Northern (RNA) analysis reveals that transcription of CAMAL2 is induced by maltose and sucrose and repressed by glucose. These results suggest that assimilation of sucrose in C. albicans relies on an inducible maltase enzyme. The family of genes controlling sucrose utilization in C. albicans shares similarities with the MAL gene family of Saccharomyces cerevisiae and provides a model system for studying gene regulation in this pathogenic yeast. Images PMID:1400249

  14. The structural and functional connectivity of the grassland plant Lychnis flos-cuculi

    Science.gov (United States)

    Aavik, T; Holderegger, R; Bolliger, J

    2014-01-01

    Understanding the relationship between structural and functional connectivity is essential for successful restoration and conservation management, particularly in intensely managed agricultural landscapes. We evaluated the relationship between structural and functional connectivity of the wetland plant Lychnis flos-cuculi in a fragmented agricultural landscape using landscape genetic and network approaches. First, we studied the effect of structural connectivity, such as geographic distance and various landscape elements (forest, agricultural land, settlements and ditch verges), on gene flow among populations as a measurement of functional connectivity. Second, we examined the effect of structural graph-theoretic connectivity measures on gene flow among populations and on genetic diversity within populations of L. flos-cuculi. Among landscape elements, forests hindered gene flow in L. flos-cuculi, whereas gene flow was independent of geographic distance. Among the structural graph-theoretic connectivity variables, only intrapopulation connectivity, which was based on population size, had a significant positive effect on gene flow, that is, more gene flow took place among larger populations. Unexpectedly, interpopulation connectivity of populations, which takes into account the spatial location and distance among populations, did not influence gene flow in L. flos-cuculi. However, higher observed heterozygosity and lower inbreeding was observed in populations characterised by higher structural interpopulation connectivity. This finding shows that a spatially coherent network of populations is significant for maintaining the genetic diversity of populations. Nevertheless, lack of significant relationships between gene flow and most of the structural connectivity measures suggests that structural connectivity does not necessarily correspond to functional connectivity. PMID:24253937

  15. Fragmentation of the large subunit ribosomal RNA gene in oyster mitochondrial genomes

    Directory of Open Access Journals (Sweden)

    Milbury Coren A

    2010-09-01

    Full Text Available Abstract Background Discontinuous genes have been observed in bacteria, archaea, and eukaryotic nuclei, mitochondria and chloroplasts. Gene discontinuity occurs in multiple forms: the two most frequent forms result from introns that are spliced out of the RNA and the resulting exons are spliced together to form a single transcript, and fragmented gene transcripts that are not covalently attached post-transcriptionally. Within the past few years, fragmented ribosomal RNA (rRNA genes have been discovered in bilateral metazoan mitochondria, all within a group of related oysters. Results In this study, we have characterized this fragmentation with comparative analysis and experimentation. We present secondary structures, modeled using comparative sequence analysis of the discontinuous mitochondrial large subunit rRNA genes of the cupped oysters C. virginica, C. gigas, and C. hongkongensis. Comparative structure models for the large subunit rRNA in each of the three oyster species are generally similar to those for other bilateral metazoans. We also used RT-PCR and analyzed ESTs to determine if the two fragmented LSU rRNAs are spliced together. The two segments are transcribed separately, and not spliced together although they still form functional rRNAs and ribosomes. Conclusions Although many examples of discontinuous ribosomal genes have been documented in bacteria and archaea, as well as the nuclei, chloroplasts, and mitochondria of eukaryotes, oysters are some of the first characterized examples of fragmented bilateral animal mitochondrial rRNA genes. The secondary structures of the oyster LSU rRNA fragments have been predicted on the basis of previous comparative metazoan mitochondrial LSU rRNA structure models.

  16. From gene to structure: Lactobacillus bulgaricus D-lactate dehydrogenase from yogurt as an integrated curriculum model for undergraduate molecular biology and biochemistry laboratory courses.

    Science.gov (United States)

    Lawton, Jeffrey A; Prescott, Noelle A; Lawton, Ping X

    2018-05-01

    We have developed an integrated, project-oriented curriculum for undergraduate molecular biology and biochemistry laboratory courses spanning two semesters that is organized around the ldhA gene from the yogurt-fermenting bacterium Lactobacillus bulgaricus, which encodes the enzyme d-lactate dehydrogenase. The molecular biology module, which consists of nine experiments carried out over eleven sessions, begins with the isolation of genomic DNA from L. bulgaricus in yogurt and guides students through the process of cloning the ldhA gene into a prokaryotic expression vector, followed by mRNA isolation and characterization of recombinant gene expression levels using RT-PCR. The biochemistry module, which consists of nine experiments carried out over eight sessions, begins with overexpression of the cloned ldhA gene and guides students through the process of affinity purification, biochemical characterization of the purified LdhA protein, and analysis of enzyme kinetics using various substrates and an inhibitor, concluding with a guided inquiry investigation of structure-function relationships in the three-dimensional structure of LdhA using molecular visualization software. Students conclude by writing a paper describing their work on the project, formatted as a manuscript to be submitted for publication in a scientific journal. Overall, this curriculum, with its emphasis on experiential learning, provides hands-on training with a variety of common laboratory techniques in molecular biology and biochemistry and builds experience with the process of scientific reasoning, along with reinforcement of essential transferrable skills such as critical thinking, information literacy, and written communication, all within the framework of an extended project having the look and feel of a research experience. © 2018 by The International Union of Biochemistry and Molecular Biology, 46(3):270-278, 2018. © 2018 The International Union of Biochemistry and Molecular Biology.

  17. Somatostatin, substance P and calcitonin gene-related peptide-positive intramural nerve structures of the human large intestine affected by carcinoma.

    Directory of Open Access Journals (Sweden)

    Jerzy Kaleczyc

    2010-11-01

    Full Text Available The aim of this study was to investigate the arrangement and chemical coding of enteric nerve structures in the human large intestine affected by cancer. Tissue samples comprising all layers of the intestinal wall were collected during surgery form both morphologically unchanged and pathologically altered segments of the intestine (n=15, and fixed by immersion in buffered paraformaldehyde solution. The cryostat sections were processed for double-labelling immunofluorescence to study the distribution of the intramural nerve structures (visualized with antibodies against protein gene-product 9.5 and their chemical coding using antibodies against somatostatin (SOM, substance P (SP and calcitonin gene-related peptide (CGRP. The microscopic observations revealed distinct morphological differences in the enteric nerve system structure between the region adjacent to the cancer invaded area and the intact part of the intestine. In general, infiltration of the cancer tissue resulted in the gradual (depending on the grade of invasion first decomposition and reduction to final partial or complete destruction and absence of the neuronal elements. A comparative analysis of immunohistochemically labeled sections (from the unchanged and pathologically altered areas revealed a statistically significant decrease in the number of CGRP-positive neurons and nerve fibres in both submucous and myenteric plexuses in the transitional zone between morphologically unchanged and cancer-invaded areas. In this zone, a decrease was also observed in the density of SP-positive nerve fibres in all intramural plexuses. Conversely, the investigations demonstrated statistically insignificant differences in number of SP- and SOM-positive neurons and a similar density of SOM-positive nerve fibres in the plexuses of the intact and pathologically changed areas. The differentiation between the potential adaptive changes in ENS or destruction of its elements by cancer invasion should be

  18. Using co-occurrence network structure to extract synonymous gene and protein names from MEDLINE abstracts

    Directory of Open Access Journals (Sweden)

    Spackman K

    2005-04-01

    Full Text Available Abstract Background Text-mining can assist biomedical researchers in reducing information overload by extracting useful knowledge from large collections of text. We developed a novel text-mining method based on analyzing the network structure created by symbol co-occurrences as a way to extend the capabilities of knowledge extraction. The method was applied to the task of automatic gene and protein name synonym extraction. Results Performance was measured on a test set consisting of about 50,000 abstracts from one year of MEDLINE. Synonyms retrieved from curated genomics databases were used as a gold standard. The system obtained a maximum F-score of 22.21% (23.18% precision and 21.36% recall, with high efficiency in the use of seed pairs. Conclusion The method performs comparably with other studied methods, does not rely on sophisticated named-entity recognition, and requires little initial seed knowledge.

  19. Targeted gene insertion for molecular medicine.

    Science.gov (United States)

    Voigt, Katrin; Izsvák, Zsuzsanna; Ivics, Zoltán

    2008-11-01

    Genomic insertion of a functional gene together with suitable transcriptional regulatory elements is often required for long-term therapeutical benefit in gene therapy for several genetic diseases. A variety of integrating vectors for gene delivery exist. Some of them exhibit random genomic integration, whereas others have integration preferences based on attributes of the targeted site, such as primary DNA sequence and physical structure of the DNA, or through tethering to certain DNA sequences by host-encoded cellular factors. Uncontrolled genomic insertion bears the risk of the transgene being silenced due to chromosomal position effects, and can lead to genotoxic effects due to mutagenesis of cellular genes. None of the vector systems currently used in either preclinical experiments or clinical trials displays sufficient preferences for target DNA sequences that would ensure appropriate and reliable expression of the transgene and simultaneously prevent hazardous side effects. We review in this paper the advantages and disadvantages of both viral and non-viral gene delivery technologies, discuss mechanisms of target site selection of integrating genetic elements (viruses and transposons), and suggest distinct molecular strategies for targeted gene delivery.

  20. Mutational analysis of two structural genes of the remperate lactococcal bacteriophage TP901-1 involved in tail length determination and baseplate assembly

    DEFF Research Database (Denmark)

    Pedersen, Margit; Østergaard, Solvej; Bresciani, José

    2000-01-01

    Two putative structural genes, orf tmp (tape measure protein) and orf bpp (baseplate protein), of the temperate lactococcal phage TP901-1 were examined by introduction of specific mutations in the prophage strain Lactococcus lactic ssp. cremoris 901-1. The adsorption efficiencies of the mutated...... or duplication of 29% in orf tmp was shown to shorten or lengthen the phage tail by approximately 30%, respectively. The orf tmp is proposed to function as a tape measure protein, TMP, important for assembly of the TP901-1 phage tail and involved in tail length determination. Specific mutations in orf bpp...... produced phages which were unable to adsorb to the indicator strain and electron microscopy revealed particles lacking the baseplate structure. The orf bpp is proposed to encode a highly immunogenic structural baseplate protein, BPP, important for assembly of the baseplate. Finally, an assembly pathway...

  1. Identification of Structural and Immunity Genes of a Class IIb Bacteriocin Encoded in the Enterocin A Operon of Enterococcus faecium Strain MXVK29.

    Science.gov (United States)

    Escamilla-Martínez, E E; Cisneros, Y M Álvarez; Fernández, F J; Quirasco-Baruch, M; Ponce-Alquicira, E

    2017-10-09

    The Enterococcus faecium strain MXVK29, isolated from fermented sausages, produces a bacteriocin with a molecular mass of 3.5 kDa that belongs to the class of enterocins II.1, according to the terminal amino acid sequence, and has been identified as enterocin A. This bacteriocin is active against selected strains of Listeria, Staphylococcus, Pediococcus, and Enterococcus. In this study, we identified the genes adjacent to the structural gene for this bacteriocin, such as the immunity gene (entI) and the inducer gene (entF). Accessory genes for this bacteriocin, such as entK, entR, and entT, were identified as well, in addition to the orf2 and orf3, showing a high identity with class IIb peptides bacteriocins. The orf2 shows the consensus motif GxxxG, similar to those shown by bacteriocins such as PlnNC8α, EntCα, and Ent1071A, whereas orf3 shows a consensus motif SxxxS similar to that present in PlnNC8β (AxxxA). PlnNC8 is expressed only in bacterial cocultures, so there is the possibility that the expression of this two-peptide bacteriocin can be induced by a similar mechanism. So far, only the expression of enterocin A has been found in this strain; however, the presence of the genes ent29α and ent29β opens the possibility for further research on its induction, functionality, and origin. Although there are reports on this type of bacteriocin (EntX, EntC, and Ent1071) in other strains of E. faecium, no report exists yet on an Enterococcus strain producing two different classes of bacteriocin.

  2. Genome-wide analysis of WRKY gene family in the sesame genome and identification of the WRKY genes involved in responses to abiotic stresses.

    Science.gov (United States)

    Li, Donghua; Liu, Pan; Yu, Jingyin; Wang, Linhai; Dossa, Komivi; Zhang, Yanxin; Zhou, Rong; Wei, Xin; Zhang, Xiurong

    2017-09-11

    Sesame (Sesamum indicum L.) is one of the world's most important oil crops. However, it is susceptible to abiotic stresses in general, and to waterlogging and drought stresses in particular. The molecular mechanisms of abiotic stress tolerance in sesame have not yet been elucidated. The WRKY domain transcription factors play significant roles in plant growth, development, and responses to stresses. However, little is known about the number, location, structure, molecular phylogenetics, and expression of the WRKY genes in sesame. We performed a comprehensive study of the WRKY gene family in sesame and identified 71 SiWRKYs. In total, 65 of these genes were mapped to 15 linkage groups within the sesame genome. A phylogenetic analysis was performed using a related species (Arabidopsis thaliana) to investigate the evolution of the sesame WRKY genes. Tissue expression profiles of the WRKY genes demonstrated that six SiWRKY genes were highly expressed in all organs, suggesting that these genes may be important for plant growth and organ development in sesame. Analysis of the SiWRKY gene expression patterns revealed that 33 and 26 SiWRKYs respond strongly to waterlogging and drought stresses, respectively. Changes in the expression of 12 SiWRKY genes were observed at different times after the waterlogging and drought treatments had begun, demonstrating that sesame gene expression patterns vary in response to abiotic stresses. In this study, we analyzed the WRKY family of transcription factors encoded by the sesame genome. Insight was gained into the classification, evolution, and function of the SiWRKY genes, revealing their putative roles in a variety of tissues. Responses to abiotic stresses in different sesame cultivars were also investigated. The results of our study provide a better understanding of the structures and functions of sesame WRKY genes and suggest that manipulating these WRKYs could enhance resistance to waterlogging and drought.

  3. Canine candidate genes for dilated cardiomyopathy: annotation of and polymorphic markers for 14 genes

    Directory of Open Access Journals (Sweden)

    van Oost Bernard A

    2007-10-01

    Full Text Available Abstract Background Dilated cardiomyopathy is a myocardial disease occurring in humans and domestic animals and is characterized by dilatation of the left ventricle, reduced systolic function and increased sphericity of the left ventricle. Dilated cardiomyopathy has been observed in several, mostly large and giant, dog breeds, such as the Dobermann and the Great Dane. A number of genes have been identified, which are associated with dilated cardiomyopathy in the human, mouse and hamster. These genes mainly encode structural proteins of the cardiac myocyte. Results We present the annotation of, and marker development for, 14 of these genes of the dog genome, i.e. α-cardiac actin, caveolin 1, cysteine-rich protein 3, desmin, lamin A/C, LIM-domain binding factor 3, myosin heavy polypeptide 7, phospholamban, sarcoglycan δ, titin cap, α-tropomyosin, troponin I, troponin T and vinculin. A total of 33 Single Nucleotide Polymorphisms were identified for these canine genes and 11 polymorphic microsatellite repeats were developed. Conclusion The presented polymorphisms provide a tool to investigate the role of the corresponding genes in canine Dilated Cardiomyopathy by linkage analysis or association studies.

  4. Integrative annotation of 21,037 human genes validated by full-length cDNA clones.

    Directory of Open Access Journals (Sweden)

    Tadashi Imanishi

    2004-06-01

    Full Text Available The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/. It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs, identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA

  5. Gene function analysis by artificial microRNAs in Physcomitrella patens.

    KAUST Repository

    Khraiwesh, Basel; Fattash, Isam; Arif, Muhammad Asif; Frank, Wolfgang

    2011-01-01

    MicroRNAs (miRNAs) are ~21 nt long small RNAs transcribed from endogenous MIR genes which form precursor RNAs with a characteristic hairpin structure. miRNAs control the expression of cognate target genes by binding to reverse complementary

  6. Spectral biclustering of microarray data: coclustering genes and conditions.

    Science.gov (United States)

    Kluger, Yuval; Basri, Ronen; Chang, Joseph T; Gerstein, Mark

    2003-04-01

    Global analyses of RNA expression levels are useful for classifying genes and overall phenotypes. Often these classification problems are linked, and one wants to find "marker genes" that are differentially expressed in particular sets of "conditions." We have developed a method that simultaneously clusters genes and conditions, finding distinctive "checkerboard" patterns in matrices of gene expression data, if they exist. In a cancer context, these checkerboards correspond to genes that are markedly up- or downregulated in patients with particular types of tumors. Our method, spectral biclustering, is based on the observation that checkerboard structures in matrices of expression data can be found in eigenvectors corresponding to characteristic expression patterns across genes or conditions. In addition, these eigenvectors can be readily identified by commonly used linear algebra approaches, in particular the singular value decomposition (SVD), coupled with closely integrated normalization steps. We present a number of variants of the approach, depending on whether the normalization over genes and conditions is done independently or in a coupled fashion. We then apply spectral biclustering to a selection of publicly available cancer expression data sets, and examine the degree to which the approach is able to identify checkerboard structures. Furthermore, we compare the performance of our biclustering methods against a number of reasonable benchmarks (e.g., direct application of SVD or normalized cuts to raw data).

  7. Ultrasound-mediated structural changes in cells revealed by FTIR spectroscopy: A contribution to the optimization of gene and drug delivery

    Science.gov (United States)

    Grimaldi, Paola; Di Giambattista, Lucia; Giordani, Serena; Udroiu, Ion; Pozzi, Deleana; Gaudenzi, Silvia; Bedini, Angelico; Giliberti, Claudia; Palomba, Raffaele; Congiu Castellano, Agostina

    2011-12-01

    Ultrasound effects on biological samples are gaining a growing interest concerning in particular, the intracellular delivery of drugs and genes in a safe and in a efficient way. Future progress in this field will require a better understanding of how ultrasound and acoustic cavitation affect the biological system properties. The morphological changes of cells due to ultrasound (US) exposure have been extensively studied, while little attention has been given to the cells structural changes. We have exposed two different cell lines to 1 MHz frequency ultrasound currently used in therapy, Jurkat T-lymphocytes and NIH-3T3 fibroblasts, both employed as models respectively in the apoptosis and in the gene therapy studies. The Fourier Transform Infrared (FTIR) Spectroscopy was used as probe to reveal the structural changes in particular molecular groups belonging to the main biological systems. The genotoxic damage of cells exposed to ultrasound was ascertained by the Cytokinesis-Block Micronucleus (CBMN) assay. The FTIR spectroscopy results, combined with multivariate statistical analysis, regarding all cellular components (lipids, proteins, nucleic acids) of the two cell lines, show that Jurkat cells are more sensitive to therapeutic ultrasound in the lipid and protein regions, whereas the NIH-3T3 cells are more sensitive in the nucleic acids region; a meaningful genotoxic effect is present in both cell lines only for long sonication times while in the Jurkat cells also a significant cytotoxic effect is revealed for long times of exposure to ultrasound.

  8. In silico Analysis of the Functional and Structural Impacts of Non-synonymous Single Nucleotide Polymorphisms in the Human Paraxonase 1 Gene

    Directory of Open Access Journals (Sweden)

    Sudip Paul

    2015-09-01

    Full Text Available Computational approaches could help in identifying deleterious non-synonymous single nucleotide polymorphisms (nsSNPs in a disease related gene which is a difficult and laborious task through laboratory experiments. In the present study, we analyzed the impacts of nsSNPs on structure and function of Paraxonase 1 (PON1 using different bioinformatics tools. The human PON1 protein sequence and its corresponding gene's SNP information were collected from UniProt and dbSNP databases, respectively. We utilized SIFT, Polyphen, I-Mutant 2.0, MutPred, SNP and GO, PhD-SNP and PANTHER tools in order to examine the total 39 nsSNPs occurring in the PON1 coding region. We filtered the most pathological mutations by combining the scores of the aforementioned servers and found 8 SNPs (G344C, S302L, W281C, D279Y, H134R, F120S, L90P, C42R as deleterious and disease causing. The PDB structure of PON1 protein was obtained from RCSB Protein Data Bank (PDB ID: 1V04. The deleterious SNPs in native PON1 were introduced using Swiss-PDB Viewer package and changes in free energy were observed for six out of eight mutant structures. Two SNPs, S302L (substitution of serine to leucine at 302 position in amino acid sequence and L90P (substitution of leucine to proline at 90 position in amino acid sequence caused the highest energy increase amongst all. The findings implicate that these nsSNPs would be analyzed further in detail to enumerate their possible association with the protein deteriorating and disease causal potentialities.

  9. Genes and Gene Therapy

    Science.gov (United States)

    ... correctly, a child can have a genetic disorder. Gene therapy is an experimental technique that uses genes to ... or prevent disease. The most common form of gene therapy involves inserting a normal gene to replace an ...

  10. Bacterial community structure in High-Arctic snow and freshwater as revealed by pyrosequencing of 16S rRNA genes and cultivation

    Directory of Open Access Journals (Sweden)

    Annette K. Møller

    2013-04-01

    Full Text Available The bacterial community structures in High-Arctic snow over sea ice and an ice-covered freshwater lake were examined by pyrosequencing of 16S rRNA genes and 16S rRNA gene sequencing of cultivated isolates. Both the pyrosequence and cultivation data indicated that the phylogenetic composition of the microbial assemblages was different within the snow layers and between snow and freshwater. The highest diversity was seen in snow. In the middle and top snow layers, Proteobacteria, Bacteroidetes and Cyanobacteria dominated, although Actinobacteria and Firmicutes were relatively abundant also. High numbers of chloroplasts were also observed. In the deepest snow layer, large percentages of Firmicutes and Fusobacteria were seen. In freshwater, Bacteroidetes, Actinobacteria and Verrucomicrobia were the most abundant phyla while relatively few Proteobacteria and Cyanobacteria were present. Possibly, light intensity controlled the distribution of the Cyanobacteria and algae in the snow while carbon and nitrogen fixed by these autotrophs in turn fed the heterotrophic bacteria. In the lake, a probable lower light input relative to snow resulted in low numbers of Cyanobacteria and chloroplasts and, hence, limited input of organic carbon and nitrogen to the heterotrophic bacteria. Thus, differences in the physicochemical conditions may play an important role in the processes leading to distinctive bacterial community structures in High-Arctic snow and freshwater.

  11. Effects of heterocyclic-based head group modifications on the structure-activity relationship of tocopherol-based lipids for non-viral gene delivery.

    Science.gov (United States)

    Gosangi, Mallikarjun; Mujahid, Thasneem Yoosuf; Gopal, Vijaya; Patri, Srilakshmi V

    2016-07-12

    Gene therapy, a promising strategy for the delivery of therapeutic nucleic acids, is greatly dependent on the development of efficient vectors. In this study, we designed and synthesized several tocopherol-based lipids varying in the head group region. Here, we present the structure-activity relationship of stable aqueous suspensions of lipids that were synthetically prepared and formulated with 1,2-dioleoyl phosphatidyl ethanolamine (DOPE) as the co-lipid. The physicochemical properties such as the hydrodynamic size, zeta potential, stability and morphology of these formulations were investigated. Interaction with plasmid DNA was clearly demonstrated through gel binding and EtBr displacement assays. Further, the transfection potential was examined in mouse neuroblastoma Neuro-2a, hepatocarcinoma HepG2, human embryonic kidney and Chinese hamster ovarian cell lines, all of different origins. Cell-uptake assays with N-methylpiperidinium, N-methylmorpholinium, N-methylimidazolium and N,N-dimethylaminopyridinium head group containing formulations evidently depicted efficient cell uptake as observed by particulate cytoplasmic fluorescence. Trafficking of lipoplexes using an endocytic marker and rhodamine-labeled phospholipid DHPE indicated that the lipoplexes were not sequestered in the lysosomes. Importantly, lipoplexes were non-toxic and mediated good transfection efficiency as analyzed by β-Gal and GFP reporter gene expression assays which established the superior activity of lipids whose structures correlate strongly with the transfection efficiency.

  12. Structural analysis of the RH-like blood group gene products in nonhuman primates

    Energy Technology Data Exchange (ETDEWEB)

    Salvignol, I. [Centre Regional de Transfusion Sanguine, Toulouse (France); Calvas, P.; Blancher, A. [Universitaire d`Immunogenetique moleculaire, Toulouse (France); Socha, W.W. [University Medical Center, New York, NY (United States); Colin, Y.; Le Van Kim, C.; Bailly, P.; Cartron, J.P. [Institut National de la Transfusion Sanguine, Paris (France); Ruffie, J.; Blancher, A. [College de France, Paris (France)

    1995-03-01

    Rh-related transcripts present in bone marrow samples from several species of nonhuman primates (chimpanzee, gorilla, gibbon, crab-eating macaque) have been amplified by RT-polymerase chain reaction using primers deduced from the sequence of human RH genes. Nucleotide sequence analysis of the nonhuman transcripts revealed a high degree of similarity to human blood group Rh sequences, suggesting a great conservation of the RH genes throughout evolution. Full-length transcripts, potentially encoding 417 amino acid long proteins homologous to Rh polypeptides, were characterized, as well as mRNA isoforms which harbored nucleotide deletions or insertions and potentially encode truncated proteins. Proteins of 30-40,000 M{sub r}, immunologically related to human Rh proteins, were detected by western blot analysis with antipeptide antibodies, indicating that Rh-like transcripts are translated into membrane proteins. Comparison of human and nonhuman protein sequences was pivotal in clarifying the molecular basis of the blood group C/c polymorphism, showing that only the Pro103Ser substitution was correlated with C/c polymorphism. In addition, it was shown that a proline residue at position 102 was critical in the expression of C and c epitopes, most likely by providing an appropriate conformation of Rh polypeptides. From these data a phylogenetic reconstruction of the RH locus evolution has been calculated from which an unrooted phylogenetic tree could be proposed, indicating that African ape Rh-like genes would be closer to the human RhD gene than to the human RhCE gene. 55 refs., 4 figs., 1 tab.

  13. A genome-wide characterization of microRNA genes in maize.

    Directory of Open Access Journals (Sweden)

    Lifang Zhang

    2009-11-01

    Full Text Available MicroRNAs (miRNAs are small, non-coding RNAs that play essential roles in plant growth, development, and stress response. We conducted a genome-wide survey of maize miRNA genes, characterizing their structure, expression, and evolution. Computational approaches based on homology and secondary structure modeling identified 150 high-confidence genes within 26 miRNA families. For 25 families, expression was verified by deep-sequencing of small RNA libraries that were prepared from an assortment of maize tissues. PCR-RACE amplification of 68 miRNA transcript precursors, representing 18 families conserved across several plant species, showed that splice variation and the use of alternative transcriptional start and stop sites is common within this class of genes. Comparison of sequence variation data from diverse maize inbred lines versus teosinte accessions suggest that the mature miRNAs are under strong purifying selection while the flanking sequences evolve equivalently to other genes. Since maize is derived from an ancient tetraploid, the effect of whole-genome duplication on miRNA evolution was examined. We found that, like protein-coding genes, duplicated miRNA genes underwent extensive gene-loss, with approximately 35% of ancestral sites retained as duplicate homoeologous miRNA genes. This number is higher than that observed with protein-coding genes. A search for putative miRNA targets indicated bias towards genes in regulatory and metabolic pathways. As maize is one of the principal models for plant growth and development, this study will serve as a foundation for future research into the functional roles of miRNA genes.

  14. The impact of gene expression variation on the robustness and evolvability of a developmental gene regulatory network.

    Directory of Open Access Journals (Sweden)

    David A Garfield

    2013-10-01

    Full Text Available Regulatory interactions buffer development against genetic and environmental perturbations, but adaptation requires phenotypes to change. We investigated the relationship between robustness and evolvability within the gene regulatory network underlying development of the larval skeleton in the sea urchin Strongylocentrotus purpuratus. We find extensive variation in gene expression in this network throughout development in a natural population, some of which has a heritable genetic basis. Switch-like regulatory interactions predominate during early development, buffer expression variation, and may promote the accumulation of cryptic genetic variation affecting early stages. Regulatory interactions during later development are typically more sensitive (linear, allowing variation in expression to affect downstream target genes. Variation in skeletal morphology is associated primarily with expression variation of a few, primarily structural, genes at terminal positions within the network. These results indicate that the position and properties of gene interactions within a network can have important evolutionary consequences independent of their immediate regulatory role.

  15. First Comprehensive In Silico Analysis of the Functional and Structural Consequences of SNPs in Human GalNAc-T1 Gene

    Directory of Open Access Journals (Sweden)

    Hussein Sheikh Ali Mohamoud

    2014-01-01

    Full Text Available GalNAc-T1, a key candidate of GalNac-transferases genes family that is involved in mucin-type O-linked glycosylation pathway, is expressed in most biological tissues and cell types. Despite the reported association of GalNAc-T1 gene mutations with human disease susceptibility, the comprehensive computational analysis of coding, noncoding and regulatory SNPs, and their functional impacts on protein level, still remains unknown. Therefore, sequence- and structure-based computational tools were employed to screen the entire listed coding SNPs of GalNAc-T1 gene in order to identify and characterize them. Our concordant in silico analysis by SIFT, PolyPhen-2, PANTHER-cSNP, and SNPeffect tools, identified the potential nsSNPs (S143P, G258V, and Y414D variants from 18 nsSNPs of GalNAc-T1. Additionally, 2 regulatory SNPs (rs72964406 and #x26; rs34304568 were also identified in GalNAc-T1 by using FastSNP tool. Using multiple computational approaches, we have systematically classified the functional mutations in regulatory and coding regions that can modify expression and function of GalNAc-T1 enzyme. These genetic variants can further assist in better understanding the wide range of disease susceptibility associated with the mucin-based cell signalling and pathogenic binding, and may help to develop novel therapeutic elements for associated diseases.

  16. Recombination Rate Heterogeneity within Arabidopsis Disease Resistance Genes.

    Science.gov (United States)

    Choi, Kyuha; Reinhard, Carsten; Serra, Heïdi; Ziolkowski, Piotr A; Underwood, Charles J; Zhao, Xiaohui; Hardcastle, Thomas J; Yelina, Nataliya E; Griffin, Catherine; Jackson, Matthew; Mézard, Christine; McVean, Gil; Copenhaver, Gregory P; Henderson, Ian R

    2016-07-01

    Meiotic crossover frequency varies extensively along chromosomes and is typically concentrated in hotspots. As recombination increases genetic diversity, hotspots are predicted to occur at immunity genes, where variation may be beneficial. A major component of plant immunity is recognition of pathogen Avirulence (Avr) effectors by resistance (R) genes that encode NBS-LRR domain proteins. Therefore, we sought to test whether NBS-LRR genes would overlap with meiotic crossover hotspots using experimental genetics in Arabidopsis thaliana. NBS-LRR genes tend to physically cluster in plant genomes; for example, in Arabidopsis most are located in large clusters on the south arms of chromosomes 1 and 5. We experimentally mapped 1,439 crossovers within these clusters and observed NBS-LRR gene associated hotspots, which were also detected as historical hotspots via analysis of linkage disequilibrium. However, we also observed NBS-LRR gene coldspots, which in some cases correlate with structural heterozygosity. To study recombination at the fine-scale we used high-throughput sequencing to analyze ~1,000 crossovers within the RESISTANCE TO ALBUGO CANDIDA1 (RAC1) R gene hotspot. This revealed elevated intragenic crossovers, overlapping nucleosome-occupied exons that encode the TIR, NBS and LRR domains. The highest RAC1 recombination frequency was promoter-proximal and overlapped CTT-repeat DNA sequence motifs, which have previously been associated with plant crossover hotspots. Additionally, we show a significant influence of natural genetic variation on NBS-LRR cluster recombination rates, using crosses between Arabidopsis ecotypes. In conclusion, we show that a subset of NBS-LRR genes are strong hotspots, whereas others are coldspots. This reveals a complex recombination landscape in Arabidopsis NBS-LRR genes, which we propose results from varying coevolutionary pressures exerted by host-pathogen relationships, and is influenced by structural heterozygosity.

  17. Associating transcription factors and conserved RNA structures with gene regulation in the human brain

    DEFF Research Database (Denmark)

    Hecker, Nikolai; Seemann, Stefan E.; Silahtaroglu, Asli

    2017-01-01

    Anatomical subdivisions of the human brain can be associated with different neuronal functions. This functional diversification is reflected by differences in gene expression. By analyzing post-mortem gene expression data from the Allen Brain Atlas, we investigated the impact of transcription fac...

  18. Gene analogue finder: a GRID solution for finding functionally analogous gene products

    Directory of Open Access Journals (Sweden)

    Licciulli Flavio

    2007-09-01

    Full Text Available Abstract Background To date more than 2,1 million gene products from more than 100000 different species have been described specifying their function, the processes they are involved in and their cellular localization using a very well defined and structured vocabulary, the gene ontology (GO. Such vast, well defined knowledge opens the possibility of compare gene products at the level of functionality, finding gene products which have a similar function or are involved in similar biological processes without relying on the conventional sequence similarity approach. Comparisons within such a large space of knowledge are highly data and computing intensive. For this reason this project was based upon the use of the computational GRID, a technology offering large computing and storage resources. Results We have developed a tool, GENe AnaloGue FINdEr (ENGINE that parallelizes the search process and distributes the calculation and data over the computational GRID, splitting the process into many sub-processes and joining the calculation and the data on the same machine and therefore completing the whole search in about 3 days instead of occupying one single machine for more than 5 CPU years. The results of the functional comparison contain potential functional analogues for more than 79000 gene products from the most important species. 46% of the analyzed gene products are well enough described for such an analysis to individuate functional analogues, such as well-known members of the same gene family, or gene products with similar functions which would never have been associated by standard methods. Conclusion ENGINE has produced a list of potential functionally analogous relations between gene products within and between species using, in place of the sequence, the gene description of the GO, thus demonstrating the potential of the GO. However, the current limiting factor is the quality of the associations of many gene products from non

  19. Gene diversity, agroecological structure and introgression patterns among village chicken populations across North, West and Central Africa

    Directory of Open Access Journals (Sweden)

    Leroy Grégoire

    2012-05-01

    Full Text Available Abstract Background Chickens represent an important animal genetic resource for improving farmers’ income in Africa. The present study provides a comparative analysis of the genetic diversity of village chickens across a subset of African countries. Four hundred seventy-two chickens were sampled in 23 administrative provinces across Cameroon, Benin, Ghana, Côte d’Ivoire, and Morocco. Geographical coordinates were recorded to analyze the relationships between geographic distribution and genetic diversity. Molecular characterization was performed with a set of 22 microsatellite markers. Five commercial lines, broilers and layers, were also genotyped to investigate potential gene flow. A genetic diversity analysis was conducted both within and between populations. Results High heterozygosity levels, ranging from 0.51 to 0.67, were reported for all local populations, corresponding to the values usually found in scavenging populations worldwide. Allelic richness varied from 2.04 for a commercial line to 4.84 for one population from Côte d’Ivoire. Evidence of gene flow between commercial and local populations was observed in Morocco and in Cameroon, which could be related to long-term improvement programs with the distribution of crossbred chicks. The impact of such introgressions seemed rather limited, probably because of poor adaptation of exotic birds to village conditions, and because of the consumers’ preference for local chickens. No such gene flow was observed in Benin, Ghana, and Côte d’Ivoire, where improvement programs are also less developed. The clustering approach revealed an interesting similarity between local populations found in regions sharing high levels of precipitation, from Cameroon to Côte d’Ivoire. Restricting the study to Benin, Ghana, and Côte d’Ivoire, did not result in a typical breed structure but a south-west to north-east gradient was observed. Three genetically differentiated areas (P  Conclusions

  20. Microbial community structure of Arctic multiyear sea ice and surface seawater by 454 sequencing of the 16S RNA gene

    DEFF Research Database (Denmark)

    Bowman, Jeff S.; Rasmussen, Simon; Blom, Nikolaj

    2011-01-01

    community in MYI at two sites near the geographic North Pole using parallel tag sequencing of the 16S rRNA gene. Although the composition of the MYI microbial community has been characterized by previous studies, microbial community structure has not been. Although richness was lower in MYI than....... In addition, several low-abundance clades not previously reported in sea ice were present, including the phylum TM7 and the classes Spartobacteria and Opitutae. Members of Coraliomargarita, a recently described genus of the class Opitutae, were present in sufficient numbers to suggest niche occupation within...