WorldWideScience

Sample records for gene cluster involved

  1. Sphingolipids regulate telomere clustering by affecting the transcription of genes involved in telomere homeostasis.

    Science.gov (United States)

    Ikeda, Atsuko; Muneoka, Tetsuya; Murakami, Suguru; Hirota, Ayaka; Yabuki, Yukari; Karashima, Takefumi; Nakazono, Kota; Tsuruno, Masahiro; Pichler, Harald; Shirahige, Katsuhiko; Kodama, Yukiko; Shimamoto, Toshi; Mizuta, Keiko; Funato, Kouichi

    2015-07-15

    In eukaryotic organisms, including mammals, nematodes and yeasts, the ends of chromosomes, telomeres are clustered at the nuclear periphery. Telomere clustering is assumed to be functionally important because proper organization of chromosomes is necessary for proper genome function and stability. However, the mechanisms and physiological roles of telomere clustering remain poorly understood. In this study, we demonstrate a role for sphingolipids in telomere clustering in the budding yeast Saccharomyces cerevisiae. Because abnormal sphingolipid metabolism causes downregulation of expression levels of genes involved in telomere organization, sphingolipids appear to control telomere clustering at the transcriptional level. In addition, the data presented here provide evidence that telomere clustering is required to protect chromosome ends from DNA-damage checkpoint signaling. As sphingolipids are found in all eukaryotes, we speculate that sphingolipid-based regulation of telomere clustering and the protective role of telomere clusters in maintaining genome stability might be conserved in eukaryotes.

  2. Identification and functional analysis of gene cluster involvement in biosynthesis of the cyclic lipopeptide antibiotic pelgipeptin produced by Paenibacillus elgii

    Directory of Open Access Journals (Sweden)

    Qian Chao-Dong

    2012-09-01

    Full Text Available Abstract Background Pelgipeptin, a potent antibacterial and antifungal agent, is a non-ribosomally synthesised lipopeptide antibiotic. This compound consists of a β-hydroxy fatty acid and nine amino acids. To date, there is no information about its biosynthetic pathway. Results A potential pelgipeptin synthetase gene cluster (plp was identified from Paenibacillus elgii B69 through genome analysis. The gene cluster spans 40.8 kb with eight open reading frames. Among the genes in this cluster, three large genes, plpD, plpE, and plpF, were shown to encode non-ribosomal peptide synthetases (NRPSs, with one, seven, and one module(s, respectively. Bioinformatic analysis of the substrate specificity of all nine adenylation domains indicated that the sequence of the NRPS modules is well collinear with the order of amino acids in pelgipeptin. Additional biochemical analysis of four recombinant adenylation domains (PlpD A1, PlpE A1, PlpE A3, and PlpF A1 provided further evidence that the plp gene cluster involved in pelgipeptin biosynthesis. Conclusions In this study, a gene cluster (plp responsible for the biosynthesis of pelgipeptin was identified from the genome sequence of Paenibacillus elgii B69. The identification of the plp gene cluster provides an opportunity to develop novel lipopeptide antibiotics by genetic engineering.

  3. Characterization of the Gene Cluster Involved in Isoprene Metabolism in Rhodococcus sp. Strain AD45

    NARCIS (Netherlands)

    van Hylckama Vlieg, Johan E.T.; Leemhuis, Hans; Lutje Spelberg, Jeffrey H.; Janssen, Dick B.

    2000-01-01

    The genes involved in isoprene (2-methyl-1,3-butadiene) utilization in Rhodococcus sp. strain AD45 were cloned and characterized. Sequence analysis of an 8.5-kb DNA fragment showed the presence of 10 genes of which 2 encoded enzymes which were previously found to be involved in isoprene degradation:

  4. Regulation of a novel gene cluster involved in secondary metabolite production in Streptomyces coelicolor.

    Science.gov (United States)

    Hindra; Pak, Patricia; Elliot, Marie A

    2010-10-01

    Antibiotic biosynthesis in the streptomycetes is a complex and highly regulated process. Here, we provide evidence for the contribution of a novel genetic locus to antibiotic production in Streptomyces coelicolor. The overexpression of a gene cluster comprising four protein-encoding genes (abeABCD) and an antisense RNA-encoding gene (α-abeA) stimulated the production of the blue-pigmented metabolite actinorhodin on solid medium. Actinorhodin production also was enhanced by the overexpression of an adjacent gene (abeR) encoding a predicted Streptomyces antibiotic regulatory protein (SARP), while the deletion of this gene impaired actinorhodin production. We found the abe genes to be differentially regulated and controlled at multiple levels. Upstream of abeA was a promoter that directed the transcription of abeABCD at a low but constitutive level. The expression of abeBCD was, however, significantly upregulated at a time that coincided with the initiation of aerial development and the onset of secondary metabolism; this expression was activated by the binding of AbeR to four heptameric repeats upstream of a promoter within abeA. Expressed divergently to the abeBCD promoter was α-abeA, whose expression mirrored that of abeBCD but did not require activation by AbeR. Instead, α-abeA transcript levels were subject to negative control by the double-strand-specific RNase, RNase III.

  5. Transgene-induced silencing of the zoosporogenesis-specific NIFC gene cluster of Phytophthora infestans involves chromatin alterations.

    Science.gov (United States)

    Judelson, Howard S; Tani, Shuji

    2007-07-01

    Clustered within the genome of the oomycete phytopathogen Phytophthora infestans are four genes encoding spore-specific nuclear LIM interactor-interacting factors (NIF proteins, a type of transcriptional regulator) that are moderately conserved in DNA sequence. NIFC1, NIFC2, and NIFC3 are zoosporogenesis-induced and grouped within 4 kb, and 20 kb away resides a sporulation-induced form, NIFS. To test the function of the NIFC family, plasmids expressing full-length hairpin constructs of NIFC1 or NIFC2 were stably transformed into P. infestans. This triggered silencing of the cognate gene in about one-third of transformants, and all three NIFC genes were usually cosilenced. However, NIFS escaped silencing despite its high sequence similarity to the NIFC genes. Silencing of the three NIFC genes impaired zoospore cyst germination by 60% but did not affect other aspects of the life cycle. Silencing was transcriptional based on nuclear run-on assays and associated with tighter chromatin packing based on nuclease accessibility experiments. The chromatin alterations extended a few hundred nucleotides beyond the boundaries of the transcribed region of the NIFC cluster and were not associated with increased DNA methylation. A plasmid expressing a short hairpin RNA having sequence similarity only to NIFC1 silenced both that gene and an adjacent member of the gene cluster, likely due to the expansion of a heterochromatic domain from the targeted locus. These data help illuminate the mechanism of silencing in Phytophthora and suggest that caution should be used when interpreting silencing experiments involving closely spaced genes.

  6. An Ipomoea batatas iron-sulfur cluster scaffold protein gene, IbNFU1, is involved in salt tolerance.

    Directory of Open Access Journals (Sweden)

    Degao Liu

    Full Text Available Iron-sulfur cluster biosynthesis involving the nitrogen fixation (Nif proteins has been proposed as a general mechanism acting in various organisms. NifU-like protein may play an important role in protecting plants against abiotic and biotic stresses. An iron-sulfur cluster scaffold protein gene, IbNFU1, was isolated from a salt-tolerant sweetpotato (Ipomoea batatas (L. Lam. line LM79 in our previous study, but its role in sweetpotato stress tolerance was not investigated. In the present study, the IbNFU1 gene was introduced into a salt-sensitive sweetpotato cv. Lizixiang to characterize its function in salt tolerance. The IbNFU1-overexpressing sweetpotato plants exhibited significantly higher salt tolerance compared with the wild-type. Proline and reduced ascorbate content were significantly increased, whereas malonaldehyde (MDA content was significantly decreased in the transgenic plants. The activities of superoxide dismutase (SOD and photosynthesis were significantly enhanced in the transgenic plants. H2O2 was also found to be significantly less accumulated in the transgenic plants than in the wild-type. Overexpression of IbNFU1 up-regulated pyrroline-5-carboxylate synthase (P5CS and pyrroline-5-carboxylate reductase (P5CR genes under salt stress. The systemic up-regulation of reactive oxygen species (ROS scavenging genes was found in the transgenic plants under salt stress. These findings suggest that IbNFU1gene is involved in sweetpotato salt tolerance and enhances salt tolerance of the transgenic sweetpotato plants by regulating osmotic balance, protecting membrane integrity and photosynthesis and activating ROS scavenging system.

  7. Clustering of two genes putatively involved in cyanate detoxification evolved recently and independently in multiple fungal lineages

    Science.gov (United States)

    Fungi that have the enzymes cyanase and carbonic anhydrase show a limited capacity to detoxify cyanate, a fungicide employed by both plants and humans. Here, we describe a novel two-gene cluster that comprises duplicated cyanase and carbonic anhydrase copies, which we name the CCA gene cluster, trac...

  8. Two transcription factors, CabA and CabR, are independently involved in multilevel regulation of the biosynthetic gene cluster encoding the novel aminocoumarin, cacibiocin.

    Science.gov (United States)

    Wolański, Marcin; Łebkowski, Tomasz; Kois-Ostrowska, Agnieszka; Zettler, Judith; Apel, Alexander K; Jakimowicz, Dagmara; Zakrzewska-Czerwińska, Jolanta

    2016-04-01

    Aminocoumarins are potent antibiotics belonging to a relatively small group of secondary metabolites produced by actinomycetes. Genome mining of Catenulispora acidiphila has recently led to the discovery of a gene cluster responsible for biosynthesis of novel aminocoumarins, cacibiocins. However, regulation of the expression of this novel gene cluster has not yet been analyzed. In this study, we identify transcriptional regulators of the cacibiocin gene cluster. Using a heterologous expression system, we show that the CabA and CabR proteins encoded by cabA and cabR genes in the cacibiocin gene cluster control the expression of genes involved in the biosynthesis, modification, regulation, and potentially, efflux/resistance of cacibiocins. CabA positively regulates the expression of cabH (the first gene in the cabHIYJKL operon) and cabhal genes encoding key enzymes responsible for the biosynthesis and halogenation of the aminocoumarin moiety, respectively. We provide evidence that CabA is a direct inducer of cacibiocin production, whereas the second transcriptional factor, CabR, is involved in the negative regulation of its own gene and cabT-the latter of which encodes a putative cacibiocin transporter. We also demonstrate that CabR activity is negatively regulated in vitro by aminocoumarin compounds, suggesting the existence of analogous regulation in vivo. Finally, we propose a model of multilevel regulation of gene transcription in the cacibiocin gene cluster by CabA and CabR.

  9. Gene clusters involved in isethionate degradation by terrestrial and marine bacteria.

    KAUST Repository

    Weinitschke, Sonja

    2010-01-01

    Ubiquitous isethionate (2-hydroxyethanesulfonate) is dissimilated by diverse bacteria. Growth of Cupriavidus necator H16 with isethionate was observed, as was inducible membrane-bound isethionate dehydrogenase (IseJ) and inducible transcription of the genes predicted to encode IseJ and a transporter (IseU). Biodiversity in isethionate transport genes was observed and investigated by transcription experiments.

  10. Clustering of two genes putatively involved in cyanate detoxification evolved recently and independently in multiple fungal lineages.

    Science.gov (United States)

    Elmore, M Holly; McGary, Kriston L; Wisecaver, Jennifer H; Slot, Jason C; Geiser, David M; Sink, Stacy; O'Donnell, Kerry; Rokas, Antonis

    2015-03-01

    Fungi that have the enzymes cyanase and carbonic anhydrase show a limited capacity to detoxify cyanate, a fungicide employed by both plants and humans. Here, we describe a novel two-gene cluster that comprises duplicated cyanase and carbonic anhydrase copies, which we name the CCA gene cluster, trace its evolution across Ascomycetes, and examine the evolutionary dynamics of its spread among lineages of the Fusarium oxysporum species complex (hereafter referred to as the FOSC), a cosmopolitan clade of purportedly clonal vascular wilt plant pathogens. Phylogenetic analysis of fungal cyanase and carbonic anhydrase genes reveals that the CCA gene cluster arose independently at least twice and is now present in three lineages, namely Cochliobolus lunatus, Oidiodendron maius, and the FOSC. Genome-wide surveys within the FOSC indicate that the CCA gene cluster varies in copy number across isolates, is always located on accessory chromosomes, and is absent in FOSC's closest relatives. Phylogenetic reconstruction of the CCA gene cluster in 163 FOSC strains from a wide variety of hosts suggests a recent history of rampant transfers between isolates. We hypothesize that the independent formation of the CCA gene cluster in different fungal lineages and its spread across FOSC strains may be associated with resistance to plant-produced cyanates or to use of cyanate fungicides in agriculture.

  11. Genes involved in degradation of para-nitrophenol are differentially arranged in form of non-contiguous gene clusters in Burkholderia sp. strain SJ98.

    Directory of Open Access Journals (Sweden)

    Surendra Vikram

    Full Text Available Biodegradation of para-Nitrophenol (PNP proceeds via two distinct pathways, having 1,2,3-benzenetriol (BT and hydroquinone (HQ as their respective terminal aromatic intermediates. Genes involved in these pathways have already been studied in different PNP degrading bacteria. Burkholderia sp. strain SJ98 degrades PNP via both the pathways. Earlier, we have sequenced and analyzed a ~41 kb fragment from the genomic library of strain SJ98. This DNA fragment was found to harbor all the lower pathway genes; however, genes responsible for the initial transformation of PNP could not be identified within this fragment. Now, we have sequenced and annotated the whole genome of strain SJ98 and found two ORFs (viz., pnpA and pnpB showing maximum identity at amino acid level with p-nitrophenol 4-monooxygenase (PnpM and p-benzoquinone reductase (BqR. Unlike the other PNP gene clusters reported earlier in different bacteria, these two ORFs in SJ98 genome are physically separated from the other genes of PNP degradation pathway. In order to ascertain the identity of ORFs pnpA and pnpB, we have performed in-vitro assays using recombinant proteins heterologously expressed and purified to homogeneity. Purified PnpA was found to be a functional PnpM and transformed PNP into benzoquinone (BQ, while PnpB was found to be a functional BqR which catalyzed the transformation of BQ into hydroquinone (HQ. Noticeably, PnpM from strain SJ98 could also transform a number of PNP analogues. Based on the above observations, we propose that the genes for PNP degradation in strain SJ98 are arranged differentially in form of non-contiguous gene clusters. This is the first report for such arrangement for gene clusters involved in PNP degradation. Therefore, we propose that PNP degradation in strain SJ98 could be an important model system for further studies on differential evolution of PNP degradation functions.

  12. Genome mining of the hitachimycin biosynthetic gene cluster: involvement of a phenylalanine-2,3-aminomutase in biosynthesis.

    Science.gov (United States)

    Kudo, Fumitaka; Kawamura, Koichi; Uchino, Asuka; Miyanaga, Akimasa; Numakura, Mario; Takayanagi, Ryuichi; Eguchi, Tadashi

    2015-04-13

    Hitachimycin is a macrolactam antibiotic with (S)-β-phenylalanine (β-Phe) at the starter position of its polyketide skeleton. To understand the incorporation mechanism of β-Phe and the modification mechanism of the unique polyketide skeleton, the biosynthetic gene cluster for hitachimycin in Streptomyces scabrisporus was identified by genome mining. The identified gene cluster contains a putative phenylalanine-2,3-aminomutase (PAM), five polyketide synthases, four β-amino-acid-carrying enzymes, and a characteristic amidohydrolase. A hitA knockout mutant showed no hitachimycin production, but antibiotic production was restored by feeding with (S)-β-Phe. We also confirmed the enzymatic activity of the HitA PAM. The results suggest that the identified gene cluster is responsible for the biosynthesis of hitachimycin. A plausible biosynthetic pathway for hitachimycin, including a unique polyketide skeletal transformation mechanism, is proposed.

  13. A novel and complete gene cluster involved in the degradation of aniline by Delftia sp.AN3

    Institute of Scientific and Technical Information of China (English)

    ZHANG Tao; ZHANG Jinglei; LIU Shuangjiang; LIU Zhipei

    2008-01-01

    A recombinant strain, Escherichia coli JM109-AN1,was obtained by constructing of a genomic library of the total DNA of Delftia sp.AN3 in E. coli JM109 and screening for catechol 2,3-dioxygenase activity.This recombinant strain could grow on aniline as sole carbou,nitrogen and energy source.Enzymatic assays revealed that the exogenous genes including aniline dioxygenase (AD) and catechol 2,3-dioxygenase (C23O) genes could well express in the recombinant strain with the activities of AD and C23O up to O.31 U/mg wet cell and 1.92 U/mg crude proteins.respectively.The AD or C23O of strain AN3 could only catalyze aniline or catechol but not any other substituted substrates.This recombinant strain contained a recombinant plasmid,pKC505-AN1,in which a 29.7-kb DNA fragment from Delftia sp.AN3 was inserted.Sequencing and open reading frame (orfs) analysis of this 29.7 kb fragment revealed that it contained at least 27 orfs,among them a gene cluster (consisting of at least 16 genes,named danQTAlA2BRDCEFGlHIJKG2) was responsible for the complete metabolism of aniline to TCA-cycle intermediates.This gene cluster could be divided into two main parts,the upper sequences consisted of 7 genes (danQTAlA2BRD) were predicted to encode a multi-component aniline dioxygenase and a LysR-type regulator, and the central genes (danCEFGIHIJKG2) were expected to encode meta-cleavage pathway enzymes for catechol degradation to TCA-cycle intermediates.Unlike clusters tad from Delftia tsuruhatensis AD9 and tdn from Pseudomonas put/da UCC22,in this gene cluster,all the genes were in the Same transcriptional direction.There was only one set of C23O gene (danC) and ferredoxin-like protein gene fdanD).The presence of only one set of these two genes and specificity of AD and C23O might be the reason for strain AN3 could only degrade aniline.The products ofdanQTA1A2BRDC showed 99%-100% identity to those from Delflia acidovorans 7N.and 50%-85% identity to those of tad cluster from D.tsuruhatensis AD9 in

  14. Involvement of gene methylation changes in the differentiation of human amniotic epithelial cells into islet-like cell clusters.

    Science.gov (United States)

    Peng, Lin; Wang, Jian; Lu, Guangxiu

    2014-09-01

    Insulin-dependent diabetes results from destruction of the insulin-producing β-cells of the pancreas. Islet cell transplantation is a promising cure for diabetes. Here, we induced human amniotic epithelial cells (hAECs) to differentiate into islet-like cell clusters by nicotinamide plus betacellulin in vitro, and further investigated the DNA methylation status by a Nimble MeDIP microarray before and after cell differentiation to shed light on the molecular mechanisms of this differentiation. In addition, 5-Aza-2'-deoxycytidine was used to investigate whether the differentiation of hAECs into islet-like cells occurred through demethylation. Purified hAECs (CK18(+)/E-cadherin(+)/CD29(+)/CD90(-)/CD34(-)/CD45(-)) were isolated from human amnia. After induction, hAECs were found to be insulin positive and sensitive to glucose, indicating successful induction to islet-like cells. The methylation status of cell cytoskeleton-related genes was down-regulated and that of negative regulation of cell adhesion-related genes was up-regulated. The methylation status of pancreas development-related genes such as HNF1α and DGAT1 was decreased in hAECs after induction. After brief demethylation, INS gene expression was up-regulated in islet-like cell clusters, suggesting that DNA methylation changes were associated with the differentiation of hAECs into islet-like cell clusters.

  15. Functional analysis of the gene cluster involved in production of the bacteriocin circularin A by Clostridium beijerinckii ATCC 25752

    NARCIS (Netherlands)

    Kemperman, R; Jonker, M; Nauta, A; Kuipers, OP; Kok, J

    2003-01-01

    A region of 12 kb flanking the structural gene of the cyclic antibacterial peptide circularin A of Clostridium beijerinckii ATCC 25752 was sequenced, and the putative proteins involved in the production and secretion of circularin A were identified. The genes are tightly organized in overlapping ope

  16. The exopolysaccharide gene cluster Bcam1330-Bcam1341 is involved in Burkholderia cenocepacia biofilm formation, and its expression is regulated by c-di-GMP and Bcam1349

    DEFF Research Database (Denmark)

    Fazli, Mustafa; McCarthy, Yvonne; Givskov, Michael

    2013-01-01

    In Burkholderia cenocepacia, the second messenger cyclic diguanosine monophosphate (c-di-GMP) has previously been shown to positively regulate biofilm formation and the expression of cellulose and type-I fimbriae genes through binding to the transcriptional regulator Bcam1349. Here, we provide...... evidence that cellulose and type-I fimbriae are not involved in B. cenocepacia biofilm formation in flow chambers, and we identify a novel Bcam1349/c-di-GMP-regulated exopolysaccharide gene cluster which is essential for B. cenocepacia biofilm formation. Overproduction of Bcam1349 in trans promotes wrinkly...... matrix exopolysaccharide and to be essential for flow-chamber biofilm formation. We demonstrate that Bcam1349 binds to the promoter region of genes in the Bcam1330-Bcam1341 cluster and that this binding is enhanced by the presence of c-di-GMP. Furthermore, we demonstrate that overproduction of both c...

  17. The nit1C gene cluster of Pseudomonas pseudoalcaligenes CECT5344 involved in assimilation of nitriles is essential for growth on cyanide.

    Science.gov (United States)

    Estepa, Jessica; Luque-Almagro, Victor M; Manso, Isabel; Escribano, M Paz; Martínez-Luque, Manuel; Castillo, Francisco; Moreno-Vivián, Conrado; Roldán, M Dolores

    2012-06-01

    A proteomic approach was used to identify several proteins induced by cyanide in the alkaliphilic bacterium Pseudomonas pseudoalcaligenes CECT5344, two of them, NitB and NitG, encoded by genes that belong to the nit1C gene cluster. The predicted products of the nit1C gene cluster are a Fis-like σ(54) -dependent transcriptional activator (NitA), a nitrilase (NitC), an S-adenosylmethionine superfamily member (NitD), an N-acyltransferase superfamily member (NitE), a trifunctional polypeptide of the AIRS/GARS family (NitF), an NADH-dependent oxidoreductase (NitH) and two hypothetical proteins of unknown function (NitB and NitG). RT-PCR analysis suggested that nitBCDEFGH genes were co-transcribed, whereas the regulatory nitA gene was divergently transcribed. Real-time RT-PCR revealed that expression of the nitBCDEFGH genes was induced by cyanide and repressed by ammonium. The P. pseudoalcaligenes CECT5344 nit1C gene cluster was found to be involved in assimilation of free and organic cyanides (nitriles) as deduced for the inability to grow with cyanides showed by the NitA, NitB and NitC mutant strains. The wild-type strain CECT5344 showed a nitrilase activity that allows growth on cyanide or hydroxynitriles. The NitB and NitC mutants only presented low basal levels of nitrilase activity that were not enough to support growth on either free cyanide or aliphatic nitriles, suggesting that nitrilase NitC is specific and essential for cyanide and aliphatic nitriles assimilation.

  18. Gene Cluster Statistics with Gene Families

    Science.gov (United States)

    Durand, Dannie

    2009-01-01

    Identifying genomic regions that descended from a common ancestor is important for understanding the function and evolution of genomes. In distantly related genomes, clusters of homologous gene pairs are evidence of candidate homologous regions. Demonstrating the statistical significance of such “gene clusters” is an essential component of comparative genomic analyses. However, currently there are no practical statistical tests for gene clusters that model the influence of the number of homologs in each gene family on cluster significance. In this work, we demonstrate empirically that failure to incorporate gene family size in gene cluster statistics results in overestimation of significance, leading to incorrect conclusions. We further present novel analytical methods for estimating gene cluster significance that take gene family size into account. Our methods do not require complete genome data and are suitable for testing individual clusters found in local regions, such as contigs in an unfinished assembly. We consider pairs of regions drawn from the same genome (paralogous clusters), as well as regions drawn from two different genomes (orthologous clusters). Determining cluster significance under general models of gene family size is computationally intractable. By assuming that all gene families are of equal size, we obtain analytical expressions that allow fast approximation of cluster probabilities. We evaluate the accuracy of this approximation by comparing the resulting gene cluster probabilities with cluster probabilities obtained by simulating a realistic, power-law distributed model of gene family size, with parameters inferred from genomic data. Surprisingly, despite the simplicity of the underlying assumption, our method accurately approximates the true cluster probabilities. It slightly overestimates these probabilities, yielding a conservative test. We present additional simulation results indicating the best choice of parameter values for data

  19. Fe2+ chelator proferrorosamine A: a gene cluster of Erwinia rhapontici P45 involved in its synthesis and its impact on growth of Erwinia amylovora CFBP1430.

    Science.gov (United States)

    Born, Yannick; Remus-Emsermann, Mitja N P; Bieri, Marco; Kamber, Tim; Piel, Jörn; Pelludat, Cosima

    2016-02-01

    Proferrorosamine A (proFRA) is an iron (Fe2+) chelator produced by the opportunistic plant pathogen Erwinia rhapontici P45. To identify genes involved in proFRA synthesis, transposon mutagenesis was performed. The identified 9.3 kb gene cluster, comprising seven genes, designated rosA-rosG, encodes proteins that are involved in proFRA synthesis. Based on gene homologies, a biosynthetic pathway model for proFRA is proposed. To obtain a better understanding of the effect of proFRA on non-proFRA producing bacteria, E. rhapontici P45 was co-cultured with Erwinia amylovora CFBP1430, a fire-blight-causing plant pathogen. E. rhapontici P45, but not corresponding proFRA-negative mutants, led to a pink coloration of E. amylovora CFBP1430 colonies on King's B agar, indicating accumulation of the proFRA-iron complex ferrorosamine, and growth inhibition in vitro. By saturating proFRA-containing extracts with Fe2+, the inhibitory effect was neutralized, suggesting that the iron-chelating capability of proFRA is responsible for the growth inhibition of E. amylovora CFBP1430.

  20. Analysis of Two Gene Clusters Involved in the Degradation of 4-Fluorophenol by Arthrobacter sp Strain IF1

    NARCIS (Netherlands)

    Ferreira, Maria Isabel M.; Iida, Toshiya; Hasan, Syed A.; Nakamura, Kaoru; Fraaije, Marco W.; Janssen, Dick B.; Kudo, Toshiaki

    2009-01-01

    Arthrobacter sp. strain IF1 is able to grow on 4-fluorophenol (4-FP) as a sole source of carbon and energy. To clone the 4-FP degradation genes, DNA libraries were constructed and screened with a probe obtained by PCR using primers designed on the basis of conserved regions of aromatic two-component

  1. Characterization of the cysK2-ctl1-cysE2 gene cluster involved in sulfur metabolism in Lactobacillus casei.

    Science.gov (United States)

    Bogicevic, Biljana; Irmler, Stefan; Portmann, Reto; Meile, Leo; Berthoud, Hélène

    2012-01-16

    The up- and downstream regions of ctl1 and ctl2 that encode a cystathionine lyase were analyzed in various Lactobacillus casei strains. ctl1 and ctl2 were found to be part of a gene cluster encoding two other open reading frames. One of the two open reading frames precedes ctl1 and encodes a putative cysteine synthase. The other open reading frame lies downstream of ctl1 and encodes a putative serine acetyltransferase. The gene cluster is not present in the publicly available genome sequences of L. casei ATCC 334, BL23 and Zhang. Apparently, the gene cluster was acquired by a horizontal gene transfer event and can also be found in other lactic acid bacteria such as Lactobacillus helveticus, Lactobacillus delbrueckii subsp. bulgaricus and Streptococcus thermophilus. RT-PCR was used to analyze the expression of the gene cluster. Additionally, an mass spectrometry-based selected reaction monitoring method was developed for quantifying Ctl1 in a cell-free extract of lactic acid bacteria. The gene cluster cysK2-ctl1-cysE2 was expressed as single transcript, and expression was down-regulated by cysteine. In addition, cystathionine lyase activity present in cell-free extracts disappeared when L. casei was grown in the presence of cysteine. Whereas the transcript and the gene product of ctl1 protein were found in all studied ctl1(+)L. casei strains, only the transcript but not the protein or cystathionine lyase activity was detected in L. helveticus FAM2888, L. delbrueckii subsp. bulgaricus ATCC 11842 and S. thermophilus FAM17014, which actually possess a homolog of the cysK2-ctl1-cysE2 gene cluster.

  2. Clustering of gene ontology terms in genomes.

    Science.gov (United States)

    Tiirikka, Timo; Siermala, Markku; Vihinen, Mauno

    2014-10-25

    Although protein coding genes occupy only a small fraction of genomes in higher species, they are not randomly distributed within or between chromosomes. Clustering of genes with related function(s) and/or characteristics has been evident at several different levels. To study how common the clustering of functionally related genes is and what kind of functions the end products of these genes are involved, we collected gene ontology (GO) terms for complete genomes and developed a method to detect previously undefined gene clustering. Exhaustive analysis was performed for seven widely studied species ranging from human to Escherichia coli. To overcome problems related to varying gene lengths and densities, a novel method was developed and a fixed number of genes were analyzed irrespective of the genome span covered. Statistically very significant GO term clustering was apparent in all the investigated genomes. The analysis window, which ranged from 5 to 50 consecutive genes, revealed extensive GO term clusters for genes with widely varying functions. Here, the most interesting and significant results are discussed and the complete dataset for each analyzed species is available at the GOme database at http://bioinf.uta.fi/GOme. The results indicated that clusters of genes with related functions are very common, not only in bacteria, in which operons are frequent, but also in all the studied species irrespective of how complex they are. There are some differences between species but in all of them GO term clusters are common and of widely differing sizes. The presented method can be applied to analyze any genome or part of a genome for which descriptive features are available, and thus is not restricted to ontology terms. This method can also be applied to investigate gene and protein expression patterns. The results pave a way for further studies of mechanisms that shape genome structure and evolutionary forces related to them. Copyright © 2014 Elsevier B.V. All

  3. FunGeneClusterS

    DEFF Research Database (Denmark)

    Vesth, Tammi Camilla; Brandl, Julian; Andersen, Mikael Rørdam

    2016-01-01

    and industrial biotechnology applications. We have previously published a method for accurate prediction of clusters from genome and transcriptome data, which could also suggest cross-chemistry, however, this method was limited both in the number of parameters which could be adjusted as well as in user......Secondary metabolites of fungi are receiving an increasing amount of interest due to their prolific bioactivities and the fact that fungal biosynthesis of secondary metabolites often occurs from co-regulated and co-located gene clusters. This makes the gene clusters attractive for synthetic biology...

  4. Identification of an ancestral resistance gene cluster involved in the coevolution process between Phaseolus vulgaris and its fungal pathogen Colletotrichum lindemuthianum.

    Science.gov (United States)

    Geffroy, V; Sicard, D; de Oliveira, J C; Sévignac, M; Cohen, S; Gepts, P; Neema, C; Langin, T; Dron, M

    1999-09-01

    The recent cloning of plant resistance (R) genes and the sequencing of resistance gene clusters have shed light on the molecular evolution of R genes. However, up to now, no attempt has been made to correlate this molecular evolution with the host-pathogen coevolution process at the population level. Cross-inoculations were carried out between 26 strains of the fungal pathogen Colletotrichum lindemuthianum and 48 Phaseolus vulgaris plants collected in the three centers of diversity of the host species. A high level of diversity for resistance against the pathogen was revealed. Most of the resistance specificities were overcome in sympatric situations, indicating an adaptation of the pathogen to the local host. In contrast, plants were generally resistant to allopatric strains, suggesting that R genes that were efficient against exotic strains but had been overcome locally were maintained in the plant genome. These results indicated that coevolution processes between the two protagonists led to a differentiation for resistance in the three centers of diversity of the host. To improve our understanding of the molecular evolution of these different specificities, a recombinant inbred (RI) population derived from two representative genotypes of the Andean (JaloEEP558) and Mesoamerican (BAT93) gene pools was used to map anthracnose specificities. A gene cluster comprising both Andean (Co-y; Co-z) and Mesoamerican (Co-9) host resistance specificities was identified, suggesting that this locus existed prior to the separation of the two major gene pools of P. vulgaris. Molecular analysis revealed a high level of complexity at this locus. It harbors 11 restriction fragment length polymorphisms when R gene analog (RGA) clones are used. The relationship between the coevolution process and diversification of resistance specificities at resistance gene clusters is discussed.

  5. The rise of operon-like gene clusters in plants.

    Science.gov (United States)

    Boycheva, Svetlana; Daviet, Laurent; Wolfender, Jean-Luc; Fitzpatrick, Teresa B

    2014-07-01

    Gene clusters are common features of prokaryotic genomes also present in eukaryotes. Most clustered genes known are involved in the biosynthesis of secondary metabolites. Although horizontal gene transfer is a primary source of prokaryotic gene cluster (operon) formation and has been reported to occur in eukaryotes, the predominant source of cluster formation in eukaryotes appears to arise de novo or through gene duplication followed by neo- and sub-functionalization or translocation. Here we aim to provide an overview of the current knowledge and open questions related to plant gene cluster functioning, assembly, and regulation. We also present potential research approaches and point out the benefits of a better understanding of gene clusters in plants for both fundamental and applied plant science.

  6. Transcription of the Escherichia coli dcw cluster: evidence for distal upstream transcripts being involved in the expression of the downstream ftsZ gene.

    Science.gov (United States)

    de la Fuente, A; Palacios, P; Vicente, M

    2001-01-01

    Escherichia coli strains VIP596 and VIP597 have been constructed to compare the amount of transcription of the ftsZ gene derived from proximal promoters in the ddlB-ftsZ region with that originating in the upstream regions of the dcw cluster. Both strains have in common a beta-galactosidase reporter fusion located at the ddlB locus, but differ in that VIP597 has a transcription terminator Omega interposon located downstream from lacZ. In addition, these strains have the ddlB, ftsQ, ftsA and ftsZ genes under the control of the IPTG-inducible promoter P(tac), allowing to control artificially ftsZ expression for normal cell division to take place. When beta-galactosidase activity was measured in VIP596 and VIP597 and compared to the levels measured in strain VIP407, in which the lacZ reporter fusion is located in the ftsZ gene, they were found to account for nearly 66% of the total transcription entering into ftsZ. This result indicates that the reduction in ftsZ transcription observed when the promoters in the ddlB-ftsA region are disconnected from the upstream sequences of the dcw cluster (as observed by Flärdh et al., Mol. Microbiol. 30 (1998) 305-316) in strain VIP490) is the direct consequence of the interruption in the transcription originated upstream and not due to the effect of such sequences on the promoters proximal to ftsZ.

  7. The Myxococcus xanthus Two-Component System CorSR Regulates Expression of a Gene Cluster Involved in Maintaining Copper Tolerance during Growth and Development

    Science.gov (United States)

    Sánchez-Sutil, María Celestina; Pérez, Juana; Gómez-Santos, Nuria; Shimkets, Lawrence J.; Moraleda-Muñoz, Aurelio; Muñoz-Dorado, José

    2013-01-01

    Myxococcus xanthus is a soil-dwelling member of the δ–Proteobacteria that exhibits a complex developmental cycle upon starvation. Development comprises aggregation and differentiation into environmentally resistant myxospores in an environment that includes fluctuations in metal ion concentrations. While copper is essential for M. xanthus cells because several housekeeping enzymes use it as a cofactor, high copper concentrations are toxic. These opposing effects force cells to maintain a tight copper homeostasis. A plethora of paralogous genes involved in copper detoxification, all of which are differentially regulated, have been reported in M. xanthus. The use of in-frame deletion mutants and fusions with the reporter gene lacZ has allowed the identification of a two-component system, CorSR, that modulates the expression of an operon termed curA consisting of nine genes whose expression slowly increases after metal addition, reaching a plateau. Transcriptional regulation of this operon is complex because transcription can be initiated at different promoters and by different types of regulators. These genes confer copper tolerance during growth and development. Copper induces carotenoid production in a ΔcorSR mutant at lower concentrations than with the wild-type strain due to lack of expression of a gene product resembling subunit III of cbb3-type cytochrome c oxidase. This data may explain why copper induces carotenoid biosynthesis at suboptimal rather than optimal growth conditions in wild-type strains. PMID:23874560

  8. The Myxococcus xanthus two-component system CorSR regulates expression of a gene cluster involved in maintaining copper tolerance during growth and development.

    Directory of Open Access Journals (Sweden)

    María Celestina Sánchez-Sutil

    Full Text Available Myxococcus xanthus is a soil-dwelling member of the δ-Proteobacteria that exhibits a complex developmental cycle upon starvation. Development comprises aggregation and differentiation into environmentally resistant myxospores in an environment that includes fluctuations in metal ion concentrations. While copper is essential for M. xanthus cells because several housekeeping enzymes use it as a cofactor, high copper concentrations are toxic. These opposing effects force cells to maintain a tight copper homeostasis. A plethora of paralogous genes involved in copper detoxification, all of which are differentially regulated, have been reported in M. xanthus. The use of in-frame deletion mutants and fusions with the reporter gene lacZ has allowed the identification of a two-component system, CorSR, that modulates the expression of an operon termed curA consisting of nine genes whose expression slowly increases after metal addition, reaching a plateau. Transcriptional regulation of this operon is complex because transcription can be initiated at different promoters and by different types of regulators. These genes confer copper tolerance during growth and development. Copper induces carotenoid production in a ΔcorSR mutant at lower concentrations than with the wild-type strain due to lack of expression of a gene product resembling subunit III of cbb3-type cytochrome c oxidase. This data may explain why copper induces carotenoid biosynthesis at suboptimal rather than optimal growth conditions in wild-type strains.

  9. Diversity of capsular polysaccharide gene clusters in Kpc-producing Klebsiella pneumoniae clinical isolates of sequence type 258 involved in the Italian epidemic.

    Science.gov (United States)

    D'Andrea, Marco Maria; Amisano, Francesco; Giani, Tommaso; Conte, Viola; Ciacci, Nagaia; Ambretti, Simone; Santoriello, Luisa; Rossolini, Gian Maria

    2014-01-01

    Strains of Klebsiella pneumoniae producing KPC-type beta-lactamases (KPC-Kp) are broadly disseminating worldwide and constitute a major healthcare threat given their extensively drug resistant phenotypes and ability to rapidly disseminate in healthcare settings. In this work we report on the characterization of two different capsular polysaccharide (CPS) gene clusters, named cpsBO-4 and cps207-2, from two KPC-Kp clinical strains from Italy belonging in sequence type (ST) 258, which is one of the most successful ST of KPC-Kp spreading worldwide. While cpsBO-4 was different from known 78 K-types according to the recently proposed typing schemes based on the wzi or wzc gene sequences, cps207-2 was classified as K41 by one of these methods. Bioinformatic analysis revealed that they were represented in the genomic sequences of KPC-Kp from strains of ST258 from different countries, and cpsBO-4 was also detected in a KPC-Kp strain of ST442 from Brazil. Investigation of a collection of 46 ST258 and ST512 (a single locus variant of ST258) clinical strains representative of the recent Italian epidemic of KPC-Kp by means of a multiplex PCR typing approach revealed that cpsBO-4 was the most prevalent type, being detected both in ST258 and ST512 strains with a countrywide distribution, while cps207-2 was only detected in ST258 strains with a more restricted distribution.

  10. Diversity of Capsular Polysaccharide Gene Clusters in Kpc-Producing Klebsiella pneumoniae Clinical Isolates of Sequence Type 258 Involved in the Italian Epidemic

    Science.gov (United States)

    D’Andrea, Marco Maria; Amisano, Francesco; Giani, Tommaso; Conte, Viola; Ciacci, Nagaia; Ambretti, Simone; Santoriello, Luisa; Rossolini, Gian Maria

    2014-01-01

    Strains of Klebsiella pneumoniae producing KPC-type beta-lactamases (KPC-Kp) are broadly disseminating worldwide and constitute a major healthcare threat given their extensively drug resistant phenotypes and ability to rapidly disseminate in healthcare settings. In this work we report on the characterization of two different capsular polysaccharide (CPS) gene clusters, named cpsBO-4 and cps207-2, from two KPC-Kp clinical strains from Italy belonging in sequence type (ST) 258, which is one of the most successful ST of KPC-Kp spreading worldwide. While cpsBO-4 was different from known 78 K-types according to the recently proposed typing schemes based on the wzi or wzc gene sequences, cps207-2 was classified as K41 by one of these methods. Bioinformatic analysis revealed that they were represented in the genomic sequences of KPC-Kp from strains of ST258 from different countries, and cpsBO-4 was also detected in a KPC-Kp strain of ST442 from Brazil. Investigation of a collection of 46 ST258 and ST512 (a single locus variant of ST258) clinical strains representative of the recent Italian epidemic of KPC-Kp by means of a multiplex PCR typing approach revealed that cpsBO-4 was the most prevalent type, being detected both in ST258 and ST512 strains with a countrywide distribution, while cps207-2 was only detected in ST258 strains with a more restricted distribution. PMID:24823690

  11. Diversity of capsular polysaccharide gene clusters in Kpc-producing Klebsiella pneumoniae clinical isolates of sequence type 258 involved in the Italian epidemic.

    Directory of Open Access Journals (Sweden)

    Marco Maria D'Andrea

    Full Text Available Strains of Klebsiella pneumoniae producing KPC-type beta-lactamases (KPC-Kp are broadly disseminating worldwide and constitute a major healthcare threat given their extensively drug resistant phenotypes and ability to rapidly disseminate in healthcare settings. In this work we report on the characterization of two different capsular polysaccharide (CPS gene clusters, named cpsBO-4 and cps207-2, from two KPC-Kp clinical strains from Italy belonging in sequence type (ST 258, which is one of the most successful ST of KPC-Kp spreading worldwide. While cpsBO-4 was different from known 78 K-types according to the recently proposed typing schemes based on the wzi or wzc gene sequences, cps207-2 was classified as K41 by one of these methods. Bioinformatic analysis revealed that they were represented in the genomic sequences of KPC-Kp from strains of ST258 from different countries, and cpsBO-4 was also detected in a KPC-Kp strain of ST442 from Brazil. Investigation of a collection of 46 ST258 and ST512 (a single locus variant of ST258 clinical strains representative of the recent Italian epidemic of KPC-Kp by means of a multiplex PCR typing approach revealed that cpsBO-4 was the most prevalent type, being detected both in ST258 and ST512 strains with a countrywide distribution, while cps207-2 was only detected in ST258 strains with a more restricted distribution.

  12. Mining Bacterial Genomes for Secondary Metabolite Gene Clusters.

    Science.gov (United States)

    Adamek, Martina; Spohn, Marius; Stegmann, Evi; Ziemert, Nadine

    2017-01-01

    With the emergence of bacterial resistance against frequently used antibiotics, novel antibacterial compounds are urgently needed. Traditional bioactivity-guided drug discovery strategies involve laborious screening efforts and display high rediscovery rates. With the progress in next generation sequencing methods and the knowledge that the majority of antibiotics in clinical use are produced as secondary metabolites by bacteria, mining bacterial genomes for secondary metabolites with antimicrobial activity is a promising approach, which can guide a more time and cost-effective identification of novel compounds. However, what sounds easy to accomplish, comes with several challenges. To date, several tools for the prediction of secondary metabolite gene clusters are available, some of which are based on the detection of signature genes, while others are searching for specific patterns in gene content or regulation.Apart from the mere identification of gene clusters, several other factors such as determining cluster boundaries and assessing the novelty of the detected cluster are important. For this purpose, comparison of the predicted secondary metabolite genes with different cluster and compound databases is necessary. Furthermore, it is advisable to classify detected clusters into gene cluster families. So far, there is no standardized procedure for genome mining; however, different approaches to overcome all of these challenges exist and are addressed in this chapter. We give practical guidance on the workflow for secondary metabolite gene cluster identification, which includes the determination of gene cluster boundaries, addresses problems occurring with the use of draft genomes, and gives an outlook on the different methods for gene cluster classification. Based on comprehensible examples a protocol is set, which should enable the readers to mine their own genome data for interesting secondary metabolites.

  13. Genomic Analyses of Bacterial Porin-Cytochrome Gene Clusters

    Directory of Open Access Journals (Sweden)

    Liang eShi

    2014-11-01

    Full Text Available The porin-cytochrome (Pcc protein complex is responsible for trans-outer membrane electron transfer during extracellular reduction of Fe(III by the dissimilatory metal-reducing bacterium Geobacter sulfurreducens PCA. The identified and characterized Pcc complex of G. sulfurreducens PCA consists of a porin-like outer-membrane protein, a periplasmic 8-heme c-type cytochrome (c-Cyt and an outer-membrane 12-heme c-Cyt, and the genes encoding the Pcc proteins are clustered in the same regions of genome (i.e., the pcc gene clusters of G. sulfurreducens PCA. A survey of additionally microbial genomes has identified the pcc gene clusters in all sequenced Geobacter spp. and other bacteria from six different phyla, including Anaeromyxobacter dehalogenans 2CP-1, A. dehalogenans 2CP-C, Anaeromyxobacter sp. K, Candidatus Kuenenia stuttgartiensis, Denitrovibrio acetiphilus DSM 12809, Desulfurispirillum indicum S5, Desulfurivibrio alkaliphilus AHT2, Desulfurobacterium thermolithotrophum DSM 11699, Desulfuromonas acetoxidans DSM 684, Ignavibacterium album JCM 16511, and Thermovibrio ammonificans HB-1. The numbers of genes in the pcc gene clusters vary, ranging from two to nine. Similar to the metal-reducing (Mtr gene clusters of other Fe(III-reducing bacteria, such as Shewanella spp., additional genes that encode putative c-Cyts with predicted cellular localizations at the cytoplasmic membrane, periplasm and outer membrane often associate with the pcc gene clusters. This suggests that the Pcc-associated c-Cyts may be part of the pathways for extracellular electron transfer reactions. The presence of pcc gene clusters in the microorganisms that do not reduce solid-phase Fe(III and Mn(IV oxides, such as D. alkaliphilus AHT2 and I. album JCM 16511, also suggests that some of the pcc gene clusters may be involved in extracellular electron transfer reactions with the substrates other than Fe(III and Mn(IV oxides.

  14. Minimum Information about a Biosynthetic Gene cluster

    NARCIS (Netherlands)

    Medema, M.H.; Kottmann, Renzo; Yilmaz, Pelin; Cummings, Matthew; Biggins, J.B.; Blin, Kai; Bruijn, De Irene; Chooi, Yit Heng; Claesen, Jan; Coates, R.C.; Cruz-Morales, Pablo; Duddela, Srikanth; Düsterhus, Stephanie; Edwards, Daniel J.; Fewer, David P.; Garg, Neha; Geiger, Christoph; Gomez-Escribano, Juan Pablo; Greule, Anja; Hadjithomas, Michalis; Haines, Anthony S.; Helfrich, Eric J.N.; Hillwig, Matthew L.; Ishida, Keishi; Jones, Adam C.; Jones, Carla S.; Jungmann, Katrin; Kegler, Carsten; Kim, Hyun Uk; Kötter, Peter; Krug, Daniel; Masschelein, Joleen; Melnik, Alexey V.; Mantovani, Simone M.; Monroe, Emily A.; Moore, Marcus; Moss, Nathan; Nützmann, Hans Wilhelm; Pan, Guohui; Pati, Amrita; Petras, Daniel; Reen, F.J.; Rosconi, Federico; Rui, Zhe; Tian, Zhenhua; Tobias, Nicholas J.; Tsunematsu, Yuta; Wiemann, Philipp; Wyckoff, Elizabeth; Yan, Xiaohui; Yim, Grace; Yu, Fengan; Xie, Yunchang; Aigle, Bertrand; Apel, Alexander K.; Balibar, Carl J.; Balskus, Emily P.; Barona-Gómez, Francisco; Bechthold, Andreas; Bode, Helge B.; Borriss, Rainer; Brady, Sean F.; Brakhage, Axel A.; Caffrey, Patrick; Cheng, Yi Qiang; Clardy, Jon; Cox, Russell J.; Mot, De René; Donadio, Stefano; Donia, Mohamed S.; Donk, Van Der Wilfred A.; Dorrestein, Pieter C.; Doyle, Sean; Driessen, Arnold J.M.; Ehling-Schulz, Monika; Entian, Karl Dieter; Fischbach, Michael A.; Gerwick, Lena; Gerwick, William H.; Gross, Harald; Gust, Bertolt; Hertweck, Christian; Höfte, Monica; Jensen, Susan E.; Ju, Jianhua; Katz, Leonard; Kaysser, Leonard; Klassen, Jonathan L.; Keller, Nancy P.; Kormanec, Jan; Kuipers, Oscar P.; Kuzuyama, Tomohisa; Kyrpides, Nikos C.; Kwon, Hyung Jin; Lautru, Sylvie; Lavigne, Rob; Lee, Chia Y.; Linquan, Bai; Liu, Xinyu; Liu, Wen; Luzhetskyy, Andriy; Mahmud, Taifo; Mast, Yvonne; Méndez, Carmen; Metsä-Ketelä, Mikko; Micklefield, Jason; Mitchell, Douglas A.; Moore, Bradley S.; Moreira, Leonilde M.; Müller, Rolf; Neilan, Brett A.; Nett, Markus; Nielsen, Jens; O'Gara, Fergal; Oikawa, Hideaki; Osbourn, Anne; Osburne, Marcia S.; Ostash, Bohdan; Payne, Shelley M.; Pernodet, Jean Luc; Petricek, Miroslav; Piel, Jörn; Ploux, Olivier; Raaijmakers, Jos M.; Salas, José A.; Schmitt, Esther K.; Scott, Barry; Seipke, Ryan F.; Shen, Ben; Sherman, David H.; Sivonen, Kaarina; Smanski, Michael J.; Sosio, Margherita; Stegmann, Evi; Süssmuth, Roderich D.; Tahlan, Kapil; Thomas, Christopher M.; Tang, Yi; Truman, Andrew W.; Viaud, Muriel; Walton, Jonathan D.; Walsh, Christopher T.; Weber, Tilmann; Wezel, Van Gilles P.; Wilkinson, Barrie; Willey, Joanne M.; Wohlleben, Wolfgang; Wright, Gerard D.; Ziemert, Nadine; Zhang, Changsheng; Zotchev, Sergey B.; Breitling, Rainer; Takano, Eriko; Glöckner, Frank Oliver

    2015-01-01

    A wide variety of enzymatic pathways that produce specialized metabolites in bacteria, fungi and plants are known to be encoded in biosynthetic gene clusters. Information about these clusters, pathways and metabolites is currently dispersed throughout the literature, making it difficult to exploi

  15. Recurring cluster and operon assembly for Phenylacetate degradation genes

    Directory of Open Access Journals (Sweden)

    McInerney James O

    2009-02-01

    Full Text Available Abstract Background A large number of theories have been advanced to explain why genes involved in the same biochemical processes are often co-located in genomes. Most of these theories have been dismissed because empirical data do not match the expectations of the models. In this work we test the hypothesis that cluster formation is most likely due to a selective pressure to gradually co-localise protein products and that operon formation is not an inevitable conclusion of the process. Results We have selected an exemplar well-characterised biochemical pathway, the phenylacetate degradation pathway, and we show that its complex history is only compatible with a model where a selective advantage accrues from moving genes closer together. This selective pressure is likely to be reasonably weak and only twice in our dataset of 102 genomes do we see independent formation of a complete cluster containing all the catabolic genes in the pathway. Additionally, de novo clustering of genes clearly occurs repeatedly, even though recombination should result in the random dispersal of such genes in their respective genomes. Interspecies gene transfer has frequently replaced in situ copies of genes resulting in clusters that have similar content but very different evolutionary histories. Conclusion Our model for cluster formation in prokaryotes, therefore, consists of a two-stage selection process. The first stage is selection to move genes closer together, either because of macromolecular crowding, chromatin relaxation or transcriptional regulation pressure. This proximity opportunity sets up a separate selection for co-transcription.

  16. Analysis of the pmsCEAB Gene Cluster Involved in Biosynthesis of Salicylic Acid and the Siderophore Pseudomonine in the Biocontrol Strain Pseudomonas fluorescens WCS374

    NARCIS (Netherlands)

    Mercado-Blanco, J.; Drift, K.M.G.M. van der; Olsson, P.E.; Thomas-Oates, J.E.; Loon, L.C. van; Bakker, P.A.H.M.

    2001-01-01

    Mutants of Pseudomonas fluorescens WCS374 defective in biosynthesis of the fluorescent siderophore pseudobactin still display siderophore activity, indicating the production of a second siderophore. A recombinant cosmid clone (pMB374-07) of a WCS374 gene library harboring loci necessary for the bios

  17. Genes involved in cell division in mycoplasmas

    Directory of Open Access Journals (Sweden)

    Frank Alarcón

    2007-01-01

    Full Text Available Bacterial cell division has been studied mainly in model systems such as Escherichia coli and Bacillus subtilis, where it is described as a complex process with the participation of a group of proteins which assemble into a multiprotein complex called the septal ring. Mycoplasmas are cell wall-less bacteria presenting a reduced genome. Thus, it was important to compare their genomes to analyze putative genes involved in cell division processes. The division and cell wall (dcw cluster, which in E. coli and B. subtilis is composed of 16 and 17 genes, respectively, is represented by only three to four genes in mycoplasmas. Even the most conserved protein, FtsZ, is not present in all mycoplasma genomes analyzed so far. A model for the FtsZ protein from Mycoplasma hyopneumoniae and Mycoplasma synoviae has been constructed. The conserved residues, essential for GTP/GDP binding, are present in FtsZ from both species. A strong conservation of hydrophobic amino acid patterns is observed, and is probably necessary for the structural stability of the protein when active. M. synoviae FtsZ presents an extended amino acid sequence at the C-terminal portion of the protein, which may participate in interactions with other still unknown proteins crucial for the cell division process.

  18. Xylan degradation by the human gut Bacteroides xylanisolvens XB1A(T) involves two distinct gene clusters that are linked at the transcriptional level.

    Science.gov (United States)

    Despres, Jordane; Forano, Evelyne; Lepercq, Pascale; Comtet-Marre, Sophie; Jubelin, Gregory; Chambon, Christophe; Yeoman, Carl J; Berg Miller, Margaret E; Fields, Christopher J; Martens, Eric; Terrapon, Nicolas; Henrissat, Bernard; White, Bryan A; Mosoni, Pascale

    2016-05-04

    Plant cell wall (PCW) polysaccharides and especially xylans constitute an important part of human diet. Xylans are not degraded by human digestive enzymes in the upper digestive tract and therefore reach the colon where they are subjected to extensive degradation by some members of the symbiotic microbiota. Xylanolytic bacteria are the first degraders of these complex polysaccharides and they release breakdown products that can have beneficial effects on human health. In order to understand better how these bacteria metabolize xylans in the colon, this study was undertaken to investigate xylan breakdown by the prominent human gut symbiont Bacteroides xylanisolvens XB1A(T). Transcriptomic analyses of B. xylanisolvens XB1A(T) grown on insoluble oat-spelt xylan (OSX) at mid- and late-log phases highlighted genes in a polysaccharide utilization locus (PUL), hereafter called PUL 43, and genes in a fragmentary remnant of another PUL, hereafter referred to as rPUL 70, which were highly overexpressed on OSX relative to glucose. Proteomic analyses supported the up-regulation of several genes belonging to PUL 43 and showed the important over-production of a CBM4-containing GH10 endo-xylanase. We also show that PUL 43 is organized in two operons and that the knockout of the PUL 43 sensor/regulator HTCS gene blocked the growth of the mutant on insoluble OSX and soluble wheat arabinoxylan (WAX). The mutation not only repressed gene expression in the PUL 43 operons but also repressed gene expression in rPUL 70. This study shows that xylan degradation by B. xylanisolvens XB1A(T) is orchestrated by one PUL and one PUL remnant that are linked at the transcriptional level. Coupled to studies on other xylanolytic Bacteroides species, our data emphasize the importance of one peculiar CBM4-containing GH10 endo-xylanase in xylan breakdown and that this modular enzyme may be used as a functional marker of xylan degradation in the human gut. Our results also suggest that B. xylanisolvens

  19. Evolution of homeobox gene clusters in animals: the Giga-cluster and primary versus secondary clustering.

    Directory of Open Access Journals (Sweden)

    David Ellard Keith Ferrier

    2016-04-01

    Full Text Available The Hox gene cluster has been a major focus in evolutionary developmental biology. This is because of its key role in patterning animal development and widespread examples of changes in Hox genes being linked to the evolution of animal body plans and morphologies. Also, the distinctive organisation of the Hox genes into genomic clusters in which the order of the genes along the chromosome corresponds to the order of their activity along the embryo, or during a developmental process, has been a further source of great interest. This is known as Colinearity, and it provides a clear link between genome organisation and the regulation of genes during development, with distinctive changes marking evolutionary transitions. The Hox genes are not alone, however. The homeobox genes are a large super-class, of which the Hox genes are only a small subset, and an ever-increasing number of further gene clusters besides the Hox are being discovered. This is of great interest because of the potential for such gene clusters to help understand major evolutionary transitions, both in terms of changes to development and morphology as well as evolution of genome organisation. However, there is uncertainty in our understanding of homeobox gene cluster evolution at present. This relates to our still rudimentary understanding of the dynamics of genome rearrangements and evolution over the evolutionary timescales being considered when we compare lineages from across the animal kingdom. A major goal is to deduce whether particular instances of clustering are primary (conserved from ancient ancestral clusters or secondary (reassortment of genes into clusters in lineage-specific fashion. The following summary of the various instances of homeobox gene clusters in animals, and the hypotheses about their evolution, provides a framework for the future resolution of this uncertainty.

  20. Evolution of orthologous tandemly arrayed gene clusters

    Directory of Open Access Journals (Sweden)

    Bertrand Denis

    2011-10-01

    Full Text Available Abstract Background Tandemly Arrayed Gene (TAG clusters are groups of paralogous genes that are found adjacent on a chromosome. TAGs represent an important repertoire of genes in eukaryotes. In addition to tandem duplication events, TAG clusters are affected during their evolution by other mechanisms, such as inversion and deletion events, that affect the order and orientation of genes. The DILTAG algorithm developed in 1 makes it possible to infer a set of optimal evolutionary histories explaining the evolution of a single TAG cluster, from an ancestral single gene, through tandem duplications (simple or multiple, direct or inverted, deletions and inversion events. Results We present a general methodology, which is an extension of DILTAG, for the study of the evolutionary history of a set of orthologous TAG clusters in multiple species. In addition to the speciation events reflected by the phylogenetic tree of the considered species, the evolutionary events that are taken into account are simple or multiple tandem duplications, direct or inverted, simple or multiple deletions, and inversions. We analysed the performance of our algorithm on simulated data sets and we applied it to the protocadherin gene clusters of human, chimpanzee, mouse and rat. Conclusions Our results obtained on simulated data sets showed a good performance in inferring the total number and size distribution of duplication events. A limitation of the algorithm is however in dealing with multiple gene deletions, as the algorithm is highly exponential in this case, and becomes quickly intractable.

  1. Chicken rRNA Gene Cluster Structure.

    Directory of Open Access Journals (Sweden)

    Alexander G Dyomin

    Full Text Available Ribosomal RNA (rRNA genes, whose activity results in nucleolus formation, constitute an extremely important part of genome. Despite the extensive exploration into avian genomes, no complete description of avian rRNA gene primary structure has been offered so far. We publish a complete chicken rRNA gene cluster sequence here, including 5'ETS (1836 bp, 18S rRNA gene (1823 bp, ITS1 (2530 bp, 5.8S rRNA gene (157 bp, ITS2 (733 bp, 28S rRNA gene (4441 bp and 3'ETS (343 bp. The rRNA gene cluster sequence of 11863 bp was assembled from raw reads and deposited to GenBank under KT445934 accession number. The assembly was validated through in situ fluorescent hybridization analysis on chicken metaphase chromosomes using computed and synthesized specific probes, as well as through the reference assembly against de novo assembled rRNA gene cluster sequence using sequenced fragments of BAC-clone containing chicken NOR (nucleolus organizer region. The results have confirmed the chicken rRNA gene cluster validity.

  2. Pichia stipitis genomics, transcriptomics, and gene clusters

    Science.gov (United States)

    Thomas W. Jeffries; Jennifer R. Headman Van Vleet

    2009-01-01

    Genome sequencing and subsequent global gene expression studies have advanced our understanding of the lignocellulose-fermenting yeast Pichia stipitis. These studies have provided an insight into its central carbon metabolism, and analysis of its genome has revealed numerous functional gene clusters and tandem repeats. Specialized physiological traits are often the...

  3. Filtering Genes for Cluster and Network Analysis

    Directory of Open Access Journals (Sweden)

    Parkhomenko Elena

    2009-06-01

    Full Text Available Abstract Background Prior to cluster analysis or genetic network analysis it is customary to filter, or remove genes considered to be irrelevant from the set of genes to be analyzed. Often genes whose variation across samples is less than an arbitrary threshold value are deleted. This can improve interpretability and reduce bias. Results This paper introduces modular models for representing network structure in order to study the relative effects of different filtering methods. We show that cluster analysis and principal components are strongly affected by filtering. Filtering methods intended specifically for cluster and network analysis are introduced and compared by simulating modular networks with known statistical properties. To study more realistic situations, we analyze simulated "real" data based on well-characterized E. coli and S. cerevisiae regulatory networks. Conclusion The methods introduced apply very generally, to any similarity matrix describing gene expression. One of the proposed methods, SUMCOV, performed well for all models simulated.

  4. The Biosynthetic Gene Cluster for Andrastin A in Penicillium roqueforti

    Directory of Open Access Journals (Sweden)

    Juan F. Rojas-Aedo

    2017-05-01

    Full Text Available Penicillium roqueforti is a filamentous fungus involved in the ripening of several kinds of blue cheeses. In addition, this fungus produces several secondary metabolites, including the meroterpenoid compound andrastin A, a promising antitumoral compound. However, to date the genomic cluster responsible for the biosynthesis of this compound in P. roqueforti has not been described. In this work, we have sequenced and annotated a genomic region of approximately 29.4 kbp (named the adr gene cluster that is involved in the biosynthesis of andrastin A in P. roqueforti. This region contains ten genes, named adrA, adrC, adrD, adrE, adrF, adrG, adrH, adrI, adrJ and adrK. Interestingly, the adrB gene previously found in the adr cluster from P. chrysogenum, was found as a residual pseudogene in the adr cluster from P. roqueforti. RNA-mediated gene silencing of each of the ten genes resulted in significant reductions in andrastin A production, confirming that all of them are involved in the biosynthesis of this compound. Of particular interest was the adrC gene, encoding for a major facilitator superfamily transporter. According to our results, this gene is required for the production of andrastin A but does not have any role in its secretion to the extracellular medium. The identification of the adr cluster in P. roqueforti will be important to understand the molecular basis of the production of andrastin A, and for the obtainment of strains of P. roqueforti overproducing andrastin A that might be of interest for the cheese industry.

  5. Lateral transfer of the lux gene cluster.

    Science.gov (United States)

    Kasai, Sabu; Okada, Kazuhisa; Hoshino, Akinori; Iida, Tetsuya; Honda, Takeshi

    2007-02-01

    The lux operon is an uncommon gene cluster. To find the pathway through which the operon has been transferred, we sequenced the operon and both flanking regions in four typical luminous species. In Vibrio cholerae NCIMB 41, a five-gene cluster, most genes of which were highly similar to orthologues present in Gram-positive bacteria, along with the lux operon, is inserted between VC1560 and VC1563, on chromosome 1. Because this entire five-gene cluster is present in Photorhabdus luminescens TT01, about 1.5 Mbp upstream of the operon, we deduced that the operon and the gene cluster were transferred from V. cholerae to an ancestor of Pr. luminescens. Because in both V. fischeri and Shewanella hanedai, luxR and luxI were found just upstream of the operon, we concluded that the operon was transferred from either species to the other. Because most of the genes flanking the operon were highly similar to orthologues present on chromosome 2 of vibrios, we speculated that the operon of most species is located on this chromosome. The undigested genomic DNAs of five luminous species were analysed by pulsed-field gel electrophoresis and Southern hybridization. In all the species except V. cholerae, the operons are located on chromosome 2.

  6. Interpolation based consensus clustering for gene expression time series.

    Science.gov (United States)

    Chiu, Tai-Yu; Hsu, Ting-Chieh; Yen, Chia-Cheng; Wang, Jia-Shung

    2015-04-16

    Unsupervised analyses such as clustering are the essential tools required to interpret time-series expression data from microarrays. Several clustering algorithms have been developed to analyze gene expression data. Early methods such as k-means, hierarchical clustering, and self-organizing maps are popular for their simplicity. However, because of noise and uncertainty of measurement, these common algorithms have low accuracy. Moreover, because gene expression is a temporal process, the relationship between successive time points should be considered in the analyses. In addition, biological processes are generally continuous; therefore, the datasets collected from time series experiments are often found to have an insufficient number of data points and, as a result, compensation for missing data can also be an issue. An affinity propagation-based clustering algorithm for time-series gene expression data is proposed. The algorithm explores the relationship between genes using a sliding-window mechanism to extract a large number of features. In addition, the time-course datasets are resampled with spline interpolation to predict the unobserved values. Finally, a consensus process is applied to enhance the robustness of the method. Some real gene expression datasets were analyzed to demonstrate the accuracy and efficiency of the algorithm. The proposed algorithm has benefitted from the use of cubic B-splines interpolation, sliding-window, affinity propagation, gene relativity graph, and a consensus process, and, as a result, provides both appropriate and effective clustering of time-series gene expression data. The proposed method was tested with gene expression data from the Yeast galactose dataset, the Yeast cell-cycle dataset (Y5), and the Yeast sporulation dataset, and the results illustrated the relationships between the expressed genes, which may give some insights into the biological processes involved.

  7. Genome classification by gene distribution: An overlapping subspace clustering approach

    Directory of Open Access Journals (Sweden)

    Halgamuge Saman K

    2008-04-01

    Full Text Available Abstract Background Genomes of lower organisms have been observed with a large amount of horizontal gene transfers, which cause difficulties in their evolutionary study. Bacteriophage genomes are a typical example. One recent approach that addresses this problem is the unsupervised clustering of genomes based on gene order and genome position, which helps to reveal species relationships that may not be apparent from traditional phylogenetic methods. Results We propose the use of an overlapping subspace clustering algorithm for such genome classification problems. The advantage of subspace clustering over traditional clustering is that it can associate clusters with gene arrangement patterns, preserving genomic information in the clusters produced. Additionally, overlapping capability is desirable for the discovery of multiple conserved patterns within a single genome, such as those acquired from different species via horizontal gene transfers. The proposed method involves a novel strategy to vectorize genomes based on their gene distribution. A number of existing subspace clustering and biclustering algorithms were evaluated to identify the best framework upon which to develop our algorithm; we extended a generic subspace clustering algorithm called HARP to incorporate overlapping capability. The proposed algorithm was assessed and applied on bacteriophage genomes. The phage grouping results are consistent overall with the Phage Proteomic Tree and showed common genomic characteristics among the TP901-like, Sfi21-like and sk1-like phage groups. Among 441 phage genomes, we identified four significantly conserved distribution patterns structured by the terminase, portal, integrase, holin and lysin genes. We also observed a subgroup of Sfi21-like phages comprising a distinctive divergent genome organization and identified nine new phage members to the Sfi21-like genus: Staphylococcus 71, phiPVL108, Listeria A118, 2389, Lactobacillus phi AT3, A2

  8. Cluster Analysis of Gene Expression Data

    CERN Document Server

    Domany, E

    2002-01-01

    The expression levels of many thousands of genes can be measured simultaneously by DNA microarrays (chips). This novel experimental tool has revolutionized research in molecular biology and generated considerable excitement. A typical experiment uses a few tens of such chips, each dedicated to a single sample - such as tissue extracted from a particular tumor. The results of such an experiment contain several hundred thousand numbers, that come in the form of a table, of several thousand rows (one for each gene) and 50 - 100 columns (one for each sample). We developed a clustering methodology to mine such data. In this review I provide a very basic introduction to the subject, aimed at a physics audience with no prior knowledge of either gene expression or clustering methods. I explain what genes are, what is gene expression and how it is measured by DNA chips. Next I explain what is meant by "clustering" and how we analyze the massive amounts of data from such experiments, and present results obtained from a...

  9. An alanine tRNA gene cluster from Nephila clavipes.

    Science.gov (United States)

    Luciano, E; Candelas, G C

    1996-06-01

    We report the sequence of a 2.3-kb genomic DNA fragment from the orb-web spider, Nephila clavipes (Nc). The fragment contains four regions of high homology to tRNA(Ala). The members of this irregularly spaced cluster of genes are oriented in the same direction and have the same anticodon (GCA), but their sequence differs at several positions. Initiation and termination signals, as well as consensus intragenic promoter sequences characteristic of tRNA genes, have been identified in all genes. tRNA(Ala) are involved in the regulation of the fibroin synthesis in the large ampullate Nc glands.

  10. Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters

    Energy Technology Data Exchange (ETDEWEB)

    Santini, Simona; Boore, Jeffrey L.; Meyer, Axel

    2003-12-31

    Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involved in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.

  11. Semi-supervised consensus clustering for gene expression data analysis

    OpenAIRE

    Wang, Yunli; Pan, Youlian

    2014-01-01

    Background Simple clustering methods such as hierarchical clustering and k-means are widely used for gene expression data analysis; but they are unable to deal with noise and high dimensionality associated with the microarray gene expression data. Consensus clustering appears to improve the robustness and quality of clustering results. Incorporating prior knowledge in clustering process (semi-supervised clustering) has been shown to improve the consistency between the data partitioning and do...

  12. Apolipoprotein gene involved in lipid metabolism

    Science.gov (United States)

    Rubin, Edward; Pennacchio, Len A.

    2007-07-03

    Methods and materials for studying the effects of a newly identified human gene, APOAV, and the corresponding mouse gene apoAV. The sequences of the genes are given, and transgenic animals which either contain the gene or have the endogenous gene knocked out are described. In addition, single nucleotide polymorphisms (SNPs) in the gene are described and characterized. It is demonstrated that certain SNPs are associated with diseases involving lipids and triglycerides and other metabolic diseases. These SNPs may be used alone or with SNPs from other genes to study individual risk factors. Methods for intervention in lipid diseases, including the screening of drugs to treat lipid-related or diabetic diseases are also disclosed.

  13. Gene Expression Data Knowledge Discovery using Global and Local Clustering

    CERN Document Server

    H, Swathi

    2010-01-01

    To understand complex biological systems, the research community has produced huge corpus of gene expression data. A large number of clustering approaches have been proposed for the analysis of gene expression data. However, extracting important biological knowledge is still harder. To address this task, clustering techniques are used. In this paper, hybrid Hierarchical k-Means algorithm is used for clustering and biclustering gene expression data is used. To discover both local and global clustering structure biclustering and clustering algorithms are utilized. A validation technique, Figure of Merit is used to determine the quality of clustering results. Appropriate knowledge is mined from the clusters by embedding a BLAST similarity search program into the clustering and biclustering process. To discover both local and global clustering structure biclustering and clustering algorithms are utilized. To determine the quality of clustering results, a validation technique, Figure of Merit is used. Appropriate ...

  14. Application of Multi-SOM clustering approach to macrophage gene expression analysis.

    Science.gov (United States)

    Ghouila, Amel; Yahia, Sadok Ben; Malouche, Dhafer; Jmel, Haifa; Laouini, Dhafer; Guerfali, Fatma Z; Abdelhak, Sonia

    2009-05-01

    The production of increasingly reliable and accessible gene expression data has stimulated the development of computational tools to interpret such data and to organize them efficiently. The clustering techniques are largely recognized as useful exploratory tools for gene expression data analysis. Genes that show similar expression patterns over a wide range of experimental conditions can be clustered together. This relies on the hypothesis that genes that belong to the same cluster are coregulated and involved in related functions. Nevertheless, clustering algorithms still show limits, particularly for the estimation of the number of clusters and the interpretation of hierarchical dendrogram, which may significantly influence the outputs of the analysis process. We propose here a multi level SOM based clustering algorithm named Multi-SOM. Through the use of clustering validity indices, Multi-SOM overcomes the problem of the estimation of clusters number. To test the validity of the proposed clustering algorithm, we first tested it on supervised training data sets. Results were evaluated by computing the number of misclassified samples. We have then used Multi-SOM for the analysis of macrophage gene expression data generated in vitro from the same individual blood infected with 5 different pathogens. This analysis led to the identification of sets of tightly coregulated genes across different pathogens. Gene Ontology tools were then used to estimate the biological significance of the clustering, which showed that the obtained clusters are coherent and biologically significant.

  15. Genes involved in translation of Mycoplasma hyopneumoniae and Mycoplasma synoviae

    Directory of Open Access Journals (Sweden)

    Mônica de Oliveira Santos

    2007-01-01

    Full Text Available This is a report on the analysis of genes involved in translation of the complete genomes of Mycoplasma hyopneumoniae strain J and 7448 and Mycoplasma synoviae. In both genomes 31 ORFs encoding large ribosomal subunit proteins and 19 ORFs encoding small ribosomal subunit proteins were found. Ten ribosomal protein gene clusters encoding 42 ribosomal proteins were found in M. synoviae, while 8 clusters encoding 39 ribosomal proteins were found in both M. hyopneumoniae strains. The L33 gene of the M. hyopneumoniae strain 7448 presented two copies in different locations. The genes encoding initiation factors (IF-1, IF-2 and IF-3, elongation factors (EF-G, EF-Tu, EF-Ts and EF-P, and the genes encoding the ribosome recycling factor (frr and one polypeptide release factor (prfA were present in the genomes of M. hyopneumoniae and M. synoviae. Nineteen aminoacyl-tRNA synthases had been previously identified in both mycoplasmas. In the two strains of M. hyopneumoniae, J and 7448, only one set of 5S, 16S and 23S rRNAs had been identified. Two sets of 16S and 23S rRNA genes and three sets of 5S rRNA genes had been identified in the M. synoviae genome.

  16. Arrangement of the Clostridium baratii F7 toxin gene cluster with identification of a σ factor that recognizes the botulinum toxin gene cluster promoters.

    Science.gov (United States)

    Dover, Nir; Barash, Jason R; Burke, Julianne N; Hill, Karen K; Detter, John C; Arnon, Stephen S

    2014-01-01

    Botulinum neurotoxin (BoNT) is the most poisonous substances known and its eight toxin types (A to H) are distinguished by the inability of polyclonal antibodies that neutralize one toxin type to neutralize any of the other seven toxin types. Infant botulism, an intestinal toxemia orphan disease, is the most common form of human botulism in the United States. It results from swallowed spores of Clostridium botulinum (or rarely, neurotoxigenic Clostridium butyricum or Clostridium baratii) that germinate and temporarily colonize the lumen of the large intestine, where, as vegetative cells, they produce botulinum toxin. Botulinum neurotoxin is encoded by the bont gene that is part of a toxin gene cluster that includes several accessory genes. We sequenced for the first time the complete botulinum neurotoxin gene cluster of nonproteolytic C. baratii type F7. Like the type E and the nonproteolytic type F6 botulinum toxin gene clusters, the C. baratii type F7 had an orfX toxin gene cluster that lacked the regulatory botR gene which is found in proteolytic C. botulinum strains and codes for an alternative σ factor. In the absence of botR, we identified a putative alternative regulatory gene located upstream of the C. baratii type F7 toxin gene cluster. This putative regulatory gene codes for a predicted σ factor that contains DNA-binding-domain homologues to the DNA-binding domains both of BotR and of other members of the TcdR-related group 5 of the σ70 family that are involved in the regulation of toxin gene expression in clostridia. We showed that this TcdR-related protein in association with RNA polymerase core enzyme specifically binds to the C. baratii type F7 botulinum toxin gene cluster promoters. This TcdR-related protein may therefore be involved in regulating the expression of the genes of the botulinum toxin gene cluster in neurotoxigenic C. baratii.

  17. Genes involved in Drosophila glutamate receptor expression and localization

    Directory of Open Access Journals (Sweden)

    Featherstone David E

    2005-06-01

    Full Text Available Abstract Background A clear picture of the mechanisms controlling glutamate receptor expression, localization, and stability remains elusive, possibly due to an incomplete understanding of the proteins involved. We screened transposon mutants generated by the ongoing Drosophila Gene Disruption Project in an effort to identify the different types of genes required for glutamate receptor cluster development. Results To enrich for non-silent insertions with severe disruptions in glutamate receptor clustering, we identified and focused on homozygous lethal mutants in a collection of 2185 BG and KG transposon mutants generated by the BDGP Gene Disruption Project. 202 lethal mutant lines were individually dissected to expose glutamatergic neuromuscular junctions, stained using antibodies that recognize neuronal membrane and the glutamate receptor subunit GluRIIA, and viewed using laser-scanning confocal microscopy. We identified 57 mutants with qualitative differences in GluRIIA expression and/or localization. 84% of mutants showed loss of receptors and/or clusters; 16% of mutants showed an increase in receptors. Insertion loci encode a variety of protein types, including cytoskeleton proteins and regulators, kinases, phosphatases, ubiquitin ligases, mucins, cell adhesion proteins, transporters, proteins controlling gene expression and protein translation, and proteins of unknown/novel function. Expression pattern analyses and complementation tests, however, suggest that any single mutant – even if a mutant gene is uniquely tagged – must be interpreted with caution until the mutation is validated genetically and phenotypically. Conclusion Our study identified 57 transposon mutants with qualitative differences in glutamate receptor expression and localization. Despite transposon tagging of every insertion locus, extensive validation is needed before one can have confidence in the role of any individual gene. Alternatively, one can focus on the

  18. Comparative Analysis of Cluster Validity Indices in Identifying Some Possible Genes Mediating Certain Cancers.

    Science.gov (United States)

    Ghosh, Anupam; Dhara, Bibhas Chandra; De, Rajat K

    2013-04-01

    In this article, we compare the performance of 19 cluster validity indices, in identifying some possible genes mediating certain cancers, based on gene expression data. For the purpose of this comparison, we have developed a method. The proposed method involves cluster generation, selection of the best k-value or c-values, cluster identification, identifying the altered gene cluster, scoring an altered gene cluster and determining the best k-value or c-value exploring through biological repositories. The effectiveness of the method has been demonstrated on three gene expression data sets dealing with human lung cancer, colon cancer, and leukemia. Here, we have used three clustering algorithms, i.e., k-means, PAM and fuzzy c-means. We have used biochemical pathways related to these cancers and p-value statistics for validating the study. Copyright © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  19. Gene duplication, modularity and adaptation in the evolution of the aflatoxin gene cluster

    Directory of Open Access Journals (Sweden)

    Jakobek Judy L

    2007-07-01

    Full Text Available Abstract Background The biosynthesis of aflatoxin (AF involves over 20 enzymatic reactions in a complex polyketide pathway that converts acetate and malonate to the intermediates sterigmatocystin (ST and O-methylsterigmatocystin (OMST, the respective penultimate and ultimate precursors of AF. Although these precursors are chemically and structurally very similar, their accumulation differs at the species level for Aspergilli. Notable examples are A. nidulans that synthesizes only ST, A. flavus that makes predominantly AF, and A. parasiticus that generally produces either AF or OMST. Whether these differences are important in the evolutionary/ecological processes of species adaptation and diversification is unknown. Equally unknown are the specific genomic mechanisms responsible for ordering and clustering of genes in the AF pathway of Aspergillus. Results To elucidate the mechanisms that have driven formation of these clusters, we performed systematic searches of aflatoxin cluster homologs across five Aspergillus genomes. We found a high level of gene duplication and identified seven modules consisting of highly correlated gene pairs (aflA/aflB, aflR/aflS, aflX/aflY, aflF/aflE, aflT/aflQ, aflC/aflW, and aflG/aflL. With the exception of A. nomius, contrasts of mean Ka/Ks values across all cluster genes showed significant differences in selective pressure between section Flavi and non-section Flavi species. A. nomius mean Ka/Ks values were more similar to partial clusters in A. fumigatus and A. terreus. Overall, mean Ka/Ks values were significantly higher for section Flavi than for non-section Flavi species. Conclusion Our results implicate several genomic mechanisms in the evolution of ST, OMST and AF cluster genes. Gene modules may arise from duplications of a single gene, whereby the function of the pre-duplication gene is retained in the copy (aflF/aflE or the copies may partition the ancestral function (aflA/aflB. In some gene modules, the

  20. Gene ordering in partitive clustering using microarray expressions.

    Science.gov (United States)

    Ray, Shubhra Sankar; Bandyopadhyay, Sanghamitra; Pal, Sankar K

    2007-08-01

    A central step in the analysis of gene expression data is the identification of groups of genes that exhibit similar expression patterns. Clustering and ordering the genes using gene expression data into homogeneous groups was shown to be useful in functional annotation, tissue classification, regulatory motif identification, and other applications. Although there is a rich literature on gene ordering in hierarchical clustering framework for gene expression analysis, there is no work addressing and evaluating the importance of gene ordering in partitive clustering framework, to the best knowledge of the authors. Outside the framework of hierarchical clustering, different gene ordering algorithms are applied on the whole data set, and the domain of partitive clustering is still unexplored with gene ordering approaches. A new hybrid method is proposed for ordering genes in each of the clusters obtained from partitive clustering solution, using microarray gene expressions.Two existing algorithms for optimally ordering cities in travelling salesman problem (TSP), namely, FRAG_GALK and Concorde, are hybridized individually with self organizing MAP to show the importance of gene ordering in partitive clustering framework. We validated our hybrid approach using yeast and fibroblast data and showed that our approach improves the result quality of partitive clustering solution, by identifying subclusters within big clusters, grouping functionally correlated genes within clusters, minimization of summation of gene expression distances, and the maximization of biological gene ordering using MIPS categorization. Moreover, the new hybrid approach, finds comparable or sometimes superior biological gene order in less computation time than those obtained by optimal leaf ordering in hierarchical clustering solution.

  1. Gene ordering in partitive clustering using microarray expressions

    Indian Academy of Sciences (India)

    Shubhra Sankar Ray; Sanghamitra Bandyopadhyay; Sankar K Pal

    2007-08-01

    A central step in the analysis of gene expression data is the identification of groups of genes that exhibit similar expression patterns. Clustering and ordering the genes using gene expression data into homogeneous groups was shown to be useful in functional annotation, tissue classification, regulatory motif identification, and other applications. Although there is a rich literature on gene ordering in hierarchical clustering framework for gene expression analysis, there is no work addressing and evaluating the importance of gene ordering in partitive clustering framework, to the best knowledge of the authors. Outside the framework of hierarchical clustering, different gene ordering algorithms are applied on the whole data set, and the domain of partitive clustering is still unexplored with gene ordering approaches. A new hybrid method is proposed for ordering genes in each of the clusters obtained from partitive clustering solution, using microarray gene expressions. Two existing algorithms for optimally ordering cities in travelling salesman problem (TSP), namely, FRAG_GALK and Concorde, are hybridized individually with self organizing MAP to show the importance of gene ordering in partitive clustering framework. We validated our hybrid approach using yeast and fibroblast data and showed that our approach improves the result quality of partitive clustering solution, by identifying subclusters within big clusters, grouping functionally correlated genes within clusters, minimization of summation of gene expression distances, and the maximization of biological gene ordering using MIPS categorization. Moreover, the new hybrid approach, finds comparable or sometimes superior biological gene order in less computation time than those obtained by optimal leaf ordering in hierarchical clustering solution.

  2. Challenges in microarray class discovery: a comprehensive examination of normalization, gene selection and clustering

    Science.gov (United States)

    2010-01-01

    Background Cluster analysis, and in particular hierarchical clustering, is widely used to extract information from gene expression data. The aim is to discover new classes, or sub-classes, of either individuals or genes. Performing a cluster analysis commonly involve decisions on how to; handle missing values, standardize the data and select genes. In addition, pre-processing, involving various types of filtration and normalization procedures, can have an effect on the ability to discover biologically relevant classes. Here we consider cluster analysis in a broad sense and perform a comprehensive evaluation that covers several aspects of cluster analyses, including normalization. Result We evaluated 2780 cluster analysis methods on seven publicly available 2-channel microarray data sets with common reference designs. Each cluster analysis method differed in data normalization (5 normalizations were considered), missing value imputation (2), standardization of data (2), gene selection (19) or clustering method (11). The cluster analyses are evaluated using known classes, such as cancer types, and the adjusted Rand index. The performances of the different analyses vary between the data sets and it is difficult to give general recommendations. However, normalization, gene selection and clustering method are all variables that have a significant impact on the performance. In particular, gene selection is important and it is generally necessary to include a relatively large number of genes in order to get good performance. Selecting genes with high standard deviation or using principal component analysis are shown to be the preferred gene selection methods. Hierarchical clustering using Ward's method, k-means clustering and Mclust are the clustering methods considered in this paper that achieves the highest adjusted Rand. Normalization can have a significant positive impact on the ability to cluster individuals, and there are indications that background correction is

  3. Challenges in microarray class discovery: a comprehensive examination of normalization, gene selection and clustering

    Directory of Open Access Journals (Sweden)

    Landfors Mattias

    2010-10-01

    Full Text Available Abstract Background Cluster analysis, and in particular hierarchical clustering, is widely used to extract information from gene expression data. The aim is to discover new classes, or sub-classes, of either individuals or genes. Performing a cluster analysis commonly involve decisions on how to; handle missing values, standardize the data and select genes. In addition, pre-processing, involving various types of filtration and normalization procedures, can have an effect on the ability to discover biologically relevant classes. Here we consider cluster analysis in a broad sense and perform a comprehensive evaluation that covers several aspects of cluster analyses, including normalization. Result We evaluated 2780 cluster analysis methods on seven publicly available 2-channel microarray data sets with common reference designs. Each cluster analysis method differed in data normalization (5 normalizations were considered, missing value imputation (2, standardization of data (2, gene selection (19 or clustering method (11. The cluster analyses are evaluated using known classes, such as cancer types, and the adjusted Rand index. The performances of the different analyses vary between the data sets and it is difficult to give general recommendations. However, normalization, gene selection and clustering method are all variables that have a significant impact on the performance. In particular, gene selection is important and it is generally necessary to include a relatively large number of genes in order to get good performance. Selecting genes with high standard deviation or using principal component analysis are shown to be the preferred gene selection methods. Hierarchical clustering using Ward's method, k-means clustering and Mclust are the clustering methods considered in this paper that achieves the highest adjusted Rand. Normalization can have a significant positive impact on the ability to cluster individuals, and there are indications that

  4. Transcriptional analysis of exopolysaccharides biosynthesis gene clusters in Lactobacillus plantarum.

    Science.gov (United States)

    Vastano, Valeria; Perrone, Filomena; Marasco, Rosangela; Sacco, Margherita; Muscariello, Lidia

    2016-04-01

    Exopolysaccharides (EPS) from lactic acid bacteria contribute to specific rheology and texture of fermented milk products and find applications also in non-dairy foods and in therapeutics. Recently, four clusters of genes (cps) associated with surface polysaccharide production have been identified in Lactobacillus plantarum WCFS1, a probiotic and food-associated lactobacillus. These clusters are involved in cell surface architecture and probably in release and/or exposure of immunomodulating bacterial molecules. Here we show a transcriptional analysis of these clusters. Indeed, RT-PCR experiments revealed that the cps loci are organized in five operons. Moreover, by reverse transcription-qPCR analysis performed on L. plantarum WCFS1 (wild type) and WCFS1-2 (ΔccpA), we demonstrated that expression of three cps clusters is under the control of the global regulator CcpA. These results, together with the identification of putative CcpA target sequences (catabolite responsive element CRE) in the regulatory region of four out of five transcriptional units, strongly suggest for the first time a role of the master regulator CcpA in EPS gene transcription among lactobacilli.

  5. The biosynthetic gene cluster for the beta-lactam carbapenem thienamycin in Streptomyces cattleya.

    Science.gov (United States)

    Núñez, Luz Elena; Méndez, Carmen; Braña, Alfredo F; Blanco, Gloria; Salas, José A

    2003-04-01

    beta-lactam ring formation in carbapenem and clavam biosynthesis proceeds through an alternative mechanism to the biosynthetic pathway of classic beta-lactam antibiotics. This involves the participation of a beta-lactam synthetase. Using available information from beta-lactam synthetases, we generated a probe for the isolation of the thienamycin cluster from Streptomyces cattleya. Genes homologous to carbapenem and clavulanic acid biosynthetic genes have been identified. They would participate in early steps of thienamycin biosynthesis leading to the formation of the beta-lactam ring. Other genes necessary for the biosynthesis of thienamycin have also been identified in the cluster (methyltransferases, cysteinyl transferases, oxidoreductases, hydroxylase, etc.) together with two regulatory genes, genes involved in exportation and/or resistance, and a quorum sensing system. Involvement of the cluster in thienamycin biosynthesis was demonstrated by insertional inactivation of several genes generating thienamycin nonproducing mutants.

  6. Loss of Bloom syndrome protein destabilizes human gene cluster architecture.

    Science.gov (United States)

    Killen, Michael W; Stults, Dawn M; Adachi, Noritaka; Hanakahi, Les; Pierce, Andrew J

    2009-09-15

    Bloom syndrome confers strong predisposition to malignancy in multiple tissue types. The Bloom syndrome patient (BLM) protein defective in the disease biochemically functions as a Holliday junction dissolvase and human cells lacking functional BLM show 10-fold elevated rates of sister chromatid exchange. Collectively, these phenomena suggest that dysregulated mitotic recombination drives the genomic instability underpinning the development of cancer in these individuals. Here we use physical analysis of the highly repeated, highly self-similar human ribosomal RNA gene clusters as sentinel biomarkers for dysregulated homologous recombination to demonstrate that loss of BLM protein function causes a striking increase in spontaneous molecular level genomic restructuring. Analysis of single-cell derived sub-clonal populations from wild-type human cell lines shows that gene cluster architecture is ordinarily very faithfully preserved under mitosis, but is so unstable in cell lines derived from BLMs as to make gene cluster architecture in different sub-clonal populations essentially unrecognizable one from another. Human cells defective in a different RecQ helicase, the WRN protein involved in the premature aging Werner syndrome, do not exhibit the gene cluster instability (GCI) phenotype, indicating that the BLM protein specifically, rather than RecQ helicases generally, holds back this recombination-mediated genomic instability. An ataxia-telangiectasia defective cell line also shows elevated rDNA GCI, although not to the extent of BLM defective cells. Genomic restructuring mediated by dysregulated recombination between the abundant low-copy repeats in the human genome may prove to be an important additional mechanism of genomic instability driving the initiation and progression of human cancer.

  7. The urease gene cluster of Vibrio parahaemolyticus does not influence the expression of the thermostable direct hemolysin (TDH) gene or the TDH-related hemolysin gene.

    Science.gov (United States)

    Nakaguchi, Yoshitsugu; Okuda, Jun; Iida, Tetsuya; Nishibuchi, Mitsuaki

    2003-01-01

    In order to investigate why the thermostable direct hemolysin (TDH) and the TDH-related hemolysin (TRH) of Vibrio parahaemolyticus are produced at low levels from urease-positive strains, the effect of the functional urease gene cluster of V. parahaemolyticus on the expression of the tdh and trh genes was examined. Transcriptional lacZ fusions with the tdh1, tdh2, trh1 and trh2 genes representing variants of the tdh and trh genes were integrated into the chromosome of an Escherichia coli strain and a urease-negative V. parahaemolyticus strain. The plasmid-borne urease gene cluster introduced and expressed in these constructs did not affect expression of any of the fusion genes. The amount of TDH produced from a Kanagawa phenomenon-positive V. parahaemolyticus did not change by introduction of the urease gene cluster either. It was concluded therefore that the urease gene cluster is not involved in the regulation of tdh and trh expression.

  8. Bioinformatics Prediction of Polyketide Synthase Gene Clusters from Mycosphaerella fijiensis

    OpenAIRE

    Noar, Roslyn D.; Daub, Margaret E.

    2016-01-01

    Mycosphaerella fijiensis, causal agent of black Sigatoka disease of banana, is a Dothideomycete fungus closely related to fungi that produce polyketides important for plant pathogenicity. We utilized the M. fijiensis genome sequence to predict PKS genes and their gene clusters and make bioinformatics predictions about the types of compounds produced by these clusters. Eight PKS gene clusters were identified in the M. fijiensis genome, placing M. fijiensis into the 23rd percentile for the numb...

  9. Genetic characteristics of vancomycin resistance gene cluster in Enterococcus spp.

    Science.gov (United States)

    Chunhui, Chen; Xiaogang, Xu

    2015-05-01

    Vancomycin resistant enterococci has become an important nosocomial pathogen since it is discovered in late 1980s. The products, encoded by vancomycin resistant gene cluster in enterococci, catalyze the synthesis of peptidoglycan precursors with low affinity with glycopeptide antibiotics including vancomycin and teicoplanin and lead to resistance. These vancomycin resistant gene clusters are classified into nine types according to their gene sequences and organization, or D-Ala:D-Lac (VanA, VanB, VanD and VanM) and D-Ala:D-Ser (VanC, VanE, VanG, VanL and VanN) ligase gene clusters based on the differences of their encoded ligases. Moreover, these gene clusters are characterized by their different resistance levels and infection models. In this review, we summarize the classification, gene organization and infection model of vancomycin resistant gene cluster in Enterococcus spp.

  10. ROUGH SET BASED CLUSTERING OF GENE EXPRESSION DATA: A SURVEY

    Directory of Open Access Journals (Sweden)

    J.JEBA EMILYN

    2010-12-01

    Full Text Available Microarray technology has now made it possible to simultaneously monitor the expression levels of thousands of genes during important biological processes and across collections of related samples. But the high dimensionality property of gene expression data makes it difficult to be analyzed. Lot of clustering algorithms are available for clustering. In this paper we first briefly introduce the concepts of microarray technology and discuss the basic elements of clustering on gene expression data. Then we introduce rough clustering and itsadvantage over strict and fuzzy clustering is explored. We also explain why rough clustering is preferred over other conventional methods by presenting a survey on few clustering algorithms based on rough set theory for gene expression data. We conclude by stating that this area proves to be potential research field for the researchcommunity.

  11. Diversity and evolution of MicroRNA gene clusters

    Institute of Scientific and Technical Information of China (English)

    2009-01-01

    microRNA(miRNA) gene clusters are a group of miRNA genes clustered within a proximal distance on a chromosome.Although a large number of miRNA clusters have been uncovered in animal and plant genomes,the functional consequences of this arrangement are still poorly understood.Located in a polycistron,the coexpressed miRNA clusters are pivotal in coordinately regulating multiple processes,including embryonic development,cell cycles and cell differentiation.In this review,based on recent progress,we discuss the genomic diversity of miRNA gene clusters,the coordination of expression and function of the clustered miRNAs,and the evolutionarily adaptive processes with gain and loss of the clustering miRNA genes mediated by duplication and transposition events.

  12. Diversity and evolution of MicroRNA gene clusters

    Institute of Scientific and Technical Information of China (English)

    ZHANG YanFeng; ZHANG Rui; SU Bing

    2009-01-01

    microRNA (miRNA) gene clusters are a group of miRNA genes clustered within a proximal distance on a chromosome. Although a large number of miRNA clusters have been uncovered in animal and plant genomes, the functional consequences of this arrangement are still poorly understood. Located in a polycistron, the coexpressed miRNA clusters are pivotal in coordinately regulating multiple processes, including embryonic development, cell cycles and cell differentiation. In this review, based on recent progress, we discuss the genomic diversity of miRNA gene clusters, the coordination of expression and function of the clustered miRNAs, and the evolutionarily adaptive processes with gain and loss of the clustering miRNA genes mediated by duplication and transposition events.

  13. Genes for iron-sulphur cluster assembly are targets of abiotic stress in rice, Oryza sativa.

    Science.gov (United States)

    Liang, Xuejiao; Qin, Lu; Liu, Peiwei; Wang, Meihuan; Ye, Hong

    2014-03-01

    Iron-sulphur (Fe-S) cluster assembly occurs in chloroplasts, mitochondria and cytosol, involving dozens of genes in higher plants. In this study, we have identified 41 putative Fe-S cluster assembly genes in rice (Oryza sativa) genome, and the expression of all genes was verified. To investigate the role of Fe-S cluster assembly as a metabolic pathway, we applied abiotic stresses to rice seedlings and analysed Fe-S cluster assembly gene expression by qRT-PCR. Our data showed that genes for Fe-S cluster assembly in chloroplasts of leaves are particularly sensitive to heavy metal treatments, and that Fe-S cluster assembly genes in roots were up-regulated in response to iron toxicity, oxidative stress and some heavy metal assault. The effect of each stress treatment on the Fe-S cluster assembly machinery demonstrated an unexpected tissue or organelle specificity, suggesting that the physiological relevance of the Fe-S cluster assembly is more complex than thought. Furthermore, our results may reveal potential candidate genes for molecular breeding of rice.

  14. Computing gene expression data with a knowledge-based gene clustering approach.

    Science.gov (United States)

    Rosa, Bruce A; Oh, Sookyung; Montgomery, Beronda L; Chen, Jin; Qin, Wensheng

    2010-01-01

    Computational analysis methods for gene expression data gathered in microarray experiments can be used to identify the functions of previously unstudied genes. While obtaining the expression data is not a difficult task, interpreting and extracting the information from the datasets is challenging. In this study, a knowledge-based approach which identifies and saves important functional genes before filtering based on variability and fold change differences was utilized to study light regulation. Two clustering methods were used to cluster the filtered datasets, and clusters containing a key light regulatory gene were located. The common genes to both of these clusters were identified, and the genes in the common cluster were ranked based on their coexpression to the key gene. This process was repeated for 11 key genes in 3 treatment combinations. The initial filtering method reduced the dataset size from 22,814 probes to an average of 1134 genes, and the resulting common cluster lists contained an average of only 14 genes. These common cluster lists scored higher gene enrichment scores than two individual clustering methods. In addition, the filtering method increased the proportion of light responsive genes in the dataset from 1.8% to 15.2%, and the cluster lists increased this proportion to 18.4%. The relatively short length of these common cluster lists compared to gene groups generated through typical clustering methods or coexpression networks narrows the search for novel functional genes while increasing the likelihood that they are biologically relevant.

  15. Detecting Sequence Homology at the Gene Cluster Level with MultiGeneBlast

    NARCIS (Netherlands)

    Medema, Marnix H.; Takano, Eriko; Breitling, Rainer; Nowick, Katja

    2013-01-01

    The genes encoding many biomolecular systems and pathways are genomically organized in operons or gene clusters. With MultiGeneBlast, we provide a user-friendly and effective tool to perform homology searches with operons or gene clusters as basic units, instead of single genes. The contextualizatio

  16. Detecting Sequence Homology at the Gene Cluster Level with MultiGeneBlast

    NARCIS (Netherlands)

    Medema, Marnix H.; Takano, Eriko; Breitling, Rainer; Nowick, Katja

    The genes encoding many biomolecular systems and pathways are genomically organized in operons or gene clusters. With MultiGeneBlast, we provide a user-friendly and effective tool to perform homology searches with operons or gene clusters as basic units, instead of single genes. The

  17. Sequencing, characterization, and gene expression analysis of the histidine decarboxylase gene cluster of Morganella morganii.

    Science.gov (United States)

    Ferrario, Chiara; Borgo, Francesca; de Las Rivas, Blanca; Muñoz, Rosario; Ricci, Giovanni; Fortina, Maria Grazia

    2014-03-01

    The histidine decarboxylase gene cluster of Morganella morganii DSM30146(T) was sequenced, and four open reading frames, named hdcT1, hdc, hdcT2, and hisRS were identified. Two putative histidine/histamine antiporters (hdcT1 and hdcT2) were located upstream and downstream the hdc gene, codifying a pyridoxal-P dependent histidine decarboxylase, and followed by hisRS gene encoding a histidyl-tRNA synthetase. This organization was comparable with the gene cluster of other known Gram negative bacteria, particularly with that of Klebsiella oxytoca. Recombinant Escherichia coli strains harboring plasmids carrying the M. morganii hdc gene were shown to overproduce histidine decarboxylase, after IPTG induction at 37 °C for 4 h. Quantitative RT-PCR experiments revealed the hdc and hisRS genes were highly induced under acidic and histidine-rich conditions. This work represents the first description and identification of the hdc-related genes in M. morganii. Results support the hypothesis that the histidine decarboxylation reaction in this prolific histamine producing species may play a role in acid survival. The knowledge of the role and the regulation of genes involved in histidine decarboxylation should improve the design of rational strategies to avoid toxic histamine production in foods.

  18. A Nomadic Subtelomeric Disease Resistance Gene Cluster in Common Bean

    Science.gov (United States)

    The B4 resistance (R)-gene cluster, located in subtelomeric region of chromosome 4, is one of the largest clusters known in common bean (Phaseolus vulgaris, Pv). We sequenced 650 kb spanning this locus and annotated 97 genes, 26 of which correspond to Coiled-coil-Nucleotide-Binding-Site-Leucine-Rich...

  19. Simultaneous clustering of multiple gene expression and physical interaction datasets.

    Directory of Open Access Journals (Sweden)

    Manikandan Narayanan

    2010-04-01

    Full Text Available Many genome-wide datasets are routinely generated to study different aspects of biological systems, but integrating them to obtain a coherent view of the underlying biology remains a challenge. We propose simultaneous clustering of multiple networks as a framework to integrate large-scale datasets on the interactions among and activities of cellular components. Specifically, we develop an algorithm JointCluster that finds sets of genes that cluster well in multiple networks of interest, such as coexpression networks summarizing correlations among the expression profiles of genes and physical networks describing protein-protein and protein-DNA interactions among genes or gene-products. Our algorithm provides an efficient solution to a well-defined problem of jointly clustering networks, using techniques that permit certain theoretical guarantees on the quality of the detected clustering relative to the optimal clustering. These guarantees coupled with an effective scaling heuristic and the flexibility to handle multiple heterogeneous networks make our method JointCluster an advance over earlier approaches. Simulation results showed JointCluster to be more robust than alternate methods in recovering clusters implanted in networks with high false positive rates. In systematic evaluation of JointCluster and some earlier approaches for combined analysis of the yeast physical network and two gene expression datasets under glucose and ethanol growth conditions, JointCluster discovers clusters that are more consistently enriched for various reference classes capturing different aspects of yeast biology or yield better coverage of the analysed genes. These robust clusters, which are supported across multiple genomic datasets and diverse reference classes, agree with known biology of yeast under these growth conditions, elucidate the genetic control of coordinated transcription, and enable functional predictions for a number of uncharacterized genes.

  20. Super-paramagnetic clustering of yeast gene expression profiles

    CERN Document Server

    Getz, G; Domany, E; Zhang, M Q

    2000-01-01

    High-density DNA arrays, used to monitor gene expression at a genomic scale, have produced vast amounts of information which require the development of efficient computational methods to analyze them. The important first step is to extract the fundamental patterns of gene expression inherent in the data. This paper describes the application of a novel clustering algorithm, Super-Paramagnetic Clustering (SPC) to analysis of gene expression profiles that were generated recently during a study of the yeast cell cycle. SPC was used to organize genes into biologically relevant clusters that are suggestive for their co-regulation. Some of the advantages of SPC are its robustness against noise and initialization, a clear signature of cluster formation and splitting, and an unsupervised self-organized determination of the number of clusters at each resolution. Our analysis revealed interesting correlated behavior of several groups of genes which has not been previously identified.

  1. Super-paramagnetic clustering of yeast gene expression profiles

    Science.gov (United States)

    Getz, G.; Levine, E.; Domany, E.; Zhang, M. Q.

    2000-04-01

    High-density DNA arrays, used to monitor gene expression at a genomic scale, have produced vast amounts of information which require the development of efficient computational methods to analyze them. The important first step is to extract the fundamental patterns of gene expression inherent in the data. This paper describes the application of a novel clustering algorithm, super-paramagnetic clustering (SPC) to analysis of gene expression profiles that were generated recently during a study of the yeast cell cycle. SPC was used to organize genes into biologically relevant clusters that are suggestive for their co-regulation. Some of the advantages of SPC are its robustness against noise and initialization, a clear signature of cluster formation and splitting, and an unsupervised self-organized determination of the number of clusters at each resolution. Our analysis revealed interesting correlated behavior of several groups of genes which has not been previously identified.

  2. Improvement of gougerotin and nikkomycin production by engineering their biosynthetic gene clusters.

    Science.gov (United States)

    Du, Deyao; Zhu, Yu; Wei, Junhong; Tian, Yuqing; Niu, Guoqing; Tan, Huarong

    2013-07-01

    Nikkomycins and gougerotin are peptidyl nucleoside antibiotics with broad biological activities. The nikkomycin biosynthetic gene cluster comprises one pathway-specific regulatory gene (sanG) and 21 structural genes, whereas the gene cluster for gougerotin biosynthesis includes one putative regulatory gene, one major facilitator superfamily transporter gene, and 13 structural genes. In the present study, we introduced sanG driven by six different promoters into Streptomyces ansochromogenes TH322. Nikkomycin production was increased significantly with the highest increase in engineered strain harboring hrdB promoter-driven sanG. In the meantime, we replaced the native promoter of key structural genes in the gougerotin (gou) gene cluster with the hrdB promoters. The heterologous producer Streptomyces coelicolor M1146 harboring the modified gene cluster produced gougerotin up to 10-fold more than strains carrying the unmodified cluster. Therefore, genetic manipulations of genes involved in antibiotics biosynthesis with the constitutive hrdB promoter present a robust, easy-to-use system generally useful for the improvement of antibiotics production in Streptomyces.

  3. Many nonuniversal archaeal ribosomal proteins are found in conserved gene clusters

    Directory of Open Access Journals (Sweden)

    Jiachen Wang

    2009-01-01

    Full Text Available The genomic associations of the archaeal ribosomal proteins, (r-proteins, were examined in detail. The archaeal versions of the universal r-protein genes are typically in clusters similar or identical and to those found in bacteria. Of the 35 nonuniversal archaeal r-protein genes examined, the gene encoding L18e was found to be associated with the conserved L13 cluster, whereas the genes for S4e, L32e and L19e were found in the archaeal version of the spc operon. Eleven nonuniversal protein genes were not associated with any common genomic context. Of the remaining 19 protein genes, 17 were convincingly assigned to one of 10 previously unrecognized gene clusters. Examination of the gene content of these clusters revealed multiple associations with genes involved in the initiation of protein synthesis, transcription or other cellular processes. The lack of such associations in the universal clusters suggests that initially the ribosome evolved largely independently of other processes. More recently it likely has evolved in concert with other cellular systems. It was also verified that a second copy of the gene encoding L7ae found in some bacteria is actually a homolog of the gene encoding L30e and should be annotated as such.

  4. Degeneration of aflatoxin gene cluster in Aspergillus flavus from Africa and North America

    Science.gov (United States)

    Aspergillus flavus is the primary causal agent of food and feed contamination with the toxic fungal metabolites aflatoxins. Aflatoxin-producing potential of A. flavus is known to vary among isolates. The genes involved in aflatoxin biosynthesis are clustered together and the order of genes within th...

  5. Regulation of transcription of cell division genes in the Escherichia coli dcw cluster.

    Science.gov (United States)

    Vicente, M; Gomez, M J; Ayala, J A

    1998-04-01

    The Escherichia coli dcw cluster contains cell division genes, such as the phylogenetically ubiquitous ftsZ, and genes involved in peptidoglycan synthesis. Transcription in the cluster proceeds in the same direction as the progress of the replication fork along the chromosome. Regulation is exerted at the transcriptional and post-transcriptional levels. The absence of transcriptional termination signals may, in principle, allow extension of the transcripts initiated at the up-stream promoter (mraZ1p) even to the furthest down-stream gene (envA). Complementation tests suggest that they extend into ftsW in the central part of the cluster. In addition, the cluster contains other promoters individually regulated by cis- and trans-acting signals. Dissociation of the expression of the ftsZ gene, located after ftsQ and A near the 3' end of the cluster, from its natural regulatory signals leads to an alteration in the physiology of cell division. The complexities observed in the regulation of gene expression in the cluster may then have an important biological role. Among them, LexA-binding SOS boxes have been found at the 5' end of the cluster, preceding promoters which direct the expression of ftsI (coding for PBP3, the penicillin-binding protein involved in septum formation). A gearbox promoter, ftsQ1p, forms part of the signals regulating the transcription of ftsQ, A and Z. It is an inversely growth-dependent mechanism driven by RNA polymerase containing sigma s, the factor involved in the expression of stationary phase-specific genes. Although the dcw cluster is conserved to a different extent in a variety of bacteria, the regulation of gene expression, the presence or absence of individual genes, and even the essentiality of some of them, show variations in the phylogenetic scale which may reflect adaptation to specific life cycles.

  6. Prospecting sugarcane genes involved in aluminum tolerance

    Directory of Open Access Journals (Sweden)

    Rodrigo D. Drummond

    2001-12-01

    Full Text Available Aluminum is one of the major factors that affect plant development in acid soils, causing a substantial reduction in yield in many crops. In South America, about 66% of the land surface is made up of acid soils where high aluminum saturation is one of the main limiting factors for agriculture. The biochemical and molecular basis of aluminum tolerance in plants is far from being completely understood despite a growing number of studies, and in the specific case of sugarcane there are virtually no reports on the effects of gene regulation on aluminum stress. The objective of the work presented in this paper was to prospect the sugarcane expressed sequence tag (SUCEST data bank for sugarcane genes related to several biochemical pathways known to be involved in the responses to aluminum toxicity in other plant species and yeast. Sugarcane genes similar to most of these genes were found, including those coding for enzymes that alleviate oxidative stress or combat infection by pathogens and those which code for proteins responsible for the release of organic acids and signal transducers. The role of these genes in aluminum tolerance mechanisms is reviewed. Due to the high level of genomic conservation in related grasses such as maize, barley, sorghum and sugarcane, these genes may be valuable tools which will help us to better understand and to manipulate aluminum tolerance in these species.Alumínio (Al é um dos principais fatores que afetam o desenvolvimento de plantas em solos ácidos, reduzindo substancialmente a produtividade agrícola. Na América do Sul, cerca de 66% da superfície do solo apresenta acidez, onde a alta saturação de alumínio é uma das maiores limitações à prática agrícola. Apesar do crescente número de estudos, uma compreensão completa das bases bioquímicas e moleculares da tolerância ao alumínio em plantas está longe de ser alcançada. No caso da cana-de-açúcar, não há nada publicado sobre a regulação g

  7. Bioinformatics Prediction of Polyketide Synthase Gene Clusters from Mycosphaerella fijiensis.

    Directory of Open Access Journals (Sweden)

    Roslyn D Noar

    Full Text Available Mycosphaerella fijiensis, causal agent of black Sigatoka disease of banana, is a Dothideomycete fungus closely related to fungi that produce polyketides important for plant pathogenicity. We utilized the M. fijiensis genome sequence to predict PKS genes and their gene clusters and make bioinformatics predictions about the types of compounds produced by these clusters. Eight PKS gene clusters were identified in the M. fijiensis genome, placing M. fijiensis into the 23rd percentile for the number of PKS genes compared to other Dothideomycetes. Analysis of the PKS domains identified three of the PKS enzymes as non-reducing and two as highly reducing. Gene clusters contained types of genes frequently found in PKS clusters including genes encoding transporters, oxidoreductases, methyltransferases, and non-ribosomal peptide synthases. Phylogenetic analysis identified a putative PKS cluster encoding melanin biosynthesis. None of the other clusters were closely aligned with genes encoding known polyketides, however three of the PKS genes fell into clades with clusters encoding alternapyrone, fumonisin, and solanapyrone produced by Alternaria and Fusarium species. A search for homologs among available genomic sequences from 103 Dothideomycetes identified close homologs (>80% similarity for six of the PKS sequences. One of the PKS sequences was not similar (< 60% similarity to sequences in any of the 103 genomes, suggesting that it encodes a unique compound. Comparison of the M. fijiensis PKS sequences with those of two other banana pathogens, M. musicola and M. eumusae, showed that these two species have close homologs to five of the M. fijiensis PKS sequences, but three others were not found in either species. RT-PCR and RNA-Seq analysis showed that the melanin PKS cluster was down-regulated in infected banana as compared to growth in culture. Three other clusters, however were strongly upregulated during disease development in banana, suggesting that

  8. Bioinformatics Prediction of Polyketide Synthase Gene Clusters from Mycosphaerella fijiensis.

    Science.gov (United States)

    Noar, Roslyn D; Daub, Margaret E

    2016-01-01

    Mycosphaerella fijiensis, causal agent of black Sigatoka disease of banana, is a Dothideomycete fungus closely related to fungi that produce polyketides important for plant pathogenicity. We utilized the M. fijiensis genome sequence to predict PKS genes and their gene clusters and make bioinformatics predictions about the types of compounds produced by these clusters. Eight PKS gene clusters were identified in the M. fijiensis genome, placing M. fijiensis into the 23rd percentile for the number of PKS genes compared to other Dothideomycetes. Analysis of the PKS domains identified three of the PKS enzymes as non-reducing and two as highly reducing. Gene clusters contained types of genes frequently found in PKS clusters including genes encoding transporters, oxidoreductases, methyltransferases, and non-ribosomal peptide synthases. Phylogenetic analysis identified a putative PKS cluster encoding melanin biosynthesis. None of the other clusters were closely aligned with genes encoding known polyketides, however three of the PKS genes fell into clades with clusters encoding alternapyrone, fumonisin, and solanapyrone produced by Alternaria and Fusarium species. A search for homologs among available genomic sequences from 103 Dothideomycetes identified close homologs (>80% similarity) for six of the PKS sequences. One of the PKS sequences was not similar (< 60% similarity) to sequences in any of the 103 genomes, suggesting that it encodes a unique compound. Comparison of the M. fijiensis PKS sequences with those of two other banana pathogens, M. musicola and M. eumusae, showed that these two species have close homologs to five of the M. fijiensis PKS sequences, but three others were not found in either species. RT-PCR and RNA-Seq analysis showed that the melanin PKS cluster was down-regulated in infected banana as compared to growth in culture. Three other clusters, however were strongly upregulated during disease development in banana, suggesting that they may encode

  9. Genes and translocations involved in POF.

    Science.gov (United States)

    Schlessinger, David; Herrera, Luisa; Crisponi, Laura; Mumm, Steven; Percesepe, Antonio; Pellegrini, Massimo; Pilia, Giuseppe; Forabosco, Antonino

    2002-08-15

    Changes at a single autosomal locus and many X-linked loci have been implicated in women with gonadal dysgenesis [premature ovarian failure (POF) with deficits in ovarian follicles]. For the chromosome 3 locus, a forkhead transcription factor gene (FOXL2) has been identified, in which lesions result in decreased follicles by haploinsufficiency. In contrast, sporadic X; autosomal translocations are distributed at many points on the X, but concentrate in a critical region on Xq. The association of the breakpoints with genes involved in ovarian function is thus far weak (in four analyzed cases) and has not been related to pathology in other POF patients. While many more translocations can be analyzed in detail as the human genome sequence is refined, it remains possible that translocations like X monosomy (Turner syndrome) lead to POF not by interrupting specific genes important in ovarian development, but by causing aberrations in pairing or X-inactivation during folliculogenesis. It is noted that the critical region has unusual features, neighboring the X-inactivation center and including an 18 Mb region of very low recombination. These suggest that chromosome dynamics in the region may be sensitive to structural changes, and when modified by translocations might provoke apoptosis at meiotic checkpoints. Choices among models for the etiology of POF should be feasible based on studies of ovarian follicle development and attrition in mouse models. Studies would prominently include gene expression profiling of developmental-specific pathways in nascent ovaries with controlled levels of Foxl2 and interacting proteins, or with defined changes in the X chromosome.

  10. Prediction of operon-like gene clusters in the Arabidopsis thaliana genome based on co-expression analysis of neighboring genes.

    Science.gov (United States)

    Wada, Masayoshi; Takahashi, Hiroki; Altaf-Ul-Amin, Md; Nakamura, Kensuke; Hirai, Masami Y; Ohta, Daisaku; Kanaya, Shigehiko

    2012-07-15

    Operon-like arrangements of genes occur in eukaryotes ranging from yeasts and filamentous fungi to nematodes, plants, and mammals. In plants, several examples of operon-like gene clusters involved in metabolic pathways have recently been characterized, e.g. the cyclic hydroxamic acid pathways in maize, the avenacin biosynthesis gene clusters in oat, the thalianol pathway in Arabidopsis thaliana, and the diterpenoid momilactone cluster in rice. Such operon-like gene clusters are defined by their co-regulation or neighboring positions within immediate vicinity of chromosomal regions. A comprehensive analysis of the expression of neighboring genes therefore accounts a crucial step to reveal the complete set of operon-like gene clusters within a genome. Genome-wide prediction of operon-like gene clusters should contribute to functional annotation efforts and provide novel insight into evolutionary aspects acquiring certain biological functions as well. We predicted co-expressed gene clusters by comparing the Pearson correlation coefficient of neighboring genes and randomly selected gene pairs, based on a statistical method that takes false discovery rate (FDR) into consideration for 1469 microarray gene expression datasets of A. thaliana. We estimated that A. thaliana contains 100 operon-like gene clusters in total. We predicted 34 statistically significant gene clusters consisting of 3 to 22 genes each, based on a stringent FDR threshold of 0.1. Functional relationships among genes in individual clusters were estimated by sequence similarity and functional annotation of genes. Duplicated gene pairs (determined based on BLAST with a cutoff of EOperon-like clusters tend to include genes encoding bio-machinery associated with ribosomes, the ubiquitin/proteasome system, secondary metabolic pathways, lipid and fatty-acid metabolism, and the lipid transfer system.

  11. The B-type lamin is required for somatic repression of testis-specific gene clusters

    Science.gov (United States)

    Shevelyov, Y. Y.; Lavrov, S. A.; Mikhaylova, L. M.; Nurminsky, I. D.; Kulathinal, R. J.; Egorova, K. S.; Rozovsky, Y. M.; Nurminsky, D. I.

    2009-01-01

    Large clusters of coexpressed tissue-specific genes are abundant on chromosomes of diverse species. The genes coordinately misexpressed in diverse diseases are also found in similar clusters, suggesting that evolutionarily conserved mechanisms regulate expression of large multigenic regions both in normal development and in its pathological disruptions. Studies on individual loci suggest that silent clusters of coregulated genes are embedded in repressed chromatin domains, often localized to the nuclear periphery. To test this model at the genome-wide scale, we studied transcriptional regulation of large testis-specific gene clusters in somatic tissues of Drosophila. These gene clusters showed a drastic paucity of known expressed transgene insertions, indicating that they indeed are embedded in repressed chromatin. Bioinformatics analysis suggested the major role for the B-type lamin, LamDmo, in repression of large testis-specific gene clusters, showing that in somatic cells as many as three-quarters of these clusters interact with LamDmo. Ablation of LamDmo by using mutants and RNAi led to detachment of testis-specific clusters from nuclear envelope and to their selective transcriptional up-regulation in somatic cells, thus providing the first direct evidence for involvement of the B-type lamin in tissue-specific gene repression. Finally, we found that transcriptional activation of the lamina-bound testis-specific gene cluster in male germ line is coupled with its translocation away from the nuclear envelope. Our studies, which directly link nuclear architecture with coordinated regulation of tissue-specific genes, advance understanding of the mechanisms underlying both normal cell differentiation and developmental disorders caused by lesions in the B-type lamins and interacting proteins. PMID:19218438

  12. clusterProfiler: an R package for comparing biological themes among gene clusters.

    Science.gov (United States)

    Yu, Guangchuang; Wang, Li-Gen; Han, Yanyan; He, Qing-Yu

    2012-05-01

    Increasing quantitative data generated from transcriptomics and proteomics require integrative strategies for analysis. Here, we present an R package, clusterProfiler that automates the process of biological-term classification and the enrichment analysis of gene clusters. The analysis module and visualization module were combined into a reusable workflow. Currently, clusterProfiler supports three species, including humans, mice, and yeast. Methods provided in this package can be easily extended to other species and ontologies. The clusterProfiler package is released under Artistic-2.0 License within Bioconductor project. The source code and vignette are freely available at http://bioconductor.org/packages/release/bioc/html/clusterProfiler.html.

  13. Some statistical properties of gene expression clustering for array data

    DEFF Research Database (Denmark)

    Abreu, G C G; Pinheiro, A; Drummond, R D;

    2010-01-01

    DNA array data without a corresponding statistical error measure. We propose an easy-to-implement and simple-to-use technique that uses bootstrap re-sampling to evaluate the statistical error of the nodes provided by SOM-based clustering. Comparisons between SOM and parametric clustering are presented......DNA arrays have been a rich source of data for the study of genomic expression of a wide variety of biological systems. Gene clustering is one of the paradigms quite used to assess the significance of a gene (or group of genes). However, most of the gene clustering techniques are applied to c...... for simulated as well as for two real data sets. We also implement a bootstrap-based pre-processing procedure for SOM, that improves the false discovery ratio of differentially expressed genes. Code in Matlab is freely available, as well as some supplementary material, at the following address: https...

  14. Clustering Algorithms: Their Application to Gene Expression Data

    Science.gov (United States)

    Oyelade, Jelili; Isewon, Itunuoluwa; Oladipupo, Funke; Aromolaran, Olufemi; Uwoghiren, Efosa; Ameh, Faridah; Achas, Moses; Adebiyi, Ezekiel

    2016-01-01

    Gene expression data hide vital information required to understand the biological process that takes place in a particular organism in relation to its environment. Deciphering the hidden patterns in gene expression data proffers a prodigious preference to strengthen the understanding of functional genomics. The complexity of biological networks and the volume of genes present increase the challenges of comprehending and interpretation of the resulting mass of data, which consists of millions of measurements; these data also inhibit vagueness, imprecision, and noise. Therefore, the use of clustering techniques is a first step toward addressing these challenges, which is essential in the data mining process to reveal natural structures and identify interesting patterns in the underlying data. The clustering of gene expression data has been proven to be useful in making known the natural structure inherent in gene expression data, understanding gene functions, cellular processes, and subtypes of cells, mining useful information from noisy data, and understanding gene regulation. The other benefit of clustering gene expression data is the identification of homology, which is very important in vaccine design. This review examines the various clustering algorithms applicable to the gene expression data in order to discover and provide useful knowledge of the appropriate clustering technique that will guarantee stability and high degree of accuracy in its analysis procedure. PMID:27932867

  15. Identification of transcriptional activators for thienamycin and cephamycin C biosynthetic genes within the thienamycin gene cluster from Streptomyces cattleya.

    Science.gov (United States)

    Rodríguez, Miriam; Núñez, Luz Elena; Braña, Alfredo F; Méndez, Carmen; Salas, José A; Blanco, Gloria

    2008-08-01

    Two regulatory genes, thnI and thnU, were identified in the thienamycin (thn) gene cluster from Streptomyces cattleya. ThnI resembles LysR-type transcriptional activators and ThnU belongs to the SARP family of transcriptional activators. Their functional role was established after independent inactivation by gene replacement together with transcriptional analysis involving reverse transcription polymerase chain reaction (RT-PCR). Deletion of thnI abolished thienamycin production showing its involvement in thienamycin biosynthesis. Gene expression analysis applied to the thn gene cluster demonstrated that ThnI is a transcriptional activator essential for thienamycin biosynthesis that regulates the expression of nine genes involved in thienamycin assembly and export (thnH, thnJ, thnK, thnL, thnM, thnN, thnO, thnP and thnQ). Unexpectedly, the thnU disrupted mutant was not affected in thienamycin production but turned out to be essential for cephamycin C biosynthesis. Transcript analysis applied to early and late structural genes for cephamycin C biosynthesis (pcbAB and cmcI), revealed that ThnU is the transcriptional activator of these cephamycin C genes although they are not physically linked to the thn cluster. In addition, it was shown that deletion of thnI has an upregulatory effect on pcbAB and cmcI transcription consistent with a significant increase in cephamycin C biosynthesis in this mutant.

  16. Phylogeographic support for horizontal gene transfer involving sympatric bruchid species

    Directory of Open Access Journals (Sweden)

    Grill Andrea

    2006-07-01

    Full Text Available Abstract Background We report on the probable horizontal transfer of a mitochondrial gene, cytb, between species of Neotropical bruchid beetles, in a zone where these species are sympatric. The bruchid beetles Acanthoscelides obtectus, A. obvelatus, A. argillaceus and Zabrotes subfasciatus develop on various bean species in Mexico. Whereas A. obtectus and A. obvelatus develop on Phaseolus vulgaris in the Mexican Altiplano, A. argillaceus feeds on P. lunatus in the Pacific coast. The generalist Z. subfasciatus feeds on both bean species, and is sympatric with A. obtectus and A. obvelatus in the Mexican Altiplano, and with A. argillaceus in the Pacific coast. In order to assess the phylogenetic position of these four species, we amplified and sequenced one nuclear (28S rRNA and two mitochondrial (cytb, COI genes. Results Whereas species were well segregated in topologies obtained for COI and 28S rRNA, an unexpected pattern was obtained in the cytb phylogenetic tree. In this tree, individuals from A. obtectus and A. obvelatus, as well as Z. subfasciatus individuals from the Mexican Altiplano, clustered together in a unique little variable monophyletic unit. In contrast, A. argillaceus and Z. subfasciatus individuals from the Pacific coast clustered in two separated clades, identically to the pattern obtained for COI and 28S rRNA. An additional analysis showed that Z. subfasciatus individuals from the Mexican Altiplano also possessed the cytb gene present in individuals of this species from the Pacific coast. Zabrotes subfasciatus individuals from the Mexican Altiplano thus demonstrated two cytb genes, an "original" one and an "infectious" one, showing 25% of nucleotide divergence. The "infectious" cytb gene seems to be under purifying selection and to be expressed in mitochondria. Conclusion The high degree of incongruence of the cytb tree with patterns for other genes is discussed in the light of three hypotheses: experimental contamination

  17. Variations in CCL3L gene cluster sequence and non-specific gene copy numbers

    Directory of Open Access Journals (Sweden)

    Edberg Jeffrey C

    2010-03-01

    Full Text Available Abstract Background Copy number variations (CNVs of the gene CC chemokine ligand 3-like1 (CCL3L1 have been implicated in HIV-1 susceptibility, but the association has been inconsistent. CCL3L1 shares homology with a cluster of genes localized to chromosome 17q12, namely CCL3, CCL3L2, and, CCL3L3. These genes are involved in host defense and inflammatory processes. Several CNV assays have been developed for the CCL3L1 gene. Findings Through pairwise and multiple alignments of these genes, we have shown that the homology between these genes ranges from 50% to 99% in complete gene sequences and from 70-100% in the exonic regions, with CCL3L1 and CCL3L3 being identical. By use of MEGA 4 and BioEdit, we aligned sense primers, anti-sense primers, and probes used in several previously described assays against pre-multiple alignments of all four chemokine genes. Each set of probes and primers aligned and matched with overlapping sequences in at least two of the four genes, indicating that previously utilized RT-PCR based CNV assays are not specific for only CCL3L1. The four available assays measured median copies of 2 and 3-4 in European and African American, respectively. The concordance between the assays ranged from 0.44-0.83 suggesting individual discordant calls and inconsistencies with the assays from the expected gene coverage from the known sequence. Conclusions This indicates that some of the inconsistencies in the association studies could be due to assays that provide heterogenous results. Sequence information to determine CNV of the three genes separately would allow to test whether their association with the pathogenesis of a human disease or phenotype is affected by an individual gene or by a combination of these genes.

  18. A gene cluster for amylovoran synthesis in Erwinia amylovora: characterization and relationship to cps genes in Erwinia stewartii.

    Science.gov (United States)

    Bernhard, F; Coplin, D L; Geider, K

    1993-05-01

    A large ams gene cluster required for production of the acidic extracellular polysaccharide (EPS) amylovoran by the fire blight pathogen Erwinia amylovora was cloned. Tn5 mutagenesis and gene replacement were used to construct chromosomal ams mutants. Five complementation groups, essential for amylovoran synthesis and virulence in E. amylovora, were identified and designated ams A-E. The ams gene cluster is about 7 kb in size and functionally equivalent to the cps gene cluster involved in EPS synthesis by the related pathogen Erwinia stewartii. Mucoidy and virulence were restored to E. stewartii mutants in four cps complementation groups by the cloned E. amylovora ams genes. Conversely, the E. stewartii cps gene cluster was able to complement mutations in E. amylovora ams genes. Correspondence was found between the amsA-E complementation groups and the cpsB-D region, but the arrangement of the genes appears to be different. EPS production and virulence were also restored to E. amylovora amsE and E. stewartii cpsD mutants by clones containing the Rhizobium meliloti exo A gene.

  19. Minimum Information about a Biosynthetic Gene cluster : commentary

    NARCIS (Netherlands)

    Medema, Marnix H; Kottmann, Renzo; Yilmaz, Pelin; Cummings, Matthew; Biggins, John B; Blin, Kai; de Bruijn, Irene; Chooi, Yit Heng; Claesen, Jan; Coates, R Cameron; Cruz-Morales, Pablo; Duddela, Srikanth; Dusterhus, Stephanie; Edwards, Daniel J; Fewer, David P; Garg, Neha; Geiger, Christoph; Gomez-Escribano, Juan Pablo; Greule, Anja; Hadjithomas, Michalis; Haines, Anthony S; Helfrich, Eric J N; Hillwig, Matthew L; Ishida, Keishi; Jones, Adam C; Jones, Carla S; Jungmann, Katrin; Kegler, Carsten; Kim, Hyun Uk; Kotter, Peter; Krug, Daniel; Masschelein, Joleen; Melnik, Alexey V; Mantovani, Simone M; Monroe, Emily A; Moore, Marcus; Moss, Nathan; Nutzmann, Hans-Wilhelm; Pan, Guohui; Pati, Amrita; Petras, Daniel; Reen, F Jerry; Rosconi, Federico; Rui, Zhe; Tian, Zhenhua; Tobias, Nicholas J; Tsunematsu, Yuta; Wiemann, Philipp; Wyckoff, Elizabeth; Yan, Xiaohui; Yim, Grace; Yu, Fengan; Xie, Yunchang; Aigle, Bertrand; Apel, Alexander K; Balibar, Carl J; Balskus, Emily P; Barona-Gomez, Francisco; Bechthold, Andreas; Bode, Helge B; Borriss, Rainer; Brady, Sean F; Brakhage, Axel A; Caffrey, Patrick; Cheng, Yi-Qiang; Clardy, Jon; Cox, Russell J; De Mot, Rene; Donadio, Stefano; Donia, Mohamed S; van der Donk, Wilfred A; Dorrestein, Pieter C; Doyle, Sean; Driessen, Arnold J M; Ehling-Schulz, Monika; Entian, Karl-Dieter; Fischbach, Michael A; Gerwick, Lena; Gerwick, William H; Gross, Harald; Gust, Bertolt; Hertweck, Christian; Hofte, Monica; Jensen, Susan E; Ju, Jianhua; Katz, Leonard; Kaysser, Leonard; Klassen, Jonathan L; Keller, Nancy P; Kormanec, Jan; Kuipers, Oscar P; Kuzuyama, Tomohisa; Kyrpides, Nikos C; Kwon, Hyung-Jin; Lautru, Sylvie; Lavigne, Rob; Lee, Chia Y; Linquan, Bai; Liu, Xinyu; Liu, Wen; Luzhetskyy, Andriy; Mahmud, Taifo; Mast, Yvonne; Mendez, Carmen; Metsa-Ketela, Mikko; Micklefield, Jason; Mitchell, Douglas A; Moore, Bradley S; Moreira, Leonilde M; Muller, Rolf; Neilan, Brett A; Nett, Markus; Nielsen, Jens; O'Gara, Fergal; Oikawa, Hideaki; Osbourn, Anne; Osburne, Marcia S; Ostash, Bohdan; Payne, Shelley M; Pernodet, Jean-Luc; Petricek, Miroslav; Piel, Jorn; Ploux, Olivier; Raaijmakers, Jos M; Salas, Jose A; Schmitt, Esther K; Scott, Barry; Seipke, Ryan F; Shen, Ben; Sherman, David H; Sivonen, Kaarina; Smanski, Michael J; Sosio, Margherita; Stegmann, Evi; Sussmuth, Roderich D; Tahlan, Kapil; Thomas, Christopher M; Tang, Yi; Truman, Andrew W; Viaud, Muriel; Walton, Jonathan D; Walsh, Christopher T; Weber, Tilmann; van Wezel, Gilles P; Wilkinson, Barrie; Willey, Joanne M; Wohlleben, Wolfgang; Wright, Gerard D; Ziemert, Nadine; Zhang, Changsheng; Zotchev, Sergey B; Breitling, Rainer; Takano, Eriko; Glockner, Frank Oliver

    2015-01-01

    A wide variety of enzymatic pathways that produce specialized metabolites in bacteria, fungi and plants are known to be encoded in biosynthetic gene clusters. Information about these clusters, pathways and metabolites is currently dispersed throughout the literature, making it difficult to exploit.

  20. Hox gene clusters in the Indonesian coelacanth, Latimeria menadoensis.

    Science.gov (United States)

    Koh, Esther G L; Lam, Kevin; Christoffels, Alan; Erdmann, Mark V; Brenner, Sydney; Venkatesh, Byrappa

    2003-02-01

    The Hox genes encode transcription factors that play a key role in specifying body plans of metazoans. They are organized into clusters that contain up to 13 paralogue group members. The complex morphology of vertebrates has been attributed to the duplication of Hox clusters during vertebrate evolution. In contrast to the single Hox cluster in the amphioxus (Branchiostoma floridae), an invertebrate-chordate, mammals have four clusters containing 39 Hox genes. Ray-finned fishes (Actinopterygii) such as zebrafish and fugu possess more than four Hox clusters. The coelacanth occupies a basal phylogenetic position among lobe-finned fishes (Sarcopterygii), which gave rise to the tetrapod lineage. The lobe fins of sarcopterygians are considered to be the evolutionary precursors of tetrapod limbs. Thus, the characterization of Hox genes in the coelacanth should provide insights into the origin of tetrapod limbs. We have cloned the complete second exon of 33 Hox genes from the Indonesian coelacanth, Latimeria menadoensis, by extensive PCR survey and genome walking. Phylogenetic analysis shows that 32 of these genes have orthologs in the four mammalian HOX clusters, including three genes (HoxA6, D1, and D8) that are absent in ray-finned fishes. The remaining coelacanth gene is an ortholog of hoxc1 found in zebrafish but absent in mammals. Our results suggest that coelacanths have four Hox clusters bearing a gene complement more similar to mammals than to ray-finned fishes, but with an additional gene, HoxC1, which has been lost during the evolution of mammals from lobe-finned fishes.

  1. Close encounters involving free-floating planets in star clusters

    CERN Document Server

    Wang, Long; Zheng, Xiaochen; Church, Ross P; Davies, Melvyn B

    2015-01-01

    Instabilities in planetary systems can result in the ejection of planets from their host system, resulting in free-floating planets (FFPs). If this occurs in a star cluster, the FFP may remain bound to the star cluster for some time and interact with the other cluster members until it is ejected. Here, we use $N$-body simulations to characterise close star-planet and planet-planet encounters and the dynamical fate of the FFP population in star clusters containing $500-2000$ single or binary star members. We find that FFPs ejected from their planetary system at low velocities typically leave the star cluster 40% earlier than their host stars, and experience tens of close ($<1000$ AU) encounters with other stars and planets before they escape. The fraction of FFPs that experiences a close encounter depends on both the stellar density and the initial velocity distribution of the FFPs. Approximately half of the close encounters occur within the first 30 Myr, and only 10% occur after 100 Myr. The periastron vel...

  2. A Rough Set based Gene Expression Clustering Algorithm

    Directory of Open Access Journals (Sweden)

    J. J. Emilyn

    2011-01-01

    Full Text Available Problem statement: Microarray technology helps in monitoring the expression levels of thousands of genes across collections of related samples. Approach: The main goal in the analysis of large and heterogeneous gene expression datasets was to identify groups of genes that get expressed in a set of experimental conditions. Results: Several clustering techniques have been proposed for identifying gene signatures and to understand their role and many of them have been applied to gene expression data, but with partial success. The main aim of this work was to develop a clustering algorithm that would successfully indentify gene patterns. The proposed novel clustering technique (RCGED provides an efficient way of finding the hidden and unique gene expression patterns. It overcomes the restriction of one object being placed in only one cluster. Conclusion/Recommendations: The proposed algorithm is termed intelligent because it automatically determines the optimum number of clusters. The proposed algorithm was experimented with colon cancer dataset and the results were compared with Rough Fuzzy K Means algorithm.

  3. Phylogeny of the Insect Homeobox Gene (Hox) Cluster

    Institute of Scientific and Technical Information of China (English)

    Sangeeta Dhawan; K. P. Gopinathan

    2005-01-01

    The homeobox (Hox) genes form an evolutionarily conserved family encoding transcription factors that play major roles in segmental identity and organ specification across species. The canonical grouping of Hox genes present in the HOM-C cluster of Drosophila or related clusters in other organisms includes eight "typical" genes,which are localized in the order labial (lab), proboscipedia (pb), Deformed (Dfd),Sex combs reduced ( Scr), Antennapedia (Antp), Ultrabithorax (Ubx), abdominalA (abdA), and AbdominalB (AbdB). The members of Hox cluster are expressed in a distinct anterior to posterior order in the embryo. Analysis of the relatedness of different members of the Hox gene cluster to each other in four evolutionarily diverse insect taxa revealed that the loci pb/Dfd and AbdB, which are farthest apart in linkage, had a high degree of evolutionary relatedness, indicating that pb/Dfd type anterior genes and AbdB are closest to the ancestral anterior and posterior Hox genes, respectively. The greater relatedness of other posterior genes Ubx and abdA to the more anterior genes such as Antp and Scr suggested that they arose by gene duplications in the more anterior members rather than the posterior AbdB.

  4. Phylogenomics of the benzoxazinoid biosynthetic pathway of Poaceae: gene duplications and origin of the Bx cluster

    Directory of Open Access Journals (Sweden)

    Dutartre Leslie

    2012-05-01

    Full Text Available Abstract Background The benzoxazinoids 2,4-dihydroxy-1,4-benzoxazin-3-one (DIBOA and 2,4-dihydroxy-7- methoxy-1,4-benzoxazin-3-one (DIMBOA, are key defense compounds present in major agricultural crops such as maize and wheat. Their biosynthesis involves nine enzymes thought to form a linear pathway leading to the storage of DI(MBOA as glucoside conjugates. Seven of the genes (Bx1-Bx6 and Bx8 form a cluster at the tip of the short arm of maize chromosome 4 that includes four P450 genes (Bx2-5 belonging to the same CYP71C subfamily. The origin of this cluster is unknown. Results We show that the pathway appeared following several duplications of the TSA gene (α-subunit of tryptophan synthase and of a Bx2-like ancestral CYP71C gene and the recruitment of Bx8 before the radiation of Poaceae. The origins of Bx6 and Bx7 remain unclear. We demonstrate that the Bx2-like CYP71C ancestor was not committed to the benzoxazinoid pathway and that after duplications the Bx2-Bx5 genes were under positive selection on a few sites and underwent functional divergence, leading to the current specific biochemical properties of the enzymes. The absence of synteny between available Poaceae genomes involving the Bx gene regions is in contrast with the conserved synteny in the TSA gene region. Conclusions These results demonstrate that rearrangements following duplications of an IGL/TSA gene and of a CYP71C gene probably resulted in the clustering of the new copies (Bx1 and Bx2 at the tip of a chromosome in an ancestor of grasses. Clustering favored cosegregation and tip chromosomal location favored gene rearrangements that allowed the further recruitment of genes to the pathway. These events, a founding event and elongation events, may have been the key to the subsequent evolution of the benzoxazinoid biosynthetic cluster.

  5. Phylogenomics of the benzoxazinoid biosynthetic pathway of Poaceae: gene duplications and origin of the Bx cluster.

    Science.gov (United States)

    Dutartre, Leslie; Hilliou, Frédérique; Feyereisen, René

    2012-05-11

    The benzoxazinoids 2,4-dihydroxy-1,4-benzoxazin-3-one (DIBOA) and 2,4-dihydroxy-7- methoxy-1,4-benzoxazin-3-one (DIMBOA), are key defense compounds present in major agricultural crops such as maize and wheat. Their biosynthesis involves nine enzymes thought to form a linear pathway leading to the storage of DI(M)BOA as glucoside conjugates. Seven of the genes (Bx1-Bx6 and Bx8) form a cluster at the tip of the short arm of maize chromosome 4 that includes four P450 genes (Bx2-5) belonging to the same CYP71C subfamily. The origin of this cluster is unknown. We show that the pathway appeared following several duplications of the TSA gene (α-subunit of tryptophan synthase) and of a Bx2-like ancestral CYP71C gene and the recruitment of Bx8 before the radiation of Poaceae. The origins of Bx6 and Bx7 remain unclear. We demonstrate that the Bx2-like CYP71C ancestor was not committed to the benzoxazinoid pathway and that after duplications the Bx2-Bx5 genes were under positive selection on a few sites and underwent functional divergence, leading to the current specific biochemical properties of the enzymes. The absence of synteny between available Poaceae genomes involving the Bx gene regions is in contrast with the conserved synteny in the TSA gene region. These results demonstrate that rearrangements following duplications of an IGL/TSA gene and of a CYP71C gene probably resulted in the clustering of the new copies (Bx1 and Bx2) at the tip of a chromosome in an ancestor of grasses. Clustering favored cosegregation and tip chromosomal location favored gene rearrangements that allowed the further recruitment of genes to the pathway. These events, a founding event and elongation events, may have been the key to the subsequent evolution of the benzoxazinoid biosynthetic cluster.

  6. Secondary metabolic gene clusters: evolutionary toolkits for chemical innovation.

    Science.gov (United States)

    Osbourn, Anne

    2010-10-01

    Microbes and plants produce a huge array of secondary metabolites that have important ecological functions. These molecules have long been exploited in medicine as antibiotics, anticancer and anti-infective agents and for a wide range of other applications. Gene clusters for secondary metabolic pathways are common in bacteria and filamentous fungi, and examples have now been discovered in plants. Here, current knowledge of gene clusters across the kingdoms is evaluated with the aim of trying to understand the rules behind cluster existence and evolution. Such knowledge will be crucial in learning how to activate the enormous number of 'silent' gene clusters being revealed by whole-genome sequencing and hence in making available a wealth of novel compounds for evaluation as drug leads and other bioactives. It could also facilitate the development of crop plants with enhanced pest or disease resistance, improved nutritional qualities and/or elevated levels of high-value products.

  7. Characterisation of the paralytic shellfish toxin biosynthesis gene clusters in Anabaena circinalis AWQC131C and Aphanizomenon sp. NH-5

    Directory of Open Access Journals (Sweden)

    Neilan Brett A

    2009-03-01

    Full Text Available Abstract Background Saxitoxin and its analogues collectively known as the paralytic shellfish toxins (PSTs are neurotoxic alkaloids and are the cause of the syndrome named paralytic shellfish poisoning. PSTs are produced by a unique biosynthetic pathway, which involves reactions that are rare in microbial metabolic pathways. Nevertheless, distantly related organisms such as dinoflagellates and cyanobacteria appear to produce these toxins using the same pathway. Hypothesised explanations for such an unusual phylogenetic distribution of this shared uncommon metabolic pathway, include a polyphyletic origin, an involvement of symbiotic bacteria, and horizontal gene transfer. Results We describe the identification, annotation and bioinformatic characterisation of the putative paralytic shellfish toxin biosynthesis clusters in an Australian isolate of Anabaena circinalis and an American isolate of Aphanizomenon sp., both members of the Nostocales. These putative PST gene clusters span approximately 28 kb and contain genes coding for the biosynthesis and export of the toxin. A putative insertion/excision site in the Australian Anabaena circinalis AWQC131C was identified, and the organization and evolution of the gene clusters are discussed. A biosynthetic pathway leading to the formation of saxitoxin and its analogues in these organisms is proposed. Conclusion The PST biosynthesis gene cluster presents a mosaic structure, whereby genes have apparently transposed in segments of varying size, resulting in different gene arrangements in all three sxt clusters sequenced so far. The gene cluster organizational structure and sequence similarity seems to reflect the phylogeny of the producer organisms, indicating that the gene clusters have an ancient origin, or that their lateral transfer was also an ancient event. The knowledge we gain from the characterisation of the PST biosynthesis gene clusters, including the identity and sequence of the genes involved

  8. Identification of sugarcane genes involved in the purine synthesis pathway

    Directory of Open Access Journals (Sweden)

    Mario A. Jancso

    2001-12-01

    Full Text Available Nucleotide synthesis is of central importance to all cells. In most organisms, the purine nucleotides are synthesized de novo from non-nucleotide precursors such as amino acids, ammonia and carbon dioxide. An understanding of the enzymes involved in sugarcane purine synthesis opens the possibility of using these enzymes as targets for chemicals which may be effective in combating phytopathogen. Such an approach has already been applied to several parasites and types of cancer. The strategy described in this paper was applied to identify sugarcane clusters for each step of the de novo purine synthesis pathway. Representative sequences of this pathway were chosen from the National Center for Biotechnology Information (NCBI database and used to search the translated sugarcane expressed sequence tag (SUCEST database using the available basic local alignment search tool (BLAST facility. Retrieved clusters were further tested for the statistical significance of the alignment by an implementation (PRSS3 of the Monte Carlo shuffling algorithm calibrated using known protein sequences of divergent taxa along the phylogenetic tree. The sequences were compared to each other and to the sugarcane clusters selected using BLAST analysis, with the resulting table of p-values indicating the degree of divergence of each enzyme within different taxa and in relation to the sugarcane clusters. The results obtained by this strategy allowed us to identify the sugarcane proteins participating in the purine synthesis pathway.A via de síntese de purino nucleotídeos é considerada uma via de central importância para todas as células. Na maioria dos organismos, os purino nucleotídeos são sintetizados ''de novo'' a partir de precursores não-nucleotídicos como amino ácidos, amônia e dióxido de carbono. O conhecimento das enzimas envolvidas na via de síntese de purinas da cana-de-açúcar vai abrir a possibilidade do uso dessas enzimas como alvos no desenho

  9. Planning a cluster randomized trial with unequal cluster sizes: practical issues involving continuous outcomes

    Directory of Open Access Journals (Sweden)

    Ravaud Philippe

    2006-04-01

    Full Text Available Abstract Background Cluster randomization design is increasingly used for the evaluation of health-care, screeening or educational interventions. At the planning stage, sample size calculations usually consider an average cluster size without taking into account any potential imbalance in cluster size. However, there may exist high discrepancies in cluster sizes. Methods We performed simulations to study the impact of an imbalance in cluster size on power. We determined by simulations to which extent four methods proposed to adapt the sample size calculations to a pre-specified imbalance in cluster size could lead to adequately powered trials. Results We showed that an imbalance in cluster size can be of high influence on the power in the case of severe imbalance, particularly if the number of clusters is low and/or the intraclass correlation coefficient is high. In the case of a severe imbalance, our simulations confirmed that the minimum variance weights correction of the variation inflaction factor (VIF used in the sample size calculations has the best properties. Conclusion Publication of cluster sizes is important to assess the real power of the trial which was conducted and to help designing future trials. We derived an adaptation of the VIF from the minimum variance weights correction to be used in case the imbalance can be a priori formulated such as "a proportion (γ of clusters actually recruit a proportion (τ of subjects to be included (γ ≤ τ".

  10. Characterization of the largest effector gene cluster of Ustilago maydis.

    Directory of Open Access Journals (Sweden)

    Thomas Brefort

    2014-07-01

    Full Text Available In the genome of the biotrophic plant pathogen Ustilago maydis, many of the genes coding for secreted protein effectors modulating virulence are arranged in gene clusters. The vast majority of these genes encode novel proteins whose expression is coupled to plant colonization. The largest of these gene clusters, cluster 19A, encodes 24 secreted effectors. Deletion of the entire cluster results in severe attenuation of virulence. Here we present the functional analysis of this genomic region. We show that a 19A deletion mutant behaves like an endophyte, i.e. is still able to colonize plants and complete the infection cycle. However, tumors, the most conspicuous symptoms of maize smut disease, are only rarely formed and fungal biomass in infected tissue is significantly reduced. The generation and analysis of strains carrying sub-deletions identified several genes significantly contributing to tumor formation after seedling infection. Another of the effectors could be linked specifically to anthocyanin induction in the infected tissue. As the individual contributions of these genes to tumor formation were small, we studied the response of maize plants to the whole cluster mutant as well as to several individual mutants by array analysis. This revealed distinct plant responses, demonstrating that the respective effectors have discrete plant targets. We propose that the analysis of plant responses to effector mutant strains that lack a strong virulence phenotype may be a general way to visualize differences in effector function.

  11. Characterization of the largest effector gene cluster of Ustilago maydis.

    Science.gov (United States)

    Brefort, Thomas; Tanaka, Shigeyuki; Neidig, Nina; Doehlemann, Gunther; Vincon, Volker; Kahmann, Regine

    2014-07-01

    In the genome of the biotrophic plant pathogen Ustilago maydis, many of the genes coding for secreted protein effectors modulating virulence are arranged in gene clusters. The vast majority of these genes encode novel proteins whose expression is coupled to plant colonization. The largest of these gene clusters, cluster 19A, encodes 24 secreted effectors. Deletion of the entire cluster results in severe attenuation of virulence. Here we present the functional analysis of this genomic region. We show that a 19A deletion mutant behaves like an endophyte, i.e. is still able to colonize plants and complete the infection cycle. However, tumors, the most conspicuous symptoms of maize smut disease, are only rarely formed and fungal biomass in infected tissue is significantly reduced. The generation and analysis of strains carrying sub-deletions identified several genes significantly contributing to tumor formation after seedling infection. Another of the effectors could be linked specifically to anthocyanin induction in the infected tissue. As the individual contributions of these genes to tumor formation were small, we studied the response of maize plants to the whole cluster mutant as well as to several individual mutants by array analysis. This revealed distinct plant responses, demonstrating that the respective effectors have discrete plant targets. We propose that the analysis of plant responses to effector mutant strains that lack a strong virulence phenotype may be a general way to visualize differences in effector function.

  12. A genome-wide analysis of nonribosomal peptide synthetase gene clusters and their peptides in a Planktothrix rubescens strain

    Directory of Open Access Journals (Sweden)

    Nederbragt Alexander J

    2009-08-01

    Full Text Available Abstract Background Cyanobacteria often produce several different oligopeptides, with unknown biological functions, by nonribosomal peptide synthetases (NRPS. Although some cyanobacterial NRPS gene cluster types are well described, the entire NRPS genomic content within a single cyanobacterial strain has never been investigated. Here we have combined a genome-wide analysis using massive parallel pyrosequencing ("454" and mass spectrometry screening of oligopeptides produced in the strain Planktothrix rubescens NIVA CYA 98 in order to identify all putative gene clusters for oligopeptides. Results Thirteen types of oligopeptides were uncovered by mass spectrometry (MS analyses. Microcystin, cyanopeptolin and aeruginosin synthetases, highly similar to already characterized NRPS, were present in the genome. Two novel NRPS gene clusters were associated with production of anabaenopeptins and microginins, respectively. Sequence-depth of the genome and real-time PCR data revealed three copies of the microginin gene cluster. Since NRPS gene cluster candidates for microviridin and oscillatorin synthesis could not be found, putative (gene encoded precursor peptide sequences to microviridin and oscillatorin were found in the genes mdnA and oscA, respectively. The genes flanking the microviridin and oscillatorin precursor genes encode putative modifying enzymes of the precursor oligopeptides. We therefore propose ribosomal pathways involving modifications and cyclisation for microviridin and oscillatorin. The microviridin, anabaenopeptin and cyanopeptolin gene clusters are situated in close proximity to each other, constituting an oligopeptide island. Conclusion Altogether seven nonribosomal peptide synthetase (NRPS gene clusters and two gene clusters putatively encoding ribosomal oligopeptide biosynthetic pathways were revealed. Our results demonstrate that whole genome shotgun sequencing combined with MS-directed determination of oligopeptides successfully

  13. Synteny in toxigenic Fusarium species: the fumonisin gene cluster and the mating type region as examples

    NARCIS (Netherlands)

    Waalwijk, C.; Lee, van der T.A.J.; Vries, de P.M.; Hesselink, T.; Arts, J.; Kema, G.H.J.

    2004-01-01

    A comparative genomic approach was used to study the mating type locus and the gene cluster involved in toxin production ( fumonisin) in Fusarium proliferatum, a pathogen with a wide host range and a complex toxin profile. A BAC library, generated from F. proliferatum isolate ITEM 2287, was used to

  14. Differential expression of TIR-like genes embedded in the M1-1 gene cluster in nematode-resistant and -susceptible tomato roots

    NARCIS (Netherlands)

    Seifi Abdolabad, A.R.; Visser, R.G.F.; Bai, Y.

    2011-01-01

    Transport inhibitor 1 (TIR1) is an auxin receptor that plays a pivotal role in auxin signaling. It has been reported that TIR-like genes are present in a gene cluster carrying the Mi-1 gene which confers resistance to nematodes, aphids and whiteflies. Since auxin is involved in the pathogenicity of

  15. Differential expression of TIR-like genes embedded in the M1-1 gene cluster in nematode-resistant and -susceptible tomato roots

    NARCIS (Netherlands)

    Seifi Abdolabad, A.R.; Visser, R.G.F.; Bai, Y.

    2011-01-01

    Transport inhibitor 1 (TIR1) is an auxin receptor that plays a pivotal role in auxin signaling. It has been reported that TIR-like genes are present in a gene cluster carrying the Mi-1 gene which confers resistance to nematodes, aphids and whiteflies. Since auxin is involved in the pathogenicity of

  16. An improved algorithm for clustering gene expression data.

    Science.gov (United States)

    Bandyopadhyay, Sanghamitra; Mukhopadhyay, Anirban; Maulik, Ujjwal

    2007-11-01

    Recent advancements in microarray technology allows simultaneous monitoring of the expression levels of a large number of genes over different time points. Clustering is an important tool for analyzing such microarray data, typical properties of which are its inherent uncertainty, noise and imprecision. In this article, a two-stage clustering algorithm, which employs a recently proposed variable string length genetic scheme and a multiobjective genetic clustering algorithm, is proposed. It is based on the novel concept of points having significant membership to multiple classes. An iterated version of the well-known Fuzzy C-Means is also utilized for clustering. The significant superiority of the proposed two-stage clustering algorithm as compared to the average linkage method, Self Organizing Map (SOM) and a recently developed weighted Chinese restaurant-based clustering method (CRC), widely used methods for clustering gene expression data, is established on a variety of artificial and publicly available real life data sets. The biological relevance of the clustering solutions are also analyzed.

  17. Identification and analysis of the paulomycin biosynthetic gene cluster and titer improvement of the paulomycins in Streptomyces paulus NRRL 8115.

    Directory of Open Access Journals (Sweden)

    Jine Li

    Full Text Available The paulomycins are a group of glycosylated compounds featuring a unique paulic acid moiety. To locate their biosynthetic gene clusters, the genomes of two paulomycin producers, Streptomyces paulus NRRL 8115 and Streptomyces sp. YN86, were sequenced. The paulomycin biosynthetic gene clusters were defined by comparative analyses of the two genomes together with the genome of the third paulomycin producer Streptomyces albus J1074. Subsequently, the identity of the paulomycin biosynthetic gene cluster was confirmed by inactivation of two genes involved in biosynthesis of the paulomycose branched chain (pau11 and the ring A moiety (pau18 in Streptomyces paulus NRRL 8115. After determining the gene cluster boundaries, a convergent biosynthetic model was proposed for paulomycin based on the deduced functions of the pau genes. Finally, a paulomycin high-producing strain was constructed by expressing an activator-encoding gene (pau13 in S. paulus, setting the stage for future investigations.

  18. A maize-specifically expressed gene cluster in Ustilago maydis.

    Science.gov (United States)

    Basse, Christoph W; Kolb, Sebastian; Kahmann, Regine

    2002-01-01

    The corn pathogen Ustilago maydis requires its host plant maize for development and completion of its sexual cycle. We have identified the fungal mig2-1 gene as being specifically expressed during this biotrophic stage. Intriguingly, mig2-1 is part of a gene cluster comprising five highly homologous and similarly regulated genes designated mig2-1 to mig2-5. Deletion analysis of the mig2-1 promoter provides evidence for negative and positive regulation. The predicted polypeptides of all five genes lack significant homologies to known genes but have characteristic N-terminal secretion sequences. The secretion signals of mig2-1 and mig2-5 were shown to be functional, and secretion of a full length Mig2-1-eGFP fusion protein to the extracellular space was demonstrated. The central domains of the Mig2 proteins are highly variable whereas the C-termini are strongly conserved and share a characteristic pattern of eight cysteine residues. The mig2 gene cluster was conserved in a wide collection of U. maydis strains. Interestingly, some U. maydis isolates from South America had lost the mig2-4 gene as a result of a homologous recombination event. Furthermore, the related Ustilago scitaminea strain, which is pathogenic on sugar cane, appears to lack the mig2 cluster. We describe a model of how the mig2 cluster might have evolved and discuss its possible role in governing host interaction.

  19. Identification of nitrogen-fixing genes and gene clusters from metagenomic library of acid mine drainage.

    Science.gov (United States)

    Dai, Zhimin; Guo, Xue; Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

    2014-01-01

    Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community.

  20. Identification of nitrogen-fixing genes and gene clusters from metagenomic library of acid mine drainage.

    Directory of Open Access Journals (Sweden)

    Zhimin Dai

    Full Text Available Biological nitrogen fixation is an essential function of acid mine drainage (AMD microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community.

  1. Hormonal Involvement in Breast Cancer Gene Amplification

    Science.gov (United States)

    2010-10-01

    and s ubsequently amp lified at the Yale University sequenc ing facility for Illumina sequencing. However, it required a lot of effort to obtain this...and Polyak K. (2008). Genome-wide functi onal synergy between amp lified and mutated genes in human breast cancer. Cancer Res. 68: 9532-9540...east cancer patient samples. Other co-amp lified genes, within the HER2 amplicon and/or at other regions, could serve as additional novel target s for

  2. Unique nucleotide polymorphism of ankyrin gene cluster in Arabidopsis

    Indian Academy of Sciences (India)

    Jianchang Du; Xingna Wang; Mingsheng Zhang; Dacheng Tian; Yong-Hua Yang

    2007-01-01

    The ankyrin (ANK) gene cluster is a part of a multigene family encoding ANK transmembrane proteins in Arabidopsis thaliana, and plays an important role in protein–protein interactions and in signal pathways. In contrast to other regions of a genome, the ANK gene cluster exhibits an extremely high level of DNA polymorphism in an ∼5-kb region, without apparent decay. Phylogenetic analysis detects two clear, deeply differentiated haplotypes (dimorphism). The divergence between haplotypes of accession Col-0 and Ler-0 (Hap-C and Hap-L) is estimated to be 10.7%, approximately equal to the 10.5% average divergence between A. thaliana and A. lyrata. Sequence comparisons for the ANK gene cluster homologues in Col-0 indicate that the members evolve independently, and that the similarity among paralogues is lower than between alleles. Very little intralocus recombination or gene conversion is detected in ANK regions. All these characteristics of the ANK gene cluster are consistent with a tandem gene duplication and birth-and-death process. The possible mechanisms for and implications of this elevated nucleotide variation are also discussed, including the suggestion of balancing selection.

  3. Accurate prediction of secondary metabolite gene clusters in filamentous fungi.

    Science.gov (United States)

    Andersen, Mikael R; Nielsen, Jakob B; Klitgaard, Andreas; Petersen, Lene M; Zachariasen, Mia; Hansen, Tilde J; Blicher, Lene H; Gotfredsen, Charlotte H; Larsen, Thomas O; Nielsen, Kristian F; Mortensen, Uffe H

    2013-01-02

    Biosynthetic pathways of secondary metabolites from fungi are currently subject to an intense effort to elucidate the genetic basis for these compounds due to their large potential within pharmaceutics and synthetic biochemistry. The preferred method is methodical gene deletions to identify supporting enzymes for key synthases one cluster at a time. In this study, we design and apply a DNA expression array for Aspergillus nidulans in combination with legacy data to form a comprehensive gene expression compendium. We apply a guilt-by-association-based analysis to predict the extent of the biosynthetic clusters for the 58 synthases active in our set of experimental conditions. A comparison with legacy data shows the method to be accurate in 13 of 16 known clusters and nearly accurate for the remaining 3 clusters. Furthermore, we apply a data clustering approach, which identifies cross-chemistry between physically separate gene clusters (superclusters), and validate this both with legacy data and experimentally by prediction and verification of a supercluster consisting of the synthase AN1242 and the prenyltransferase AN11080, as well as identification of the product compound nidulanin A. We have used A. nidulans for our method development and validation due to the wealth of available biochemical data, but the method can be applied to any fungus with a sequenced and assembled genome, thus supporting further secondary metabolite pathway elucidation in the fungal kingdom.

  4. Identification of the Scopularide Biosynthetic Gene Cluster in Scopulariopsis brevicaulis

    Directory of Open Access Journals (Sweden)

    Mie Bech Lukassen

    2015-07-01

    Full Text Available Scopularide A is a promising potent anticancer lipopeptide isolated from a marine derived Scopulariopsis brevicaulis strain. The compound consists of a reduced carbon chain (3-hydroxy-methyldecanoyl attached to five amino acids (glycine, l-valine, d-leucine, l-alanine, and l-phenylalanine. Using the newly sequenced S. brevicaulis genome we were able to identify the putative biosynthetic gene cluster using genetic information from the structurally related emericellamide A from Aspergillus nidulans and W493-B from Fusarium pseudograminearum. The scopularide A gene cluster includes a nonribosomal peptide synthetase (NRPS1, a polyketide synthase (PKS2, a CoA ligase, an acyltransferase, and a transcription factor. Homologous recombination was low in S. brevicaulis so the local transcription factor was integrated randomly under a constitutive promoter, which led to a three to four-fold increase in scopularide A production. This indirectly verifies the identity of the proposed biosynthetic gene cluster.

  5. Ontology-Driven Co-clustering of Gene Expression Data

    Science.gov (United States)

    Cordero, Francesca; Pensa, Ruggero G.; Visconti, Alessia; Ienco, Dino; Botta, Marco

    The huge volume of gene expression data produced by microarrays and other high-throughput techniques has encouraged the development of new computational techniques to evaluate the data and to formulate new biological hypotheses. To this purpose, co-clustering techniques are widely used: these identify groups of genes that show similar activity patterns under a specific subset of the experimental conditions by measuring the similarity in expression within these groups. However, in many applications, distance metrics based only on expression levels fail in capturing biologically meaningful clusters.

  6. An Sp185/333 gene cluster from the purple sea urchin and putative microsatellite-mediated gene diversification

    Directory of Open Access Journals (Sweden)

    Buckley Katherine M

    2010-10-01

    Full Text Available Abstract Background The immune system of the purple sea urchin, Strongylocentrotus purpuratus, is complex and sophisticated. An important component of sea urchin immunity is the Sp185/333 gene family, which is significantly upregulated in immunologically challenged animals. The Sp185/333 genes are less than 2 kb with two exons and are members of a large diverse family composed of greater than 40 genes. The S. purpuratus genome assembly, however, contains only six Sp185/333 genes. This underrepresentation could be due to the difficulties that large gene families present in shotgun assembly, where multiple similar genes can be collapsed into a single consensus gene. Results To understand the genomic organization of the Sp185/333 gene family, a BAC insert containing Sp185/333 genes was assembled, with careful attention to avoiding artifacts resulting from collapse or artificial duplication/expansion of very similar genes. Twelve candidate BAC assemblies were generated with varying parameters and the optimal assembly was identified by PCR, restriction digests, and subclone sequencing. The validated assembly contained six Sp185/333 genes that were clustered in a 34 kb region at one end of the BAC with five of the six genes tightly clustered within 20 kb. The Sp185/333 genes in this cluster were no more similar to each other than to previously sequenced Sp185/333 genes isolated from three different animals. This was unexpected given their proximity and putative effects of gene homogenization in closely linked, similar genes. All six genes displayed significant similarity including both 5' and 3' flanking regions, which were bounded by microsatellites. Three of the Sp185/333 genes and their flanking regions were tandemly duplicated such that each repeated segment consisted of a gene plus 0.7 kb 5' and 2.4 kb 3' of the gene (4.5 kb total. Both edges of the segmental duplications were bounded by different microsatellites. Conclusions The high sequence

  7. Genetic localization and in vivo characterization of a Monascus azaphilone pigment biosynthetic gene cluster.

    Science.gov (United States)

    Balakrishnan, Bijinu; Karki, Suman; Chiu, Shih-Hau; Kim, Hyun-Ju; Suh, Jae-Won; Nam, Bora; Yoon, Yeo-Min; Chen, Chien-Chi; Kwon, Hyung-Jin

    2013-07-01

    Monascus spp. produce several well-known polyketides such as monacolin K, citrinin, and azaphilone pigments. In this study, the azaphilone pigment biosynthetic gene cluster was identified through T-DNA random mutagenesis in Monascus purpureus. The albino mutant W13 bears a T-DNA insertion upstream of a transcriptional regulator gene (mppR1). The transcription of mppR1 and the nearby polyketide synthase gene (MpPKS5) was significantly repressed in the W13 mutant. Targeted inactivation of MpPKS5 also gave rise to an albino mutant, confirming that mppR1 and MpPKS5 belong to an azaphilone pigment biosynthetic gene cluster. This M. purpureus sequence was used to identify the whole biosynthetic gene cluster in the Monascus pilosus genome. MpPKS5 contains SAT/KS/AT/PT/ACP/MT/R domains, and this domain organization is preserved in other azaphilone polyketide synthases. This biosynthetic gene cluster also encodes fatty acid synthase (FAS), which is predicted to assist the synthesis of 3-oxooactanoyl-CoA and 3-oxodecanoyl-CoA. These 3-oxoacyl compounds are proposed to be incorporated into the azaphilone backbone to complete the pigment biosynthesis. A monooxygenase gene (an azaH and tropB homolog) that is located far downstream of the FAS gene is proposed to be involved in pyrone ring formation. A homology search on other fungal genome sequences suggests that this azaphilone pigment gene cluster also exists in the Penicillium marneffei and Talaromyces stipitatus genomes.

  8. Functional identification of gene cluster for the aniline metabolic pathway mediated by transposable element

    Institute of Scientific and Technical Information of China (English)

    LIANG Quanfeng; Takeo Masahiro; LIN Min; CHEN Ming; XU Yuquan; ZHANG Wei; PING Shuzhen; LU Wei; SONG Xianlong; WANG Weiwei; GENG Lizhao

    2005-01-01

    A convenient and widely applicable method has been developed to clone aniline metabolic gene cluster in this study. Three positive recombinant plasmids pDA1, pDB2 and pDB11 were cloned from genomic library of aniline degradation strain AD9. The result of aniline dioxygenase (AD) activity and catechol 2,3-oxygenase (C23O) activity assay showed that pDA1 and pDB11 contain aniline dioxygenase genes and catechol 2,3-dioxygenase genes, respectively. The sequence analysis of the total 24.7-kb region revealed that this region contains 25 ORFs, of which 17 genes involve metabolism of aniline. In the gene cluster, the first five genes (tadQTA1A2B) and the subsequent gene (tadR1) were predicted to encode a multi-component aniline dioxygenase and a LysR-type regulator, respectively, while the others (tadD1C1D2C2EFGIJKL) were expected to encode meta- cleavage pathway enzymes for catechol degradation. The gene cluster was surrounded by two IS1071 sequences.

  9. EMAS Regulation in Italian Clusters: Investigating the Involvement of Local Stakeholders

    National Research Council Canada - National Science Library

    Roberto Merli; Michele Preziosi; Ilaria Massa

    2014-01-01

      The last revision of the EMAS (Eco Management and Audit Scheme) Regulation encouraged a cluster approach to increase the participation of the organizations and to involve local stakeholders in the commitment to sustainability...

  10. Simultaneous clustering of gene expression data with clinical chemistry and pathological evaluations reveals phenotypic prototypes

    Directory of Open Access Journals (Sweden)

    Wolfinger Russell D

    2007-02-01

    Full Text Available Abstract Background Commonly employed clustering methods for analysis of gene expression data do not directly incorporate phenotypic data about the samples. Furthermore, clustering of samples with known phenotypes is typically performed in an informal fashion. The inability of clustering algorithms to incorporate biological data in the grouping process can limit proper interpretation of the data and its underlying biology. Results We present a more formal approach, the modk-prototypes algorithm, for clustering biological samples based on simultaneously considering microarray gene expression data and classes of known phenotypic variables such as clinical chemistry evaluations and histopathologic observations. The strategy involves constructing an objective function with the sum of the squared Euclidean distances for numeric microarray and clinical chemistry data and simple matching for histopathology categorical values in order to measure dissimilarity of the samples. Separate weighting terms are used for microarray, clinical chemistry and histopathology measurements to control the influence of each data domain on the clustering of the samples. The dynamic validity index for numeric data was modified with a category utility measure for determining the number of clusters in the data sets. A cluster's prototype, formed from the mean of the values for numeric features and the mode of the categorical values of all the samples in the group, is representative of the phenotype of the cluster members. The approach is shown to work well with a simulated mixed data set and two real data examples containing numeric and categorical data types. One from a heart disease study and another from acetaminophen (an analgesic exposure in rat liver that causes centrilobular necrosis. Conclusion The modk-prototypes algorithm partitioned the simulated data into clusters with samples in their respective class group and the heart disease samples into two groups (sick and

  11. Evolution and differential expression of a vertebrate vitellogenin gene cluster

    Directory of Open Access Journals (Sweden)

    Kongshaug Heidi

    2009-01-01

    Full Text Available Abstract Background The multiplicity or loss of the vitellogenin (vtg gene family in vertebrates has been argued to have broad implications for the mode of reproduction (placental or non-placental, cleavage pattern (meroblastic or holoblastic and character of the egg (pelagic or benthic. Earlier proposals for the existence of three forms of vertebrate vtgs present conflicting models for their origin and subsequent duplication. Results By integrating phylogenetics of novel vtg transcripts from old and modern teleosts with syntenic analyses of all available genomic variants of non-metatherian vertebrates we identify the gene orthologies between the Sarcopterygii (tetrapod branch and Actinopterygii (fish branch. We argue that the vertebrate vtg gene cluster originated in proto-chromosome m, but that vtg genes have subsequently duplicated and rearranged following whole genome duplications. Sequencing of a novel fourth vtg transcript in labrid species, and the presence of duplicated paralogs in certain model organisms supports the notion that lineage-specific gene duplications frequently occur in teleosts. The data show that the vtg gene cluster is more conserved between acanthomorph teleosts and tetrapods, than in ostariophysan teleosts such as the zebrafish. The differential expression of the labrid vtg genes are further consistent with the notion that neofunctionalized Aa-type vtgs are important determinants of the pelagic or benthic character of the eggs in acanthomorph teleosts. Conclusion The vertebrate vtg gene cluster existed prior to the separation of Sarcopterygii from Actinopterygii >450 million years ago, a period associated with the second round of whole genome duplication. The presence of higher copy numbers in a more highly expressed subcluster is particularly prevalent in teleosts. The differential expression and latent neofunctionalization of vtg genes in acanthomorph teleosts is an adaptive feature associated with oocyte hydration

  12. Coupled Two-Way Clustering Analysis of Gene Microarray Data

    CERN Document Server

    Getz, G; Domany, E

    2000-01-01

    We present a novel coupled two-way clustering approach to gene microarray data analysis. The main idea is to identify subsets of the genes and samples, such that when one of these is used to cluster the other, stable and significant partitions emerge. The search for such subsets is a computationally complex task: we present an algorithm, based on iterative clustering, which performs such a search. This analysis is especially suitable for gene microarray data, where the contributions of a variety of biological mechanisms to the gene expression levels are entangled in a large body of experimental data. The method was applied to two gene microarray data sets, on colon cancer and leukemia. By identifying relevant subsets of the data and focusing on them we were able to discover partitions and correlations that were masked and hidden when the full dataset was used in the analysis. Some of these partitions have clear biological interpretation; others can serve to identify possible directions for future research.

  13. Coupled two-way clustering analysis of gene microarray data

    Science.gov (United States)

    Getz, Gad; Levine, Erel; Domany, Eytan

    2000-10-01

    We present a coupled two-way clustering approach to gene microarray data analysis. The main idea is to identify subsets of the genes and samples, such that when one of these is used to cluster the other, stable and significant partitions emerge. The search for such subsets is a computationally complex task. We present an algorithm, based on iterative clustering, that performs such a search. This analysis is especially suitable for gene microarray data, where the contributions of a variety of biological mechanisms to the gene expression levels are entangled in a large body of experimental data. The method was applied to two gene microarray data sets, on colon cancer and leukemia. By identifying relevant subsets of the data and focusing on them we were able to discover partitions and correlations that were masked and hidden when the full dataset was used in the analysis. Some of these partitions have clear biological interpretation; others can serve to identify possible directions for future research.

  14. Genotyping of Campylobacter jejuni strains from Danish broiler chickens by restriction fragment length polymorphism of the LPS gene cluster

    DEFF Research Database (Denmark)

    Knudsen, K.N.; Bang, Dang Duong; Nielsen, E.M.

    2005-01-01

    Aims: To apply and evaluate LG (LPS genes) genotyping, which is a genotyping method based on a cluster of genes involved in the synthesis of surface lipopolysaccharides (LPS) in Campylobacter species, for typing of Campylobacter jejuni isolates obtained from Danish broiler chickens. Furthermore, ...

  15. The entire β-globin gene cluster is deleted in a form of τδβ-thalassemia.

    NARCIS (Netherlands)

    E.R. Fearon; H.H.Jr. Kazazian; P.G. Waber (Pamela); J.I. Lee (Joseph); S.E. Antonarakis; S.H. Orkin (Stuart); E.F. Vanin; P.S. Henthorn; F.G. Grosveld (Frank); A.F. Scott; G.R. Buchanan

    1983-01-01

    textabstractWe have used restriction endonuclease mapping to study a deletion involving the beta-globin gene cluster in a Mexican-American family with gamma delta beta-thalassemia. Analysis of DNA polymorphisms demonstrated deletion of the beta-globin gene from the affected chromosome. Using a DNA

  16. The entire β-globin gene cluster is deleted in a form of τδβ-thalassemia.

    NARCIS (Netherlands)

    E.R. Fearon; H.H.Jr. Kazazian; P.G. Waber (Pamela); J.I. Lee (Joseph); S.E. Antonarakis; S.H. Orkin (Stuart); E.F. Vanin; P.S. Henthorn; F.G. Grosveld (Frank); A.F. Scott; G.R. Buchanan

    1983-01-01

    textabstractWe have used restriction endonuclease mapping to study a deletion involving the beta-globin gene cluster in a Mexican-American family with gamma delta beta-thalassemia. Analysis of DNA polymorphisms demonstrated deletion of the beta-globin gene from the affected chromosome. Using a DNA f

  17. Some statistical properties of gene expression clustering for array data

    DEFF Research Database (Denmark)

    Abreu, G C G; Pinheiro, A; Drummond, R D

    2010-01-01

    DNA array data without a corresponding statistical error measure. We propose an easy-to-implement and simple-to-use technique that uses bootstrap re-sampling to evaluate the statistical error of the nodes provided by SOM-based clustering. Comparisons between SOM and parametric clustering are presented...... for simulated as well as for two real data sets. We also implement a bootstrap-based pre-processing procedure for SOM, that improves the false discovery ratio of differentially expressed genes. Code in Matlab is freely available, as well as some supplementary material, at the following address: https...

  18. A Gene Selection Approach based on Clustering for Classification Tasks in Colon Cancer

    Directory of Open Access Journals (Sweden)

    José Antonio CASTELLANOS GARZÓN

    2016-06-01

    Full Text Available Gene selection (GS is an important research area in the analysis of DNA-microarray data, since it involves gene discovery meaningful for a particular target annotation or able to discriminate expression profiles of samples coming from different populations. In this context, a wide number of filter methods have been proposed in the literature to identify subsets of relevant genes in accordance with prefixed targets. Despite the fact that there is a wide number of proposals, the complexity imposed by this problem (GS remains a challenge. Hence, this paper proposes a novel approach for gene selection by using cluster techniques and filter methods on the found groupings to achieve informative gene subsets. As a result of applying our methodology to Colon cancer data, we have identified the best informative gene subset between several one subsets. According to the above, the reached results have proven the reliability of the approach given in this paper.

  19. Transcription mediated insulation and interference direct gene cluster expression switches.

    Science.gov (United States)

    Nguyen, Tania; Fischl, Harry; Howe, Françoise S; Woloszczuk, Ronja; Serra Barros, Ana; Xu, Zhenyu; Brown, David; Murray, Struan C; Haenni, Simon; Halstead, James M; O'Connor, Leigh; Shipkovenska, Gergana; Steinmetz, Lars M; Mellor, Jane

    2014-11-19

    In yeast, many tandemly arranged genes show peak expression in different phases of the metabolic cycle (YMC) or in different carbon sources, indicative of regulation by a bi-modal switch, but it is not clear how these switches are controlled. Using native elongating transcript analysis (NET-seq), we show that transcription itself is a component of bi-modal switches, facilitating reciprocal expression in gene clusters. HMS2, encoding a growth-regulated transcription factor, switches between sense- or antisense-dominant states that also coordinate up- and down-regulation of transcription at neighbouring genes. Engineering HMS2 reveals alternative mono-, di- or tri-cistronic and antisense transcription units (TUs), using different promoter and terminator combinations, that underlie state-switching. Promoters or terminators are excluded from functional TUs by read-through transcriptional interference, while antisense TUs insulate downstream genes from interference. We propose that the balance of transcriptional insulation and interference at gene clusters facilitates gene expression switches during intracellular and extracellular environmental change.

  20. Degradation of Benzene by Pseudomonas veronii 1YdBTEX2 and 1YB2 Is Catalyzed by Enzymes Encoded in Distinct Catabolism Gene Clusters

    Science.gov (United States)

    de Lima-Morales, Daiana; Chaves-Moreno, Diego; Wos-Oxley, Melissa L.; Jáuregui, Ruy; Vilchez-Vargas, Ramiro

    2015-01-01

    Pseudomonas veronii 1YdBTEX2, a benzene and toluene degrader, and Pseudomonas veronii 1YB2, a benzene degrader, have previously been shown to be key players in a benzene-contaminated site. These strains harbor unique catabolic pathways for the degradation of benzene comprising a gene cluster encoding an isopropylbenzene dioxygenase where genes encoding downstream enzymes were interrupted by stop codons. Extradiol dioxygenases were recruited from gene clusters comprising genes encoding a 2-hydroxymuconic semialdehyde dehydrogenase necessary for benzene degradation but typically absent from isopropylbenzene dioxygenase-encoding gene clusters. The benzene dihydrodiol dehydrogenase-encoding gene was not clustered with any other aromatic degradation genes, and the encoded protein was only distantly related to dehydrogenases of aromatic degradation pathways. The involvement of the different gene clusters in the degradation pathways was suggested by real-time quantitative reverse transcription PCR. PMID:26475106

  1. Transcriptome analysis identifies genes involved in ethanol response of Saccharomyces cerevisiae in Agave tequilana juice.

    Science.gov (United States)

    Ramirez-Córdova, Jesús; Drnevich, Jenny; Madrigal-Pulido, Jaime Alberto; Arrizon, Javier; Allen, Kirk; Martínez-Velázquez, Moisés; Alvarez-Maya, Ikuri

    2012-08-01

    During ethanol fermentation, yeast cells are exposed to stress due to the accumulation of ethanol, cell growth is altered and the output of the target product is reduced. For Agave beverages, like tequila, no reports have been published on the global gene expression under ethanol stress. In this work, we used microarray analysis to identify Saccharomyces cerevisiae genes involved in the ethanol response. Gene expression of a tequila yeast strain of S. cerevisiae (AR5) was explored by comparing global gene expression with that of laboratory strain S288C, both after ethanol exposure. Additionally, we used two different culture conditions, cells grown in Agave tequilana juice as a natural fermentation media or grown in yeast-extract peptone dextrose as artificial media. Of the 6368 S. cerevisiae genes in the microarray, 657 genes were identified that had different expression responses to ethanol stress due to strain and/or media. A cluster of 28 genes was found over-expressed specifically in the AR5 tequila strain that could be involved in the adaptation to tequila yeast fermentation, 14 of which are unknown such as yor343c, ylr162w, ygr182c, ymr265c, yer053c-a or ydr415c. These could be the most suitable genes for transforming tequila yeast to increase ethanol tolerance in the tequila fermentation process. Other genes involved in response to stress (RFC4, TSA1, MLH1, PAU3, RAD53) or transport (CYB2, TIP20, QCR9) were expressed in the same cluster. Unknown genes could be good candidates for the development of recombinant yeasts with ethanol tolerance for use in industrial tequila fermentation.

  2. A phase synchronization clustering algorithm for identifying interesting groups of genes from cell cycle expression data

    Directory of Open Access Journals (Sweden)

    Tcha Hong

    2008-01-01

    Full Text Available Abstract Background The previous studies of genome-wide expression patterns show that a certain percentage of genes are cell cycle regulated. The expression data has been analyzed in a number of different ways to identify cell cycle dependent genes. In this study, we pose the hypothesis that cell cycle dependent genes are considered as oscillating systems with a rhythm, i.e. systems producing response signals with period and frequency. Therefore, we are motivated to apply the theory of multivariate phase synchronization for clustering cell cycle specific genome-wide expression data. Results We propose the strategy to find groups of genes according to the specific biological process by analyzing cell cycle specific gene expression data. To evaluate the propose method, we use the modified Kuramoto model, which is a phase governing equation that provides the long-term dynamics of globally coupled oscillators. With this equation, we simulate two groups of expression signals, and the simulated signals from each group shares their own common rhythm. Then, the simulated expression data are mixed with randomly generated expression data to be used as input data set to the algorithm. Using these simulated expression data, it is shown that the algorithm is able to identify expression signals that are involved in the same oscillating process. We also evaluate the method with yeast cell cycle expression data. It is shown that the output clusters by the proposed algorithm include genes, which are closely associated with each other by sharing significant Gene Ontology terms of biological process and/or having relatively many known biological interactions. Therefore, the evaluation analysis indicates that the method is able to identify expression signals according to the specific biological process. Our evaluation analysis also indicates that some portion of output by the proposed algorithm is not obtainable by the traditional clustering algorithm with

  3. Host genes involved in Agrobacterium-mediated transformation

    NARCIS (Netherlands)

    Soltani, Jalal

    2009-01-01

    Agrobacterium is the nature’s genetic engineer that can transfer genes across the kingdom barriers to both prokaryotic and eukaryotic host cells. The host genes which are involved in Agrobacterium-mediated transformatiom (AMT) are not well known. Here, I studied in a systematic way to identify the

  4. Host genes involved in Agrobacterium-mediated transformation

    NARCIS (Netherlands)

    Soltani, Jalal

    2009-01-01

    Agrobacterium is the nature’s genetic engineer that can transfer genes across the kingdom barriers to both prokaryotic and eukaryotic host cells. The host genes which are involved in Agrobacterium-mediated transformatiom (AMT) are not well known. Here, I studied in a systematic way to identify the w

  5. EST analysis of Prorocentrum donghaiense with emphasis on genes involved in PCD

    Institute of Scientific and Technical Information of China (English)

    Zhang Xiufang; Liu Yongjian; Yang Guanpin; Zhu Mingyuan; Li Ruixiang

    2009-01-01

    Prorocentrum donghaiense has caused large-scale red tides off the Chinese coast in recent years. Expressed sequence tag (EST) analysis was carried out for this dinoflagellate in order to identify the functional genes involved in its biological processes. A cDNA library was constructed for P. donghaiense at exponential growth phase, and 565 usable sequencing reads were obtained from 700 clones selected randomly. Messenger RNA corresponding reads were clustered into 36 contigs and 272 singletons (EST groups). Twenty-two EST groups were found to tag the genes involved in diverse biological processes including programmed cell death (PCD). Two EST groups showed significant homologies with the encoding genes of cysteine protease (caspase) and proliferating cell nuclear antigen, respectively, two key proteins involved in PCD.

  6. Functional Analysis of the Fusarielin Biosynthetic Gene Cluster

    Directory of Open Access Journals (Sweden)

    Aida Droce

    2016-12-01

    Full Text Available Fusarielins are polyketides with a decalin core produced by various species of Aspergillus and Fusarium. Although the responsible gene cluster has been identified, the biosynthetic pathway remains to be elucidated. In the present study, members of the gene cluster were deleted individually in a Fusarium graminearum strain overexpressing the local transcription factor. The results suggest that a trans-acting enoyl reductase (FSL5 assists the polyketide synthase FSL1 in biosynthesis of a polyketide product, which is released by hydrolysis by a trans-acting thioesterase (FSL2. Deletion of the epimerase (FSL3 resulted in accumulation of an unstable compound, which could be the released product. A novel compound, named prefusarielin, accumulated in the deletion mutant of the cytochrome P450 monooxygenase FSL4. Unlike the known fusarielins from Fusarium, this compound does not contain oxygenized decalin rings, suggesting that FSL4 is responsible for the oxygenation.

  7. Evaluation of clustering algorithms for gene expression data using gene ontology annotations

    Institute of Scientific and Technical Information of China (English)

    MA Ning; ZHANG Zheng-guo

    2012-01-01

    Background Clustering is a useful exploratory technique for interpreting gene expression data to reveal groups of genes sharing common functional attributes.Biologists frequently face the problem of choosing an appropriate algorithm.We aimed to provide a standalone,easily accessible and biologically oriented criterion for expression data clustering evaluation.Methods An external criterion utilizing annotation based similarities between genes is proposed in this work.Gene ontology information is employed as the annotation source.Comparisons among six widely used clustering algorithms over various types of gene expression data sets were carried out based on the criterion proposed.Results The rank of these algorithms given by the criterion coincides with our common knowledge.Single-linkage has significantly poorer performance,even worse than the random algorithm.Ward's method archives the best performance in most cases.Conclusions The criterion proposed has a strong ability to distinguish among different clustering algorithms with different distance measurements.It is also demonstrated that analyzing main contributors of the criterion may offer some guidelines in finding local compact clusters.As an addition,we suggest using Ward's algorithm for gene expression data analysis.

  8. Comparative genomic analysis of sixty mycobacteriophage genomes: Genome clustering, gene acquisition and gene size

    Science.gov (United States)

    Hatfull, Graham F.; Jacobs-Sera, Deborah; Lawrence, Jeffrey G.; Pope, Welkin H.; Russell, Daniel A.; Ko, Ching-Chung; Weber, Rebecca J.; Patel, Manisha C.; Germane, Katherine L.; Edgar, Robert H.; Hoyte, Natasha N.; Bowman, Charles A.; Tantoco, Anthony T.; Paladin, Elizabeth C.; Myers, Marlana S.; Smith, Alexis L.; Grace, Molly S.; Pham, Thuy T.; O'Brien, Matthew B.; Vogelsberger, Amy M.; Hryckowian, Andrew J.; Wynalek, Jessica L.; Donis-Keller, Helen; Bogel, Matt W.; Peebles, Craig L.; Cresawn, Steve G.; Hendrix, Roger W.

    2010-01-01

    Mycobacteriophages are viruses that infect mycobacterial hosts. Expansion of a collection of sequenced phage genomes to a total of sixty – all infecting a common bacterial host – provides further insight into their diversity and evolution. Of the sixty phage genomes, 55 can be grouped into nine clusters according to their nucleotide sequence similarities, five of which can be further divided into subclusters; five genomes do not cluster with other phages. The sequence diversity between genomes within a cluster varies greatly; for example, the six genomes in cluster D share more than 97.5% average nucleotide similarity with each other. In contrast, similarity between the two genomes in Cluster I is barely detectable by diagonal plot analysis. The total of 6,858 predicted ORFs have been grouped into 1523 phamilies (phams) of related sequences, 46% of which possess only a single member. Only 18.8% of the phams have sequence similarity to non-mycobacteriophage database entries and fewer than 10% of all phams can be assigned functions based on database searching or synteny. Genome clustering facilitates the identification of genes that are in greatest genetic flux and are more likely to have been exchanged horizontally in relatively recent evolutionary time. Although mycobacteriophage genes exhibit smaller average size than genes of their host (205 residues compared to 315), phage genes in higher flux average only ∼100 amino acids, suggesting that the primary units of genetic exchange correspond to single protein domains. PMID:20064525

  9. Genome-scale analysis of positional clustering of mouse testis-specific genes

    Directory of Open Access Journals (Sweden)

    Lee Bernett TK

    2005-01-01

    Full Text Available Abstract Background Genes are not randomly distributed on a chromosome as they were thought even after removal of tandem repeats. The positional clustering of co-expressed genes is known in prokaryotes and recently reported in several eukaryotic organisms such as Caenorhabditis elegans, Drosophila melanogaster, and Homo sapiens. In order to further investigate the mode of tissue-specific gene clustering in higher eukaryotes, we have performed a genome-scale analysis of positional clustering of the mouse testis-specific genes. Results Our computational analysis shows that a large proportion of testis-specific genes are clustered in groups of 2 to 5 genes in the mouse genome. The number of clusters is much higher than expected by chance even after removal of tandem repeats. Conclusion Our result suggests that testis-specific genes tend to cluster on the mouse chromosomes. This provides another piece of evidence for the hypothesis that clusters of tissue-specific genes do exist.

  10. Transcriptome analysis reveals key differentially expressed genes involved in wheat grain development

    Institute of Scientific and Technical Information of China (English)

    Yonglong Yu; Dong Zhu; Chaoying Ma; Hui Cao; Yaping Wang; Yanhao Xu; Wenying Zhang; Yueming Yan

    2016-01-01

    Wheat seed development is an important physiological process of seed maturation and directly affects wheat yield and quality. In this study, we performed dynamic transcriptome microarray analysis of an elite Chinese bread wheat cultivar (Jimai 20) during grain development using the GeneChip Wheat Genome Array. Grain morphology and scanning electron microscope observations showed that the period of 11–15 days post-anthesis (DPA) was a key stage for the synthesis and accumulation of seed starch. Genome-wide transcriptional profiling and significance analysis of microarrays revealed that the period from 11 to 15 DPA was more important than the 15–20 DPA stage for the synthesis and accumulation of nutritive reserves. Series test of cluster analysis of differential genes revealed five statistically significant gene expression profiles. Gene ontology annotation and enrichment analysis gave further informa-tion about differentially expressed genes, and MapMan analysis revealed expression changes within functional groups during seed development. Metabolic pathway network analysis showed that major and minor metabolic pathways regulate one another to ensure regular seed development and nutritive reserve accumulation. We performed gene co-expression network analysis to identify genes that play vital roles in seed development and identified several key genes involved in important metabolic pathways. The transcriptional expression of eight key genes involved in starch and protein synthesis and stress defense was further validated by qRT-PCR. Our results provide new insight into the molecular mechanisms of wheat seed development and the determinants of yield and quality.

  11. Transcriptome analysis reveals key differentially expressed genes involved in wheat grain development

    Directory of Open Access Journals (Sweden)

    Yonglong Yu

    2016-04-01

    Full Text Available Wheat seed development is an important physiological process of seed maturation and directly affects wheat yield and quality. In this study, we performed dynamic transcriptome microarray analysis of an elite Chinese bread wheat cultivar (Jimai 20 during grain development using the GeneChip Wheat Genome Array. Grain morphology and scanning electron microscope observations showed that the period of 11–15 days post-anthesis (DPA was a key stage for the synthesis and accumulation of seed starch. Genome-wide transcriptional profiling and significance analysis of microarrays revealed that the period from 11 to 15 DPA was more important than the 15–20 DPA stage for the synthesis and accumulation of nutritive reserves. Series test of cluster analysis of differential genes revealed five statistically significant gene expression profiles. Gene ontology annotation and enrichment analysis gave further information about differentially expressed genes, and MapMan analysis revealed expression changes within functional groups during seed development. Metabolic pathway network analysis showed that major and minor metabolic pathways regulate one another to ensure regular seed development and nutritive reserve accumulation. We performed gene co-expression network analysis to identify genes that play vital roles in seed development and identified several key genes involved in important metabolic pathways. The transcriptional expression of eight key genes involved in starch and protein synthesis and stress defense was further validated by qRT-PCR. Our results provide new insight into the molecular mechanisms of wheat seed development and the determinants of yield and quality.

  12. Identification of Nitrogen-Fixing Genes and Gene Clusters from Metagenomic Library of Acid Mine Drainage

    OpenAIRE

    Zhimin Dai; Xue Guo; Huaqun Yin; Yili Liang; Jing Cong; Xueduan Liu

    2014-01-01

    Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large...

  13. Identification of genes directly involved in shell formation and their functions in pearl oyster, Pinctada fucata.

    Directory of Open Access Journals (Sweden)

    Dong Fang

    Full Text Available Mollusk shell formation is a fascinating aspect of biomineralization research. Shell matrix proteins play crucial roles in the control of calcium carbonate crystallization during shell formation in the pearl oyster, Pinctada fucata. Characterization of biomineralization-related genes during larval development could enhance our understanding of shell formation. Genes involved in shell biomineralization were isolated by constructing three suppression subtractive hybridization (SSH libraries that represented genes expressed at key points during larval shell formation. A total of 2,923 ESTs from these libraries were sequenced and gave 990 unigenes. Unigenes coding for secreted proteins and proteins with tandem-arranged repeat units were screened in the three SSH libraries. A set of sequences coding for genes involved in shell formation was obtained. RT-PCR and in situ hybridization assays were carried out on five genes to investigate their spatial expression in several tissues, especially the mantle tissue. They all showed a different expression pattern from known biomineralization-related genes. Inhibition of the five genes by RNA interference resulted in different defects of the nacreous layer, indicating that they all were involved in aragonite crystallization. Intriguingly, one gene (UD_Cluster94.seq.Singlet1 was restricted to the 'aragonitic line'. The current data has yielded for the first time, to our knowledge, a suite of biomineralization-related genes active during the developmental stages of P. fucata, five of which were responsible for nacreous layer formation. This provides a useful starting point for isolating new genes involved in shell formation. The effects of genes on the formation of the 'aragonitic line', and other areas of the nacreous layer, suggests a different control mechanism for aragonite crystallization initiation from that of mature aragonite growth.

  14. Cloning large natural product gene clusters from the environment: Piecing environmental DNA gene clusters back together with TAR

    OpenAIRE

    Kim, Jeffrey H.; Feng, Zhiyang; Bauer, John D.; Kallifidas, Dimitris; Calle, Paula Y.; Brady, Sean F

    2010-01-01

    A single gram of soil can contain thousands of unique bacterial species, of which only a small fraction is regularly cultured in the laboratory. Although the fermentation of cultured microorganisms has provided access to numerous bioactive secondary metabolites, with these same methods it is not possible to characterize the natural products encoded by the uncultured majority. The heterologous expression of biosynthetic gene clusters cloned from DNA extracted directly from environmental sample...

  15. Acquisition and Evolution of Plant Pathogenesis–Associated Gene Clusters and Candidate Determinants of Tissue-Specificity in Xanthomonas

    Science.gov (United States)

    Van Sluys, Marie-Anne; White, Frank F.; Ryan, Robert P.; Dow, J. Maxwell; Rabinowicz, Pablo; Salzberg, Steven L.; Leach, Jan E.; Sonti, Ramesh; Brendel, Volker; Bogdanove, Adam J.

    2008-01-01

    Background Xanthomonas is a large genus of plant-associated and plant-pathogenic bacteria. Collectively, members cause diseases on over 392 plant species. Individually, they exhibit marked host- and tissue-specificity. The determinants of this specificity are unknown. Methodology/Principal Findings To assess potential contributions to host- and tissue-specificity, pathogenesis-associated gene clusters were compared across genomes of eight Xanthomonas strains representing vascular or non-vascular pathogens of rice, brassicas, pepper and tomato, and citrus. The gum cluster for extracellular polysaccharide is conserved except for gumN and sequences downstream. The xcs and xps clusters for type II secretion are conserved, except in the rice pathogens, in which xcs is missing. In the otherwise conserved hrp cluster, sequences flanking the core genes for type III secretion vary with respect to insertion sequence element and putative effector gene content. Variation at the rpf (regulation of pathogenicity factors) cluster is more pronounced, though genes with established functional relevance are conserved. A cluster for synthesis of lipopolysaccharide varies highly, suggesting multiple horizontal gene transfers and reassortments, but this variation does not correlate with host- or tissue-specificity. Phylogenetic trees based on amino acid alignments of gum, xps, xcs, hrp, and rpf cluster products generally reflect strain phylogeny. However, amino acid residues at four positions correlate with tissue specificity, revealing hpaA and xpsD as candidate determinants. Examination of genome sequences of xanthomonads Xylella fastidiosa and Stenotrophomonas maltophilia revealed that the hrp, gum, and xcs clusters are recent acquisitions in the Xanthomonas lineage. Conclusions/Significance Our results provide insight into the ancestral Xanthomonas genome and indicate that differentiation with respect to host- and tissue-specificity involved not major modifications or wholesale

  16. Acquisition and evolution of plant pathogenesis-associated gene clusters and candidate determinants of tissue-specificity in xanthomonas.

    Directory of Open Access Journals (Sweden)

    Hong Lu

    Full Text Available BACKGROUND: Xanthomonas is a large genus of plant-associated and plant-pathogenic bacteria. Collectively, members cause diseases on over 392 plant species. Individually, they exhibit marked host- and tissue-specificity. The determinants of this specificity are unknown. METHODOLOGY/PRINCIPAL FINDINGS: To assess potential contributions to host- and tissue-specificity, pathogenesis-associated gene clusters were compared across genomes of eight Xanthomonas strains representing vascular or non-vascular pathogens of rice, brassicas, pepper and tomato, and citrus. The gum cluster for extracellular polysaccharide is conserved except for gumN and sequences downstream. The xcs and xps clusters for type II secretion are conserved, except in the rice pathogens, in which xcs is missing. In the otherwise conserved hrp cluster, sequences flanking the core genes for type III secretion vary with respect to insertion sequence element and putative effector gene content. Variation at the rpf (regulation of pathogenicity factors cluster is more pronounced, though genes with established functional relevance are conserved. A cluster for synthesis of lipopolysaccharide varies highly, suggesting multiple horizontal gene transfers and reassortments, but this variation does not correlate with host- or tissue-specificity. Phylogenetic trees based on amino acid alignments of gum, xps, xcs, hrp, and rpf cluster products generally reflect strain phylogeny. However, amino acid residues at four positions correlate with tissue specificity, revealing hpaA and xpsD as candidate determinants. Examination of genome sequences of xanthomonads Xylella fastidiosa and Stenotrophomonas maltophilia revealed that the hrp, gum, and xcs clusters are recent acquisitions in the Xanthomonas lineage. CONCLUSIONS/SIGNIFICANCE: Our results provide insight into the ancestral Xanthomonas genome and indicate that differentiation with respect to host- and tissue-specificity involved not major

  17. ESTs analysis reveals putative genes involved in symbiotic seed germination in Dendrobium officinale.

    Directory of Open Access Journals (Sweden)

    Ming-Ming Zhao

    Full Text Available Dendrobiumofficinale (Orchidaceae is one of the world's most endangered plants with great medicinal value. In nature, D. officinale seeds must establish symbiotic relationships with fungi to germinate. However, the molecular events involved in the interaction between fungus and plant during this process are poorly understood. To isolate the genes involved in symbiotic germination, a suppression subtractive hybridization (SSH cDNA library of symbiotically germinated D. officinale seeds was constructed. From this library, 1437 expressed sequence tags (ESTs were clustered to 1074 Unigenes (including 902 singletons and 172 contigs, which were searched against the NCBI non-redundant (NR protein database (E-value cutoff, e(-5. Based on sequence similarity with known proteins, 579 differentially expressed genes in D. officinale were identified and classified into different functional categories by Gene Ontology (GO, Clusters of orthologous Groups of proteins (COGs and Kyoto Encyclopedia of Genes and Genomes (KEGG pathways. The expression levels of 15 selected genes emblematic of symbiotic germination were confirmed via real-time quantitative PCR. These genes were classified into various categories, including defense and stress response, metabolism, transcriptional regulation, transport process and signal transduction pathways. All transcripts were upregulated in the symbiotically germinated seeds (SGS. The functions of these genes in symbiotic germination were predicted. Furthermore, two fungus-induced calcium-dependent protein kinases (CDPKs, which were upregulated 6.76- and 26.69-fold in SGS compared with un-germinated seeds (UGS, were cloned from D. officinale and characterized for the first time. This study provides the first global overview of genes putatively involved in D. officinale symbiotic seed germination and provides a foundation for further functional research regarding symbiotic relationships in orchids.

  18. Nitrate assimilation gene cluster from the heterocyst-forming cyanobacterium Anabaena sp. strain PCC 7120.

    Science.gov (United States)

    Frías, J E; Flores, E; Herrero, A

    1997-01-01

    A region of the genome of the filamentous, nitrogen-fixing, heterocyst-forming cyanobacterium Anabaena sp. strain PCC 7120 that contains a cluster of genes involved in nitrate assimilation has been identified. The genes nir, encoding nitrite reductase, and nrtABC, encoding elements of a nitrate permease, have been cloned. Insertion of a gene cassette into the nir-nrtA region impaired expression of narB, the nitrate reductase structural gene which together with nrtD is found downstream from nrtC in the gene cluster. This indicates that the nir-nrtABCD-narB genes are cotranscribed, thus constituting an operon. Expression of the nir operon in strain PCC 7120 is subjected to ammonium-promoted repression and takes place from an NtcA-activated promoter located 460 bp upstream from the start of the nir gene. In the absence of ammonium, cellular levels of the products of the nir operon are higher in the presence of nitrate than in the absence of combined nitrogen.

  19. A polyketide synthase-peptide synthetase gene cluster from an uncultured bacterial symbiont of Paederus beetles.

    Science.gov (United States)

    Piel, Jörn

    2002-10-29

    Many drug candidates from marine and terrestrial invertebrates are suspected metabolites of uncultured bacterial symbionts. The antitumor polyketides of the pederin family, isolated from beetles and sponges, are an example. Drug development from such sources is commonly hampered by low yields and the difficulty of sustaining invertebrate cultures. To obtain insight into the true producer and find alternative supplies of these rare drug candidates, the putative pederin biosynthesis genes were cloned from total DNA of Paederus fuscipes beetles, which use this compound for chemical defense. Sequence analysis of the gene cluster and adjacent regions revealed the presence of ORFs with typical bacterial architecture and homologies. The ped cluster, which is present only in beetle specimens with high pederin content, is located on a 54-kb region bordered by transposase pseudogenes and encodes a mixed modular polyketide synthase/nonribosomal peptide synthetase. Notably, none of the modules contains regions with homology to acyltransferase domains, but two copies of isolated monodomain acyltransferase genes were found at the upstream end of the cluster. In line with an involvement in pederin biosynthesis, the upstream cluster region perfectly mirrors pederin structure. The unexpected presence of additional polyketide synthase/nonribosomal peptide synthetase modules reveals surprising insights into the evolutionary relationship between pederin-type pathways in beetles and sponges.

  20. Engineered Streptomyces avermitilis host for heterologous expression of biosynthetic gene cluster for secondary metabolites.

    Science.gov (United States)

    Komatsu, Mamoru; Komatsu, Kyoko; Koiwai, Hanae; Yamada, Yuuki; Kozone, Ikuko; Izumikawa, Miho; Hashimoto, Junko; Takagi, Motoki; Omura, Satoshi; Shin-ya, Kazuo; Cane, David E; Ikeda, Haruo

    2013-07-19

    An industrial microorganism, Streptomyces avermitilis, which is a producer of anthelmintic macrocyclic lactones, avermectins, has been constructed as a versatile model host for heterologous expression of genes encoding secondary metabolite biosynthesis. Twenty of the entire biosynthetic gene clusters for secondary metabolites were successively cloned and introduced into a versatile model host S. avermitilis SUKA17 or 22. Almost all S. avermitilis transformants carrying the entire gene cluster produced metabolites as a result of the expression of biosynthetic gene clusters introduced. A few transformants were unable to produce metabolites, but their production was restored by the expression of biosynthetic genes using an alternative promoter or the expression of a regulatory gene in the gene cluster that controls the expression of biosynthetic genes in the cluster using an alternative promoter. Production of metabolites in some transformants of the versatile host was higher than that of the original producers, and cryptic biosynthetic gene clusters in the original producer were also expressed in a versatile host.

  1. Metabolic diversification--independent assembly of operon-like gene clusters in different plants.

    Science.gov (United States)

    Field, Ben; Osbourn, Anne E

    2008-04-25

    Operons are clusters of unrelated genes with related functions that are a feature of prokaryotic genomes. Here, we report on an operon-like gene cluster in the plant Arabidopsis thaliana that is required for triterpene synthesis (the thalianol pathway). The clustered genes are coexpressed, as in bacterial operons. However, despite the resemblance to a bacterial operon, this gene cluster has been assembled from plant genes by gene duplication, neofunctionalization, and genome reorganization, rather than by horizontal gene transfer from bacteria. Furthermore, recent assembly of operon-like gene clusters for triterpene synthesis has occurred independently in divergent plant lineages (Arabidopsis and oat). Thus, selection pressure may act during the formation of certain plant metabolic pathways to drive gene clustering.

  2. Clusters of Factors Identify A High Prevalence of Pregnancy Involvement Among US Adolescent Males.

    Science.gov (United States)

    Lau, May; Lin, Hua; Flores, Glenn

    2015-08-01

    The study purpose was to use recursive partitioning analysis (RPA) to identify factors that, when clustered, are associated with a high prevalence of pregnancy involvement among US adolescent males. The National Survey of Family Growth is a nationally representative survey of individuals 15-44 years old. RPA was done for the 2002 and 2006-2010 cycles to identify factors which, when combined, identify adolescent males with the highest prevalence of pregnancy involvement. Pregnancy-involvement prevalence among adolescent males was 6 %. Two clusters of adolescent males have the highest pregnancy-involvement prevalence, at 84-87 %. In RPA, the highest pregnancy-involvement prevalence (87 %) was seen in adolescent males who ever HIV tested, had >4 lifetime sexual partners, reported less than an almost certain chance of feeling less physical pleasure with condom use, had an educational attainment of 4 lifetime sexual partners, reported less than an almost certain chance of feeling less physical pleasure with condom use, had an educational attainment ≥11th grade, were >17 years old, and had their first contraceptive education ≥10th grade, had a pregnancy-involvement prevalence of 84 %. Pregnancy-prevention efforts among adolescent males who have been involved in a pregnancy may need to target risk factors identified in clusters with the highest pregnancy prevalence to prevent subsequent pregnancies in these adolescent males and improve their future outcomes.

  3. New type IV pili-related genes involved in early stages of Ralstonia solanacearum potato infection.

    Science.gov (United States)

    Siri, María Inés; Sanabria, Analía; Boucher, Christian; Pianzzola, María Julia

    2014-07-01

    This study provides insights into the pathogenesis of Ralstonia solanacearum, in particular with regards to strains belonging to phylotype IIB, sequevar 1 (IIB-1) and their interaction with potato, its natural host. We performed a comparative genomic analysis among IIB-1 R. solanacearum strains with different levels of virulence in order to identify candidate virulence genes. With this approach, we identified a 33.7-kb deletion in a strain showing reduced virulence on potato. This region contains a cluster of six genes putatively involved in type IV pili (Tfp) biogenesis. Functional analysis suggests that these proteins contribute to several Tfp-related functions such as twitching motility and biofilm formation. In addition, this genetic cluster was found to contribute to early bacterial wilt pathogenesis and colonization fitness of potato roots.

  4. Data Preprocessing in Cluster Analysis of Gene Expression

    Institute of Scientific and Technical Information of China (English)

    杨春梅; 万柏坤; 高晓峰

    2003-01-01

    Considering that the DNA microarray technology has generated explosive gene expression data and that it is urgent to analyse and to visualize such massive datasets with efficient methods, we investigate the data preprocessing methods used in cluster analysis, normalization or logarithm of the matrix, by using hierarchical clustering, principal component analysis (PCA) and self-organizing maps (SOMs). The results illustrate that when using the Euclidean distance as measuring metrics, logarithm of relative expression level is the best preprocessing method, while data preprocessed by normalization cannot attain the expected results because the data structure is ruined. If there are only a few principal components, the PCA is an effective method to extract the frame structure, while SOMs are more suitable for a specific structure.

  5. Global Analysis of miRNA Gene Clusters and Gene Families Reveals Dynamic and Coordinated Expression

    Directory of Open Access Journals (Sweden)

    Li Guo

    2014-01-01

    Full Text Available To further understand the potential expression relationships of miRNAs in miRNA gene clusters and gene families, a global analysis was performed in 4 paired tumor (breast cancer and adjacent normal tissue samples using deep sequencing datasets. The compositions of miRNA gene clusters and families are not random, and clustered and homologous miRNAs may have close relationships with overlapped miRNA species. Members in the miRNA group always had various expression levels, and even some showed larger expression divergence. Despite the dynamic expression as well as individual difference, these miRNAs always indicated consistent or similar deregulation patterns. The consistent deregulation expression may contribute to dynamic and coordinated interaction between different miRNAs in regulatory network. Further, we found that those clustered or homologous miRNAs that were also identified as sense and antisense miRNAs showed larger expression divergence. miRNA gene clusters and families indicated important biological roles, and the specific distribution and expression further enrich and ensure the flexible and robust regulatory network.

  6. Identification and structural analysis of a novel snoRNA gene cluster from Arabidopsis thaliana

    Institute of Scientific and Technical Information of China (English)

    2000-01-01

    A Z2 snoRNA gene cluster,consisting of four antisense snoRNA genes, was identified from Arabidopsis thaliana. The sequence and structural analysis showed that the Z2 snoRNA gene cluster might be transcribed as a polycistronic precursor from an upstream promoter, and the intergenic spacers of the gene cluster encode the 'hairpin' structures similar to the processing recognition signals of yeast Saccharomyces cerevisiae polycistronic snoRNA precursor. The results also revealed that plant snoRNA gene with multiple copies is a characteristic in common, and provides a good system for further revealing the transcription and expression mechanism of plant snoRNA gene cluster.

  7. Genes involved in convergent evolution of eusociality in bees.

    Science.gov (United States)

    Woodard, S Hollis; Fischman, Brielle J; Venkat, Aarti; Hudson, Matt E; Varala, Kranthi; Cameron, Sydney A; Clark, Andrew G; Robinson, Gene E

    2011-05-03

    Eusociality has arisen independently at least 11 times in insects. Despite this convergence, there are striking differences among eusocial lifestyles, ranging from species living in small colonies with overt conflict over reproduction to species in which colonies contain hundreds of thousands of highly specialized sterile workers produced by one or a few queens. Although the evolution of eusociality has been intensively studied, the genetic changes involved in the evolution of eusociality are relatively unknown. We examined patterns of molecular evolution across three independent origins of eusociality by sequencing transcriptomes of nine socially diverse bee species and combining these data with genome sequence from the honey bee Apis mellifera to generate orthologous sequence alignments for 3,647 genes. We found a shared set of 212 genes with a molecular signature of accelerated evolution across all eusocial lineages studied, as well as unique sets of 173 and 218 genes with a signature of accelerated evolution specific to either highly or primitively eusocial lineages, respectively. These results demonstrate that convergent evolution can involve a mosaic pattern of molecular changes in both shared and lineage-specific sets of genes. Genes involved in signal transduction, gland development, and carbohydrate metabolism are among the most prominent rapidly evolving genes in eusocial lineages. These findings provide a starting point for linking specific genetic changes to the evolution of eusociality.

  8. Bacillus sp.CDB3 isolated from cattle dip-sites possesses two ars gene clusters

    Institute of Scientific and Technical Information of China (English)

    Somanath Bhat; Xi Luo; Zhiqiang Xu; Lixia Liu; Ren Zhang

    2011-01-01

    Contamination of soil and water by arsenic is a global problem.In Australia, the dipping of cattle in arsenic-containing solution to control cattle ticks in last centenary has left many sites heavily contaminated with arsenic and other toxicants.We had previously isolated five soil bacterial strains (CDB1-5) highly resistant to arsenic.To understand the resistance mechanism, molecular studies have been carried out.Two chromosome-encoded arsenic resistance (ars) gene clusters have been cloned from CDB3 (Bacillus sp.).They both function in Escherichia coli and cluster 1 exerts a much higher resistance to the toxic metalloid.Cluster 2 is smaller possessing four open reading frames (ORFs) arsRorf2BC, similar to that identified in Bacillus subtilis Skin element.Among the eight ORFs in cluster 1 five are analogs of common ars genes found in other bacteria, however, organized in a unique order arsRBCDA instead of arsRDABC.Three other putative genes are located directly downstream and designated as arsTIP based on the homologies of their theoretical translation sequences respectively to thioredoxin reductases, iron-sulphur cluster proteins and protein phosphatases.The latter two are novel of any known ars operons.The arsD gene from Bacillus species was cloned for the first time and the predict protein differs from the well studied E.coli ArsD by lacking two pairs of C-terrninal cysteine residues.Its functional involvement in arsenic resistance has been confirmed by a deletion experiment.There exists also an inverted repeat in the intergenic region between arsC and arsD implying some unknown transcription regulation.

  9. [Obesity based on mutation of genes involved in energy balance].

    Science.gov (United States)

    Hainerová, I

    2007-01-01

    Within the last decade an intensive research led to an identification of several genes which are involved in a regulation of energy balance. In most cases, carriers of these gene mutations do not exhibit further characteristic phenotypic features except for a severe obesity. Obesity based on mutation of one gene product is called monogenic obesity. Mutations in genes for leptin, leptin receptor, proopiomelanocortin, prohormone convertase 1, melanocortin 4 and 3 receptor disrupt the physiological humoral signalization between peripheral signals and the hypothalamic centres of satiety and hunger. Defects of all above mentioned genes lead to phenotype of abnormal eating behaviour followed by a development of severe early-onset obesity. Mutations of melanocortin 4 receptor gene represent the most common cause of monogenic obesity because they are detected in almost 6 % children with early-onset severe obesity. Mutations of the other genes involved in energy homeostasis are very rare. Although these mutations are sporadic we assume that further research of monogenic forms of obesity might lead to our understanding of physiology and pathophysiology of regulation of the energy homeostasis and eating behaviour. Additionally, they may open new approach to the management of eating behaviour and to the treatment of obesity.

  10. Involvement of astrocyte and oligodendrocyte gene sets in migraine.

    Science.gov (United States)

    Eising, Else; de Leeuw, Christiaan; Min, Josine L; Anttila, Verneri; Verheijen, Mark Hg; Terwindt, Gisela M; Dichgans, Martin; Freilinger, Tobias; Kubisch, Christian; Ferrari, Michel D; Smit, August B; de Vries, Boukje; Palotie, Aarno; van den Maagdenberg, Arn Mjm; Posthuma, Danielle

    2016-06-01

    Migraine is a common episodic brain disorder characterized by recurrent attacks of severe unilateral headache and additional neurological symptoms. Two main migraine types can be distinguished based on the presence of aura symptoms that can accompany the headache: migraine with aura and migraine without aura. Multiple genetic and environmental factors confer disease susceptibility. Recent genome-wide association studies (GWAS) indicate that migraine susceptibility genes are involved in various pathways, including neurotransmission, which have already been implicated in genetic studies of monogenic familial hemiplegic migraine, a subtype of migraine with aura. To further explore the genetic background of migraine, we performed a gene set analysis of migraine GWAS data of 4954 clinic-based patients with migraine, as well as 13,390 controls. Curated sets of synaptic genes and sets of genes predominantly expressed in three glial cell types (astrocytes, microglia and oligodendrocytes) were investigated. Our results show that gene sets containing astrocyte- and oligodendrocyte-related genes are associated with migraine, which is especially true for gene sets involved in protein modification and signal transduction. Observed differences between migraine with aura and migraine without aura indicate that both migraine types, at least in part, seem to have a different genetic background. © International Headache Society 2015.

  11. Developmental expression and gene/enzyme identifications in the alpha esterase gene cluster of Drosophila melanogaster.

    Science.gov (United States)

    Campbell, P M; de Q Robin, G C; Court, L N; Dorrian, S J; Russell, R J; Oakeshott, J G

    2003-10-01

    Here we show how the 10 genes of the alpha esterase cluster of Drosophila melanogaster have diverged substantially in their expression profiles. Together with previously described sequence divergence this suggests substantial functional diversification. By peptide mass fingerprinting and in vitro gene expression we have also shown that two of the genes encode the isozymes EST9 (formerly ESTC) and EST23. EST9 is the major 'alpha staining' esterase in zymograms of gut tissues in feeding stages while orthologues of EST23 confer resistance to organophosphorus insecticides in other higher Diptera. The results for EST9 and EST23 concur with previous suggestions that the products of the alpha esterase cluster function in digestion and detoxification of xenobiotic esters. However, many of the other genes in the cluster show developmental or tissue-specific expression that seems inconsistent with such roles. Furthermore, there is generally poor correspondence between the mRNA expression patterns of the remaining eight genes and isozymes previously characterized by standard techniques of electrophoresis and staining, suggesting that the alpha cluster might only account for a small minority of the esterase isozyme profile.

  12. Self-cloning in Streptomyces griseus of an str gene cluster for streptomycin biosynthesis and streptomycin resistance.

    OpenAIRE

    Ohnuki, T; Imanaka, T; Aiba, S

    1985-01-01

    An str gene cluster containing at least four genes (strR, strA, strB, and strC) involved in streptomycin biosynthesis or streptomycin resistance or both was self-cloned in Streptomyces griseus by using plasmid pOA154. The strA gene was verified to encode streptomycin 6-phosphotransferase, a streptomycin resistance factor in S. griseus, by examining the gene product expressed in Escherichia coli. The other three genes were determined by complementation tests with streptomycin-nonproducing muta...

  13. Coupled Two-Way Clustering Analysis of Breast Cancer and Colon Cancer Gene Expression Data

    CERN Document Server

    Getz, G; Kela, I; Domany, E; Notterman, D A; Getz, Gad; Gal, Hilah; Kela, Itai; Domany, Eytan; Notterman, Dan A.

    2003-01-01

    We present and review Coupled Two Way Clustering, a method designed to mine gene expression data. The method identifies submatrices of the total expression matrix, whose clustering analysis reveals partitions of samples (and genes) into biologically relevant classes. We demonstrate, on data from colon and breast cancer, that we are able to identify partitions that elude standard clustering analysis.

  14. Copy number of pilus gene clusters in Haemophilus influenzae and variation in the hifE pilin gene.

    Science.gov (United States)

    Read, T D; Satola, S W; Opdyke, J A; Farley, M M

    1998-04-01

    Brazilian purpuric fever (BPF)-associated Haemophilus influenzae biogroup aegyptius strain F3031 contains two identical copies of a five gene cluster (hifA to hifE) encoding pili similar to well-characterized Hif fimbriae of H. influenzae type b. HifE, the putative pilus tip adhesin of F3031, shares only 40% amino acid sequence similarity with the same molecule from type b strains, whereas the other four proteins have 75 to 95% identity. To determine whether pilus cluster duplication and the hifE(F3031) allele were special features of BPF-associated bacteria, we analyzed a collection of H. influenzae strains by PCR with hifA- and hifE-specific oligonucleotides, by Southern hybridization with a hifC gene probe, and by nucleotide sequencing. The presence of two pilus clusters was limited to some H. influenzae biogroup aegyptius strains. The hifE(F3031) allele was limited to H. influenzae biogroup aegyptius. Two strains contained one copy of hifE(F3031) and one copy of a variant hifE allele. We determined the nucleotide sequences of four hifE genes from H. influenzae biogroup aegyptius and H. influenzae capsule serotypes a and c. The predicted proteins produced by these genes demonstrated only 35 to 70% identity to the three published HifE proteins from nontypeable H. influenzae, serotype b, and BPF strains. The C-terminal third of the molecules implicated in chaperone binding was the most highly conserved region. Three conserved domains in the otherwise highly variable N-terminal putative receptor-binding region of HifE were similar to conserved portions in the N terminus of Neisseria pilus adhesin PilC. We concluded that two pilus clusters and hifE(F3031) were not specific for BPF-causing H. influenzae, and we also identified portions of HifE possibly involved in binding mammalian cell receptors.

  15. Identification and Functional Analysis of the Mycophenolic Acid Gene Cluster of Penicillium roqueforti.

    Directory of Open Access Journals (Sweden)

    Abdiel Del-Cid

    Full Text Available The filamentous fungus Penicillium roqueforti is widely known as the ripening agent of blue-veined cheeses. Additionally, this fungus is able to produce several secondary metabolites, including the meroterpenoid compound mycophenolic acid (MPA. Cheeses ripened with P. roqueforti are usually contaminated with MPA. On the other hand, MPA is a commercially valuable immunosuppressant. However, to date the molecular basis of the production of MPA by P. roqueforti is still unknown. Using a bioinformatic approach, we have identified a genomic region of approximately 24.4 kbp containing a seven-gene cluster that may be involved in the MPA biosynthesis in P. roqueforti. Gene silencing of each of these seven genes (named mpaA, mpaB, mpaC, mpaDE, mpaF, mpaG and mpaH resulted in dramatic reductions in MPA production, confirming that all of these genes are involved in the biosynthesis of the compound. Interestingly, the mpaF gene, originally described in P. brevicompactum as a MPA self-resistance gene, also exerts the same function in P. roqueforti, suggesting that this gene has a dual function in MPA metabolism. The knowledge of the biosynthetic pathway of MPA in P. roqueforti will be important for the future control of MPA contamination in cheeses and the improvement of MPA production for commercial purposes.

  16. Cloning and characterization of the polyether salinomycin biosynthesis gene cluster of Streptomyces albus XM211.

    Science.gov (United States)

    Jiang, Chunyan; Wang, Hougen; Kang, Qianjin; Liu, Jing; Bai, Linquan

    2012-02-01

    Salinomycin is widely used in animal husbandry as a food additive due to its antibacterial and anticoccidial activities. However, its biosynthesis had only been studied by feeding experiments with isotope-labeled precursors. A strategy with degenerate primers based on the polyether-specific epoxidase sequences was successfully developed to clone the salinomycin gene cluster. Using this strategy, a putative epoxidase gene, slnC, was cloned from the salinomycin producer Streptomyces albus XM211. The targeted replacement of slnC and subsequent trans-complementation proved its involvement in salinomycin biosynthesis. A 127-kb DNA region containing slnC was sequenced, including genes for polyketide assembly and release, oxidative cyclization, modification, export, and regulation. In order to gain insight into the salinomycin biosynthesis mechanism, 13 gene replacements and deletions were conducted. Including slnC, 7 genes were identified as essential for salinomycin biosynthesis and putatively responsible for polyketide chain release, oxidative cyclization, modification, and regulation. Moreover, 6 genes were found to be relevant to salinomycin biosynthesis and possibly involved in precursor supply, removal of aberrant extender units, and regulation. Sequence analysis and a series of gene replacements suggest a proposed pathway for the biosynthesis of salinomycin. The information presented here expands the understanding of polyether biosynthesis mechanisms and paves the way for targeted engineering of salinomycin activity and productivity.

  17. Coordinated evolution of co-expressed gene clusters in the Drosophila transcriptome

    Directory of Open Access Journals (Sweden)

    Jones Corbin D

    2008-01-01

    Full Text Available Abstract Background Co-expression of genes that physically cluster together is a common characteristic of eukaryotic transcriptomes. This organization of transcriptomes suggests that coordinated evolution of gene expression for clustered genes may also be common. Clusters where expression evolution of each gene is not independent of their neighbors are important units for understanding transcriptome evolution. Results We used a common microarray platform to measure gene expression in seven closely related species in the Drosophila melanogaster subgroup, accounting for confounding effects of sequence divergence. To summarize the correlation structure among genes in a chromosomal region, we analyzed the fraction of variation along the first principal component of the correlation matrix. We analyzed the correlation for blocks of consecutive genes to assess patterns of correlation that may be manifest at different scales of coordinated expression. We find that expression of physically clustered genes does evolve in a coordinated manner in many locations throughout the genome. Our analysis shows that relatively few of these clusters are near heterochromatin regions and that these clusters tend to be over-dispersed relative to the rest of the genome. This suggests that these clusters are not the byproduct of local gene clustering. We also analyzed the pattern of co-expression among neighboring genes within a single Drosophila species: D. simulans. For the co-expression clusters identified within this species, we find an under-representation of genes displaying a signature of recurrent adaptive amino acid evolution consistent with previous findings. However, clusters displaying co-evolution of expression among species are enriched for adaptively evolving genes. This finding points to a tie between adaptive sequence evolution and evolution of the transcriptome. Conclusion Our results demonstrate that co-evolution of expression in gene clusters is

  18. Evolutionary formation of gene clusters by reorganization: the meleagrin/roquefortine paradigm in different fungi.

    Science.gov (United States)

    Martín, Juan F; Liras, Paloma

    2016-02-01

    The biosynthesis of secondary metabolites in fungi is catalyzed by enzymes encoded by genes linked in clusters that are frequently co-regulated at the transcriptional level. Formation of gene clusters may take place by de novo assembly of genes recruited from other cellular functions, but also novel gene clusters are formed by reorganization of progenitor clusters and are distributed by horizontal gene transfer. This article reviews (i) the published information on the roquefortine/meleagrin/neoxaline gene clusters of Penicillium chrysogenum (Penicillium rubens) and the short roquefortine cluster of Penicillium roqueforti, and (ii) the correlation of the genes present in those clusters with the enzymes and metabolites derived from these pathways. The P. chrysogenum roq/mel cluster consists of seven genes and includes a gene (roqT) encoding a 12-TMS transporter protein of the MFS family. Interestingly, the orthologous P. roquefortine gene cluster has only four genes and the roqT gene is present as a residual pseudogene that encodes only small peptides. Two of the genes present in the central region of the P. chrysogenum roq/mel cluster have been lost during the evolutionary formation of the short cluster and the order of the structural genes in the cluster has been rearranged. The two lost genes encode a N1 atom hydroxylase (nox) and a roquefortine scaffold-reorganizing oxygenase (sro). As a consequence P. roqueforti has lost the ability to convert the roquefortine-type carbon skeleton to the glandicoline/meleagrin-type scaffold and is unable to produce glandicoline B, meleagrin and neoxaline. The loss of this genetic information is not recent and occurred probably millions of years ago when a progenitor Penicillium strain got adapted to life in a few rich habitats such as cheese, fermented cereal grains or silage. P. roqueforti may be considered as a "domesticated" variant of a progenitor common to contemporary P. chrysogenum and related Penicillia.

  19. Functional clustering of time series gene expression data by Granger causality

    Science.gov (United States)

    2012-01-01

    Background A common approach for time series gene expression data analysis includes the clustering of genes with similar expression patterns throughout time. Clustered gene expression profiles point to the joint contribution of groups of genes to a particular cellular process. However, since genes belong to intricate networks, other features, besides comparable expression patterns, should provide additional information for the identification of functionally similar genes. Results In this study we perform gene clustering through the identification of Granger causality between and within sets of time series gene expression data. Granger causality is based on the idea that the cause of an event cannot come after its consequence. Conclusions This kind of analysis can be used as a complementary approach for functional clustering, wherein genes would be clustered not solely based on their expression similarity but on their topological proximity built according to the intensity of Granger causality among them. PMID:23107425

  20. Functional clustering of time series gene expression data by Granger causality

    Directory of Open Access Journals (Sweden)

    Fujita André

    2012-10-01

    Full Text Available Abstract Background A common approach for time series gene expression data analysis includes the clustering of genes with similar expression patterns throughout time. Clustered gene expression profiles point to the joint contribution of groups of genes to a particular cellular process. However, since genes belong to intricate networks, other features, besides comparable expression patterns, should provide additional information for the identification of functionally similar genes. Results In this study we perform gene clustering through the identification of Granger causality between and within sets of time series gene expression data. Granger causality is based on the idea that the cause of an event cannot come after its consequence. Conclusions This kind of analysis can be used as a complementary approach for functional clustering, wherein genes would be clustered not solely based on their expression similarity but on their topological proximity built according to the intensity of Granger causality among them.

  1. A study of genes involved in adipocyte differentiation.

    Science.gov (United States)

    Zhu, Shunming; Cheng, Gong; Zhu, Huolan; Guan, Gongchang

    2015-01-01

    With the use of the microarray technique, genes expressed in the late phase of adipocyte differentiation were investigated. These genes play an important role in stimulating adipocyte growth and lipid droplet formation. Therefore, they contribute a great deal to the onset of obesity. With the use of SW872 adipocytes and the microarray technique, genes related to adipocyte differentiation were tested and compared with undifferentiated preadipocytes 14 days after induction. Real-time reverse transcription polymerase chain reaction (RT-PCR) was used for confirmation. More than 21,329 transcriptors were expressed and determined, of which 1326 increased and 687 decreased undifferentiated adipocytes. Among them, 21 were highly expressed by more than 10-fold. With RT-PCR, 12 were confirmed, including apelin, CIDEC, PID1, LYRM1, ADD1, PPARγ2, ANGPTL4, ADIPOQ, ACOX1, FIP1L1, MAP3K2 and PEX14. Furthermore, genes involved in lipid metabolism, signal transduction, DNA replication, redox status and transcription factors were determined as well. Novel genes involved in adipogenesis (e.g., apelin) were detected. A variety of genes were discovered and validated with RT-PCR at the late phase of adipocyte differentiation. This may help us better understand the onset of obesity and the potential role of adipocytes in other organs.

  2. Gene prioritization and clustering by multi-view text mining.

    Science.gov (United States)

    Yu, Shi; Tranchevent, Leon-Charles; De Moor, Bart; Moreau, Yves

    2010-01-14

    Text mining has become a useful tool for biologists trying to understand the genetics of diseases. In particular, it can help identify the most interesting candidate genes for a disease for further experimental analysis. Many text mining approaches have been introduced, but the effect of disease-gene identification varies in different text mining models. Thus, the idea of incorporating more text mining models may be beneficial to obtain more refined and accurate knowledge. However, how to effectively combine these models still remains a challenging question in machine learning. In particular, it is a non-trivial issue to guarantee that the integrated model performs better than the best individual model. We present a multi-view approach to retrieve biomedical knowledge using different controlled vocabularies. These controlled vocabularies are selected on the basis of nine well-known bio-ontologies and are applied to index the vast amounts of gene-based free-text information available in the MEDLINE repository. The text mining result specified by a vocabulary is considered as a view and the obtained multiple views are integrated by multi-source learning algorithms. We investigate the effect of integration in two fundamental computational disease gene identification tasks: gene prioritization and gene clustering. The performance of the proposed approach is systematically evaluated and compared on real benchmark data sets. In both tasks, the multi-view approach demonstrates significantly better performance than other comparing methods. In practical research, the relevance of specific vocabulary pertaining to the task is usually unknown. In such case, multi-view text mining is a superior and promising strategy for text-based disease gene identification.

  3. Alanylclavam Biosynthetic Genes Are Clustered Together with One Group of Clavulanic Acid Biosynthetic Genes in Streptomyces clavuligerus▿ §

    Science.gov (United States)

    Zelyas, Nathan J.; Cai, Hui; Kwong, Thomas; Jensen, Susan E.

    2008-01-01

    Streptomyces clavuligerus produces at least five different clavam metabolites, including clavulanic acid and the methionine antimetabolite, alanylclavam. In vitro transposon mutagenesis was used to analyze a 13-kb region upstream of the known paralogue gene cluster. The paralogue cluster includes one group of clavulanic acid biosynthetic genes in S. clavuligerus. Twelve open reading frames (ORFs) were found in this area, and mutants were generated in each using either in vitro transposon or PCR-targeted mutagenesis. Mutants with defects in any of the genes orfA, orfB, orfC, or orfD were unable to produce alanylclavam but could produce all of the other clavams, including clavulanic acid. orfA encodes a predicted hydroxymethyltransferase, orfB encodes a YjgF/YER057c/UK114-family regulatory protein, orfC encodes an aminotransferase, and orfD encodes a dehydratase. All of these types of proteins are normally involved in amino acid metabolism. Mutants in orfC or orfD also accumulated a novel clavam metabolite instead of alanylclavam, and a complemented orfC mutant was able to produce trace amounts of alanylclavam while still producing the novel clavam. Mass spectrometric analyses, together with consideration of the enzymes involved in its production, led to tentative identification of the novel clavam as 8-OH-alanylclavam, an intermediate in the proposed alanylclavam biosynthetic pathway. PMID:18931110

  4. Gravitation field algorithm and its application in gene cluster

    Directory of Open Access Journals (Sweden)

    Zheng Ming

    2010-09-01

    Full Text Available Abstract Background Searching optima is one of the most challenging tasks in clustering genes from available experimental data or given functions. SA, GA, PSO and other similar efficient global optimization methods are used by biotechnologists. All these algorithms are based on the imitation of natural phenomena. Results This paper proposes a novel searching optimization algorithm called Gravitation Field Algorithm (GFA which is derived from the famous astronomy theory Solar Nebular Disk Model (SNDM of planetary formation. GFA simulates the Gravitation field and outperforms GA and SA in some multimodal functions optimization problem. And GFA also can be used in the forms of unimodal functions. GFA clusters the dataset well from the Gene Expression Omnibus. Conclusions The mathematical proof demonstrates that GFA could be convergent in the global optimum by probability 1 in three conditions for one independent variable mass functions. In addition to these results, the fundamental optimization concept in this paper is used to analyze how SA and GA affect the global search and the inherent defects in SA and GA. Some results and source code (in Matlab are publicly available at http://ccst.jlu.edu.cn/CSBG/GFA.

  5. Adaptive evolution of the FADS gene cluster within Africa.

    Directory of Open Access Journals (Sweden)

    Rasika A Mathias

    Full Text Available Long chain polyunsaturated fatty acids (LC-PUFAs are essential for brain structure, development, and function, and adequate dietary quantities of LC-PUFAs are thought to have been necessary for both brain expansion and the increase in brain complexity observed during modern human evolution. Previous studies conducted in largely European populations suggest that humans have limited capacity to synthesize brain LC-PUFAs such as docosahexaenoic acid (DHA from plant-based medium chain (MC PUFAs due to limited desaturase activity. Population-based differences in LC-PUFA levels and their product-to-substrate ratios can, in part, be explained by polymorphisms in the fatty acid desaturase (FADS gene cluster, which have been associated with increased conversion of MC-PUFAs to LC-PUFAs. Here, we show evidence that these high efficiency converter alleles in the FADS gene cluster were likely driven to near fixation in African populations by positive selection ∼85 kya. We hypothesize that selection at FADS variants, which increase LC-PUFA synthesis from plant-based MC-PUFAs, played an important role in allowing African populations obligatorily tethered to marine sources for LC-PUFAs in isolated geographic regions, to rapidly expand throughout the African continent 60-80 kya.

  6. Studies of Genes Involved in Congenital Heart Disease

    Directory of Open Access Journals (Sweden)

    Tushar K. Ghosh

    2014-05-01

    Full Text Available Congenital heart disease (CHD affects the intricate structure and function of the heart and is one of the leading causes of death in newborns. The genetic basis of CHD is beginning to emerge. Our laboratory has been engaged in identifying mutations in genes linked to CHD both in families and in sporadic cases. Over the last two decades, we have employed linkage analysis, targeted gene sequencing and genome wide association studies to identify genes involved in CHDs. Cardiac specific genes that encode transcription factors and sarcomeric proteins have been identified and linked to CHD. Functional analysis of the relevant mutant proteins has established the molecular mechanisms of CHDs in our studies.

  7. Characterisation of Campylobacter jejuni genes potentially involved in phosphonate degradation

    Directory of Open Access Journals (Sweden)

    Hartley Lauren E

    2009-06-01

    Full Text Available Abstract Potential biological roles of the Campylobacter jejuni genes cj0641, cj0774c and cj1663 were investigated. The proteins encoded by these genes showed sequence similarities to the phosphonate utilisation PhnH, K and L gene products of Escherichia coli. The genes cj0641, cj0774c and cj1663 were amplified from the pathogenic C. jejuni strain 81116, sequenced, and cloned into pGEM-T Easy vectors. Recombinant plasmids were used to disrupt each one of the genes by inserting a kanamycin resistance (KmR cassette employing site-directed mutagenesis or inverse PCR. Campylobacter jejuni 81116 isogenic mutants were generated by integration of the mutated genes into the genome of the wild-type strain. The C. jejuni mutants grew on primary isolation plates, but they could not be purified by subsequent passages owing to cell death. The mutant C. jejuni strains survived and proliferated in co-cultures with wild-type bacteria or in media in which wild-type C. jejuni had been previously grown. PCR analyses of mixed wild-type/mutant cultures served to verify the presence of the mutated gene in the genome of a fraction of the total bacterial population. The data suggested that each mutation inactivated a gene essential for survival. Rates of phosphonate catabolism in lysates of E. coli strain DH5α were determined using proton nuclear magnetic resonance spectroscopy. Whole-cell lysates of the wild-type degraded phosphonoacetate, phenylphosphonate and aminomethylphosphonate. Significant differences in the rates of phosphonate degradation were observed between lysates of wild-type E. coli, and of bacteria transformed with each one of the vectors carrying one of the C. jejuni genes, suggesting that these genes were involved in phosphonate catabolism.

  8. Inter-MAR association contributes to transcriptionally active looping events in human beta-globin gene cluster.

    Science.gov (United States)

    Wang, Li; Di, Li-Jun; Lv, Xiang; Zheng, Wei; Xue, Zheng; Guo, Zhi-Chen; Liu, De-Pei; Liang, Chi-Chuan

    2009-01-01

    Matrix attachment regions (MARs) are important in chromatin organization and gene regulation. Although it is known that there are a number of MAR elements in the beta-globin gene cluster, it is unclear that how these MAR elements are involved in regulating beta-globin genes expression. Here, we report the identification of a new MAR element at the LCR (locus control region) of human beta-globin gene cluster and the detection of the inter-MAR association within the beta-globin gene cluster. Also, we demonstrate that SATB1, a protein factor that has been implicated in the formation of network like higher order chromatin structures at some gene loci, takes part in beta-globin specific inter-MAR association through binding the specific MARs. Knocking down of SATB1 obviously reduces the binding of SATB1 to the MARs and diminishes the frequency of the inter-MAR association. As a result, the ACH establishment and the alpha-like globin genes and beta-like globin genes expressions are affected either. In summary, our results suggest that SATB1 is a regulatory factor of hemoglobin genes, especially the early differentiation genes at least through affecting the higher order chromatin structure.

  9. Hierarchical clustering of breast cancer methylomes revealed differentially methylated and expressed breast cancer genes.

    Directory of Open Access Journals (Sweden)

    I-Hsuan Lin

    Full Text Available Oncogenic transformation of normal cells often involves epigenetic alterations, including histone modification and DNA methylation. We conducted whole-genome bisulfite sequencing to determine the DNA methylomes of normal breast, fibroadenoma, invasive ductal carcinomas and MCF7. The emergence, disappearance, expansion and contraction of kilobase-sized hypomethylated regions (HMRs and the hypomethylation of the megabase-sized partially methylated domains (PMDs are the major forms of methylation changes observed in breast tumor samples. Hierarchical clustering of HMR revealed tumor-specific hypermethylated clusters and differential methylated enhancers specific to normal or breast cancer cell lines. Joint analysis of gene expression and DNA methylation data of normal breast and breast cancer cells identified differentially methylated and expressed genes associated with breast and/or ovarian cancers in cancer-specific HMR clusters. Furthermore, aberrant patterns of X-chromosome inactivation (XCI was found in breast cancer cell lines as well as breast tumor samples in the TCGA BRCA (breast invasive carcinoma dataset. They were characterized with differentially hypermethylated XIST promoter, reduced expression of XIST, and over-expression of hypomethylated X-linked genes. High expressions of these genes were significantly associated with lower survival rates in breast cancer patients. Comprehensive analysis of the normal and breast tumor methylomes suggests selective targeting of DNA methylation changes during breast cancer progression. The weak causal relationship between DNA methylation and gene expression observed in this study is evident of more complex role of DNA methylation in the regulation of gene expression in human epigenetics that deserves further investigation.

  10. Time-series clustering of gene expression in irradiated and bystander fibroblasts: an application of FBPA clustering

    Directory of Open Access Journals (Sweden)

    Markatou Marianthi

    2011-01-01

    Full Text Available Abstract Background The radiation bystander effect is an important component of the overall biological response of tissues and organisms to ionizing radiation, but the signaling mechanisms between irradiated and non-irradiated bystander cells are not fully understood. In this study, we measured a time-series of gene expression after α-particle irradiation and applied the Feature Based Partitioning around medoids Algorithm (FBPA, a new clustering method suitable for sparse time series, to identify signaling modules that act in concert in the response to direct irradiation and bystander signaling. We compared our results with those of an alternate clustering method, Short Time series Expression Miner (STEM. Results While computational evaluations of both clustering results were similar, FBPA provided more biological insight. After irradiation, gene clusters were enriched for signal transduction, cell cycle/cell death and inflammation/immunity processes; but only FBPA separated clusters by function. In bystanders, gene clusters were enriched for cell communication/motility, signal transduction and inflammation processes; but biological functions did not separate as clearly with either clustering method as they did in irradiated samples. Network analysis confirmed p53 and NF-κB transcription factor-regulated gene clusters in irradiated and bystander cells and suggested novel regulators, such as KDM5B/JARID1B (lysine (K-specific demethylase 5B and HDACs (histone deacetylases, which could epigenetically coordinate gene expression after irradiation. Conclusions In this study, we have shown that a new time series clustering method, FBPA, can provide new leads to the mechanisms regulating the dynamic cellular response to radiation. The findings implicate epigenetic control of gene expression in addition to transcription factor networks.

  11. Recurrent adenylation domain replacement in the microcystin synthetase gene cluster

    Directory of Open Access Journals (Sweden)

    Laakso Kati

    2007-10-01

    Full Text Available Abstract Background Microcystins are small cyclic heptapeptide toxins produced by a range of distantly related cyanobacteria. Microcystins are synthesized on large NRPS-PKS enzyme complexes. Many structural variants of microcystins are produced simulatenously. A recombination event between the first module of mcyB (mcyB1 and mcyC in the microcystin synthetase gene cluster is linked to the simultaneous production of microcystin variants in strains of the genus Microcystis. Results Here we undertook a phylogenetic study to investigate the order and timing of recombination between the mcyB1 and mcyC genes in a diverse selection of microcystin producing cyanobacteria. Our results provide support for complex evolutionary processes taking place at the mcyB1 and mcyC adenylation domains which recognize and activate the amino acids found at X and Z positions. We find evidence for recent recombination between mcyB1 and mcyC in strains of the genera Anabaena, Microcystis, and Hapalosiphon. We also find clear evidence for independent adenylation domain conversion of mcyB1 by unrelated peptide synthetase modules in strains of the genera Nostoc and Microcystis. The recombination events replace only the adenylation domain in each case and the condensation domains of mcyB1 and mcyC are not transferred together with the adenylation domain. Our findings demonstrate that the mcyB1 and mcyC adenylation domains are recombination hotspots in the microcystin synthetase gene cluster. Conclusion Recombination is thought to be one of the main mechanisms driving the diversification of NRPSs. However, there is very little information on how recombination takes place in nature. This study demonstrates that functional peptide synthetases are created in nature through transfer of adenylation domains without the concomitant transfer of condensation domains.

  12. Repression of genes involved in melanocyte differentiation in uveal melanoma

    Science.gov (United States)

    Bergeron, Marjorie-Allison; Champagne, Sophie; Gaudreault, Manon; Deschambeault, Alexandre

    2012-01-01

    Purpose Uveal melanoma (UM) has been the subject of intense interest due to its distinctive metastatic pattern, which involves hematogenous dissemination of cancerous cells toward the liver in 50% of patients. To search for new UM prognostic markers, the Suppressive Subtractive Hybridization (SSH) technique was used to isolate genes that are differentially expressed between UM primary tumors and normal uveal melanocytes (UVM). Methods A subtracted cDNA library was prepared using cDNA from uncultured UM primary tumors and UVM. The expression level of selected genes was further validated by cDNA microarray, semi-quantitative reverse transcription polymerase chain reaction (RT-PCR), and immunofluorescence analyses. Results One hundred-fifteen genes were identified using the SSH technique. Microarray analyses comparing the gene expression profiles of UM primary tumors to UVM validated a significant differential expression for 48% of these genes. The expression pattern of selected genes was then analyzed by semi-quantitative RT–PCR and was found to be consistent with the SSH and cDNA microarray findings. A down-regulation of genes associated with melanocyte differentiation was confirmed in UM primary tumors. Presence of undifferentiated cells in the UM was demonstrated by the expression of stem cell markers ATP-binding cassette sub-family G member 2 (ABCG2) and octamer-binding protein 4 (OCT4). Conclusions We demonstrated that the SSH technique is efficient to detect differentially expressed genes between UM and UVM. The genes identified in this study represent valuable candidates for further functional analysis in UM and should be informative in studying the biology of this tumor. In addition, deregulation of the melanocyte differentiation pathway revealed the presence of UM cells exhibiting a stem cell-like phenotype. PMID:22815634

  13. Constitutional Chromothripsis Rearrangements Involve Clustered Double-Stranded DNA Breaks and Nonhomologous Repair Mechanisms

    Directory of Open Access Journals (Sweden)

    Wigard P. Kloosterman

    2012-06-01

    Full Text Available Chromothripsis represents a novel phenomenon in the structural variation landscape of cancer genomes. Here, we analyze the genomes of ten patients with congenital disease who were preselected to carry complex chromosomal rearrangements with more than two breakpoints. The rearrangements displayed unanticipated complexity resembling chromothripsis. We find that eight of them contain hallmarks of multiple clustered double-stranded DNA breaks (DSBs on one or more chromosomes. In addition, nucleotide resolution analysis of 98 breakpoint junctions indicates that break repair involves nonhomologous or microhomology-mediated end joining. We observed that these eight rearrangements are balanced or contain sporadic deletions ranging in size between a few hundred base pairs and several megabases. The two remaining complex rearrangements did not display signs of DSBs and contain duplications, indicative of rearrangement processes involving template switching. Our work provides detailed insight into the characteristics of chromothripsis and supports a role for clustered DSBs driving some constitutional chromothripsis rearrangements.

  14. Unusual Gene Order and Organization of the Sea Urchin Hox Cluster

    OpenAIRE

    Richardson, Paul M.; Lucas, Susan; Cameron, R. Andrew; Rowen, Lee; Nesbitt, Ryan; Bloom, Scott; Rast, Jonathan P.; Berney, Kevin; Arenas-Mena, Cesar; Martinez, Pedro; Davidson, Eric H.; Peterson, Kevin J.; Hood, Leroy

    2005-01-01

    The highly consistent gene order and axial colinear expression patterns found in vertebrate hox gene clusters are less well conserved across the rest of bilaterians. We report the first deuterostome instance of an intact hox cluster with a unique gene order where the paralog groups are not expressed in a sequential manner. The finished sequence from BAC clones from the genome of the sea urchin, Strongylocentrotus purpuratus, reveals a gene order wherein the anterior genes (Hox1, Hox2 and...

  15. A hybrid distance measure for clustering expressed sequence tags originating from the same gene family.

    Directory of Open Access Journals (Sweden)

    Keng-Hoong Ng

    Full Text Available BACKGROUND: Clustering is a key step in the processing of Expressed Sequence Tags (ESTs. The primary goal of clustering is to put ESTs from the same transcript of a single gene into a unique cluster. Recent EST clustering algorithms mostly adopt the alignment-free distance measures, where they tend to yield acceptable clustering accuracies with reasonable computational time. Despite the fact that these clustering methods work satisfactorily on a majority of the EST datasets, they have a common weakness. They are prone to deliver unsatisfactory clustering results when dealing with ESTs from the genes derived from the same family. The root cause is the distance measures applied on them are not sensitive enough to separate these closely related genes. METHODOLOGY/PRINCIPAL FINDINGS: We propose a hybrid distance measure that combines the global and local features extracted from ESTs, with the aim to address the clustering problem faced by ESTs derived from the same gene family. The clustering process is implemented using the DBSCAN algorithm. We test the hybrid distance measure on the ten EST datasets, and the clustering results are compared with the two alignment-free EST clustering tools, i.e. wcd and PEACE. The clustering results indicate that the proposed hybrid distance measure performs relatively better (in terms of clustering accuracy than both EST clustering tools. CONCLUSIONS/SIGNIFICANCE: The clustering results provide support for the effectiveness of the proposed hybrid distance measure in solving the clustering problem for ESTs that originate from the same gene family. The improvement of clustering accuracies on the experimental datasets has supported the claim that the sensitivity of the hybrid distance measure is sufficient to solve the clustering problem.

  16. Polymorphisms in genes involved in neurotransmission in relation to smoking.

    Science.gov (United States)

    Arinami, T; Ishiguro, H; Onaivi, E S

    2000-12-27

    Smoking behavior is influenced by both genetic and environmental factors. The genetic contribution to smoking behavior is at least as great as its contribution to alcoholism. Much progress has been achieved in genomic research related to cigarette-smoking within recent years. Linkage studies indicate that there are several loci linked to smoking, and candidate genes that are related to neurotransmission have been examined. Possible associated genes include cytochrome P450 subfamily polypeptide 6 (CYP2A6), dopamine D(1), D(2), and D(4) receptors, dopamine transporter, and serotonin transporter genes. There are other important candidate genes but studies evaluating the link with smoking have not been reported. These include genes encoding the dopamine D(3) and D(5) receptors, serotonin receptors, tyrosine hydroxylase, trytophan 2,3-dioxygenase, opioid receptors, and cannabinoid receptors. Since smoking-related factors are extremely complex, studies of diverse populations and of many aspects of smoking behavior including initiation, maintenance, cessation, relapse, and influence of environmental factors are needed to identify smoking-associated genes. We now review genetic polymorphisms reported to be involved in neurotransmission in relation to smoking.

  17. Regulator of complement activation (RCA) gene cluster in Xenopus tropicalis.

    Science.gov (United States)

    Oshiumi, Hiroyuki; Suzuki, Yuzuru; Matsumoto, Misako; Seya, Tsukasa

    2009-05-01

    Genome and expressed sequence tag information of Xenopus tropicalis suggested that short-consensus repeat (SCR)-containing proteins are encoded by three genes that are mapped within a 300-kb downstream of PFKFB2, which is a marker gene for the regulator of complement activation (RCA) loci in human and chicken. Based on this observation, we cloned the three cDNAs of these proteins using 3'- or 5'-RACE technique. Since their primary structures and locations of the proximity to the PFKFB2 locus, we named them amphibian RCA protein (ARC) 1, 2, and 3. Expression in human HEK293 or CHO cells suggested that ARC1 is a soluble protein of Mr approximately 67 kDa, ARC2 is a membrane protein with Mr 44 kDa, and ARC3 a secretary protein with a putative transmembrane region. They were N-glycosylated during maturation. In human and chicken RCA clusters, the order in which genes for soluble, GPI-anchored, and membrane forms of SCR proteins are arranged is from the distant to proximity to the PFKFB2 gene. However, the amphibian ARC1, 2, and 3 resembled one another and did not reflect the same order found in human and chicken RCA genes. This may be due to self-duplication of ARCs to form a family, and it evolved after the amphibia separated from the ancestor of the amniotes, which possessed soluble, GPI-anchored, and membrane forms of SCR protein members. Taken together, frog possesses a RCA locus, but the constitution of the ARC proteins differs from that of the amniotes with a unique self-resemblance.

  18. Comparisons of Graph-structure Clustering Methods for Gene Expression Data

    Institute of Scientific and Technical Information of China (English)

    Zhuo FANG; Lei LIU; Jiong YANG; Qing-Ming LUO; Yi-Xue LI

    2006-01-01

    Although many numerical clustering algorithms have been applied to gene expression data analysis, the essential step is still biological interpretation by manual inspection. The correlation between genetic co-regulation and affiliation to a common biological process is what biologists expect. Here, we introduce some clustering algorithms that are based on graph structure constituted by biological knowledge. After applying a widely used dataset, we compared the result clusters of two of these algorithms in terms of the homogeneity of clusters and coherence of annotation and matching ratio. The results show that the clusters of knowledge-guided analysis are the kernel parts of the clusters of Gene Ontology (GO)-Cluster software, which contains the genes that are most expression correlative and most consistent with biological functions. Moreover, knowledge-guided analysis seems much more applicable than GO-Cluster in a larger dataset.

  19. Plant Genes Involved in Symbiotic Sinal Perception/Signal Transduction

    DEFF Research Database (Denmark)

    Binder, A; Soyano, T; Hayashi, H

    2014-01-01

    to nodule primordia formation, and the infection thread initiation in the root hairs guiding bacteria towards dividing cortical cells. This chapter focuses on the plant genes involved in the recognition of the symbiotic signal produced by rhizobia, and the downstream genes, which are part of a complex......A host genetic programme that is initiated upon recognition of specific rhizobial Nod factors governs the symbiosis of legumes with nitrogen-fixing bacteria. This programme coordinates two major developmental processes that run in parallel in legume roots: de novo cortical cell division leading...... symbiotic signalling pathway that leads to the generation of calcium spiking in the nuclear regions and activation of transcription factors controlling symbiotic genes induction...

  20. Extended genetic effects of ADH cluster genes on the risk of alcohol dependence: from GWAS to replication.

    Science.gov (United States)

    Park, Byung Lae; Kim, Jee Wook; Cheong, Hyun Sub; Kim, Lyoung Hyo; Lee, Boung Chul; Seo, Cheong Hoon; Kang, Tae-Cheon; Nam, Young-Woo; Kim, Goon-Bo; Shin, Hyoung Doo; Choi, Ihn-Geun

    2013-06-01

    Alcohol dependence (AD) is a multifactorial and polygenic disorder involving complex gene-to-gene and gene-to-environment interactions. Several genome-wide association studies have reported numerous risk factors for AD, but replication results following these studies have been controversial. To identify new candidate genes, the present study used GWAS and replication studies in a Korean cohort with AD. Genome-wide association analysis revealed that two chromosome regions on Chr. 4q22-q23 (ADH gene cluster, including ADH5, ADH4, ADH6, ADH1A, ADH1B, and ADH7) and Chr. 12q24 (ALDH2) showed multiple association signals for the risk of AD. To investigate detailed genetic effects of these ADH genes on AD, a follow-up study of the ADH gene cluster on 4q22-q23 was performed. A total of 90 SNPs, including ADH1B rs1229984 (H47R), were genotyped in an additional 975 Korean subjects. In case-control analysis, ADH1B rs1229984 (H47R) showed the most significant association with the risk of AD (p = 2.63 × 10(-21), OR = 2.35). Moreover, subsequent conditional analyses revealed that all positive associations of other ADH genes in the cluster disappeared, which suggested that ADH1B rs1229984 (H47R) might be the sole functional genetic marker across the ADH gene cluster. Our findings could provide additional information on the ADH gene cluster regarding the risk of AD, as well as a new and important insight into the genetic factors associated with AD.

  1. Burkholderia thailandensis harbors two identical rhl gene clusters responsible for the biosynthesis of rhamnolipids

    Directory of Open Access Journals (Sweden)

    Woods Donald E

    2009-12-01

    Full Text Available Abstract Background Rhamnolipids are surface active molecules composed of rhamnose and β-hydroxydecanoic acid. These biosurfactants are produced mainly by Pseudomonas aeruginosa and have been thoroughly investigated since their early discovery. Recently, they have attracted renewed attention because of their involvement in various multicellular behaviors. Despite this high interest, only very few studies have focused on the production of rhamnolipids by Burkholderia species. Results Orthologs of rhlA, rhlB and rhlC, which are responsible for the biosynthesis of rhamnolipids in P. aeruginosa, have been found in the non-infectious Burkholderia thailandensis, as well as in the genetically similar important pathogen B. pseudomallei. In contrast to P. aeruginosa, both Burkholderia species contain these three genes necessary for rhamnolipid production within a single gene cluster. Furthermore, two identical, paralogous copies of this gene cluster are found on the second chromosome of these bacteria. Both Burkholderia spp. produce rhamnolipids containing 3-hydroxy fatty acid moieties with longer side chains than those described for P. aeruginosa. Additionally, the rhamnolipids produced by B. thailandensis contain a much larger proportion of dirhamnolipids versus monorhamnolipids when compared to P. aeruginosa. The rhamnolipids produced by B. thailandensis reduce the surface tension of water to 42 mN/m while displaying a critical micelle concentration value of 225 mg/L. Separate mutations in both rhlA alleles, which are responsible for the synthesis of the rhamnolipid precursor 3-(3-hydroxyalkanoyloxyalkanoic acid, prove that both copies of the rhl gene cluster are functional, but one contributes more to the total production than the other. Finally, a double ΔrhlA mutant that is completely devoid of rhamnolipid production is incapable of swarming motility, showing that both gene clusters contribute to this phenotype. Conclusions Collectively, these

  2. Plant genes involved in harbouring symbiotic rhizobia or pathogenic nematodes.

    Science.gov (United States)

    Damiani, Isabelle; Baldacci-Cresp, Fabien; Hopkins, Julie; Andrio, Emilie; Balzergue, Sandrine; Lecomte, Philippe; Puppo, Alain; Abad, Pierre; Favery, Bruno; Hérouart, Didier

    2012-04-01

    The establishment and development of plant-microorganism interactions involve impressive transcriptomic reprogramming of target plant genes. The symbiont (Sinorhizobium meliloti) and the root knot-nematode pathogen (Meloidogyne incognita) induce the formation of new root organs, the nodule and the gall, respectively. Using laser-assisted microdissection, we specifically monitored, at the cell level, Medicago gene expression in nodule zone II cells, which are preparing to receive rhizobia, and in gall giant and surrounding cells, which play an essential role in nematode feeding and constitute the typical root swollen structure, respectively. We revealed an important reprogramming of hormone pathways and C1 metabolism in both interactions, which may play key roles in nodule and gall neoformation, rhizobia endocytosis and nematode feeding. Common functions targeted by rhizobia and nematodes were mainly down-regulated, whereas the specificity of the interaction appeared to involve up-regulated genes. Our transcriptomic results provide powerful datasets to unravel the mechanisms involved in the accommodation of rhizobia and root-knot nematodes. Moreover, they raise the question of host specificity and the evolution of plant infection mechanisms by a symbiont and a pathogen.

  3. Sites and gene products involved in lambdoid phage DNA packaging.

    Science.gov (United States)

    Smith, M P; Feiss, M

    1993-04-01

    21 is a temperate lambdoid coliphage, and the genes that encode the head proteins of lambda and 21 are descended from a common ancestral bacteriophage. The sequencing of terminase genes 1 and 2 of 21 was completed, along with that of a segment at the right end of 21 DNA that includes the R4 sequence. The R4 sequence, a site that is likely involved in termination of DNA packaging, was found to be very similar to the R4 sequences of lambda and phi 80, suggesting that R4 is a recognition site that is not phage specific. DNA packaging by 21 is dependent on a host protein, integration host factor. A series of mutations in gene 1 (her mutations), which allow integration host factor-independent DNA packaging by 21, were found to be missense changes that affect predicted alpha-helixes in gp1. gp2, the large terminase subunit, is predicted to contain an ATP-binding domain and, perhaps, a second domain important for the cos-cutting activity of terminase. orf1, an open reading frame analogous in position to FI, a lambda gene involved in DNA packaging, shares some sequence identity with FI. orf1 was inactivated with nonsense and insertion mutations; these mutations were found not to affect phage growth. 21 was also not able to complement a lambda FI mutant.

  4. Selections of data preprocessing methods and similarity metrics for gene cluster analysis

    Institute of Scientific and Technical Information of China (English)

    YANG Chunmei; WAN Baikun; GAO Xiaofeng

    2006-01-01

    Clustering is one of the major exploratory techniques for gene expression data analysis. Only with suitable similarity metrics and when datasets are properly preprocessed, can results of high quality be obtained in cluster analysis. In this study, gene expression datasets with external evaluation criteria were preprocessed as normalization by line, normalization by column or logarithm transformation by base-2, and were subsequently clustered by hierarchical clustering, k-means clustering and self-organizing maps (SOMs) with Pearson correlation coefficient or Euclidean distance as similarity metric. Finally, the quality of clusters was evaluated by adjusted Rand index. The results illustrate that k-means clustering and SOMs have distinct advantages over hierarchical clustering in gene clustering, and SOMs are a bit better than k-means when randomly initialized. It also shows that hierarchical clustering prefers Pearson correlation coefficient as similarity metric and dataset normalized by line. Meanwhile, k-means clustering and SOMs can produce better clusters with Euclidean distance and logarithm transformed datasets. These results will afford valuable reference to the implementation of gene expression cluster analysis.

  5. Sucrose in bloom-forming cyanobacteria: loss and gain of genes involved in its biosynthesis.

    Science.gov (United States)

    Kolman, María A; Salerno, Graciela L

    2016-02-01

    Bloom-forming cyanobacteria are widely distributed in freshwater ecosystems. To cope with salinity fluctuations, cyanobacteria synthesize compatible solutes, such as sucrose, to maintain the intracellular osmotic balance. The screening of cyanobacterial genomes revealed that homologues to sucrose metabolism-related genes only occur in few bloom-forming strains, mostly belonging to Nostocales and Stigonematales orders. Remarkably, among Chroococcales and Oscillatoriales strains, homologues were only found in M. aeruginosa PCC 7806 and Leptolyngbya boryana PCC 6306, suggesting a massive loss of sucrose metabolism in bloom-forming strains of these orders. After a complete functional characterization of sucrose genes in M. aeruginosa PCC 7806, we showed that sucrose metabolism depends on the expression of a gene cluster that defines a transcriptional unit, unique among all sucrose-containing cyanobacteria. It was also demonstrated that the expression of the encoding genes of sucrose-related proteins is stimulated by salt. In view of its ancestral origin in cyanobacteria, the fact that most bloom-forming strains lack sucrose metabolism indicates that the genes involved might have been lost during evolution. However, in a particular strain, like M. aeruginosa PCC 7806, sucrose synthesis genes were probably regained by horizontal gene transfer, which could be hypothesized as a response to salinity fluctuations.

  6. Enzymology of aminoglycoside biosynthesis-deduction from gene clusters.

    Science.gov (United States)

    Wehmeier, Udo F; Piepersberg, Wolfgang

    2009-01-01

    The classical aminoglycosides are, with very few exceptions, typically actinobacterial secondary metabolites with antimicrobial activities all mediated by inhibiting translation on the 30S subunit of the bacterial ribosome. Some chemically related natural products inhibit glucosidases by mimicking oligo-alpha-1,4-glucosides. The biochemistry of the aminoglycoside biosynthetic pathways is still a developing field since none of the pathways has been analyzed to completeness as yet. In this chapter we treat the enzymology of aminoglycoside biosyntheses as far as it becomes apparent from recent investigations based on the availability of DNA sequence data of biosynthetic gene clusters for all major structural classes of these bacterial metabolites. We give a more general overview of the field, including descriptions of some key enzymes in various aminoglycoside pathways, whereas in Chapter 20 provides a detailed account of the better-studied enzymology thus far known for the neomycin and butirosin pathways.

  7. pySAPC, a python package for sparse affinity propagation clustering: Application to odontogenesis whole genome time series gene-expression data.

    Science.gov (United States)

    Cao, Huojun; Amendt, Brad A

    2016-11-01

    Developmental dental anomalies are common forms of congenital defects. The molecular mechanisms of dental anomalies are poorly understood. Systematic approaches such as clustering genes based on similar expression patterns could identify novel genes involved in dental anomalies and provide a framework for understanding molecular regulatory mechanisms of these genes during tooth development (odontogenesis). A python package (pySAPC) of sparse affinity propagation clustering algorithm for large datasets was developed. Whole genome pair-wise similarity was calculated based on expression pattern similarity based on 45 microarrays of several stages during odontogenesis. pySAPC identified 743 gene clusters based on expression pattern similarity during mouse tooth development. Three clusters are significantly enriched for genes associated with dental anomalies (with FDR odontogenesis. Clustering genes based on similar expression profiles recovered several known regulatory relationships for genes involved in odontogenesis, as well as many novel genes that may be involved with the same genetic pathways as genes that have already been shown to contribute to dental defects. By using sparse similarity matrix, pySAPC use much less memory and CPU time compared with the original affinity propagation program that uses a full similarity matrix. This python package will be useful for many applications where dataset(s) are too large to use full similarity matrix. This article is part of a Special Issue entitled "System Genetics" Guest Editor: Dr. Yudong Cai and Dr. Tao Huang. Copyright © 2016. Published by Elsevier B.V.

  8. Transmission permeability of tuberculosis involving immigrants, revealed by a multicentre analysis of clusters.

    Science.gov (United States)

    Alonso Rodríguez, N; Chaves, F; Iñigo, J; Bouza, E; García de Viedma, D; Andrés, S; Cías, R; Daza, R; Domingo, D; Esteban, J; García, J; Gómez Mampaso, E; Herranz, M; Palenque, E; Ruiz Serrano, M J

    2009-05-01

    In recent years, the number of cases of tuberculosis (TB) among immigrants in Spain has increased markedly, and led to this analysis of the recent transmission patterns of TB in the immigrant population in Madrid. The countries from which the highest number of immigrant cases have been reported were Ecuador (21%), Romania (16%), Morocco (12%), Peru (11%) and Bolivia (9%). Fifty-one per cent of the cases were from South America. In a multicentre study (2004-2006), IS6110 restriction fragment length polymorphism and spoligotyping were used to genotype the Mycobacterium tuberculosis isolates from 632 immigrant cases from 47 countries. A total of 183 cases (29%) were grouped into 59 clusters, which are markers of potential transmission events. Most of the clusters (81%) included patients living in different healthcare districts, and 54% of the clusters were multinational. When a sample of 478 autochthonous cases was included, 53% of the clusters involving immigrants also included autochthonous cases. This study revealed marked transmission permeability among nationalities and between the immigrant and the autochthonous populations.

  9. The gsdf gene locus harbors evolutionary conserved and clustered genes preferentially expressed in fish previtellogenic oocytes.

    Science.gov (United States)

    Gautier, Aude; Le Gac, Florence; Lareyre, Jean-Jacques

    2011-02-01

    display a different cellular localization compared to that of the gsdf gene indicating that the later gene is not co-regulated. Interestingly, our study identifies new clustered genes that are specifically expressed in previtellogenic oocytes (nup54, aff1, klhl8, sdad1).

  10. Mining genes involved in insecticide resistance of Liposcelis bostrychophila Badonnel by transcriptome and expression profile analysis.

    Directory of Open Access Journals (Sweden)

    Wei Dou

    Full Text Available BACKGROUND: Recent studies indicate that infestations of psocids pose a new risk for global food security. Among the psocids species, Liposcelis bostrychophila Badonnel has gained recognition in importance because of its parthenogenic reproduction, rapid adaptation, and increased worldwide distribution. To date, the molecular data available for L. bostrychophila is largely limited to genes identified through homology. Also, no transcriptome data relevant to psocids infection is available. METHODOLOGY AND PRINCIPAL FINDINGS: In this study, we generated de novo assembly of L. bostrychophila transcriptome performed through the short read sequencing technology (Illumina. In a single run, we obtained more than 51 million sequencing reads that were assembled into 60,012 unigenes (mean size = 711 bp by Trinity. The transcriptome sequences from different developmental stages of L. bostrychophila including egg, nymph and adult were annotated with non-redundant (Nr protein database, gene ontology (GO, cluster of orthologous groups of proteins (COG, and KEGG orthology (KO. The analysis revealed three major enzyme families involved in insecticide metabolism as differentially expressed in the L. bostrychophila transcriptome. A total of 49 P450-, 31 GST- and 21 CES-specific genes representing the three enzyme families were identified. Besides, 16 transcripts were identified to contain target site sequences of resistance genes. Furthermore, we profiled gene expression patterns upon insecticide (malathion and deltamethrin exposure using the tag-based digital gene expression (DGE method. CONCLUSION: The L. bostrychophila transcriptome and DGE data provide gene expression data that would further our understanding of molecular mechanisms in psocids. In particular, the findings of this investigation will facilitate identification of genes involved in insecticide resistance and designing of new compounds for control of psocids.

  11. Strategies to identify long noncoding RNAs involved in gene regulation

    Directory of Open Access Journals (Sweden)

    Lee Catherine

    2012-11-01

    Full Text Available Abstract Long noncoding RNAs (lncRNAs have been detected in nearly every cell type and found to be fundamentally involved in many biological processes. The characterization of lncRNAs has immense potential to advance our comprehensive understanding of cellular processes and gene regulation, along with implications for the treatment of human disease. The recent ENCODE (Encyclopedia of DNA Elements study reported 9,640 lncRNA loci in the human genome, which corresponds to around half the number of protein-coding genes. Because of this sheer number and their functional diversity, it is crucial to identify a pool of potentially relevant lncRNAs early on in a given study. In this review, we evaluate the methods for isolating lncRNAs by immunoprecipitation and review the advantages, disadvantages, and applications of three widely used approaches – microarray, tiling array, and RNA-seq – for identifying lncRNAs involved in gene regulation. We also look at ways in which data from publicly available databases such as ENCODE can support the study of lncRNAs.

  12. Physical and genetic map of the major nif gene cluster from Azotobacter vinelandii.

    OpenAIRE

    Jacobson, M R; Brigle, K E; Bennett, L T; Setterquist, R A; Wilson, M. S.; Cash, V L; Beynon, J.; Newton, W.E.; Dean, D R

    1989-01-01

    Determination of a 28,793-base-pair DNA sequence of a region from the Azotobacter vinelandii genome that includes and flanks the nitrogenase structural gene region was completed. This information was used to revise the previously proposed organization of the major nif cluster. The major nif cluster from A. vinelandii encodes 15 nif-specific genes whose products bear significant structural identity to the corresponding nif-specific gene products from Klebsiella pneumoniae. These genes include ...

  13. Physical and genetic map of the major nif gene cluster from Azotobacter vinelandii.

    OpenAIRE

    Jacobson, M. R.; Brigle, K E; Bennett, L T; Setterquist, R. A.; Wilson, M S; Cash, V L; Beynon, J; Newton, W E; Dean, D. R.

    1989-01-01

    Determination of a 28,793-base-pair DNA sequence of a region from the Azotobacter vinelandii genome that includes and flanks the nitrogenase structural gene region was completed. This information was used to revise the previously proposed organization of the major nif cluster. The major nif cluster from A. vinelandii encodes 15 nif-specific genes whose products bear significant structural identity to the corresponding nif-specific gene products from Klebsiella pneumoniae. These genes include ...

  14. MitoCOGs: clusters of orthologous genes from mitochondria and implications for the evolution of eukaryotes.

    Science.gov (United States)

    Kannan, Sivakumar; Rogozin, Igor B; Koonin, Eugene V

    2014-11-25

    Mitochondria are ubiquitous membranous organelles of eukaryotic cells that evolved from an alpha-proteobacterial endosymbiont and possess a small genome that encompasses from 3 to 106 genes. Accumulation of thousands of mitochondrial genomes from diverse groups of eukaryotes provides an opportunity for a comprehensive reconstruction of the evolution of the mitochondrial gene repertoire. Clusters of orthologous mitochondrial protein-coding genes (MitoCOGs) were constructed from all available mitochondrial genomes and complemented with nuclear orthologs of mitochondrial genes. With minimal exceptions, the mitochondrial gene complements of eukaryotes are subsets of the superset of 66 genes found in jakobids. Reconstruction of the evolution of mitochondrial genomes indicates that the mitochondrial gene set of the last common ancestor of the extant eukaryotes was slightly larger than that of jakobids. This superset of mitochondrial genes likely represents an intermediate stage following the loss and transfer to the nucleus of most of the endosymbiont genes early in eukaryote evolution. Subsequent evolution in different lineages involved largely parallel transfer of ancestral endosymbiont genes to the nuclear genome. The intron density in nuclear orthologs of mitochondrial genes typically is nearly the same as in the rest of the genes in the respective genomes. However, in land plants, the intron density in nuclear orthologs of mitochondrial genes is almost 1.5-fold lower than the genomic mean, suggestive of ongoing transfer of functional genes from mitochondria to the nucleus. The MitoCOGs are expected to become an important resource for the study of mitochondrial evolution. The nearly complete superset of mitochondrial genes in jakobids likely represents an intermediate stage in the evolution of eukaryotes after the initial, extensive loss and transfer of the endosymbiont genes. In addition, the bacterial multi-subunit RNA polymerase that is encoded in the jakobid

  15. Recursive Cluster Elimination (RCE for classification and feature selection from gene expression data

    Directory of Open Access Journals (Sweden)

    Showe Louise C

    2007-05-01

    Full Text Available Abstract Background Classification studies using gene expression datasets are usually based on small numbers of samples and tens of thousands of genes. The selection of those genes that are important for distinguishing the different sample classes being compared, poses a challenging problem in high dimensional data analysis. We describe a new procedure for selecting significant genes as recursive cluster elimination (RCE rather than recursive feature elimination (RFE. We have tested this algorithm on six datasets and compared its performance with that of two related classification procedures with RFE. Results We have developed a novel method for selecting significant genes in comparative gene expression studies. This method, which we refer to as SVM-RCE, combines K-means, a clustering method, to identify correlated gene clusters, and Support Vector Machines (SVMs, a supervised machine learning classification method, to identify and score (rank those gene clusters for the purpose of classification. K-means is used initially to group genes into clusters. Recursive cluster elimination (RCE is then applied to iteratively remove those clusters of genes that contribute the least to the classification performance. SVM-RCE identifies the clusters of correlated genes that are most significantly differentially expressed between the sample classes. Utilization of gene clusters, rather than individual genes, enhances the supervised classification accuracy of the same data as compared to the accuracy when either SVM or Penalized Discriminant Analysis (PDA with recursive feature elimination (SVM-RFE and PDA-RFE are used to remove genes based on their individual discriminant weights. Conclusion SVM-RCE provides improved classification accuracy with complex microarray data sets when it is compared to the classification accuracy of the same datasets using either SVM-RFE or PDA-RFE. SVM-RCE identifies clusters of correlated genes that when considered together

  16. Pneumococcal gene complex involved in resistance to extracellular oxidative stress.

    Science.gov (United States)

    Andisi, Vahid Farshchi; Hinojosa, Cecilia A; de Jong, Anne; Kuipers, Oscar P; Orihuela, Carlos J; Bijlsma, Jetta J E

    2012-03-01

    Streptococcus pneumoniae is a gram-positive bacterium which is a member of the normal human nasopharyngeal flora but can also cause serious disease such as pneumonia, bacteremia, and meningitis. Throughout its life cycle, S. pneumoniae is exposed to significant oxidative stress derived from endogenously produced hydrogen peroxide (H(2)O(2)) and from the host through the oxidative burst. How S. pneumoniae, an aerotolerant anaerobic bacterium that lacks catalase, protects itself against hydrogen peroxide stress is still unclear. Bioinformatic analysis of its genome identified a hypothetical open reading frame belonging to the thiol-specific antioxidant (TlpA/TSA) family, located in an operon consisting of three open reading frames. For all four strains tested, deletion of the gene resulted in an approximately 10-fold reduction in survival when strains were exposed to external peroxide stress. However, no role for this gene in survival of internal superoxide stress was observed. Mutagenesis and complementation analysis demonstrated that all three genes are necessary and sufficient for protection against oxidative stress. Interestingly, in a competitive index mouse pneumonia model, deletion of the operon had no impact shortly after infection but was detrimental during the later stages of disease. Thus, we have identified a gene complex involved in the protection of S. pneumoniae against external oxidative stress, which plays an important role during invasive disease.

  17. Fragmentation of an aflatoxin-like gene cluster in a forest pathogen

    Science.gov (United States)

    Secondary metabolic pathway genes are typically clustered in fungi. An exception to this paradigm is seen for genes required for the production of dothistromin, an aflatoxin-like virulence factor produced by the pine needle pathogen Dothistroma septosporum. In contrast to the tight clustering of gen...

  18. Characterization of the biosynthetic gene cluster of rebeccamycin from Lechevalieria aerocolonigenes ATCC 39243.

    Science.gov (United States)

    Onaka, Hiroyasu; Taniguchi, Shin-ichi; Igarashi, Yasuhiro; Furumai, Tamotsu

    2003-01-01

    The biosynthetic gene cluster for rebeccamycin, an indolocarbazole antibiotic, from Lechevalieria aerocolonigenes ATCC 39243 has 11 ORFs. To clarify their functions, mutants with rebG, rebD, rebC, rebP, rebM, rebR, rebH, rebT, or orfD2 disrupted were constructed, and the gene products were examined. rebP disruptants produced 11,11'-dichlorochromopyrrolic acid, found to be a biosynthetic intermediate by a bioconversion experiment. Other genes encoded N-glycosyltransferase (rebG), monooxygenase (rebC), methyltransferase (rebM), a transcriptional activator (rebR), and halogenase (rebH). rebT disruptants produced rebeccamycin as much as the wild strain, so rebT was probably not involved in rebeccamycin production. Biosynthetic genes of staurosporine, an another indolocarbazole antibiotic, were cloned from Streptomyces sp. TP-A0274. staO, staD, and staP were similar to rebO, rebD, and rebP, respectively, all of which are responsible for indolocarbazole biosynthesis, But a rebC homolog, encoding a putative enzyme oxidizing the C-7 site of pyrrole rings, was not found in the staurosporine biosynthetic gene cluster. These results suggest that indolocarbazole is constructed by oxidative decarboxylation of chromopyrrolic acid (11,11'-dichlorochromopyrrolic acid in rebeccamycin) generated from two molecules of tryptophan by coupling and that the oxidation state at the C-7 position depends on the additional enzyme(s) encoded by the biosynthetic genes.

  19. A phylogenomic gene cluster resource: The phylogeneticallyinferred groups (PhlGs) database

    Energy Technology Data Exchange (ETDEWEB)

    Dehal, Paramvir S.; Boore, Jeffrey L.

    2005-08-25

    We present here the PhIGs database, a phylogenomic resource for sequenced genomes. Although many methods exist for clustering gene families, very few attempt to create truly orthologous clusters sharing descent from a single ancestral gene across a range of evolutionary depths. Although these non-phylogenetic gene family clusters have been used broadly for gene annotation, errors are known to be introduced by the artifactual association of slowly evolving paralogs and lack of annotation for those more rapidly evolving. A full phylogenetic framework is necessary for accurate inference of function and for many studies that address pattern and mechanism of the evolution of the genome. The automated generation of evolutionary gene clusters, creation of gene trees, determination of orthology and paralogy relationships, and the correlation of this information with gene annotations, expression information, and genomic context is an important resource to the scientific community.

  20. antiSMASH 4.0-improvements in chemistry prediction and gene cluster boundary identification

    DEFF Research Database (Denmark)

    Blin, Kai; Wolf, Thomas; Chevrette, Marc G.

    2017-01-01

    Many antibiotics, chemotherapeutics, crop protection agents and food preservatives originate from molecules produced by bacteria, fungi or plants. In recent years, genome mining methodologies have been widely adopted to identify and characterize the biosynthetic gene clusters encoding......, including prediction of gene cluster boundaries using the ClusterFinder method or the newly integrated CASSIS algorithm, improved substrate specificity prediction for non-ribosomal peptide synthetase adenylation domains based on the new SANDPUMA algorithm, improved predictions for terpene and ribosomally...

  1. Characterization of the Biosynthetic Gene Cluster for Benzoxazole Antibiotics A33853 Reveals Unusual Assembly Logic.

    Science.gov (United States)

    Lv, Meinan; Zhao, Junfeng; Deng, Zixin; Yu, Yi

    2015-10-22

    A33853, which shows excellent bioactivity against Leishmania, is a benzoxazole-family compound formed from two moieties of 3-hydroxyanthranilic acid and one 3-hydroxypicolinic acid. In this study, we have identified the gene cluster responsible for the biosynthesis of A33853 in Streptomyces sp. NRRL12068 through genome mining and heterologous expression. Bioinformatics analysis and functional characterization of the orfs contained in the gene cluster revealed that the biosynthesis of A33853 is directed by a group of unusual enzymes. In particular, BomK, annotated as a ketosynthase, was found to catalyze the amide bond formation between 3-hydroxypicolinic and 3-hydroxyanthranilic acid during the assembly of A33853. BomJ, a putative ATP-dependent coenzyme A ligase, and BomN, a putative amidohydrolase, were further proposed to be involved in the benzoxazole formation in A33853 according to gene deletion experiments. Finally, we have successfully utilized mutasynthesis to generate two analogs of A33853, which were reported previously to possess excellent anti-leishmanial activity.

  2. Similar Microbial Consortia and Genes Are Involved in the Biodegradation of Benzalkonium Chlorides in Different Environments.

    Science.gov (United States)

    Ertekin, Emine; Hatt, Janet K; Konstantinidis, Konstantinos T; Tezel, Ulas

    2016-04-19

    Benzalkonium chlorides (BACs) are emerging pollutants. Identification of microorganisms and the genes involved in the biodegradation of BACs is crucial for better understanding the fate of BACs in the environment and developing treatment strategies. Four microbial communities degrading BACs were developed from sewage (SEW), activated sludge (AS), soil (SOIL) and sea sediment (SEA) samples. According to 16S rRNA pyrosequencing and shotgun metagenome sequencing analyses, the most abundant species represented uncharacterized members of the Pseudomonas and Achromobacter genera. BAC biotransformation rates of the enriched microbial communities were 2.8, 3.2, 17.8, and 24.3 μM hr(-1) for SEA, AS, SOIL, and SEW, respectively, and were positively correlated with the relative abundance of a particular Pseudomonas sp. strain, BIOMIG1. The strain BIOMIG1 mineralizes BACs at a rate up to 2.40 μmol hr(-1) 10(-11) cells. Genomes of four BAC degrading and nondegrading BIOMIG1 phenotypes were sequenced and differentially compared with each other. As a result, a gene cluster encoding for transporters, an integrase and a dioxygenase were involved in BAC biotransformation. Our results suggest that BIOMIG1 plays a key role on the fate of BACs in the environment and genes, other than those reported to date, are involved in BAC biotransformation in various habitats.

  3. Base J represses genes at the end of polycistronic gene clusters in Leishmania major by promoting RNAP II termination.

    Science.gov (United States)

    Reynolds, David L; Hofmeister, Brigitte T; Cliffe, Laura; Siegel, T Nicolai; Anderson, Britta A; Beverley, Stephen M; Schmitz, Robert J; Sabatini, Robert

    2016-08-01

    The genomes of kinetoplastids are organized into polycistronic gene clusters that are flanked by the modified DNA base J. Previous work has established a role of base J in promoting RNA polymerase II termination in Leishmania spp. where the loss of J leads to termination defects and transcription into adjacent gene clusters. It remains unclear whether these termination defects affect gene expression and whether read through transcription is detrimental to cell growth, thus explaining the essential nature of J. We now demonstrate that reduction of base J at specific sites within polycistronic gene clusters in L. major leads to read through transcription and increased expression of downstream genes in the cluster. Interestingly, subsequent transcription into the opposing polycistronic gene cluster does not lead to downregulation of sense mRNAs. These findings indicate a conserved role for J regulating transcription termination and expression of genes within polycistronic gene clusters in trypanosomatids. In contrast to the expectations often attributed to opposing transcription, the essential nature of J in Leishmania spp. is related to its role in gene repression rather than preventing transcriptional interference resulting from read through and dual strand transcription.

  4. The Involvement of hybrid cluster protein 4, HCP4, in Anaerobic Metabolism in Chlamydomonas reinhardtii.

    Science.gov (United States)

    Olson, Adam C; Carter, Clay J

    2016-01-01

    The unicellular green algae Chlamydomonas reinhardtii has long been studied for its unique fermentation pathways and has been evaluated as a candidate organism for biofuel production. Fermentation in C. reinhardtii is facilitated by a network of three predominant pathways producing four major byproducts: formate, ethanol, acetate and hydrogen. Previous microarray studies identified many genes as being highly up-regulated during anaerobiosis. For example, hybrid cluster protein 4 (HCP4) was found to be one of the most highly up-regulated genes under anoxic conditions. Hybrid cluster proteins have long been studied for their unique spectroscopic properties, yet their biological functions remain largely unclear. To probe its role during anaerobiosis, HCP4 was silenced using artificial microRNAs (ami-hcp4) followed by extensive phenotypic analyses of cells grown under anoxic conditions. Both the expression of key fermentative enzymes and their respective metabolites were significantly altered in ami-hcp4, with nitrogen uptake from the media also being significantly different than wild-type cells. The results strongly suggest a role for HCP4 in regulating key fermentative and nitrogen utilization pathways.

  5. The Involvement of hybrid cluster protein 4, HCP4, in Anaerobic Metabolism in Chlamydomonas reinhardtii.

    Directory of Open Access Journals (Sweden)

    Adam C Olson

    Full Text Available The unicellular green algae Chlamydomonas reinhardtii has long been studied for its unique fermentation pathways and has been evaluated as a candidate organism for biofuel production. Fermentation in C. reinhardtii is facilitated by a network of three predominant pathways producing four major byproducts: formate, ethanol, acetate and hydrogen. Previous microarray studies identified many genes as being highly up-regulated during anaerobiosis. For example, hybrid cluster protein 4 (HCP4 was found to be one of the most highly up-regulated genes under anoxic conditions. Hybrid cluster proteins have long been studied for their unique spectroscopic properties, yet their biological functions remain largely unclear. To probe its role during anaerobiosis, HCP4 was silenced using artificial microRNAs (ami-hcp4 followed by extensive phenotypic analyses of cells grown under anoxic conditions. Both the expression of key fermentative enzymes and their respective metabolites were significantly altered in ami-hcp4, with nitrogen uptake from the media also being significantly different than wild-type cells. The results strongly suggest a role for HCP4 in regulating key fermentative and nitrogen utilization pathways.

  6. A joint finite mixture model for clustering genes from independent Gaussian and beta distributed data

    Directory of Open Access Journals (Sweden)

    Yli-Harja Olli

    2009-05-01

    Full Text Available Abstract Background Cluster analysis has become a standard computational method for gene function discovery as well as for more general explanatory data analysis. A number of different approaches have been proposed for that purpose, out of which different mixture models provide a principled probabilistic framework. Cluster analysis is increasingly often supplemented with multiple data sources nowadays, and these heterogeneous information sources should be made as efficient use of as possible. Results This paper presents a novel Beta-Gaussian mixture model (BGMM for clustering genes based on Gaussian distributed and beta distributed data. The proposed BGMM can be viewed as a natural extension of the beta mixture model (BMM and the Gaussian mixture model (GMM. The proposed BGMM method differs from other mixture model based methods in its integration of two different data types into a single and unified probabilistic modeling framework, which provides a more efficient use of multiple data sources than methods that analyze different data sources separately. Moreover, BGMM provides an exceedingly flexible modeling framework since many data sources can be modeled as Gaussian or beta distributed random variables, and it can also be extended to integrate data that have other parametric distributions as well, which adds even more flexibility to this model-based clustering framework. We developed three types of estimation algorithms for BGMM, the standard expectation maximization (EM algorithm, an approximated EM and a hybrid EM, and propose to tackle the model selection problem by well-known model selection criteria, for which we test the Akaike information criterion (AIC, a modified AIC (AIC3, the Bayesian information criterion (BIC, and the integrated classification likelihood-BIC (ICL-BIC. Conclusion Performance tests with simulated data show that combining two different data sources into a single mixture joint model greatly improves the clustering

  7. Identification and characterization of a novel diterpene gene cluster in Aspergillus nidulans.

    Directory of Open Access Journals (Sweden)

    Kirsi Bromann

    Full Text Available Fungal secondary metabolites are a rich source of medically useful compounds due to their pharmaceutical and toxic properties. Sequencing of fungal genomes has revealed numerous secondary metabolite gene clusters, yet products of many of these biosynthetic pathways are unknown since the expression of the clustered genes usually remains silent in normal laboratory conditions. Therefore, to discover new metabolites, it is important to find ways to induce the expression of genes in these otherwise silent biosynthetic clusters. We discovered a novel secondary metabolite in Aspergillus nidulans by predicting a biosynthetic gene cluster with genomic mining. A Zn(II(2Cys(6-type transcription factor, PbcR, was identified, and its role as a pathway-specific activator for the predicted gene cluster was demonstrated. Overexpression of pbcR upregulated the transcription of seven genes in the identified cluster and led to the production of a diterpene compound, which was characterized with GC/MS as ent-pimara-8(14,15-diene. A change in morphology was also observed in the strains overexpressing pbcR. The activation of a cryptic gene cluster by overexpression of its putative Zn(II(2Cys(6-type transcription factor led to discovery of a novel secondary metabolite in Aspergillus nidulans. Quantitative real-time PCR and DNA array analysis allowed us to predict the borders of the biosynthetic gene cluster. Furthermore, we identified a novel fungal pimaradiene cyclase gene as well as genes encoding 3-hydroxy-3-methyl-glutaryl-coenzyme A (HMG-CoA reductase and a geranylgeranyl pyrophosphate (GGPP synthase. None of these genes have been previously implicated in the biosynthesis of terpenes in Aspergillus nidulans. These results identify the first Aspergillus nidulans diterpene gene cluster and suggest a biosynthetic pathway for ent-pimara-8(14,15-diene.

  8. Identification and Characterization of a Novel Diterpene Gene Cluster in Aspergillus nidulans

    Science.gov (United States)

    Bromann, Kirsi; Toivari, Mervi; Viljanen, Kaarina; Vuoristo, Anu; Ruohonen, Laura; Nakari-Setälä, Tiina

    2012-01-01

    Fungal secondary metabolites are a rich source of medically useful compounds due to their pharmaceutical and toxic properties. Sequencing of fungal genomes has revealed numerous secondary metabolite gene clusters, yet products of many of these biosynthetic pathways are unknown since the expression of the clustered genes usually remains silent in normal laboratory conditions. Therefore, to discover new metabolites, it is important to find ways to induce the expression of genes in these otherwise silent biosynthetic clusters. We discovered a novel secondary metabolite in Aspergillus nidulans by predicting a biosynthetic gene cluster with genomic mining. A Zn(II)2Cys6–type transcription factor, PbcR, was identified, and its role as a pathway-specific activator for the predicted gene cluster was demonstrated. Overexpression of pbcR upregulated the transcription of seven genes in the identified cluster and led to the production of a diterpene compound, which was characterized with GC/MS as ent-pimara-8(14),15-diene. A change in morphology was also observed in the strains overexpressing pbcR. The activation of a cryptic gene cluster by overexpression of its putative Zn(II)2Cys6–type transcription factor led to discovery of a novel secondary metabolite in Aspergillus nidulans. Quantitative real-time PCR and DNA array analysis allowed us to predict the borders of the biosynthetic gene cluster. Furthermore, we identified a novel fungal pimaradiene cyclase gene as well as genes encoding 3-hydroxy-3-methyl-glutaryl-coenzyme A (HMG-CoA) reductase and a geranylgeranyl pyrophosphate (GGPP) synthase. None of these genes have been previously implicated in the biosynthesis of terpenes in Aspergillus nidulans. These results identify the first Aspergillus nidulans diterpene gene cluster and suggest a biosynthetic pathway for ent-pimara-8(14),15-diene. PMID:22506079

  9. Nonlinear biosynthetic gene cluster dose effect on penicillin production by Penicillium chrysogenum.

    Science.gov (United States)

    Nijland, Jeroen G; Ebbendorf, Bjorg; Woszczynska, Marta; Boer, Rémon; Bovenberg, Roel A L; Driessen, Arnold J M

    2010-11-01

    Industrial penicillin production levels by the filamentous fungus Penicillium chrysogenum increased dramatically by classical strain improvement. High-yielding strains contain multiple copies of the penicillin biosynthetic gene cluster that encodes three key enzymes of the β-lactam biosynthetic pathway. We have analyzed the gene cluster dose effect on penicillin production using the high-yielding P. chrysogenum strain DS17690 that was cured from its native clusters. The amount of penicillin V produced increased with the penicillin biosynthetic gene cluster number but was saturated at high copy numbers. Likewise, transcript levels of the biosynthetic genes pcbAB [δ-(l-α-aminoadipyl)-l-cysteinyl-d-valine synthetase], pcbC (isopenicillin N synthase), and penDE (acyltransferase) correlated with the cluster copy number. Remarkably, the protein level of acyltransferase, which localizes to peroxisomes, was saturated already at low cluster copy numbers. At higher copy numbers, intracellular levels of isopenicillin N increased, suggesting that the acyltransferase reaction presents a limiting step at a high gene dose. Since the number and appearance of the peroxisomes did not change significantly with the gene cluster copy number, we conclude that the acyltransferase activity is limiting for penicillin biosynthesis at high biosynthetic gene cluster copy numbers. These results suggest that at a high penicillin production level, productivity is limited by the peroxisomal acyltransferase import activity and/or the availability of coenzyme A (CoA)-activated side chains.

  10. Identification of Phytophthora sojae genes involved in asexual sporogenesis

    Indian Academy of Sciences (India)

    Ziying Wang; Xhaoxia Wang; Jie Shen; Guangyue Wang; Xiaoxi Zhu; Hongxia Lu

    2009-08-01

    To explore the molecular mechanisms involved in asexual spore development in Phytophthora sojae, the zoospores of strain PS26 were treated with ultraviolet (UV) irradiation. After selection, a mutant progeny, termed PS26-U03, was obtained and demonstrated to exhibit no oospore production. A suppression subtractive hybridization (SSH) approach was developed to investigate differences in gene expression between PS26 and PS26-U03 during asexual sporogenesis. Of the 126 sequences chosen for examination, 39 putative unigenes were identified that exhibit high expression in PS26. These sequences are predicted to encode proteins involved in metabolism, cell cycle, protein biosynthesis, cell signalling, cell defence, and transcription regulation. Seven clones were selected for temporal expression analysis using RT-PCR based on the results of the dot-blot screens. Three of the selected genes, developmental protein DG1037 (UB88), glycoside hydrolase (UB149) and a hypothetical protein (UB145), were expressed only in PS26, whereas the transcripts of phosphatidylinositol-4-phosphate 5-kinase (UB36), FAD-dependent pyridine nucleotide-disulphide oxidoreductase (UB226) and sugar transporter (UB256) were expressed at very low levels in PS26-U03 but at high levels in PS26.

  11. Evolution of C2H2-zinc finger genes and subfamilies in mammals: Species-specific duplication and loss of clusters, genes and effector domains

    Directory of Open Access Journals (Sweden)

    Aubry Muriel

    2008-06-01

    Full Text Available Abstract Background C2H2 zinc finger genes (C2H2-ZNF constitute the largest class of transcription factors in humans and one of the largest gene families in mammals. Often arranged in clusters in the genome, these genes are thought to have undergone a massive expansion in vertebrates, primarily by tandem duplication. However, this view is based on limited datasets restricted to a single chromosome or a specific subset of genes belonging to the large KRAB domain-containing C2H2-ZNF subfamily. Results Here, we present the first comprehensive study of the evolution of the C2H2-ZNF family in mammals. We assembled the complete repertoire of human C2H2-ZNF genes (718 in total, about 70% of which are organized into 81 clusters across all chromosomes. Based on an analysis of their N-terminal effector domains, we identified two new C2H2-ZNF subfamilies encoding genes with a SET or a HOMEO domain. We searched for the syntenic counterparts of the human clusters in other mammals for which complete gene data are available: chimpanzee, mouse, rat and dog. Cross-species comparisons show a large variation in the numbers of C2H2-ZNF genes within homologous mammalian clusters, suggesting differential patterns of evolution. Phylogenetic analysis of selected clusters reveals that the disparity in C2H2-ZNF gene repertoires across mammals not only originates from differential gene duplication but also from gene loss. Further, we discovered variations among orthologs in the number of zinc finger motifs and association of the effector domains, the latter often undergoing sequence degeneration. Combined with phylogenetic studies, physical maps and an analysis of the exon-intron organization of genes from the SCAN and KRAB domains-containing subfamilies, this result suggests that the SCAN subfamily emerged first, followed by the SCAN-KRAB and finally by the KRAB subfamily. Conclusion Our results are in agreement with the "birth and death hypothesis" for the evolution of

  12. CTDGFinder: A Novel Homology-Based Algorithm for Identifying Closely Spaced Clusters of Tandemly Duplicated Genes.

    Science.gov (United States)

    Ortiz, Juan F; Rokas, Antonis

    2017-01-01

    Closely spaced clusters of tandemly duplicated genes (CTDGs) contribute to the diversity of many phenotypes, including chemosensation, snake venom, and animal body plans. CTDGs have traditionally been identified subjectively as genomic neighborhoods containing several gene duplicates in close proximity; however, CTDGs are often highly variable with respect to gene number, intergenic distance, and synteny. This lack of formal definition hampers the study of CTDG evolutionary dynamics and the discovery of novel CTDGs in the exponentially growing body of genomic data. To address this gap, we developed a novel homology-based algorithm, CTDGFinder, which formalizes and automates the identification of CTDGs by examining the physical distribution of individual members of families of duplicated genes across chromosomes. Application of CTDGFinder accurately identified CTDGs for many well-known gene clusters (e.g., Hox and beta-globin gene clusters) in the human, mouse and 20 other mammalian genomes. Differences between previously annotated gene clusters and our inferred CTDGs were due to the exclusion of nonhomologs that have historically been considered parts of specific gene clusters, the inclusion or absence of genes between the CTDGs and their corresponding gene clusters, and the splitting of certain gene clusters into distinct CTDGs. Examination of human genes showing tissue-specific enhancement of their expression by CTDGFinder identified members of several well-known gene clusters (e.g., cytochrome P450s and olfactory receptors) and revealed that they were unequally distributed across tissues. By formalizing and automating CTDG identification, CTDGFinder will facilitate understanding of CTDG evolutionary dynamics, their functional implications, and how they are associated with phenotypic diversity. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e

  13. A transcriptomic approach to identify regulatory genes involved in fruit set of wild-type and parthenocarpic tomato genotypes.

    Science.gov (United States)

    Ruiu, Fabrizio; Picarella, Maurizio Enea; Imanishi, Shunsuke; Mazzucato, Andrea

    2015-10-01

    The tomato parthenocarpic fruit (pat) mutation associates a strong competence for parthenocarpy with homeotic transformation of anthers and aberrancy of ovules. To dissect this complex floral phenotype, genes involved in the pollination-independent fruit set of the pat mutant were investigated by microarray analysis using wild-type and mutant ovaries. Normalized expression data were subjected to one-way ANOVA and 2499 differentially expressed genes (DEGs) displaying a >1.5 log-fold change in at least one of the pairwise comparisons analyzed were detected. DEGs were categorized into 20 clusters and clusters classified into five groups representing transcripts with similar expression dynamics. The "regulatory function" group (685 DEGs) contained putative negative or positive fruit set regulators, "pollination-dependent" (411 DEGs) included genes activated by pollination, "fruit growth-related" (815 DEGs) genes activated at early fruit growth. The last groups listed genes with different or similar expression pattern at all stages in the two genotypes. qRT-PCR validation of 20 DEGs plus other four selected genes assessed the high reliability of microarray expression data; the average correlation coefficient for the 20 DEGs was 0.90. In all the groups were evidenced relevant transcription factors encoding proteins regulating meristem differentiation and floral organ development, genes involved in metabolism, transport and response of hormones, genes involved in cell division and in primary and secondary metabolism. Among pathways related to secondary metabolites emerged genes related to the synthesis of flavonoids, supporting the recent evidence that these compounds are important at the fruit set phase. Selected genes showing a de-regulated expression pattern in pat were studied in other four parthenocarpic genotypes either genetically anonymous or carrying lesions in known gene sequences. This comparative approach offered novel insights for improving the present

  14. Fetal exposure to teratogens: evidence of genes involved in autism.

    Science.gov (United States)

    Dufour-Rainfray, Diane; Vourc'h, Patrick; Tourlet, Sébastien; Guilloteau, Denis; Chalon, Sylvie; Andres, Christian R

    2011-04-01

    Environmental challenges during the prenatal period can result in behavioral abnormalities and cognitive deficits that appear later in life such as autism. Prenatal exposure to valproic acid, ethanol, thalidomide and misoprostol has been shown to be associated with an increased incidence of autism. In addition, rodents exposed in utero to some of these drugs show autism-like abnormalities, including brain changes and lifelong behavior dysfunction. Our aim is to summarize current understanding of the relationship between in utero exposure to these drugs and autism in humans and in autism-like animal model phenotypes. It also highlights the importance of these models to understanding the neurobiology of autism, particularly in the identification of susceptibility genes. These drugs are able to modulate the expression of many genes involved in processes such as proliferation, apoptosis, neuronal differentiation and migration, synaptogenesis and synaptic activity. It seems essential to focus research on genes expressed during early neurodevelopment which may be the target of mutations or affected by drugs such as those included in this review.

  15. GenCLiP: a software program for clustering gene lists by literature profiling and constructing gene co-occurrence networks related to custom keywords

    Directory of Open Access Journals (Sweden)

    Zhou Yi-Bo

    2008-07-01

    Full Text Available Abstract Background Biomedical researchers often want to explore pathogenesis and pathways regulated by abnormally expressed genes, such as those identified by microarray analyses. Literature mining is an important way to assist in this task. Many literature mining tools are now available. However, few of them allows the user to make manual adjustments to zero in on what he/she wants to know in particular. Results We present our software program, GenCLiP (Gene Cluster with Literature Profiles, which is based on the methods presented by Chaussabel and Sher (Genome Biol 2002, 3(10:RESEARCH0055 that search gene lists to identify functional clusters of genes based on up-to-date literature profiling. Four features were added to this previously described method: the ability to 1 manually curate keywords extracted from the literature, 2 search genes and gene co-occurrence networks related to custom keywords, 3 compare analyzed gene results with negative and positive controls generated by GenCLiP, and 4 calculate probabilities that the resulting genes and gene networks are randomly related. In this paper, we show with a set of differentially expressed genes between keloids and normal control, how implementation of functions in GenCLiP successfully identified keywords related to the pathogenesis of keloids and unknown gene pathways involved in the pathogenesis of keloids. Conclusion With regard to the identification of disease-susceptibility genes, GenCLiP allows one to quickly acquire a primary pathogenesis profile and identify pathways involving abnormally expressed genes not previously associated with the disease.

  16. Adaptive evolution of the chrysanthemyl diphosphate synthase gene involved in irregular monoterpene metabolism

    Directory of Open Access Journals (Sweden)

    Liu Ping-Li

    2012-11-01

    Full Text Available Abstract Background Chrysanthemyl diphosphate synthase (CDS is a key enzyme in biosynthetic pathways producing pyrethrins and irregular monoterpenes. These compounds are confined to plants of the tribe Anthemideae of the Asteraceae, and play an important role in defending the plants against herbivorous insects. It has been proposed that the CDS genes arose from duplication of the farnesyl diphosphate synthase (FDS gene and have different function from FDSs. However, the duplication time toward the origin of CDS and the evolutionary force behind the functional divergence of the CDS gene are still unknown. Results Two duplication events were detected in the evolutionary history of the FDS gene family in the Asteraceae, and the second duplication led to the origin of CDS. CDS occurred after the divergence of the tribe Mutisieae from other tribes of Asteraceae but before the birth of the Anthemideae tribe. After its origin, CDS accumulated four mutations in sites homologous to the substrate-binding and catalysis sites of FDS. Of these, two sites were involved in the binding of the nucleophilic substrate isopentenyl diphosphate in FDS. Maximum likelihood analyses showed that some sites in CDS were under positive selection and were scattered throughout primary sequences, whereas in the three-dimensional structure model they clustered in the large central cavity. Conclusion Positive selection associated with gene duplication played a major role in the evolution of CDS.

  17. Identification and structural analysis of a novel snoRNA gene cluster from Arabidopsis thaliana

    Institute of Scientific and Technical Information of China (English)

    周惠; 孟清; 屈良鹄

    2000-01-01

    A 22 snoRNA gene cluster, consisting of four antisense snoRNA genes, was identified from Arabidopsis thaliana. The sequence and structural analysis showed that the 22 snoRNA gene cluster might be transcribed as a polycistronic precursor from an upstream promoter, and the in-tergenic spacers of the gene cluster encode the ’hairpin’ structures similar to the processing recognition signals of yeast Saccharomyces cerevisiae polycistronic snoRNA precursor. The results also revealed that plant snoRNA gene with multiple copies is a characteristic in common, and provides a good system for further revealing the transcription and expression mechanism of plant snoRNA gene cluster.

  18. Effects of gene disruptions in the nisin gene cluster of Lactococcus lactis on nisin production and producer immunity

    NARCIS (Netherlands)

    Ra, Runar; Beerthuyzen, Marke M.; Vos, Willem M. de; Saris, Per E.J.; Kuipers, Oscar P.

    1999-01-01

    The lantibiotic nisin is produced by several strains of Lactococcus lactis subsp. lactis. The chromosomally located gene cluster nisABTCIPRKFEG is required for biosynthesis, development of immunity, and regulation of gene expression. In-frame deletions in the nisB and nisT genes, and disruption of

  19. A compendium of human genes regulating feeding behavior and body weight, its functional characterization and identification of GWAS genes involved in brain-specific PPI network.

    Science.gov (United States)

    Ignatieva, Elena V; Afonnikov, Dmitry A; Saik, Olga V; Rogaev, Evgeny I; Kolchanov, Nikolay A

    2016-12-22

    Obesity is heritable. It predisposes to many diseases. The objectives of this study were to create a compendium of genes relevant to feeding behavior (FB) and/or body weight (BW) regulation; to construct and to analyze networks formed by associations between genes/proteins; and to identify the most significant genes, biological processes/pathways, and tissues/organs involved in BW regulation. The compendium of genes controlling FB or BW includes 578 human genes. Candidate genes were identified from various sources, including previously published original research and review articles, GWAS meta-analyses, and OMIM (Online Mendelian Inheritance in Man). All genes were ranked according to knowledge about their biological role in body weight regulation and classified according to expression patterns or functional characteristics. Substantial and overrepresented numbers of genes from the compendium encoded cell surface receptors, signaling molecules (hormones, neuropeptides, cytokines), transcription factors, signal transduction proteins, cilium and BBSome components, and lipid binding proteins or were present in the brain-specific list of tissue-enriched genes identified with TSEA tool. We identified 27 pathways from KEGG, REACTOME and BIOCARTA whose genes were overrepresented in the compendium. Networks formed by physical interactions or homological relationships between proteins or interactions between proteins involved in biochemical/signaling pathways were reconstructed and analyzed. Subnetworks and clusters identified by the MCODE tool included genes/proteins associated with cilium morphogenesis, signal transduction proteins (particularly, G protein-coupled receptors, kinases or proteins involved in response to insulin stimulus) and transcription regulation (particularly nuclear receptors). We ranked GWAS genes according to the number of neighbors in three networks and revealed 22 GWAS genes involved in the brain-specific PPI network. On the base of the most

  20. A putative gene cluster from a Lyngbya wollei bloom that encodes paralytic shellfish toxin biosynthesis.

    Directory of Open Access Journals (Sweden)

    Troco K Mihali

    Full Text Available Saxitoxin and its analogs cause the paralytic shellfish-poisoning syndrome, adversely affecting human health and coastal shellfish industries worldwide. Here we report the isolation, sequencing, annotation, and predicted pathway of the saxitoxin biosynthetic gene cluster in the cyanobacterium Lyngbya wollei. The gene cluster spans 36 kb and encodes enzymes for the biosynthesis and export of the toxins. The Lyngbya wollei saxitoxin gene cluster differs from previously identified saxitoxin clusters as it contains genes that are unique to this cluster, whereby the carbamoyltransferase is truncated and replaced by an acyltransferase, explaining the unique toxin profile presented by Lyngbya wollei. These findings will enable the creation of toxin probes, for water monitoring purposes, as well as proof-of-concept for the combinatorial biosynthesis of these natural occurring alkaloids for the production of novel, biologically active compounds.

  1. AutoSOME: a clustering method for identifying gene expression modules without prior knowledge of cluster number

    Directory of Open Access Journals (Sweden)

    Cooper James B

    2010-03-01

    Full Text Available Abstract Background Clustering the information content of large high-dimensional gene expression datasets has widespread application in "omics" biology. Unfortunately, the underlying structure of these natural datasets is often fuzzy, and the computational identification of data clusters generally requires knowledge about cluster number and geometry. Results We integrated strategies from machine learning, cartography, and graph theory into a new informatics method for automatically clustering self-organizing map ensembles of high-dimensional data. Our new method, called AutoSOME, readily identifies discrete and fuzzy data clusters without prior knowledge of cluster number or structure in diverse datasets including whole genome microarray data. Visualization of AutoSOME output using network diagrams and differential heat maps reveals unexpected variation among well-characterized cancer cell lines. Co-expression analysis of data from human embryonic and induced pluripotent stem cells using AutoSOME identifies >3400 up-regulated genes associated with pluripotency, and indicates that a recently identified protein-protein interaction network characterizing pluripotency was underestimated by a factor of four. Conclusions By effectively extracting important information from high-dimensional microarray data without prior knowledge or the need for data filtration, AutoSOME can yield systems-level insights from whole genome microarray expression studies. Due to its generality, this new method should also have practical utility for a variety of data-intensive applications, including the results of deep sequencing experiments. AutoSOME is available for download at http://jimcooperlab.mcdb.ucsb.edu/autosome.

  2. The cylindrospermopsin gene cluster of Aphanizomenon sp. strain 10E6: organization and recombination.

    Science.gov (United States)

    Stüken, Anke; Jakobsen, Kjetill S

    2010-08-01

    Cylindrospermopsin (CYN), a potent hepatoxin, occurs in freshwaters worldwide. Several cyanobacterial species produce the toxin, but the producing species vary between geographical regions. Aphanizomenon flos-aquae, a common algae species in temperate fresh and brackish waters, is one of the three well-documented CYN producers in European waters. So far, no genetic information on the CYN genes of this species has been available. Here, we describe the complete CYN gene cluster, including flanking regions from the German Aphanizomenon sp. strain 10E6 using a full genome sequencing approach by 454 pyrosequencing and bioinformatic identification of the gene cluster. In addition, we have sequenced a approximately 7 kb fragment covering the genes cyrC (partially), cyrA and cyrB (partially) of the same gene cluster in the CYN-producing Aphanizomenon sp. strains 10E9 and 22D11. Comparisons with the orthologous gene clusters of the Australian Cylindrospermopsis raciborskii strains AWT205 and CS505 and the partial gene cluster of the Israeli Aphanizomenon ovalisporum strain ILC-146 revealed a high gene sequence similarity, but also extensive rearrangements of gene order. The high sequence similarity (generally higher than that of 16S rRNA gene fragments from the same strains), atypical GC-content and signs of transposase activities support the suggestion that the CYN genes have been horizontally transferred.

  3. Sequencing and comparative analysis of fugu protocadherin clusters reveal diversity of protocadherin genes among teleosts

    Directory of Open Access Journals (Sweden)

    Rajasegaran Vikneswari

    2007-03-01

    Full Text Available Abstract Background The synaptic cell adhesion molecules, protocadherins, are a vertebrate innovation that accompanied the emergence of the neural tube and the elaborate central nervous system. In mammals, the protocadherins are encoded by three closely-linked clusters (α, β and γ of tandem genes and are hypothesized to provide a molecular code for specifying the remarkably-diverse neural connections in the central nervous system. Like mammals, the coelacanth, a lobe-finned fish, contains a single protocadherin locus, also arranged into α, β and γ clusters. Zebrafish, however, possesses two protocadherin loci that contain more than twice the number of genes as the coelacanth, but arranged only into α and γ clusters. To gain further insight into the evolutionary history of protocadherin clusters, we have sequenced and analyzed protocadherin clusters from the compact genome of the pufferfish, Fugu rubripes. Results Fugu contains two unlinked protocadherin loci, Pcdh1 and Pcdh2, that collectively consist of at least 77 genes. The fugu Pcdh1 locus has been subject to extensive degeneration, resulting in the complete loss of Pcdh1γ cluster. The fugu Pcdh genes have undergone lineage-specific regional gene conversion processes that have resulted in a remarkable regional sequence homogenization among paralogs in the same subcluster. Phylogenetic analyses show that most protocadherin genes are orthologous between fugu and zebrafish either individually or as paralog groups. Based on the inferred phylogenetic relationships of fugu and zebrafish genes, we have reconstructed the evolutionary history of protocadherin clusters in the teleost fish lineage. Conclusion Our results demonstrate the exceptional evolutionary dynamism of protocadherin genes in vertebrates in general, and in teleost fishes in particular. Besides the 'fish-specific' whole genome duplication, the evolution of protocadherin genes in teleost fishes is influenced by lineage

  4. DNACLUST: accurate and efficient clustering of phylogenetic marker genes

    Directory of Open Access Journals (Sweden)

    Liu Bo

    2011-06-01

    Full Text Available Abstract Background Clustering is a fundamental operation in the analysis of biological sequence data. New DNA sequencing technologies have dramatically increased the rate at which we can generate data, resulting in datasets that cannot be efficiently analyzed by traditional clustering methods. This is particularly true in the context of taxonomic profiling of microbial communities through direct sequencing of phylogenetic markers (e.g. 16S rRNA - the domain that motivated the work described in this paper. Many analysis approaches rely on an initial clustering step aimed at identifying sequences that belong to the same operational taxonomic unit (OTU. When defining OTUs (which have no universally accepted definition, scientists must balance a trade-off between computational efficiency and biological accuracy, as accurately estimating an environment's phylogenetic composition requires computationally-intensive analyses. We propose that efficient and mathematically well defined clustering methods can benefit existing taxonomic profiling approaches in two ways: (i the resulting clusters can be substituted for OTUs in certain applications; and (ii the clustering effectively reduces the size of the data-sets that need to be analyzed by complex phylogenetic pipelines (e.g., only one sequence per cluster needs to be provided to downstream analyses. Results To address the challenges outlined above, we developed DNACLUST, a fast clustering tool specifically designed for clustering highly-similar DNA sequences. Given a set of sequences and a sequence similarity threshold, DNACLUST creates clusters whose radius is guaranteed not to exceed the specified threshold. Underlying DNACLUST is a greedy clustering strategy that owes its performance to novel sequence alignment and k-mer based filtering algorithms. DNACLUST can also produce multiple sequence alignments for every cluster, allowing users to manually inspect clustering results, and enabling more

  5. Genetic Algorithms Applied to Multi-Class Clustering for Gene Expression Data

    Institute of Scientific and Technical Information of China (English)

    Haiyan Pan; Jun Zhu; Danfu Han

    2003-01-01

    A hybrid GA (genetic algorithm)-based clustering (HGACLUS) schema, combining merits of the Simulated Annealing, was described for finding an optimal or near-optimal set of medoids. This schema maximized the clustering success by achieving internal cluster cohesion and external cluster isolation. The performance of HGACLUS and other methods was compared by using simulated data and open microarray gene-expression datasets. HGACLUS was generally found to be more accurate and robust than other methods discussed in this paper by the exact validation strategy and the explicit cluster number.

  6. Directed natural product biosynthesis gene cluster capture and expression in the model bacterium Bacillus subtilis

    Science.gov (United States)

    Li, Yongxin; Li, Zhongrui; Yamanaka, Kazuya; Xu, Ying; Zhang, Weipeng; Vlamakis, Hera; Kolter, Roberto; Moore, Bradley S.; Qian, Pei-Yuan

    2015-03-01

    Bacilli are ubiquitous low G+C environmental Gram-positive bacteria that produce a wide assortment of specialized small molecules. Although their natural product biosynthetic potential is high, robust molecular tools to support the heterologous expression of large biosynthetic gene clusters in Bacillus hosts are rare. Herein we adapt transformation-associated recombination (TAR) in yeast to design a single genomic capture and expression vector for antibiotic production in Bacillus subtilis. After validating this direct cloning ``plug-and-play'' approach with surfactin, we genetically interrogated amicoumacin biosynthetic gene cluster from the marine isolate Bacillus subtilis 1779. Its heterologous expression allowed us to explore an unusual maturation process involving the N-acyl-asparagine pro-drug intermediates preamicoumacins, which are hydrolyzed by the asparagine-specific peptidase into the active component amicoumacin A. This work represents the first direct cloning based heterologous expression of natural products in the model organism B. subtilis and paves the way to the development of future genome mining efforts in this genus.

  7. Microbial communication leading to the activation of silent fungal secondary metabolite gene clusters

    Directory of Open Access Journals (Sweden)

    Tina eNetzker

    2015-04-01

    Full Text Available Microorganisms form diverse multispecies communities in various ecosystems. The high abundance of fungal and bacterial species in these consortia results in specific communication between the microorganisms. A key role in this communication is played by secondary metabolites (SMs, which are also called natural products. Recently, it was shown that interspecies ‘talk’ between microorganisms represents a physiological trigger to activate silent gene clusters leading to the formation of novel SMs by the involved species. This review focuses on mixed microbial cultivation, mainly between bacteria and fungi, with a special emphasis on the induced formation of fungal SMs in co-cultures. In addition, the role of chromatin remodeling in the induction is examined, and methodical perspectives for the analysis of natural products are presented. As an example for an intermicrobial interaction elucidated at the molecular level, we discuss the specific interaction between the filamentous fungi Aspergillus nidulans and Aspergillus fumigatus with the soil bacterium Streptomyces rapamycinicus, which provides an excellent model system to enlighten molecular concepts behind regulatory mechanisms and will pave the way to a novel avenue of drug discovery through targeted activation of silent SM gene clusters through co-cultivations of microorganisms.

  8. Directed natural product biosynthesis gene cluster capture and expression in the model bacterium Bacillus subtilis

    KAUST Repository

    Li, Yongxin

    2015-03-24

    Bacilli are ubiquitous low G+C environmental Gram-positive bacteria that produce a wide assortment of specialized small molecules. Although their natural product biosynthetic potential is high, robust molecular tools to support the heterologous expression of large biosynthetic gene clusters in Bacillus hosts are rare. Herein we adapt transformation-associated recombination (TAR) in yeast to design a single genomic capture and expression vector for antibiotic production in Bacillus subtilis. After validating this direct cloning plug-and-playa approach with surfactin, we genetically interrogated amicoumacin biosynthetic gene cluster from the marine isolate Bacillus subtilis 1779. Its heterologous expression allowed us to explore an unusual maturation process involving the N-acyl-asparagine pro-drug intermediates preamicoumacins, which are hydrolyzed by the asparagine-specific peptidase into the active component amicoumacin A. This work represents the first direct cloning based heterologous expression of natural products in the model organism B. subtilis and paves the way to the development of future genome mining efforts in this genus.

  9. The dppBCDF gene cluster of Haemophilus influenzae: Role in heme utilization

    Directory of Open Access Journals (Sweden)

    Morton Daniel J

    2009-08-01

    Full Text Available Abstract Background Haemophilus influenzae requires a porphyrin source for aerobic growth and possesses multiple mechanisms to obtain this essential nutrient. This porphyrin requirement may be satisfied by either heme alone, or protoporphyrin IX in the presence of an iron source. One protein involved in heme acquisition by H. influenzae is the periplasmic heme binding protein HbpA. HbpA exhibits significant homology to the dipeptide and heme binding protein DppA of Escherichia coli. DppA is a component of the DppABCDF peptide-heme permease of E. coli. H. influenzae homologs of dppBCDF are located in the genome at a point distant from hbpA. The object of this study was to investigate the potential role of the H. influenzae dppBCDF locus in heme utilization. Findings An insertional mutation in dppC was constructed and the impact of the mutation on the utilization of both free heme and various proteinaceous heme sources as well as utilization of protoporphyrin IX was determined in growth curve studies. The dppC insertion mutant strain was significantly impacted in utilization of all tested heme sources and protoporphyin IX. Complementation of the dppC mutation with an intact dppCBDF gene cluster in trans corrected the growth defects seen in the dppC mutant strain. Conclusion The dppCBDF gene cluster constitutes part of the periplasmic heme-acquisition systems of H. influenzae.

  10. Identification of certain cancer-mediating genes using Gaussian fuzzy cluster validity index

    Indian Academy of Sciences (India)

    Anupam Ghosh; Rajat K De

    2015-10-01

    In this article, we have used an index, called Gaussian fuzzy index (GFI), recently developed by the authors, based on the notion of fuzzy set theory, for validating the clusters obtained by a clustering algorithm applied on cancer gene expression data. GFI is then used for the identification of genes that have altered quite significantly from normal state to carcinogenic state with respect to their mRNA expression patterns. The effectiveness of the methodology has been demonstrated on three gene expression cancer datasets dealing with human lung, colon and leukemia. The performance of GFI is compared with 19 exiting cluster validity indices. The results are appropriately validated biologically and statistically. In this context, we have used biochemical pathways, -value statistics of GO attributes, -test and -score for the validation of the results. It has been reported that GFI is capable of identifying high-quality enriched clusters of genes, and thereby is able to select more cancer-mediating genes.

  11. A rough set based rational clustering framework for determining correlated genes.

    Science.gov (United States)

    Jeyaswamidoss, Jeba Emilyn; Thangaraj, Kesavan; Ramar, Kadarkarai; Chitra, Muthusamy

    2016-06-01

    Cluster analysis plays a foremost role in identifying groups of genes that show similar behavior under a set of experimental conditions. Several clustering algorithms have been proposed for identifying gene behaviors and to understand their significance. The principal aim of this work is to develop an intelligent rough clustering technique, which will efficiently remove the irrelevant dimensions in a high-dimensional space and obtain appropriate meaningful clusters. This paper proposes a novel biclustering technique that is based on rough set theory. The proposed algorithm uses correlation coefficient as a similarity measure to simultaneously cluster both the rows and columns of a gene expression data matrix and mean squared residue to generate the initial biclusters. Furthermore, the biclusters are refined to form the lower and upper boundaries by determining the membership of the genes in the clusters using mean squared residue. The algorithm is illustrated with yeast gene expression data and the experiment proves the effectiveness of the method. The main advantage is that it overcomes the problem of selection of initial clusters and also the restriction of one object belonging to only one cluster by allowing overlapping of biclusters.

  12. An Effective Tri-Clustering Algorithm Combining Expression Data with Gene Regulation Information

    Directory of Open Access Journals (Sweden)

    Ao Li

    2009-04-01

    Full Text Available Motivation: Bi-clustering algorithms aim to identify sets of genes sharing similar expression patterns across a subset of conditions. However direct interpretation or prediction of gene regulatory mechanisms may be difficult as only gene expression data is used. Information about gene regulators may also be available, most commonly about which transcription factors may bind to the promoter region and thus control the expression level of a gene. Thus a method to integrate gene expression and gene regulation information is desirable for clustering and analyzing. Methods: By incorporating gene regulatory information with gene expression data, we define regulated expression values (REV as indicators of how a gene is regulated by a specific factor. Existing bi-clustering methods are extended to a three dimensional data space by developing a heuristic TRI-Clustering algorithm. An additional approach named Automatic Boundary Searching algorithm (ABS is introduced to automatically determine the boundary threshold. Results: Results based on incorporating ChIP-chip data representing transcription factor-gene interactions show that the algorithms are efficient and robust for detecting tri-clusters. Detailed analysis of the tri-cluster extracted from yeast sporulation REV data shows genes in this cluster exhibited significant differences during the middle and late stages. The implicated regulatory network was then reconstructed for further study of defined regulatory mechanisms. Topological and statistical analysis of this network demonstrated evidence of significant changes of TF activities during the different stages of yeast sporulation, and suggests this approach might be a general way to study regulatory networks undergoing transformations.

  13. HLA genes and other candidate genes involved in susceptibility for (pre)neoplastic cervical disease

    NARCIS (Netherlands)

    Zoodsma, M; Nolte, IM; Meerman, GJT; De Vries, EGE; Van Der Zee, AGJ

    2005-01-01

    This review focuses on common and genetic risk factors such as HLA and other genes that may be involved in susceptibility for (pre)neoplastic cervical disease. The goal of this review is the evaluation of polymorphisms that are either associated with cervical intraepithelial neoplasia (CIN) and/or c

  14. HLA genes and other candidate genes involved in susceptibility for (pre)neoplastic cervical disease

    NARCIS (Netherlands)

    Zoodsma, M; Nolte, IM; Meerman, GJT; De Vries, EGE; Van Der Zee, AGJ

    2005-01-01

    This review focuses on common and genetic risk factors such as HLA and other genes that may be involved in susceptibility for (pre)neoplastic cervical disease. The goal of this review is the evaluation of polymorphisms that are either associated with cervical intraepithelial neoplasia (CIN) and/or c

  15. Extracellular complementation and the identification of additional genes involved in aerial mycelium formation in Streptomyces coelicolor.

    Science.gov (United States)

    Nodwell, J R; Yang, M; Kuo, D; Losick, R

    1999-02-01

    Morphogenesis in the bacterium Streptomyces coelicolor involves the formation of a lawn of hair-like aerial hyphae on the colony surface that stands up in the air and differentiates into chains of spores. bld mutants are defective in the formation of this aerial mycelium and grow as smooth, hairless colonies. When certain pairs of bld mutants are grown close to one another on rich sporulation medium, they exhibit extracellular complementation such that one mutant restores aerial mycelium formation to the other. The extracellular complementation relationships of most of the previously isolated bld mutants placed them in a hierarchy of extracellular complementation groups. We have screened for further bld mutants with precautions intended to maximize the discovery of additional genes. Most of the 50 newly isolated mutant strains occupy one of three of the previously described positions in the hierarchy, behaving like bldK, bldC, or bldD mutants. We show that the mutations in some of the strains that behave like bldK are bldK alleles but that others fall in a cluster at a position on the chromosome distinct from that of any known bld gene. We name this locus bldL. By introducing cloned genes into the strains that exhibit bldC or bldD-like extracellular complementation phenotypes, we show that most of these strains are likely to contain mutations in genes other than bldC or bldD. These results indicate that the genetic control of aerial mycelium formation is more complex than previously recognized and support the idea that a high proportion of bld genes are directly or indirectly involved in the production of substances that are exchanged between cells during morphological differentiation.

  16. Functional Analysis of Promoters in the Nisin Gene Cluster of Lactococcus lactis

    NARCIS (Netherlands)

    Ruyter, Pascalle G.G.A. de; Kuipers, Oscar P.; Beerthuyzen, Marke M.; Alen-Boerrigter, Ingrid van; Vos, Willem M. de

    1996-01-01

    The promoters in the nisin gene cluster nisABTCIPRKFEG of Lactococcus lactis were characterized by primer extension and transcriptional fusions to the Escherichia coli promoterless β-glucuronidase gene (gusA). Three promoters preceding the nisA, nisR, and nisF genes, which all give rise to gusA expr

  17. Leveraging long sequencing reads to investigate R-gene clustering and variation in sugar beet

    Science.gov (United States)

    Host-pathogen interactions are of prime importance to modern agriculture. Plants utilize various types of resistance genes to mitigate pathogen damage. Identification of the specific gene responsible for a specific resistance can be difficult due to duplication and clustering within R-gene families....

  18. Horizontal transfer of a nitrate assimilation gene cluster and ecological transitions in fungi: a phylogenetic study.

    Directory of Open Access Journals (Sweden)

    Jason C Slot

    Full Text Available High affinity nitrate assimilation genes in fungi occur in a cluster (fHANT-AC that can be coordinately regulated. The clustered genes include nrt2, which codes for a high affinity nitrate transporter; euknr, which codes for nitrate reductase; and NAD(PH-nir, which codes for nitrite reductase. Homologs of genes in the fHANT-AC occur in other eukaryotes and prokaryotes, but they have only been found clustered in the oomycete Phytophthora (heterokonts. We performed independent and concatenated phylogenetic analyses of homologs of all three genes in the fHANT-AC. Phylogenetic analyses limited to fungal sequences suggest that the fHANT-AC has been transferred horizontally from a basidiomycete (mushrooms and smuts to an ancestor of the ascomycetous mold Trichoderma reesei. Phylogenetic analyses of sequences from diverse eukaryotes and eubacteria, and cluster structure, are consistent with a hypothesis that the fHANT-AC was assembled in a lineage leading to the oomycetes and was subsequently transferred to the Dikarya (Ascomycota+Basidiomycota, which is a derived fungal clade that includes the vast majority of terrestrial fungi. We propose that the acquisition of high affinity nitrate assimilation contributed to the success of Dikarya on land by allowing exploitation of nitrate in aerobic soils, and the subsequent transfer of a complete assimilation cluster improved the fitness of T. reesei in a new niche. Horizontal transmission of this cluster of functionally integrated genes supports the "selfish operon" hypothesis for maintenance of gene clusters.

  19. Fatal and serious road crashes involving young New Zealand drivers: a latent class clustering approach

    DEFF Research Database (Denmark)

    Weiss, Harold B.; Kaplan, Sigal; Prato, Carlo Giacomo

    2016-01-01

    a holistic overview of crash patterns useful for designing youth-targeted road safety programmes. We compiled a database of 8644 New Zealand crashes from 2002 to 2011 involving at least one 15–24-year-old driver and a fatal or serious injury for at least one road user. We considered crash location......The over-representation of young drivers in road crashes remains an important concern worldwide. Cluster analysis has been applied to young driver sub-groups, but its application by analysing crash occurrence is just emerging. We present a classification analysis that advances the field through......, infrastructure characteristics, environmental conditions, demographic characteristics, driving behaviour, and pre-crash manoeuvres. The analysis yielded 15 and 8 latent classes of, respectively, single-vehicle and multi-vehicle crashes, and average posterior probabilities measured the odds of correct...

  20. Bayesian History Reconstruction of Complex Human Gene Clusters on a Phylogeny

    CERN Document Server

    Vinař, Tomáš; Song, Giltae; Siepel, Adam

    2009-01-01

    Clusters of genes that have evolved by repeated segmental duplication present difficult challenges throughout genomic analysis, from sequence assembly to functional analysis. Improved understanding of these clusters is of utmost importance, since they have been shown to be the source of evolutionary innovation, and have been linked to multiple diseases, including HIV and a variety of cancers. Previously, Zhang et al. (2008) developed an algorithm for reconstructing parsimonious evolutionary histories of such gene clusters, using only human genomic sequence data. In this paper, we propose a probabilistic model for the evolution of gene clusters on a phylogeny, and an MCMC algorithm for reconstruction of duplication histories from genomic sequences in multiple species. Several projects are underway to obtain high quality BAC-based assemblies of duplicated clusters in multiple species, and we anticipate that our method will be useful in analyzing these valuable new data sets.

  1. Divergence and transcriptional analysis of the division cell wall (dcw) gene cluster in Neisseria spp.

    Science.gov (United States)

    Snyder, Lori A S; Shafer, William M; Saunders, Nigel J

    2003-01-01

    Three of the 18 open reading frames in the division and cell wall synthesis cluster of the pathogenic Neisseria spp. are not present in the clusters of other bacterial species. The region containing two of these, dcaB and dcaC, displays interstrain and interspecies variability uncharacteristic of such clusters. 3' of dcaB is a Correia repeat enclosed element (CREE), which is only present in some strains. It has been suggested that this CREE is a transcriptional terminator, although we demonstrate otherwise. A gearbox-like promoter within this CREE is active in Escherichia coli but not in Neisseria meningitidis. There is an active promoter 5' of dcaC, although its sequence is not conserved. The presence of similarly located promoters has not been demonstrated in other species. In Neisseria lactamica, this promoter involves another dcw-associated CREE, the first demonstration of active promoter generation at the 5' end of this common intergenic, apparently mobile, element. Upstream of this promoter is an inverted pair of neisserial uptake signal sequences, which are commonly considered to be transcriptional terminators. It has been proposed to terminate transcription in this location, although we have demonstrated transcript extending through this uptake signal sequence. dcaC contains a 108 bp tandem repeat, which is present in different copy numbers in the neisserial strains examined. This investigation reveals extensive sequence variation, disputes the presence of transcriptional terminators and identifies active internal promoters in this normally highly conserved cluster of essential genes, and addresses the transcriptional activity of two common neisserial intergenic components.

  2. MADIBA: A web server toolkit for biological interpretation of Plasmodium and plant gene clusters

    Directory of Open Access Journals (Sweden)

    Louw Abraham I

    2008-02-01

    Full Text Available Abstract Background Microarray technology makes it possible to identify changes in gene expression of an organism, under various conditions. Data mining is thus essential for deducing significant biological information such as the identification of new biological mechanisms or putative drug targets. While many algorithms and software have been developed for analysing gene expression, the extraction of relevant information from experimental data is still a substantial challenge, requiring significant time and skill. Description MADIBA (MicroArray Data Interface for Biological Annotation facilitates the assignment of biological meaning to gene expression clusters by automating the post-processing stage. A relational database has been designed to store the data from gene to pathway for Plasmodium, rice and Arabidopsis. Tools within the web interface allow rapid analyses for the identification of the Gene Ontology terms relevant to each cluster; visualising the metabolic pathways where the genes are implicated, their genomic localisations, putative common transcriptional regulatory elements in the upstream sequences, and an analysis specific to the organism being studied. Conclusion MADIBA is an integrated, online tool that will assist researchers in interpreting their results and understand the meaning of the co-expression of a cluster of genes. Functionality of MADIBA was validated by analysing a number of gene clusters from several published experiments – expression profiling of the Plasmodium life cycle, and salt stress treatments of Arabidopsis and rice. In most of the cases, the same conclusions found by the authors were quickly and easily obtained after analysing the gene clusters with MADIBA.

  3. A murine specific expansion of the Rhox cluster involved in embryonic stem cell biology is under natural selection

    Directory of Open Access Journals (Sweden)

    Keebler Jon

    2006-08-01

    Full Text Available Abstract Background The rodent specific reproductive homeobox (Rhox gene cluster on the X chromosome has been reported to contain twelve homeobox-containing genes, Rhox1-12. Results We have identified a 40 kb genomic region within the Rhox cluster that is duplicated eight times in tandem resulting in the presence of eight paralogues of Rhox2 and Rhox3 and seven paralogues of Rhox4. Transcripts have been identified for the majority of these paralogues and all but three are predicted to produce full-length proteins with functional potential. We predict that there are a total of thirty-two Rhox genes at this genomic location, making it the most gene-rich homoeobox cluster identified in any species. From the 95% sequence similarity between the eight duplicated genomic regions and the synonymous substitution rate of the Rhox2, 3 and 4 paralogues we predict that the duplications occurred after divergence of mouse and rat and represent the youngest homoeobox cluster identified to date. Molecular evolutionary analysis reveals that this cluster is an actively evolving region with Rhox2 and 4 paralogues under diversifying selection and Rhox3 evolving neutrally. The biological importance of this duplication is emphasised by the identification of an important role for Rhox2 and Rhox4 in regulating the initial stages of embryonic stem (ES cell differentiation. Conclusion The gene rich Rhox cluster provides the mouse with significant biological novelty that we predict could provide a substrate for speciation. Moreover, this unique cluster may explain species differences in ES cell derivation and maintenance between mouse, rat and human.

  4. Functional analysis of soybean genes involved in flavonoid biosynthesis by virus-induced gene silencing.

    Science.gov (United States)

    Nagamatsu, Atsushi; Masuta, Chikara; Senda, Mineo; Matsuura, Hideyuki; Kasai, Atsushi; Hong, Jin-Sung; Kitamura, Keisuke; Abe, Jun; Kanazawa, Akira

    2007-11-01

    Virus-induced gene silencing (VIGS) is a powerful tool for functional analysis of genes in plants. A wide-host-range VIGS vector, which was developed based on the Cucumber mosaic virus (CMV), was tested for its ability to silence endogenous genes involved in flavonoid biosynthesis in soybean. Symptomless infection was established using a pseudorecombinant virus, which enabled detection of specific changes in metabolite content by VIGS. It has been demonstrated that the yellow seed coat phenotype of various cultivated soybean lines that lack anthocyanin pigmentation is induced by natural degradation of chalcone synthase (CHS) mRNA. When soybean plants with brown seed coats were infected with a virus that contains the CHS gene sequence, the colour of the seed coats changed to yellow, which indicates that the naturally occurring RNA silencing is reproduced by VIGS. In addition, CHS VIGS consequently led to a decrease in isoflavone content in seeds. VIGS was also tested on the putative flavonoid 3'-hydroxylase (F3'H) gene in the pathway. This experiment resulted in a decrease in the content of quercetin relative to kaempferol in the upper leaves after viral infection, which suggests that the putative gene actually encodes the F3'H protein. In both experiments, a marked decrease in the target mRNA and accumulation of short interfering RNAs were detected, indicating that sequence-specific mRNA degradation was induced. The present report is a successful demonstration of the application of VIGS for genes involved in flavonoid biosynthesis in plants; the CMV-based VIGS system provides an efficient tool for functional analysis of soybean genes.

  5. Identifying genes and gene networks involved in chromium metabolism and detoxification in Crambe abyssinica

    Energy Technology Data Exchange (ETDEWEB)

    Zulfiqar, Asma, E-mail: asmazulfiqar08@yahoo.com [Department of Plant, Soil, and Insect Sciences, 270 Stockbridge Road, University of Massachusetts Amherst, MA 01003 (United States); Paulose, Bibin, E-mail: bpaulose@psis.umass.edu [Department of Plant, Soil, and Insect Sciences, 270 Stockbridge Road, University of Massachusetts Amherst, MA 01003 (United States); Chhikara, Sudesh, E-mail: sudesh@psis.umass.edu [Department of Plant, Soil, and Insect Sciences, 270 Stockbridge Road, University of Massachusetts Amherst, MA 01003 (United States); Dhankher, Om Parkash, E-mail: parkash@psis.umass.edu [Department of Plant, Soil, and Insect Sciences, 270 Stockbridge Road, University of Massachusetts Amherst, MA 01003 (United States)

    2011-10-15

    Chromium pollution is a serious environmental problem with few cost-effective remediation strategies available. Crambe abyssinica (a member of Brassicaseae), a non-food, fast growing high biomass crop, is an ideal candidate for phytoremediation of heavy metals contaminated soils. The present study used a PCR-Select Suppression Subtraction Hybridization approach in C. abyssinica to isolate differentially expressed genes in response to Cr exposure. A total of 72 differentially expressed subtracted cDNAs were sequenced and found to represent 43 genes. The subtracted cDNAs suggest that Cr stress significantly affects pathways related to stress/defense, ion transporters, sulfur assimilation, cell signaling, protein degradation, photosynthesis and cell metabolism. The regulation of these genes in response to Cr exposure was further confirmed by semi-quantitative RT-PCR. Characterization of these differentially expressed genes may enable the engineering of non-food, high-biomass plants, including C. abyssinica, for phytoremediation of Cr-contaminated soils and sediments. - Highlights: > Molecular mechanism of Cr uptake and detoxification in plants is not well known. > We identified differentially regulated genes upon Cr exposure in Crambe abyssinica. > 72 Cr-induced subtracted cDNAs were sequenced and found to represent 43 genes. > Pathways linked to stress, ion transport, and sulfur assimilation were affected. > This is the first Cr transcriptome study in a crop with phytoremediation potential. - This study describes the identification and isolation of differentially expressed genes involved in chromium metabolism and detoxification in a non-food industrial oil crop Crambe abyssinica.

  6. Characterization of the fumonisin B2 biosynthetic gene cluster in Aspergillus niger and A. awamori.

    Science.gov (United States)

    Aspergillus niger and A. awamori strains isolated from grapes cultivated in Mediterranean basin were examined for fumonisin B2 (FB2) production and presence/absence of sequences within the fumonisin biosynthetic gene (fum) cluster. Presence of 13 regions in the fum cluster was evaluated by PCR assay...

  7. Bayesian hierarchical clustering for studying cancer gene expression data with unknown statistics.

    Directory of Open Access Journals (Sweden)

    Korsuk Sirinukunwattana

    Full Text Available Clustering analysis is an important tool in studying gene expression data. The Bayesian hierarchical clustering (BHC algorithm can automatically infer the number of clusters and uses Bayesian model selection to improve clustering quality. In this paper, we present an extension of the BHC algorithm. Our Gaussian BHC (GBHC algorithm represents data as a mixture of Gaussian distributions. It uses normal-gamma distribution as a conjugate prior on the mean and precision of each of the Gaussian components. We tested GBHC over 11 cancer and 3 synthetic datasets. The results on cancer datasets show that in sample clustering, GBHC on average produces a clustering partition that is more concordant with the ground truth than those obtained from other commonly used algorithms. Furthermore, GBHC frequently infers the number of clusters that is often close to the ground truth. In gene clustering, GBHC also produces a clustering partition that is more biologically plausible than several other state-of-the-art methods. This suggests GBHC as an alternative tool for studying gene expression data. The implementation of GBHC is available at https://sites.google.com/site/gaussianbhc/

  8. AntiSMASH 4.0 - improvements in chemistry prediction and gene cluster boundary identification

    NARCIS (Netherlands)

    Blin, Kai; Wolf, Thomas; Chevrette, Marc G.; Lu, Xiaowen; Schwalen, Christopher J.; Kautsar, Satria A.; Suarez Duran, Hernando G.; Los Santos, De Emmanuel L.C.; Kim, Hyun Uk; Nave, Mariana; Dickschat, Jeroen S.; Mitchell, Douglas A.; Shelest, Ekaterina; Breitling, Rainer; Takano, Eriko; Lee, Sang Yup; Weber, Tilmann; Medema, Marnix H.

    2017-01-01

    Many antibiotics, chemotherapeutics, crop protection agents and food preservatives originate from molecules produced by bacteria, fungi or plants. In recent years, genome mining methodologies have been widely adopted to identify and characterize the biosynthetic gene clusters encoding the production

  9. Comparative genomic analysis of secondary metabolite biosynthetic gene clusters in 207 isolates of Fusarium

    Science.gov (United States)

    Fusarium species are known for their ability to produce secondary metabolites (SMs), including plant hormones, pigments, mycotoxins, and other compounds with potential agricultural, pharmaceutical, and biotechnological impact. Understanding the distribution of SM biosynthetic gene clusters across th...

  10. Clusters of Antibiotic Resistance Genes Enriched Together Stay Together in Swine Agriculture.

    Science.gov (United States)

    Johnson, Timothy A; Stedtfeld, Robert D; Wang, Qiong; Cole, James R; Hashsham, Syed A; Looft, Torey; Zhu, Yong-Guan; Tiedje, James M

    2016-04-12

    Antibiotic resistance is a worldwide health risk, but the influence of animal agriculture on the genetic context and enrichment of individual antibiotic resistance alleles remains unclear. Using quantitative PCR followed by amplicon sequencing, we quantified and sequenced 44 genes related to antibiotic resistance, mobile genetic elements, and bacterial phylogeny in microbiomes from U.S. laboratory swine and from swine farms from three Chinese regions. We identified highly abundant resistance clusters: groups of resistance and mobile genetic element alleles that cooccur. For example, the abundance of genes conferring resistance to six classes of antibiotics together with class 1 integrase and the abundance of IS6100-type transposons in three Chinese regions are directly correlated. These resistance cluster genes likely colocalize in microbial genomes in the farms. Resistance cluster alleles were dramatically enriched (up to 1 to 10% as abundant as 16S rRNA) and indicate that multidrug-resistant bacteria are likely the norm rather than an exception in these communities. This enrichment largely occurred independently of phylogenetic composition; thus, resistance clusters are likely present in many bacterial taxa. Furthermore, resistance clusters contain resistance genes that confer resistance to antibiotics independently of their particular use on the farms. Selection for these clusters is likely due to the use of only a subset of the broad range of chemicals to which the clusters confer resistance. The scale of animal agriculture and its wastes, the enrichment and horizontal gene transfer potential of the clusters, and the vicinity of large human populations suggest that managing this resistance reservoir is important for minimizing human risk. Agricultural antibiotic use results in clusters of cooccurring resistance genes that together confer resistance to multiple antibiotics. The use of a single antibiotic could select for an entire suite of resistance genes if

  11. Integrating Data Clustering and Visualization for the Analysis of 3D Gene Expression Data

    Energy Technology Data Exchange (ETDEWEB)

    Data Analysis and Visualization (IDAV) and the Department of Computer Science, University of California, Davis, One Shields Avenue, Davis CA 95616, USA,; nternational Research Training Group ``Visualization of Large and Unstructured Data Sets,' ' University of Kaiserslautern, Germany; Computational Research Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA; Genomics Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley CA 94720, USA; Life Sciences Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley CA 94720, USA,; Computer Science Division,University of California, Berkeley, CA, USA,; Computer Science Department, University of California, Irvine, CA, USA,; All authors are with the Berkeley Drosophila Transcription Network Project, Lawrence Berkeley National Laboratory,; Rubel, Oliver; Weber, Gunther H.; Huang, Min-Yu; Bethel, E. Wes; Biggin, Mark D.; Fowlkes, Charless C.; Hendriks, Cris L. Luengo; Keranen, Soile V. E.; Eisen, Michael B.; Knowles, David W.; Malik, Jitendra; Hagen, Hans; Hamann, Bernd

    2008-05-12

    The recent development of methods for extracting precise measurements of spatial gene expression patterns from three-dimensional (3D) image data opens the way for new analyses of the complex gene regulatory networks controlling animal development. We present an integrated visualization and analysis framework that supports user-guided data clustering to aid exploration of these new complex datasets. The interplay of data visualization and clustering-based data classification leads to improved visualization and enables a more detailed analysis than previously possible. We discuss (i) integration of data clustering and visualization into one framework; (ii) application of data clustering to 3D gene expression data; (iii) evaluation of the number of clusters k in the context of 3D gene expression clustering; and (iv) improvement of overall analysis quality via dedicated post-processing of clustering results based on visualization. We discuss the use of this framework to objectively define spatial pattern boundaries and temporal profiles of genes and to analyze how mRNA patterns are controlled by their regulatory transcription factors.

  12. Chordate Hox and ParaHox gene clusters differ dramatically in their repetitive element content.

    Science.gov (United States)

    Osborne, Peter W; Ferrier, David E K

    2010-02-01

    The ParaHox and Hox gene clusters control aspects of animal anterior-posterior development and are related as paralogous evolutionary sisters. Despite this relationship, it is not clear if the clusters operate in similar ways, with similar constraints. To compare clusters, we examined the transposable-element (TE) content of amphioxus and mammalian ParaHox and Hox clusters. Chordate Hox clusters are known to be largely devoid of TEs, possibly due to gene regulation and constraints on clustering in these animals. Here, we describe several novel amphioxus TEs and show that the amphioxus ParaHox cluster is a hotspot for TE insertion. TE contents of mammalian ParaHox loci are at background levels, in stark contrast to chordate Hox clusters. This marks a significant difference between Hox and ParaHox clusters. The presence of so many potentially disruptive elements implies selection constrains these ParaHox clusters as they have not dispersed despite 500 My of evolution for each lineage.

  13. Visualization of mappings between the gene ontology and cluster trees

    Science.gov (United States)

    Jusufi, Ilir; Kerren, Andreas; Aleksakhin, Vladyslav; Schreiber, Falk

    2012-01-01

    Ontologies and hierarchical clustering are both important tools in biology and medicine to study high-throughput data such as transcriptomics and metabolomics data. Enrichment of ontology terms in the data is used to identify statistically overrepresented ontology terms, giving insight into relevant biological processes or functional modules. Hierarchical clustering is a standard method to analyze and visualize data to find relatively homogeneous clusters of experimental data points. Both methods support the analysis of the same data set, but are usually considered independently. However, often a combined view is desired: visualizing a large data set in the context of an ontology under consideration of a clustering of the data. This paper proposes a new visualization method for this task.

  14. Prevalence of the lmo0036-0043 gene cluster encoding arginine deiminase and agmatine deiminase systems in Listeria monocytogenes.

    Science.gov (United States)

    Chen, Jianshun; Chen, Fan; Cheng, Changyong; Fang, Weihuan

    2013-04-01

    Arginine deiminase and agmatine deiminase systems are involved in acid tolerance, and their encoding genes form the cluster lmo0036-0043 in Listeria monocytogenes. While lmo0042 and lmo0043 were conserved in all L. monocytogenes strains, the lmo0036-0041 region of this cluster was identified in all lineages I and II, and the majority of lineage IV (83.3%) strains, but absent in all lineage III and a small fraction of lineage IV (16.7%) strains, suggesting that the presence of the complete lmo0036-0043 cluster is dependent on lineages. lmo0036-0043-complete and -deficient lineage IV strains exhibit specific ascB-dapE profiles, which might represent two subpopulations with distinct genetic characteristics.

  15. Gene expression correlation analysis predicts involvement of high- and low-confidence risk genes in different stages of prostate carcinogenesis.

    Science.gov (United States)

    Yano, Kojiro

    2010-12-01

    Whole genome association studies have identified many loci associated with the risk of prostate cancer (PC). However, very few of the genes associated with these loci have been related to specific processes of prostate carcinogenesis. Therefore I inferred biological functions associated with these risk genes using gene expression correlation analysis. PC risk genes reported in the literature were classified as having high (Plow (Phigh-confidence genes and other genes in the microarray dataset, whereas correlation between low-confidence genes and other genes in PC showed smaller decrease. Genes involved in developmental processes were significantly correlated with all risk gene categories. Ectoderm development genes, which may be related to squamous metaplasia, and genes enriched in fetal prostate stem cells (PSCs) showed strong association with the high-confidence genes. The association between the PSC genes and the low-confidence genes was weak, but genes related to neural system genes showed strong association with low-confidence genes. The high-confidence risk genes may be associated with an early stage of prostate carcinogenesis, possibly involving PSCs and squamous metaplasia. The low-confidence genes may be involved in a later stage of carcinogenesis. © 2010 Wiley-Liss, Inc.

  16. Yeast homologous recombination-based promoter engineering for the activation of silent natural product biosynthetic gene clusters.

    Science.gov (United States)

    Montiel, Daniel; Kang, Hahk-Soo; Chang, Fang-Yuan; Charlop-Powers, Zachary; Brady, Sean F

    2015-07-21

    Large-scale sequencing of prokaryotic (meta)genomic DNA suggests that most bacterial natural product gene clusters are not expressed under common laboratory culture conditions. Silent gene clusters represent a promising resource for natural product discovery and the development of a new generation of therapeutics. Unfortunately, the characterization of molecules encoded by these clusters is hampered owing to our inability to express these gene clusters in the laboratory. To address this bottleneck, we have developed a promoter-engineering platform to transcriptionally activate silent gene clusters in a model heterologous host. Our approach uses yeast homologous recombination, an auxotrophy complementation-based yeast selection system and sequence orthogonal promoter cassettes to exchange all native promoters in silent gene clusters with constitutively active promoters. As part of this platform, we constructed and validated a set of bidirectional promoter cassettes consisting of orthogonal promoter sequences, Streptomyces ribosome binding sites, and yeast selectable marker genes. Using these tools we demonstrate the ability to simultaneously insert multiple promoter cassettes into a gene cluster, thereby expediting the reengineering process. We apply this method to model active and silent gene clusters (rebeccamycin and tetarimycin) and to the silent, cryptic pseudogene-containing, environmental DNA-derived Lzr gene cluster. Complete promoter refactoring and targeted gene exchange in this "dead" cluster led to the discovery of potent indolotryptoline antiproliferative agents, lazarimides A and B. This potentially scalable and cost-effective promoter reengineering platform should streamline the discovery of natural products from silent natural product biosynthetic gene clusters.

  17. Unusual Gene Order and Organization of the Sea Urchin Hox Cluster

    Energy Technology Data Exchange (ETDEWEB)

    Cameron, R A; Rowen, L; Nesbitt, R; Bloom, S; Rast, J P; Berney, K; Arenas-Mena, C; Martinez, P; Lucas, S; Richardson, P M; Davidson, E H; Peterson, K J; Hood, L

    2005-10-11

    The highly consistent gene order and axial colinear expression patterns found in vertebrate hox gene clusters are less well conserved across the rest of bilaterians. We report the first deuterostome instance of an intact hox cluster with a unique gene order where the paralog groups are not expressed in a sequential manner. The finished sequence from BAC clones from the genome of the sea urchin, Strongylocentrotus purpuratus, reveals a gene order wherein the anterior genes (Hox1, Hox2 and Hox3) lie nearest the posterior genes in the cluster such that the most 3 gene is Hox5. (The gene order is : 5-Hox1, 2, 3, 11/13c, 11/13b, 11/13a, 9/10, 8, 7, 6, 5 - 3). The finished sequence result is corroborated by restriction mapping evidence and BAC-end scaffold analyses. Comparisons with a putative ancestral deuterostome Hox gene cluster suggest that the rearrangements leading to the sea urchin gene order were many and complex.

  18. Unusual Gene Order and Organization of the Sea Urchin HoxCluster

    Energy Technology Data Exchange (ETDEWEB)

    Richardson, Paul M.; Lucas, Susan; Cameron, R. Andrew; Rowen,Lee; Nesbitt, Ryan; Bloom, Scott; Rast, Jonathan P.; Berney, Kevin; Arenas-Mena, Cesar; Martinez, Pedro; Davidson, Eric H.; Peterson, KevinJ.; Hood, Leroy

    2005-05-10

    The highly consistent gene order and axial colinear expression patterns found in vertebrate hox gene clusters are less well conserved across the rest of bilaterians. We report the first deuterostome instance of an intact hox cluster with a unique gene order where the paralog groups are not expressed in a sequential manner. The finished sequence from BAC clones from the genome of the sea urchin, Strongylocentrotus purpuratus, reveals a gene order wherein the anterior genes (Hox1, Hox2 and Hox3) lie nearest the posterior genes in the cluster such that the most 3' gene is Hox5. (The gene order is : 5'-Hox1,2, 3, 11/13c, 11/13b, '11/13a, 9/10, 8, 7, 6, 5 - 3)'. The finished sequence result is corroborated by restriction mapping evidence and BAC-end scaffold analyses. Comparisons with a putative ancestral deuterostome Hox gene cluster suggest that the rearrangements leading to the sea urchin gene order were many and complex.

  19. Sequencing rare marine actinomycete genomes reveals high density of unique natural product biosynthetic gene clusters.

    Science.gov (United States)

    Schorn, Michelle A; Alanjary, Mohammad M; Aguinaldo, Kristen; Korobeynikov, Anton; Podell, Sheila; Patin, Nastassia; Lincecum, Tommie; Jensen, Paul R; Ziemert, Nadine; Moore, Bradley S

    2016-12-01

    Traditional natural product discovery methods have nearly exhausted the accessible diversity of microbial chemicals, making new sources and techniques paramount in the search for new molecules. Marine actinomycete bacteria have recently come into the spotlight as fruitful producers of structurally diverse secondary metabolites, and remain relatively untapped. In this study, we sequenced 21 marine-derived actinomycete strains, rarely studied for their secondary metabolite potential and under-represented in current genomic databases. We found that genome size and phylogeny were good predictors of biosynthetic gene cluster diversity, with larger genomes rivalling the well-known marine producers in the Streptomyces and Salinispora genera. Genomes in the Micrococcineae suborder, however, had consistently the lowest number of biosynthetic gene clusters. By networking individual gene clusters into gene cluster families, we were able to computationally estimate the degree of novelty each genus contributed to the current sequence databases. Based on the similarity measures between all actinobacteria in the Joint Genome Institute's Atlas of Biosynthetic gene Clusters database, rare marine genera show a high degree of novelty and diversity, with Corynebacterium, Gordonia, Nocardiopsis, Saccharomonospora and Pseudonocardia genera representing the highest gene cluster diversity. This research validates that rare marine actinomycetes are important candidates for exploration, as they are relatively unstudied, and their relatives are historically rich in secondary metabolites.

  20. Natural product proteomining, a quantitative proteomics platform, allows rapid discovery of biosynthetic gene clusters for different classes of natural products.

    Science.gov (United States)

    Gubbens, Jacob; Zhu, Hua; Girard, Geneviève; Song, Lijiang; Florea, Bogdan I; Aston, Philip; Ichinose, Koji; Filippov, Dmitri V; Choi, Young H; Overkleeft, Herman S; Challis, Gregory L; van Wezel, Gilles P

    2014-06-19

    Information on gene clusters for natural product biosynthesis is accumulating rapidly because of the current boom of available genome sequencing data. However, linking a natural product to a specific gene cluster remains challenging. Here, we present a widely applicable strategy for the identification of gene clusters for specific natural products, which we name natural product proteomining. The method is based on using fluctuating growth conditions that ensure differential biosynthesis of the bioactivity of interest. Subsequent combination of metabolomics and quantitative proteomics establishes correlations between abundance of natural products and concomitant changes in the protein pool, which allows identification of the relevant biosynthetic gene cluster. We used this approach to elucidate gene clusters for different natural products in Bacillus and Streptomyces, including a novel juglomycin-type antibiotic. Natural product proteomining does not require prior knowledge of the gene cluster or secondary metabolite and therefore represents a general strategy for identification of all types of gene clusters.

  1. Lampreys, the jawless vertebrates, contain only two ParaHox gene clusters.

    Science.gov (United States)

    Zhang, Huixian; Ravi, Vydianathan; Tay, Boon-Hui; Tohari, Sumanty; Pillai, Nisha E; Prasad, Aravind; Lin, Qiang; Brenner, Sydney; Venkatesh, Byrappa

    2017-08-22

    ParaHox genes (Gsx, Pdx, and Cdx) are an ancient family of developmental genes closely related to the Hox genes. They play critical roles in the patterning of brain and gut. The basal chordate, amphioxus, contains a single ParaHox cluster comprising one member of each family, whereas nonteleost jawed vertebrates contain four ParaHox genomic loci with six or seven ParaHox genes. Teleosts, which have experienced an additional whole-genome duplication, contain six ParaHox genomic loci with six ParaHox genes. Jawless vertebrates, represented by lampreys and hagfish, are the most ancient group of vertebrates and are crucial for understanding the origin and evolution of vertebrate gene families. We have previously shown that lampreys contain six Hox gene loci. Here we report that lampreys contain only two ParaHox gene clusters (designated as α- and β-clusters) bearing five ParaHox genes (Gsxα, Pdxα, Cdxα, Gsxβ, and Cdxβ). The order and orientation of the three genes in the α-cluster are identical to that of the single cluster in amphioxus. However, the orientation of Gsxβ in the β-cluster is inverted. Interestingly, Gsxβ is expressed in the eye, unlike its homologs in jawed vertebrates, which are expressed mainly in the brain. The lamprey Pdxα is expressed in the pancreas similar to jawed vertebrate Pdx genes, indicating that the pancreatic expression of Pdx was acquired before the divergence of jawless and jawed vertebrate lineages. It is likely that the lamprey Pdxα plays a crucial role in pancreas specification and insulin production similar to the Pdx of jawed vertebrates.

  2. IMP2, a gene involved in the expression of glucose-repressible genes in Saccharomyces cerevisiae.

    Science.gov (United States)

    Lodi, T; Goffrini, P; Ferrero, I; Donnini, C

    1995-09-01

    Two mutants carrying different deletions of the IMP2 coding sequence of Saccharomyces cerevisiae, delta T1, which encodes a protein lacking the last 26 C-terminal amino acids, and delta T2, which completely lacks the coding region, were analysed for derepression of glucose-repressible maltose, galactose, raffinose and ethanol utilization pathways in response to glucose limitation. The role of the IMP2 gene product in the regulation of carbon catabolite repressible enzymes maltase, invertase, alcohol dehydrogenase, NAD-dependent glutamate dehydrogenase (NAD-GDH) and L-lactate:ferricytochrome-c oxidoreductase (L-LCR) was also analysed. The IMP2 gene product is required for the rapid glucose derepression of all above-mentioned carbon source utilization pathways and of all the enzymes except for L-LCR. NAD-GDH is regulated by IMP2 in the opposite way and, in fact, this enzyme was released at higher levels in both imp2 mutants than in the wild-type strain. Therefore, the product of IMP2 appears to be involved in positive and negative regulation. Both deletions result in growth and catalytic defects; in some cases partial modification of the gene product yielded more dramatic effects than its complete absence. Moreover, evidence is provided that the IMP2 gene product regulates galactose- and maltose-inducible genes at the transcriptional level and is a positive regulator of maltase, maltose permease and galactose permease gene expression.

  3. Characterization of the urease gene cluster from Rhizobium leguminosarum bv. viciae.

    Science.gov (United States)

    Toffanin, Annita; Cadahia, Esther; Imperial, Juan; Ruiz-Argüeso, Tomás; Palacios, Manuel

    2002-04-01

    Moderate levels of urease activity (ca. 300 mU mg(-1)) were detected in Rhizobium leguminosarum bv. viciae UPM791 vegetative cells. This activity did not require urea for induction and was partially repressed by the addition of ammonium into the medium. Lower levels of urease activity (ca. 100 mU mg(-1)) were detected also in pea bacteroids. A DNA region of ca. 9 kb containing the urease structural genes ( ureA, ureB and ureC), accessory genes ( ureD, ureE, ureF, and ureG), and five additional ORFs ( orf83, orf135, orf207, orf223, and orf287) encoding proteins of unknown function was sequenced. Three of these ORFs ( orf83, orf135 and orf207) have a homologous counterpart in a gene cluster from Sinorhizobium meliloti, reported to be involved in urease and hydrogenase activities. R. leguminosarum mutant strains carrying Tn 5 insertions within this region exhibited a urease-negative phenotype, but induced wild-type levels of hydrogenase and nitrogenase activities in bacteroids. orf287 encodes a potential transmembrane protein with a C-terminal GGDEF domain. A mutant affected in orf287 exhibited normal levels of urease activity in culture cells. Experiments aimed at cross-complementing Ni-binding proteins required for urease and hydrogenase synthesis (UreE and HypB, respectively) indicated that these two proteins are not functionally interchangeable in R. leguminosarum.

  4. Identification and comparative analyses of Siamois cluster genes in Xenopus laevis and tropicalis.

    Science.gov (United States)

    Haramoto, Yoshikazu; Saijyo, Tomohito; Tanaka, Toshiaki; Furuno, Nobuaki; Suzuki, Atsushi; Ito, Yuzuru; Kondo, Mariko; Taira, Masanori; Takahashi, Shuji

    2017-06-15

    Two siamois-related homeobox genes siamois (sia1) and twin (sia2), have been reported in Xenopus laevis. These genes are expressed in the blastula chordin- and noggin-expressing (BCNE) center and the Nieuwkoop center, and have complete secondary axis-inducing activity when over-expressed on the ventral side of the embryo. Using whole genome sequences of X. tropicalis and X. laevis, we identified two additional siamois-related genes, which are tandemly duplicated near sia1 and sia2 to form the siamois gene cluster. Four siamois genes in X. tropicalis are transcribed at blastula to gastrula stages. In X. laevis, the siamois gene cluster is present on both homeologous chromosomes, XLA3L and XLA3S. Transcripts from seven siamois genes (three on XLA3L and four on XLA3S) in X. laevis were detected at blastula to gastrula stages. A transcribed gene, sia1p. S, encodes an inactive protein without a homeodomain. When over-expressed ventrally, all siamois-related genes tested in this study except for sia1p. S induced a complete secondary axis, indicating that X. tropicalis and X. laevis have four and six active siamois-related genes, respectively. Of note, each gene required different amounts of mRNA for full activity. These results suggest the possibility that siamois cluster genes have functional redundancy to endow robustness and quickness to organizer formation in Xenopus species. Copyright © 2017. Published by Elsevier Inc.

  5. The Local Maximum Clustering Method and Its Application in Microarray Gene Expression Data Analysis

    Directory of Open Access Journals (Sweden)

    Chen Yidong

    2004-01-01

    Full Text Available An unsupervised data clustering method, called the local maximum clustering (LMC method, is proposed for identifying clusters in experiment data sets based on research interest. A magnitude property is defined according to research purposes, and data sets are clustered around each local maximum of the magnitude property. By properly defining a magnitude property, this method can overcome many difficulties in microarray data clustering such as reduced projection in similarities, noises, and arbitrary gene distribution. To critically evaluate the performance of this clustering method in comparison with other methods, we designed three model data sets with known cluster distributions and applied the LMC method as well as the hierarchic clustering method, the -mean clustering method, and the self-organized map method to these model data sets. The results show that the LMC method produces the most accurate clustering results. As an example of application, we applied the method to cluster the leukemia samples reported in the microarray study of Golub et al. (1999.

  6. Integration of gene expression and GWAS results supports involvement of calcium signaling in Schizophrenia.

    Science.gov (United States)

    Hertzberg, L; Katsel, P; Roussos, P; Haroutunian, V; Domany, E

    2015-05-01

    The number of Genome Wide Association Studies (GWAS) of schizophrenia is rapidly growing. However, the small effect of individual genes limits the number of reliably implicated genes, which are too few and too diverse to perform reliable pathway analysis; hence the biological roles of the genes implicated in schizophrenia are unclear. To overcome these limitations we combine GWAS with genome-wide expression data from human post-mortem brain samples of schizophrenia patients and controls, taking these steps: 1) Identify 36 GWAS-based genes which are expressed in our dataset. 2) Find a cluster of 19 genes with highly correlated expression. We show that this correlation pattern is robust and statistically significant. 3) GO-enrichment analysis of these 19 genes reveals significant enrichment of ion channels and calcium-related processes. This finding (based on analyzing a small number of coherently expressed genes) is validated and enhanced in two ways: First, the emergence of calcium channels and calcium signaling is corroborated by identifying proteins that interact with those encoded by the cluster of 19. Second, extend the 19 cluster genes into 1028 genes, whose expression is highly correlated with the cluster's average profile. When GO-enrichment analysis is performed on this extended set, many schizophrenia related pathways appear, with calcium-related processes enriched with high statistical significance. Our results give further, expression-based validation to GWAS results, support a central role of calcium-signaling in the pathogenesis of schizophrenia, and point to additional pathways potentially related to the disease.

  7. Motif-independent de novo detection of secondary metabolite gene clusters-toward identification from filamentous fungi.

    Science.gov (United States)

    Umemura, Myco; Koike, Hideaki; Machida, Masayuki

    2015-01-01

    Secondary metabolites are produced mostly by clustered genes that are essential to their biosynthesis. The transcriptional expression of these genes is often cooperatively regulated by a transcription factor located inside or close to a cluster. Most of the secondary metabolism biosynthesis (SMB) gene clusters identified to date contain so-called core genes with distinctive sequence features, such as polyketide synthase (PKS) and non-ribosomal peptide synthetase (NRPS). Recent efforts in sequencing fungal genomes have revealed far more SMB gene clusters than expected based on the number of core genes in the genomes. Several bioinformatics tools have been developed to survey SMB gene clusters using the sequence motif information of the core genes, including SMURF and antiSMASH. More recently, accompanied by the development of sequencing techniques allowing to obtain large-scale genomic and transcriptomic data, motif-independent prediction methods of SMB gene clusters, including MIDDAS-M, have been developed. Most these methods detect the clusters in which the genes are cooperatively regulated at transcriptional levels, thus allowing the identification of novel SMB gene clusters regardless of the presence of the core genes. Another type of the method, MIPS-CG, uses the characteristics of SMB genes, which are highly enriched in non-syntenic blocks (NSBs), enabling the prediction even without transcriptome data although the results have not been evaluated in detail. Considering that large portion of SMB gene clusters might be sufficiently expressed only in limited uncommon conditions, it seems that prediction of SMB gene clusters by bioinformatics and successive experimental validation is an only way to efficiently uncover hidden SMB gene clusters. Here, we describe and discuss possible novel approaches for the determination of SMB gene clusters that have not been identified using conventional methods.

  8. New Alzheimer amyloid beta responsive genes identified in human neuroblastoma cells by hierarchical clustering.

    Directory of Open Access Journals (Sweden)

    Markus Uhrig

    Full Text Available Alzheimer's disease (AD is characterized by neuronal degeneration and cell loss. Abeta(42, in contrast to Abeta(40, is thought to be the pathogenic form triggering the pathological cascade in AD. In order to unravel overall gene regulation we monitored the transcriptomic responses to increased or decreased Abeta(40 and Abeta(42 levels, generated and derived from its precursor C99 (C-terminal fragment of APP comprising 99 amino acids in human neuroblastoma cells. We identified fourteen differentially expressed transcripts by hierarchical clustering and discussed their involvement in AD. These fourteen transcripts were grouped into two main clusters each showing distinct differential expression patterns depending on Abeta(40 and Abeta(42 levels. Among these transcripts we discovered an unexpected inverse and strong differential expression of neurogenin 2 (NEUROG2 and KIAA0125 in all examined cell clones. C99-overexpression had a similar effect on NEUROG2 and KIAA0125 expression as a decreased Abeta(42/Abeta(40 ratio. Importantly however, an increased Abeta(42/Abeta(40 ratio, which is typical of AD, had an inverse expression pattern of NEUROG2 and KIAA0125: An increased Abeta(42/Abeta(40 ratio up-regulated NEUROG2, but down-regulated KIAA0125, whereas the opposite regulation pattern was observed for a decreased Abeta(42/Abeta(40 ratio. We discuss the possibilities that the so far uncharacterized KIAA0125 might be a counter player of NEUROG2 and that KIAA0125 could be involved in neurogenesis, due to the involvement of NEUROG2 in developmental neural processes.

  9. Two Horizontally Transferred Xenobiotic Resistance Gene Clusters Associated with Detoxification of Benzoxazolinones by Fusarium Species

    Science.gov (United States)

    Glenn, Anthony E.; Davis, C. Britton; Gao, Minglu; Gold, Scott E.; Mitchell, Trevor R.; Proctor, Robert H.; Stewart, Jane E.; Snook, Maurice E.

    2016-01-01

    Microbes encounter a broad spectrum of antimicrobial compounds in their environments and often possess metabolic strategies to detoxify such xenobiotics. We have previously shown that Fusarium verticillioides, a fungal pathogen of maize known for its production of fumonisin mycotoxins, possesses two unlinked loci, FDB1 and FDB2, necessary for detoxification of antimicrobial compounds produced by maize, including the γ-lactam 2-benzoxazolinone (BOA). In support of these earlier studies, microarray analysis of F. verticillioides exposed to BOA identified the induction of multiple genes at FDB1 and FDB2, indicating the loci consist of gene clusters. One of the FDB1 cluster genes encoded a protein having domain homology to the metallo-β-lactamase (MBL) superfamily. Deletion of this gene (MBL1) rendered F. verticillioides incapable of metabolizing BOA and thus unable to grow on BOA-amended media. Deletion of other FDB1 cluster genes, in particular AMD1 and DLH1, did not affect BOA degradation. Phylogenetic analyses and topology testing of the FDB1 and FDB2 cluster genes suggested two horizontal transfer events among fungi, one being transfer of FDB1 from Fusarium to Colletotrichum, and the second being transfer of the FDB2 cluster from Fusarium to Aspergillus. Together, the results suggest that plant-derived xenobiotics have exerted evolutionary pressure on these fungi, leading to horizontal transfer of genes that enhance fitness or virulence. PMID:26808652

  10. β-globin gene cluster haplotypes in ethnic minority populations of southwest China

    Science.gov (United States)

    Sun, Hao; Liu, Hongxian; Huang, Kai; Lin, Keqin; Huang, Xiaoqin; Chu, Jiayou; Ma, Shaohui; Yang, Zhaoqing

    2017-01-01

    The genetic diversity and relationships among ethnic minority populations of southwest China were investigated using seven polymorphic restriction enzyme sites in the β-globin gene cluster. The haplotypes of 1392 chromosomes from ten ethnic populations living in southwest China were determined. Linkage equilibrium and recombination hotspot were found between the 5′ sites and 3′ sites of the β-globin gene cluster. 5′ haplotypes 2 (+−−−), 6 (−++−+), 9 (−++++) and 3′ haplotype FW3 (−+) were the predominant haplotypes. Notably, haplotype 9 frequency was significantly high in the southwest populations, indicating their difference with other Chinese. The interpopulation differentiation of southwest Chinese minority populations is less than those in populations of northern China and other continents. Phylogenetic analysis shows that populations sharing same ethnic origin or language clustered to each other, indicating current β-globin cluster diversity in the Chinese populations reflects their ethnic origin and linguistic affiliations to a great extent. This study characterizes β-globin gene cluster haplotypes in southwest Chinese minorities for the first time, and reveals the genetic variability and affinity of these populations using β-globin cluster haplotype frequencies. The results suggest that ethnic origin plays an important role in shaping variations of the β-globin gene cluster in the southwestern ethnic populations of China. PMID:28205625

  11. Ensemble attribute profile clustering: discovering and characterizing groups of genes with similar patterns of biological features

    Directory of Open Access Journals (Sweden)

    Bissell MJ

    2006-03-01

    Full Text Available Abstract Background Ensemble attribute profile clustering is a novel, text-based strategy for analyzing a user-defined list of genes and/or proteins. The strategy exploits annotation data present in gene-centered corpora and utilizes ideas from statistical information retrieval to discover and characterize properties shared by subsets of the list. The practical utility of this method is demonstrated by employing it in a retrospective study of two non-overlapping sets of genes defined by a published investigation as markers for normal human breast luminal epithelial cells and myoepithelial cells. Results Each genetic locus was characterized using a finite set of biological properties and represented as a vector of features indicating attributes associated with the locus (a gene attribute profile. In this study, the vector space models for a pre-defined list of genes were constructed from the Gene Ontology (GO terms and the Conserved Domain Database (CDD protein domain terms assigned to the loci by the gene-centered corpus LocusLink. This data set of GO- and CDD-based gene attribute profiles, vectors of binary random variables, was used to estimate multiple finite mixture models and each ensuing model utilized to partition the profiles into clusters. The resultant partitionings were combined using a unanimous voting scheme to produce consensus clusters, sets of profiles that co-occured consistently in the same cluster. Attributes that were important in defining the genes assigned to a consensus cluster were identified. The clusters and their attributes were inspected to ascertain the GO and CDD terms most associated with subsets of genes and in conjunction with external knowledge such as chromosomal location, used to gain functional insights into human breast biology. The 52 luminal epithelial cell markers and 89 myoepithelial cell markers are disjoint sets of genes. Ensemble attribute profile clustering-based analysis indicated that both lists

  12. Clustering based gene expression feature selection method: A computational approach to enrich the classifier efficiency of differentially expressed genes

    KAUST Repository

    Abusamra, Heba

    2016-07-20

    The native nature of high dimension low sample size of gene expression data make the classification task more challenging. Therefore, feature (gene) selection become an apparent need. Selecting a meaningful and relevant genes for classifier not only decrease the computational time and cost, but also improve the classification performance. Among different approaches of feature selection methods, however most of them suffer from several problems such as lack of robustness, validation issues etc. Here, we present a new feature selection technique that takes advantage of clustering both samples and genes. Materials and methods We used leukemia gene expression dataset [1]. The effectiveness of the selected features were evaluated by four different classification methods; support vector machines, k-nearest neighbor, random forest, and linear discriminate analysis. The method evaluate the importance and relevance of each gene cluster by summing the expression level for each gene belongs to this cluster. The gene cluster consider important, if it satisfies conditions depend on thresholds and percentage otherwise eliminated. Results Initial analysis identified 7120 differentially expressed genes of leukemia (Fig. 15a), after applying our feature selection methodology we end up with specific 1117 genes discriminating two classes of leukemia (Fig. 15b). Further applying the same method with more stringent higher positive and lower negative threshold condition, number reduced to 58 genes have be tested to evaluate the effectiveness of the method (Fig. 15c). The results of the four classification methods are summarized in Table 11. Conclusions The feature selection method gave good results with minimum classification error. Our heat-map result shows distinct pattern of refines genes discriminating between two classes of leukemia.

  13. The genome of tolypocladium inflatum: evolution, organization, and expression of the cyclosporin biosynthetic gene cluster.

    Science.gov (United States)

    Bushley, Kathryn E; Raja, Rajani; Jaiswal, Pankaj; Cumbie, Jason S; Nonogaki, Mariko; Boyd, Alexander E; Owensby, C Alisha; Knaus, Brian J; Elser, Justin; Miller, Daniel; Di, Yanming; McPhail, Kerry L; Spatafora, Joseph W

    2013-06-01

    The ascomycete fungus Tolypocladium inflatum, a pathogen of beetle larvae, is best known as the producer of the immunosuppressant drug cyclosporin. The draft genome of T. inflatum strain NRRL 8044 (ATCC 34921), the isolate from which cyclosporin was first isolated, is presented along with comparative analyses of the biosynthesis of cyclosporin and other secondary metabolites in T. inflatum and related taxa. Phylogenomic analyses reveal previously undetected and complex patterns of homology between the nonribosomal peptide synthetase (NRPS) that encodes for cyclosporin synthetase (simA) and those of other secondary metabolites with activities against insects (e.g., beauvericin, destruxins, etc.), and demonstrate the roles of module duplication and gene fusion in diversification of NRPSs. The secondary metabolite gene cluster responsible for cyclosporin biosynthesis is described. In addition to genes necessary for cyclosporin biosynthesis, it harbors a gene for a cyclophilin, which is a member of a family of immunophilins known to bind cyclosporin. Comparative analyses support a lineage specific origin of the cyclosporin gene cluster rather than horizontal gene transfer from bacteria or other fungi. RNA-Seq transcriptome analyses in a cyclosporin-inducing medium delineate the boundaries of the cyclosporin cluster and reveal high levels of expression of the gene cluster cyclophilin. In medium containing insect hemolymph, weaker but significant upregulation of several genes within the cyclosporin cluster, including the highly expressed cyclophilin gene, was observed. T. inflatum also represents the first reference draft genome of Ophiocordycipitaceae, a third family of insect pathogenic fungi within the fungal order Hypocreales, and supports parallel and qualitatively distinct radiations of insect pathogens. The T. inflatum genome provides additional insight into the evolution and biosynthesis of cyclosporin and lays a foundation for further investigations of the role

  14. Mutational analysis of the thienamycin biosynthetic gene cluster from Streptomyces cattleya.

    Science.gov (United States)

    Rodríguez, Miriam; Núñez, Luz Elena; Braña, Alfredo F; Méndez, Carmen; Salas, José A; Blanco, Gloria

    2011-04-01

    The generation of non-thienamycin-producing mutants with mutations in the thnL, thnN, thnO, and thnI genes within the thn gene cluster from Streptomyces cattleya and their involvement in thienamycin biosynthesis and regulation were previously reported. Four additional mutations were independently generated in the thnP, thnG, thnR, and thnT genes by insertional inactivation. Only the first two genes were found to play a role in thienamycin biosynthesis, since these mutations negatively or positively affect antibiotic production. A mutation of thnP results in the absence of thienamycin production, whereas a 2- to 3-fold increase in thienamycin production was observed for the thnG mutant. On the other hand, mutations in thnR and thnT showed that although these genes were previously reported to participate in this pathway, they seem to be nonessential for thienamycin biosynthesis, as thienamycin production was not affected in these mutants. High-performance liquid chromatography (HPLC)-mass spectrometry (MS) analysis of all available mutants revealed some putative intermediates in the thienamycin biosynthetic pathway. A compound with a mass corresponding to carbapenam-3-carboxylic acid was detected in some of the mutants, suggesting that the assembly of the bicyclic nucleus of thienamycin might proceed in a way analogous to that of the simplest natural carbapenem, 1-carbapen-2-em-3-carboxylic acid biosynthesis. The accumulation of a compound with a mass corresponding to 2,3-dihydrothienamycin in the thnG mutant suggests that it might be the last intermediate in the biosynthetic pathway. These data, together with the establishment of cross-feeding relationships by the cosynthesis analysis of the non-thienamycin-producing mutants, lead to a proposal for some enzymatic steps during thienamycin assembly.

  15. Genome-wide significant association between alcohol dependence and a variant in the ADH gene cluster.

    Science.gov (United States)

    Frank, Josef; Cichon, Sven; Treutlein, Jens; Ridinger, Monika; Mattheisen, Manuel; Hoffmann, Per; Herms, Stefan; Wodarz, Norbert; Soyka, Michael; Zill, Peter; Maier, Wolfgang; Mössner, Rainald; Gaebel, Wolfgang; Dahmen, Norbert; Scherbaum, Norbert; Schmäl, Christine; Steffens, Michael; Lucae, Susanne; Ising, Marcus; Müller-Myhsok, Bertram; Nöthen, Markus M; Mann, Karl; Kiefer, Falk; Rietschel, Marcella

    2012-01-01

    Alcohol dependence (AD) is an important contributory factor to the global burden of disease. The etiology of AD involves both environmental and genetic factors, and the disorder has a heritability of around 50%. The aim of the present study was to identify susceptibility genes for AD by performing a genome-wide association study (GWAS). The sample comprised 1333 male in-patients with severe AD according to the Diagnostic and Statistical Manual of Mental Disorders, 4th edition, and 2168 controls. These included 487 patients and 1358 controls from a previous GWAS study by our group. All individuals were of German descent. Single-marker tests and a polygenic score-based analysis to assess the combined contribution of multiple markers with small effects were performed. The single nucleotide polymorphism (SNP) rs1789891, which is located between the ADH1B and ADH1C genes, achieved genome-wide significance [P = 1.27E-8, odds ratio (OR) = 1.46]. Other markers from this region were also associated with AD, and conditional analyses indicated that these made a partially independent contribution. The SNP rs1789891 is in complete linkage disequilibrium with the functional Arg272Gln variant (P = 1.24E-7, OR = 1.31) of the ADH1C gene, which has been reported to modify the rate of ethanol oxidation to acetaldehyde in vitro. A polygenic score-based approach produced a significant result (P = 9.66E-9). This is the first GWAS of AD to provide genome-wide significant support for the role of the ADH gene cluster and to suggest a polygenic component to the etiology of AD. The latter result may indicate that many more AD susceptibility genes still await identification.

  16. Co-expression analysis reveals a group of genes potentially involved in regulation of plant response to iron-deficiency.

    Science.gov (United States)

    Li, Hua; Wang, Lei; Yang, Zhi Min

    2015-01-01

    Iron (Fe) is an essential element for plant growth and development. Iron deficiency results in abnormal metabolisms from respiration to photosynthesis. Exploration of Fe-deficient responsive genes and their networks is critically important to understand molecular mechanisms leading to the plant adaptation to soil Fe-limitation. Co-expression genes are a cluster of genes that have a similar expression pattern to execute relatively biological functions at a stage of development or under a certain environmental condition. They may share a common regulatory mechanism. In this study, we investigated Fe-starved-related co-expression genes from Arabidopsis. From the biological process GO annotation of TAIR (The Arabidopsis Information Resource), 180 iron-deficient responsive genes were detected. Using ATTED-II database, we generated six gene co-expression networks. Among these, two modules of PYE and IRT1 were successfully constructed. There are 30 co-expression genes that are incorporated in the two modules (12 in PYE-module and 18 in IRT1-module). Sixteen of the co-expression genes were well characterized. The remaining genes (14) are poorly or not functionally identified with iron stress. Validation of the 14 genes using real-time PCR showed differential expression under iron-deficiency. Most of the co-expression genes (23/30) could be validated in pye and fit mutant plants with iron-deficiency. We further identified iron-responsive cis-elements upstream of the co-expression genes and found that 22 out of 30 genes contain the iron-responsive motif IDE1. Furthermore, some auxin and ethylene-responsive elements were detected in the promoters of the co-expression genes. These results suggest that some of the genes can be also involved in iron stress response through the phytohormone-responsive pathways.

  17. Gene identification and protein classification in microbial metagenomic sequence data via incremental clustering

    Directory of Open Access Journals (Sweden)

    Li Weizhong

    2008-04-01

    Full Text Available Abstract Background The identification and study of proteins from metagenomic datasets can shed light on the roles and interactions of the source organisms in their communities. However, metagenomic datasets are characterized by the presence of organisms with varying GC composition, codon usage biases etc., and consequently gene identification is challenging. The vast amount of sequence data also requires faster protein family classification tools. Results We present a computational improvement to a sequence clustering approach that we developed previously to identify and classify protein coding genes in large microbial metagenomic datasets. The clustering approach can be used to identify protein coding genes in prokaryotes, viruses, and intron-less eukaryotes. The computational improvement is based on an incremental clustering method that does not require the expensive all-against-all compute that was required by the original approach, while still preserving the remote homology detection capabilities. We present evaluations of the clustering approach in protein-coding gene identification and classification, and also present the results of updating the protein clusters from our previous work with recent genomic and metagenomic sequences. The clustering results are available via CAMERA, (http://camera.calit2.net. Conclusion The clustering paradigm is shown to be a very useful tool in the analysis of microbial metagenomic data. The incremental clustering method is shown to be much faster than the original approach in identifying genes, grouping sequences into existing protein families, and also identifying novel families that have multiple members in a metagenomic dataset. These clusters provide a basis for further studies of protein families.

  18. Biosynthesis of Akaeolide and Lorneic Acids and Annotation of Type I Polyketide Synthase Gene Clusters in the Genome of Streptomyces sp. NPS554

    Directory of Open Access Journals (Sweden)

    Tao Zhou

    2015-01-01

    Full Text Available The incorporation pattern of biosynthetic precursors into two structurally unique polyketides, akaeolide and lorneic acid A, was elucidated by feeding experiments with 13C-labeled precursors. In addition, the draft genome sequence of the producer, Streptomyces sp. NPS554, was performed and the biosynthetic gene clusters for these polyketides were identified. The putative gene clusters contain all the polyketide synthase (PKS domains necessary for assembly of the carbon skeletons. Combined with the 13C-labeling results, gene function prediction enabled us to propose biosynthetic pathways involving unusual carbon-carbon bond formation reactions. Genome analysis also indicated the presence of at least ten orphan type I PKS gene clusters that might be responsible for the production of new polyketides.

  19. Shared gene structures and clusters of mutually exclusive spliced exons within the metazoan muscle myosin heavy chain genes.

    Directory of Open Access Journals (Sweden)

    Martin Kollmar

    Full Text Available Multicellular animals possess two to three different types of muscle tissues. Striated muscles have considerable ultrastructural similarity and contain a core set of proteins including the muscle myosin heavy chain (Mhc protein. The ATPase activity of this myosin motor protein largely dictates muscle performance at the molecular level. Two different solutions to adjusting myosin properties to different muscle subtypes have been identified so far: Vertebrates and nematodes contain many independent differentially expressed Mhc genes while arthropods have single Mhc genes with clusters of mutually exclusive spliced exons (MXEs. The availability of hundreds of metazoan genomes now allowed us to study whether the ancient bilateria already contained MXEs, how MXE complexity subsequently evolved, and whether additional scenarios to control contractile properties in different muscles could be proposed, By reconstructing the Mhc genes from 116 metazoans we showed that all intron positions within the motor domain coding regions are conserved in all bilateria analysed. The last common ancestor of the bilateria already contained a cluster of MXEs coding for part of the loop-2 actin-binding sequence. Subsequently the protostomes and later the arthropods gained many further clusters while MXEs got completely lost independently in several branches (vertebrates and nematodes and species (for example the annelid Helobdella robusta and the salmon louse Lepeophtheirus salmonis. Several bilateria have been found to encode multiple Mhc genes that might all or in part contain clusters of MXEs. Notable examples are a cluster of six tandemly arrayed Mhc genes, of which two contain MXEs, in the owl limpet Lottia gigantea and four Mhc genes with three encoding MXEs in the predatory mite Metaseiulus occidentalis. Our analysis showed that similar solutions to provide different myosin isoforms (multiple genes or clusters of MXEs or both have independently been developed

  20. Molecular population genetics of the -esterase gene cluster of Drosophila melanogaster

    Indian Academy of Sciences (India)

    Evgeniy S. Balakirev; Francisco J. Ayala

    2003-12-01

    We have investigated nucleotide polymorphism at the -esterase gene cluster including the Est-6 gene and Est-6 putative pseudogene in four samples of Drosophila melanogaster derived from natural populations of southern Africa (Zimbabwe), Europe (Spain), North America (USA: California), and South America (Venezuela). A complex haplo-type structure is revealed in both Est-6 and Est-6. Total nucleotide diversity is twice in Est-6 as in Est-6; diversity is higher in the African sample than in the non-African ones. Strong linkage disequilibrium occurs within the -esterase gene cluster in non-African samples, but not in the African one. Intragenic gene conversion events are detected within Est-6 and, to a much greater extent, within Est-6; intergenic gene conversion events are rare. Tests of neutrality with recombination are significant for the -esterase gene cluster in the non-African samples but not significant in the African one. We suggest that the demographic history (bottleneck and admixture of genetically differentiated populations) is the major factor shaping the pattern of nucleotide polymorphism in the -esterase gene cluster. However there are some ‘footprints’ of directional and balancing selection shaping specific distribution of nucleotide polymorphism within the cluster. Intergenic epistatic selection between Est-6 and Est-6 may play an important role in the evolution of the -esterase gene cluster preserving the putative pseudogene from degenerative destruction and reflecting possible functional interaction between the functional gene and the putative pseudogene. Est-6 and Est-6 may represent an indivisible intergenic complex (‘intergene’) in which each single component (Est-6 or Est-6) cannot separately carry out the full functional role.

  1. Mycobiota and identification of aflatoxin gene cluster in marketed spices in West Africa

    DEFF Research Database (Denmark)

    Gnonlonfin, G. J. B.; Adjovi, Y. C.; Tokpo, A. F.

    2013-01-01

    of Aspergillus were dominant on all marketed dried and milled spices irrespective of country. Gene characterization and amplification analysis showed that most of the Aspergillus flavus isolates possess the cluster genes for aflatoxin production. Aflatoxin B1 assessment by Thin Layer Chromatography showed...

  2. The clustering of functionally related genes contributes to CNV-mediated disease

    NARCIS (Netherlands)

    Andrews, T.; Honti, F.; Pfundt, R.P.; Leeuw, N. de; Hehir, J.Y.; Vulto-van Silfhout, A.T.; Vries, B. de; Webber, C.

    2015-01-01

    Clusters of functionally related genes can be disrupted by a single copy number variant (CNV). We demonstrate that the simultaneous disruption of multiple functionally related genes is a frequent and significant characteristic of de novo CNVs in patients with developmental disorders (P = 1 x 10(-3))

  3. MicroRNAs located in the Hox gene clusters are implicated in huntington's disease pathogenesis.

    Directory of Open Access Journals (Sweden)

    Andrew G Hoss

    2014-02-01

    Full Text Available Transcriptional dysregulation has long been recognized as central to the pathogenesis of Huntington's disease (HD. MicroRNAs (miRNAs represent a major system of post-transcriptional regulation, by either preventing translational initiation or by targeting transcripts for storage or for degradation. Using next-generation miRNA sequencing in prefrontal cortex (Brodmann Area 9 of twelve HD and nine controls, we identified five miRNAs (miR-10b-5p, miR-196a-5p, miR-196b-5p, miR-615-3p and miR-1247-5p up-regulated in HD at genome-wide significance (FDR q-value<0.05. Three of these, miR-196a-5p, miR-196b-5p and miR-615-3p, were expressed at near zero levels in control brains. Expression was verified for all five miRNAs using reverse transcription quantitative PCR and all but miR-1247-5p were replicated in an independent sample (8HD/8C. Ectopic miR-10b-5p expression in PC12 HTT-Q73 cells increased survival by MTT assay and cell viability staining suggesting increased expression may be a protective response. All of the miRNAs but miR-1247-5p are located in intergenic regions of Hox clusters. Total mRNA sequencing in the same samples identified fifteen of 55 genes within the Hox cluster gene regions as differentially expressed in HD, and the Hox genes immediately adjacent to the four Hox cluster miRNAs as up-regulated. Pathway analysis of mRNA targets of these miRNAs implicated functions for neuronal differentiation, neurite outgrowth, cell death and survival. In regression models among the HD brains, huntingtin CAG repeat size, onset age and age at death were independently found to be inversely related to miR-10b-5p levels. CAG repeat size and onset age were independently inversely related to miR-196a-5p, onset age was inversely related to miR-196b-5p and age at death was inversely related to miR-615-3p expression. These results suggest these Hox-related miRNAs may be involved in neuroprotective response in HD. Recently, miRNAs have shown promise as

  4. Sequence breakpoints in the aflatoxin biosynthesis gene cluster and flanking regions in nonaflatoxigenic Aspergillus flavus isolates.

    Science.gov (United States)

    Chang, Perng-Kuang; Horn, Bruce W; Dorner, Joe W

    2005-11-01

    Aspergillus flavus populations are genetically diverse. Isolates that produce either, neither, or both aflatoxins and cyclopiazonic acid (CPA) are present in the field. We investigated defects in the aflatoxin gene cluster in 38 nonaflatoxigenic A. flavus isolates collected from southern United States. PCR assays using aflatoxin-gene-specific primers grouped these isolates into eight (A-H) deletion patterns. Patterns C, E, G, and H, which contain 40 kb deletions, were examined for their sequence breakpoints. Pattern C has one breakpoint in the cypA 3' untranslated region (UTR) and another in the verA coding region. Pattern E has a breakpoint in the amdA coding region and another in the ver1 5'UTR. Pattern G contains a deletion identical to the one found in pattern C and has another deletion that extends from the cypA coding region to one end of the chromosome as suggested by the presence of telomeric sequence repeats, CCCTAATGTTGA. Pattern H has a deletion of the entire aflatoxin gene cluster from the hexA coding region in the sugar utilization gene cluster to the telomeric region. Thus, deletions in the aflatoxin gene cluster among A. flavus isolates are not rare, and the patterns appear to be diverse. Genetic drift may be a driving force that is responsible for the loss of the entire aflatoxin gene cluster in nonaflatoxigenic A. flavus isolates when aflatoxins have lost their adaptive value in nature.

  5. A putatively phase variable gene (dca) required for natural competence in Neisseria gonorrhoeae but not Neisseria meningitidis is located within the division cell wall (dcw) gene cluster.

    Science.gov (United States)

    Snyder, L A; Saunders, N J; Shafer, W M

    2001-02-01

    A cluster of 18 open reading frames (ORFs), 15 of which are homologous to genes involved in division and cell wall synthesis, has been identified in Neisseria gonorrhoeae and Neisseria meningitidis. The three additional ORFs, internal to the dcw cluster, are not homologous to dcw-related genes present in other bacterial species. Analysis of the N. meningitidis strain MC58 genome for foreign DNA suggests that these additional ORFs have not been acquired by recent horizontal exchange, indicating that they are a long-standing, integral part of the neisserial dcw gene cluster. Reverse transcription-PCR analysis of RNA extracted from N. gonorrhoeae strain FA19 confirmed that all three ORFs are transcribed in gonococci. One of these ORFs (dca, for division cluster competence associated), located between murE and murF, was studied in detail and found to be essential for competence in the gonococcal but not in the meningococcal strains tested. Computer analysis predicts that dca encodes an inner membrane protein similar to hypothetical proteins produced by other gram-negative bacteria. In some meningococcal strains dca is prematurely terminated following a homopolymeric tract of G's, the length of which differs between isolates of N. meningitidis, suggesting that dca is phase variable in this species. A deletion and insertional mutation was made in the dca gene of N. gonorrhoeae strain FA19 and N. meningitidis strain NMB. This mutation abrogated the ability of the gonococci to be transformed with chromosomal DNA. Thus, we conclude that the dca-encoded gene product is an essential competence factor for gonococci.

  6. Fast Gene Ontology based clustering for microarray experiments

    OpenAIRE

    Ovaska Kristian; Laakso Marko; Hautaniemi Sampsa

    2008-01-01

    Abstract Background Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. Results We present fa...

  7. Structural variation of the ribosomal gene cluster within the class Insecta

    Energy Technology Data Exchange (ETDEWEB)

    Mukha, D.V.; Sidorenko, A.P.; Lazebnaya, I.V. [Vavilov Institute of General Genetics, Moscow (Russian Federation)] [and others

    1995-09-01

    General estimation of ribosomal DNA variation within the class Insecta is presented. It is shown that, using blot-hybridization, one can detect differences in the structure of the ribosomal gene cluster not only between genera within an order, but also between species within a genera, including sibling species. Structure of the ribosomal gene cluster of the Coccinellidae family (ladybirds) is analyzed. It is shown that cloned highly conservative regions of ribosomal DNA of Tetrahymena pyriformis can be used as probes for analyzing ribosomal genes in insects. 24 refs., 4 figs.

  8. Identification and manipulation of the pleuromutilin gene cluster from Clitopilus passeckerianus for increased rapid antibiotic production

    Science.gov (United States)

    Bailey, Andy M.; Alberti, Fabrizio; Kilaru, Sreedhar; Collins, Catherine M.; de Mattos-Shipley, Kate; Hartley, Amanda J.; Hayes, Patrick; Griffin, Alison; Lazarus, Colin M.; Cox, Russell J.; Willis, Christine L.; O’Dwyer, Karen; Spence, David W.; Foster, Gary D.

    2016-05-01

    Semi-synthetic derivatives of the tricyclic diterpene antibiotic pleuromutilin from the basidiomycete Clitopilus passeckerianus are important in combatting bacterial infections in human and veterinary medicine. These compounds belong to the only new class of antibiotics for human applications, with novel mode of action and lack of cross-resistance, representing a class with great potential. Basidiomycete fungi, being dikaryotic, are not generally amenable to strain improvement. We report identification of the seven-gene pleuromutilin gene cluster and verify that using various targeted approaches aimed at increasing antibiotic production in C. passeckerianus, no improvement in yield was achieved. The seven-gene pleuromutilin cluster was reconstructed within Aspergillus oryzae giving production of pleuromutilin in an ascomycete, with a significant increase (2106%) in production. This is the first gene cluster from a basidiomycete to be successfully expressed in an ascomycete, and paves the way for the exploitation of a metabolically rich but traditionally overlooked group of fungi.

  9. A CLUSTERING OF DJA STOCKS - THE APPLICATION IN FINANCE OF A METHOD FIRST USED IN GENE TRAJECTORY STUDY

    Directory of Open Access Journals (Sweden)

    Silaghi Gheorghe Cosmin

    2009-05-01

    Full Text Available Previously we employed the Gene Trajectory Clustering methodology to search for different associations of the stocks composing the DJA index, with the aim of finding different, logic clusters, supported by economic reasons, preferably different than the

  10. Two genes, rif15 and rif16, of the rifamycin biosynthetic gene cluster in Amycolatopsis mediterranei likely encode a transketolase and a P450 monooxygenase,respectively, both essential for the conversion of rifamycin SV into B

    Institute of Scientific and Technical Information of China (English)

    Hua Yuan; Wei Zhao; Yi Zhong; Jin Wang; Zhongiun Qin; Xiaoming Ding; Guo-Ping Zhao

    2011-01-01

    Amycolatopsis mediterranei produces an important antibiotic rifamycin,the biosynthesis of which involves many unusual modifications.Previous work suggested a putative P450 enzyme encoded by rif16 within the rifamycin biosynthetic gene cluster (rif) was required for the conversion of the intermediate rifamycin SV into the end product rifamycin B.In this study,we genetically proved that a putative transketolase encoded by rif15 is another essential enzyme for this conversion.Expression of merely rif15 and rif16 in a rif cluster null mutant ofA.mediterranei U32 was able to convert rifamycin SV into B.However,this Rifl5- and Rifl6-mediated conversion was only detected in intact cells of A.meidterranei,but not in Streptomyce coelicolor or Mycobacterium smegmatis,suggesting that yet-characterized gene(s) in A.mediterranei other than those encoded by the rif cluster should be involved in this process.

  11. Identification of a Gene Cluster Enabling Lactobacillus casei BL23 To Utilize myo-Inositol▿ †

    Science.gov (United States)

    Yebra, María Jesús; Zúñiga, Manuel; Beaufils, Sophie; Pérez-Martínez, Gaspar; Deutscher, Josef; Monedero, Vicente

    2007-01-01

    Genome analysis of Lactobacillus casei BL23 revealed that, compared to L. casei ATCC 334, it carries a 12.8-kb DNA insertion containing genes involved in the catabolism of the cyclic polyol myo-inositol (MI). Indeed, L. casei ATCC 334 does not ferment MI, whereas strain BL23 is able to utilize this carbon source. The inserted DNA consists of an iolR gene encoding a DeoR family transcriptional repressor and a divergently transcribed iolTABCDG1G2EJK operon, encoding a complete MI catabolic pathway, in which the iolK gene probably codes for a malonate semialdehyde decarboxylase. The presence of iolK suggests that L. casei has two alternative pathways for the metabolism of malonic semialdehyde: (i) the classical MI catabolic pathway in which IolA (malonate semialdehyde dehydrogenase) catalyzes the formation of acetyl-coenzyme A from malonic semialdehyde and (ii) the conversion of malonic semialdehyde to acetaldehyde catalyzed by the product of iolK. The function of the iol genes was verified by the disruption of iolA, iolT, and iolD, which provided MI-negative strains. By contrast, the disruption of iolK resulted in a strain with no obvious defect in MI utilization. Transcriptional analyses conducted with different mutant strains showed that the iolTABCDG1G2EJK cluster is regulated by substrate-specific induction mediated by the inactivation of the transcriptional repressor IolR and by carbon catabolite repression mediated by the catabolite control protein A (CcpA). This is the first example of an operon for MI utilization in lactic acid bacteria and illustrates the versatility of carbohydrate utilization in L. casei BL23. PMID:17449687

  12. A genomics based discovery of secondary metabolite biosynthetic gene clusters in Aspergillus ustus.

    Directory of Open Access Journals (Sweden)

    Borui Pi

    Full Text Available Secondary metabolites (SMs produced by Aspergillus have been extensively studied for their crucial roles in human health, medicine and industrial production. However, the resulting information is almost exclusively derived from a few model organisms, including A. nidulans and A. fumigatus, but little is known about rare pathogens. In this study, we performed a genomics based discovery of SM biosynthetic gene clusters in Aspergillus ustus, a rare human pathogen. A total of 52 gene clusters were identified in the draft genome of A. ustus 3.3904, such as the sterigmatocystin biosynthesis pathway that was commonly found in Aspergillus species. In addition, several SM biosynthetic gene clusters were firstly identified in Aspergillus that were possibly acquired by horizontal gene transfer, including the vrt cluster that is responsible for viridicatumtoxin production. Comparative genomics revealed that A. ustus shared the largest number of SM biosynthetic gene clusters with A. nidulans, but much fewer with other Aspergilli like A. niger and A. oryzae. These findings would help to understand the diversity and evolution of SM biosynthesis pathways in genus Aspergillus, and we hope they will also promote the development of fungal identification methodology in clinic.

  13. MS/MS networking guided analysis of molecule and gene cluster families.

    Science.gov (United States)

    Nguyen, Don Duy; Wu, Cheng-Hsuan; Moree, Wilna J; Lamsa, Anne; Medema, Marnix H; Zhao, Xiling; Gavilan, Ronnie G; Aparicio, Marystella; Atencio, Librada; Jackson, Chanaye; Ballesteros, Javier; Sanchez, Joel; Watrous, Jeramie D; Phelan, Vanessa V; van de Wiel, Corine; Kersten, Roland D; Mehnaz, Samina; De Mot, René; Shank, Elizabeth A; Charusanti, Pep; Nagarajan, Harish; Duggan, Brendan M; Moore, Bradley S; Bandeira, Nuno; Palsson, Bernhard Ø; Pogliano, Kit; Gutiérrez, Marcelino; Dorrestein, Pieter C

    2013-07-09

    The ability to correlate the production of specialized metabolites to the genetic capacity of the organism that produces such molecules has become an invaluable tool in aiding the discovery of biotechnologically applicable molecules. Here, we accomplish this task by matching molecular families with gene cluster families, making these correlations to 60 microbes at one time instead of connecting one molecule to one organism at a time, such as how it is traditionally done. We can correlate these families through the use of nanospray desorption electrospray ionization MS/MS, an ambient pressure MS technique, in conjunction with MS/MS networking and peptidogenomics. We matched the molecular families of peptide natural products produced by 42 bacilli and 18 pseudomonads through the generation of amino acid sequence tags from MS/MS data of specific clusters found in the MS/MS network. These sequence tags were then linked to biosynthetic gene clusters in publicly accessible genomes, providing us with the ability to link particular molecules with the genes that produced them. As an example of its use, this approach was applied to two unsequenced Pseudoalteromonas species, leading to the discovery of the gene cluster for a molecular family, the bromoalterochromides, in the previously sequenced strain P. piscicida JCM 20779(T). The approach itself is not limited to 60 related strains, because spectral networking can be readily adopted to look at molecular family-gene cluster families of hundreds or more diverse organisms in one single MS/MS network.

  14. Genome mining demonstrates the widespread occurrence of gene clusters encoding bacteriocins in cyanobacteria.

    Science.gov (United States)

    Wang, Hao; Fewer, David P; Sivonen, Kaarina

    2011-01-01

    Cyanobacteria are a rich source of natural products with interesting biological activities. Many of these are peptides and the end products of a non-ribosomal pathway. However, several cyanobacterial peptide classes were recently shown to be produced through the proteolytic cleavage and post-translational modification of short precursor peptides. A new class of bacteriocins produced through the proteolytic cleavage and heterocyclization of precursor proteins was recently identified from marine cyanobacteria. Here we show the widespread occurrence of bacteriocin gene clusters in cyanobacteria through comparative analysis of 58 cyanobacterial genomes. A total of 145 bacteriocin gene clusters were discovered through genome mining. These clusters encoded 290 putative bacteriocin precursors. They ranged in length from 28 to 164 amino acids with very little sequence conservation of the core peptide. The gene clusters could be classified into seven groups according to their gene organization and domain composition. This classification is supported by phylogenetic analysis, which further indicated independent evolutionary trajectories of gene clusters in different groups. Our data suggests that cyanobacteria are a prolific source of low-molecular weight post-translationally modified peptides.

  15. Genome mining demonstrates the widespread occurrence of gene clusters encoding bacteriocins in cyanobacteria.

    Directory of Open Access Journals (Sweden)

    Hao Wang

    Full Text Available Cyanobacteria are a rich source of natural products with interesting biological activities. Many of these are peptides and the end products of a non-ribosomal pathway. However, several cyanobacterial peptide classes were recently shown to be produced through the proteolytic cleavage and post-translational modification of short precursor peptides. A new class of bacteriocins produced through the proteolytic cleavage and heterocyclization of precursor proteins was recently identified from marine cyanobacteria. Here we show the widespread occurrence of bacteriocin gene clusters in cyanobacteria through comparative analysis of 58 cyanobacterial genomes. A total of 145 bacteriocin gene clusters were discovered through genome mining. These clusters encoded 290 putative bacteriocin precursors. They ranged in length from 28 to 164 amino acids with very little sequence conservation of the core peptide. The gene clusters could be classified into seven groups according to their gene organization and domain composition. This classification is supported by phylogenetic analysis, which further indicated independent evolutionary trajectories of gene clusters in different groups. Our data suggests that cyanobacteria are a prolific source of low-molecular weight post-translationally modified peptides.

  16. MS/MS networking guided analysis of molecule and gene cluster families

    Science.gov (United States)

    Nguyen, Don Duy; Wu, Cheng-Hsuan; Moree, Wilna J.; Lamsa, Anne; Medema, Marnix H.; Zhao, Xiling; Gavilan, Ronnie G.; Aparicio, Marystella; Atencio, Librada; Jackson, Chanaye; Ballesteros, Javier; Sanchez, Joel; Watrous, Jeramie D.; Phelan, Vanessa V.; van de Wiel, Corine; Kersten, Roland D.; Mehnaz, Samina; De Mot, René; Shank, Elizabeth A.; Charusanti, Pep; Nagarajan, Harish; Duggan, Brendan M.; Moore, Bradley S.; Bandeira, Nuno; Palsson, Bernhard Ø.; Pogliano, Kit; Gutiérrez, Marcelino; Dorrestein, Pieter C.

    2013-01-01

    The ability to correlate the production of specialized metabolites to the genetic capacity of the organism that produces such molecules has become an invaluable tool in aiding the discovery of biotechnologically applicable molecules. Here, we accomplish this task by matching molecular families with gene cluster families, making these correlations to 60 microbes at one time instead of connecting one molecule to one organism at a time, such as how it is traditionally done. We can correlate these families through the use of nanospray desorption electrospray ionization MS/MS, an ambient pressure MS technique, in conjunction with MS/MS networking and peptidogenomics. We matched the molecular families of peptide natural products produced by 42 bacilli and 18 pseudomonads through the generation of amino acid sequence tags from MS/MS data of specific clusters found in the MS/MS network. These sequence tags were then linked to biosynthetic gene clusters in publicly accessible genomes, providing us with the ability to link particular molecules with the genes that produced them. As an example of its use, this approach was applied to two unsequenced Pseudoalteromonas species, leading to the discovery of the gene cluster for a molecular family, the bromoalterochromides, in the previously sequenced strain P. piscicida JCM 20779T. The approach itself is not limited to 60 related strains, because spectral networking can be readily adopted to look at molecular family–gene cluster families of hundreds or more diverse organisms in one single MS/MS network. PMID:23798442

  17. Operon and non-operon gene clusters in the C. elegans genome.

    Science.gov (United States)

    Blumenthal, Thomas; Davis, Paul; Garrido-Lecca, Alfonso

    2015-04-28

    Nearly 15% of the ~20,000 C. elegans genes are contained in operons, multigene clusters controlled by a single promoter. The vast majority of these are of a type where the genes in the cluster are ~100 bp apart and the pre-mRNA is processed by 3' end formation accompanied by trans-splicing. A spliced leader, SL2, is specialized for operon processing. Here we summarize current knowledge on several variations on this theme including: (1) hybrid operons, which have additional promoters between genes; (2) operons with exceptionally long (> 1 kb) intercistronic regions; (3) operons with a second 3' end formation site close to the trans-splice site; (4) alternative operons, in which the exons are sometimes spliced as a single gene and sometimes as two genes; (5) SL1-type operons, which use SL1 instead of SL2 to trans-splice and in which there is no intercistronic space; (6) operons that make dicistronic mRNAs; and (7) non-operon gene clusters, in which either two genes use a single exon as the 3' end of one and the 5' end of the next, or the 3' UTR of one gene serves as the outron of the next. Each of these variations is relatively infrequent, but together they show a remarkable variety of tight-linkage gene arrangements in the C. elegans genome.

  18. Expression Analysis of Genes in the Nif Cluster of Clostridium beijerinckii

    OpenAIRE

    2007-01-01

    The nif genes of Clostridium beijerinckii NRRL B593 occupy a region of about 16 kilobases. Besides the two glnB-like genes, five other genes are interspersed between the nifNB and the nifVw genes. An expression analysis of the nif genes in nitrogen-fixing and non-nitrogen-fixing cells with probes generated from various regions of the nif cluster by northern blot analysis revealed the presence of four different transcripts in nitrogen-fixing cells. Two of these transcripts had the predicted si...

  19. Sleeping Beauty mouse models identify candidate genes involved in gliomagenesis.

    Science.gov (United States)

    Vyazunova, Irina; Maklakova, Vilena I; Berman, Samuel; De, Ishani; Steffen, Megan D; Hong, Won; Lincoln, Hayley; Morrissy, A Sorana; Taylor, Michael D; Akagi, Keiko; Brennan, Cameron W; Rodriguez, Fausto J; Collier, Lara S

    2014-01-01

    Genomic studies of human high-grade gliomas have discovered known and candidate tumor drivers. Studies in both cell culture and mouse models have complemented these approaches and have identified additional genes and processes important for gliomagenesis. Previously, we found that mobilization of Sleeping Beauty transposons in mice ubiquitously throughout the body from the Rosa26 locus led to gliomagenesis with low penetrance. Here we report the characterization of mice in which transposons are mobilized in the Glial Fibrillary Acidic Protein (GFAP) compartment. Glioma formation in these mice did not occur on an otherwise wild-type genetic background, but rare gliomas were observed when mobilization occurred in a p19Arf heterozygous background. Through cloning insertions from additional gliomas generated by transposon mobilization in the Rosa26 compartment, several candidate glioma genes were identified. Comparisons to genetic, epigenetic and mRNA expression data from human gliomas implicates several of these genes as tumor suppressor genes and oncogenes in human glioblastoma.

  20. Molecular characterization of the PR-toxin gene cluster in Penicillium roqueforti and Penicillium chrysogenum: cross talk of secondary metabolite pathways.

    Science.gov (United States)

    Hidalgo, Pedro I; Ullán, Ricardo V; Albillos, Silvia M; Montero, Olimpio; Fernández-Bodega, María Ángeles; García-Estrada, Carlos; Fernández-Aguado, Marta; Martín, Juan-Francisco

    2014-01-01

    The PR-toxin is a potent mycotoxin produced by Penicillium roqueforti in moulded grains and grass silages and may contaminate blue-veined cheese. The PR-toxin derives from the 15 carbon atoms sesquiterpene aristolochene formed by the aristolochene synthase (encoded by ari1). We have cloned and sequenced a four gene cluster that includes the ari1 gene from P. roqueforti. Gene silencing of each of the four genes (named prx1 to prx4) resulted in a reduction of 65-75% in the production of PR-toxin indicating that the four genes encode enzymes involved in PR-toxin biosynthesis. Interestingly the four silenced mutants overproduce large amounts of mycophenolic acid, an antitumor compound formed by an unrelated pathway suggesting a cross-talk of PR-toxin and mycophenolic acid production. An eleven gene cluster that includes the above mentioned four prx genes and a 14-TMS drug/H(+) antiporter was found in the genome of Penicillium chrysogenum. This eleven gene cluster has been reported to be very poorly expressed in a transcriptomic study of P. chrysogenum genes under conditions of penicillin production (strongly aerated cultures). We found that this apparently silent gene cluster is able to produce PR-toxin in P. chrysogenum under static culture conditions on hydrated rice medium. Noteworthily, the production of PR-toxin was 2.6-fold higher in P. chrysogenum npe10, a strain deleted in the 56.8kb amplifiable region containing the pen gene cluster, than in the parental strain Wisconsin 54-1255 providing another example of cross-talk between secondary metabolite pathways in this fungus. A detailed PR-toxin biosynthesis pathway is proposed based on all available evidence.

  1. Three highly similar formate dehydrogenase genes located in the vicinity of the B4 resistance gene cluster are differentially expressed under biotic and abiotic stresses in Phaseolus vulgaris.

    Science.gov (United States)

    David, Perrine; des Francs-Small, Catherine Colas; Sévignac, Mireille; Thareau, Vincent; Macadré, Catherine; Langin, Thierry; Geffroy, Valérie

    2010-06-01

    In higher plants, formate dehydrogenase (FDH, EC1.2.1.2.) catalyzes the NAD-linked oxidation of formate to CO(2), and FDH transcript accumulation has been reported after various abiotic stresses. By sequencing a Phaseolus vulgaris BAC clone encompassing a CC-NBS-LRR gene rich region of the B4 resistance gene cluster, we identified three FDH-encoding genes. FDH is present as a single copy gene in the Arabidopsis thaliana genome, and public database searches confirm that FDH is a low copy gene in plant genomes, since only 33 FDH homologs were identified from 27 plant species. Three independent prediction programs (Predotar, TargetP and Mitoprot) used on this large subset of 33 plant FDHs, revealed that mitochondrial localization of FDH might be the rule in higher plants. A phylogenetic analysis suggests a scenario of local FDH gene duplication in an ancestor of the Phaseoleae followed by another more recent duplication event after bean/soybean divergence. The expression levels of two common bean FDH genes under different treatments were investigated by quantitative RT-PCR analysis. FDH genes are differentially up-regulated after biotic and abiotic stresses (infection with the fungus Colletotrichum lindemuthianum, and dark treatment, respectively). The present study provides the first report of FDH transcript accumulation after biotic stress, suggesting the involvement of FDH in the pathogen resistance process.

  2. Identification and characterization of another 4-nitrophenol degradation gene cluster, nps, in Rhodococcus sp. strain PN1.

    Science.gov (United States)

    Yamamoto, Kenta; Nishimura, Munehiro; Kato, Dai-ichiro; Takeo, Masahiro; Negoro, Seiji

    2011-06-01

    4-Nitrophenol (4-NP) is a toxic compound formed in soil by the hydrolysis of organophosphorous pesticides, such as parathion. We previously reported the presence of the 4-NP degradation gene cluster (nphRA1A2) in Rhodococcus sp. strain PN1, which encodes a two-component 4-NP hydroxylase system that oxidizes 4-NP into 4-nitrocatechol. In the current study, another gene cluster (npsC and npsRA2A1B) encoding a similar 4-NP hydroxylase system was cloned from strain PN1. The enzymes from this 4-NP hydroxylase system (NpsA1 and NpsA2) were purified as histidine-tagged (His-) proteins and then characterized. His-NpsA2 showed NADH/FAD oxidoreductase activity, and His-NpsA1 showed 4-NP oxidizing activity in the presence of His-NpsA2. In the 4-NP oxidation using the reconstituted enzyme system (His-NpsA1 and His-NpsA2), hydroquinone (35% of 4-NP disappeared) and hydroxyquinol (59% of 4-NP disappeared) were detected in the presence of ascorbic acid as a reducing reagent, suggesting that, without the reducing reagent, 4-NP was converted into their oxidized forms, 1,4-benzoquinone and 2-hydroxy-1,4-benzoquinone. In addition, in the cell extract of recombinant Escherichia coli expressing npsB, a typical spectral change showing conversion of hydroxyquinol into maleylacetate was observed. These results indicate that this nps gene cluster, in addition to the nph gene cluster, is also involved in 4-NP degradation in strain PN1.

  3. Sequencing and mapping hemoglobin gene clusters in the australian model dasyurid marsupial sminthopsis macroura

    Energy Technology Data Exchange (ETDEWEB)

    De Leo, A.A.; Wheeler, D.; Lefevre, C.; Cheng, Jan-Fang; Hope, R.; Kuliwaba, J.; Nicholas, K.R.; Westermanc, M.; Graves, J.A.M.

    2004-07-26

    Comparing globin genes and their flanking sequences across many species has allowed globin gene evolution to be reconstructed in great detail. Marsupial globin sequences have proved to be of exceptional significance. A previous finding of a beta-like omega gene in the alpha cluster in the tammar wallaby suggested that the alpha and beta cluster evolved via genome duplication and loss rather than tandem duplication. To confirm and extend this important finding we isolated and sequenced BACs containing the alpha and beta loci from the distantly related Australian marsupial Sminthopsis macroura. We report that the alpha gene lies in the same BAC as the beta-like omega gene, implying that the alpha-omega juxtaposition is likely to be conserved in all marsupials. The LUC7L gene was found 3' of the S. macroura alpha locus, a gene order shared with humans but not mouse, chicken or fugu. Sequencing a BAC contig that contained the S. macroura beta globin and epsilon globin loci showed that the globin cluster is flanked by olfactory genes, demonstrating a gene arrangement conserved for over 180 MY. Analysis of the region 5' to the S. macroura epsilon globin gene revealed a region similar to the eutherian LCR, containing sequences and potential transcription factor binding sites with homology to eutherian hypersensitive sites 1 to 5. FISH mapping of BACs containing S. macroura alpha and beta globin genes located the beta globin cluster on chromosome 3q and the alpha locus close to the centromere on 1q, resolving contradictory map locations obtained by previous radioactive in situ hybridization.

  4. Epigenetic characterization of the growth hormone gene identifies SmcHD1 as a regulator of autosomal gene clusters.

    Directory of Open Access Journals (Sweden)

    Shabnam Massah

    Full Text Available Regulatory elements for the mouse growth hormone (GH gene are located distally in a putative locus control region (LCR in addition to key elements in the promoter proximal region. The role of promoter DNA methylation for GH gene regulation is not well understood. Pit-1 is a POU transcription factor required for normal pituitary development and obligatory for GH gene expression. In mammals, Pit-1 mutations eliminate GH production resulting in a dwarf phenotype. In this study, dwarf mice illustrated that Pit-1 function was obligatory for GH promoter hypomethylation. By monitoring promoter methylation levels during developmental GH expression we found that the GH promoter became hypomethylated coincident with gene expression. We identified a promoter differentially methylated region (DMR that was used to characterize a methylation-dependent DNA binding activity. Upon DNA affinity purification using the DMR and nuclear extracts, we identified structural maintenance of chromosomes hinge domain containing -1 (SmcHD1. To better understand the role of SmcHD1 in genome-wide gene expression, we performed microarray analysis and compared changes in gene expression upon reduced levels of SmcHD1 in human cells. Knock-down of SmcHD1 in human embryonic kidney (HEK293 cells revealed a disproportionate number of up-regulated genes were located on the X-chromosome, but also suggested regulation of genes on non-sex chromosomes. Among those, we identified several genes located in the protocadherin β cluster. In addition, we found that imprinted genes in the H19/Igf2 cluster associated with Beckwith-Wiedemann and Silver-Russell syndromes (BWS & SRS were dysregulated. For the first time using human cells, we showed that SmcHD1 is an important regulator of imprinted and clustered genes.

  5. Transcriptome profiling of the testis reveals genes involved in spermatogenesis and marker discovery in the oriental fruit fly, Bactrocera dorsalis.

    Science.gov (United States)

    Wei, D; Li, H-M; Yang, W-J; Wei, D-D; Dou, W; Huang, Y; Wang, J-J

    2015-02-01

    The testis is a highly specialized tissue that plays a vital role in ensuring fertility by producing spermatozoa, which are transferred to the female during mating. Spermatogenesis is a complex process, resulting in the production of mature sperm, and involves significant structural and biochemical changes in the seminiferous epithelium of the adult testis. The identification of genes involved in spermatogenesis of Bactrocera dorsalis (Hendel) is critical for a better understanding of its reproductive development. In this study, we constructed a cDNA library of testes from male B. dorsalis adults at different ages, and performed de novo transcriptome sequencing to produce a comprehensive transcript data set, using Illumina sequencing technology. The analysis yielded 52 016 732 clean reads, including a total of 4.65 Gb of nucleotides. These reads were assembled into 47 677 contigs (average 443 bp) and then clustered into 30 516 unigenes (average 756 bp). Based on BLAST hits with known proteins in different databases, 20 921 unigenes were annotated with a cut-off E-value of 10(-5). The transcriptome sequences were further annotated using the Clusters of Orthologous Groups, Gene Orthology and the Kyoto Encyclopedia of Genes and Genomes databases. Functional genes involved in spermatogenesis were analysed, including cell cycle proteins, metalloproteins, actin, and ubiquitin and antihyperthermia proteins. Several testis-specific genes were also identified. The transcripts database will help us to understand the molecular mechanisms underlying spermatogenesis in B. dorsalis. Furthermore, 2913 simple sequence repeats and 151 431 single nucleotide polymorphisms were identified, which will be useful for investigating the genetic diversity of B. dorsalis in the future. © 2014 The Royal Entomological Society.

  6. Characterization of head transcriptome and analysis of gene expression involved in caste differentiation and aggression in Odontotermes formosanus (Shiraki.

    Directory of Open Access Journals (Sweden)

    Qiuying Huang

    Full Text Available BACKGROUND: The subterranean termite Odontotermes formosanus (Shiraki is a serious insect pest of trees and dams in China. To date, very little is known about genomic or transcriptomic data for caste differentiation and aggression in O. formosanus. Hence, studies on transcriptome and gene expression profiling are helpful to better understand molecular basis underlying caste differentiation and aggressive behavior in O. formosanus. METHODOLOGY AND PRINCIPAL FINDINGS: Using the Illumina sequencing, we obtained more than 57 million sequencing reads derived from the heads of O. formosanus. These reads were assembled into 116,885 unique sequences (mean size  =  536 bp. Of the unigenes, 30,646 (26.22% had significant similarity with proteins in the NCBI nonredundant protein database and Swiss-Prot database (E-value<10(-5. Of these annotated unigenes, 10,409 and 9,009 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. In total, 19,611 (25.52% unigenes were mapped onto 242 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG. A total of 11,661 simple sequence repeats (SSRs were predicted from the current transcriptome database. Moreover, we detected seven putative genes involved in caste differentiation and six putative genes involved in aggression. The qPCR analysis showed that there were significant differences in the expression levels of the three putative genes hexamerin 2, β-glycosidase and bicaudal D involved in caste differentiation and one putative gene Cyp6a20 involved in aggression among workers, soldiers and larvae of O. formosanus. CONCLUSIONS: To our knowledge, this is the first study to characterize the complete head transcriptome of a higher fungus-cultivating termite using high-throughput sequencing. Our study has provided the comprehensive sequence resources available for elucidating molecular basis underlying caste differentiation and aggressive

  7. Methods for simultaneously identifying coherent local clusters with smooth global patterns in gene expression profiles

    Directory of Open Access Journals (Sweden)

    Lee Yun-Shien

    2008-03-01

    Full Text Available Abstract Background The hierarchical clustering tree (HCT with a dendrogram 1 and the singular value decomposition (SVD with a dimension-reduced representative map 2 are popular methods for two-way sorting the gene-by-array matrix map employed in gene expression profiling. While HCT dendrograms tend to optimize local coherent clustering patterns, SVD leading eigenvectors usually identify better global grouping and transitional structures. Results This study proposes a flipping mechanism for a conventional agglomerative HCT using a rank-two ellipse (R2E, an improved SVD algorithm for sorting purpose seriation by Chen 3 as an external reference. While HCTs always produce permutations with good local behaviour, the rank-two ellipse seriation gives the best global grouping patterns and smooth transitional trends. The resulting algorithm automatically integrates the desirable properties of each method so that users have access to a clustering and visualization environment for gene expression profiles that preserves coherent local clusters and identifies global grouping trends. Conclusion We demonstrate, through four examples, that the proposed method not only possesses better numerical and statistical properties, it also provides more meaningful biomedical insights than other sorting algorithms. We suggest that sorted proximity matrices for genes and arrays, in addition to the gene-by-array expression matrix, can greatly aid in the search for comprehensive understanding of gene expression structures. Software for the proposed methods can be obtained at http://gap.stat.sinica.edu.tw/Software/GAP.

  8. Identification of the Fucose Synthetase Gene in the Colanic Acid Gene Cluster of Escherichia coli K-12

    OpenAIRE

    Andrianopoulos, Kanella; Wang, Lei; Reeves, Peter R.

    1998-01-01

    GDP–l-fucose, the substrate for fucosyltransferases for addition of fucose to polysaccharides or glycoproteins in both procaryotes and eucaryotes, is made from GDP–d-mannose. l-Fucose is a component of bacterial surface antigens, including the extracellular polysaccharide colanic acid produced by most Escherichia coli strains. We previously sequenced the E. coli colanic acid gene cluster and identified one of the GDP–l-fucose biosynthetic pathway genes, gmd. We report here the identification ...

  9. Organization and Differential Regulation of a Cluster of Lignin Peroxidase Genes of Phanerochaete chrysosporium

    Science.gov (United States)

    Stewart, Philip; Cullen, Daniel

    1999-01-01

    The lignin peroxidases of Phanerochaete chrysosporium are encoded by a minimum of 10 closely related genes. Physical and genetic mapping of a cluster of eight lip genes revealed six genes occurring in pairs and transcriptionally convergent, suggesting that portions of the lip family arose by gene duplication events. The completed sequence of lipG and lipJ, together with previously published sequences, allowed phylogenetic and intron/exon classifications, indicating two main branches within the lip family. Competitive reverse transcription-PCR was used to assess lip transcript levels in both carbon- and nitrogen-limited media. Transcript patterns showed differential regulation of lip genes in response to medium composition. No apparent correlation was observed between genomic organization and transcript levels. Both constitutive and upregulated transcripts, structurally unrelated to peroxidases, were identified within the lip cluster. PMID:10348854

  10. Clustering Time-Series Gene Expression Data Using Smoothing Spline Derivatives

    Directory of Open Access Journals (Sweden)

    S. Déjean

    2007-06-01

    Full Text Available Microarray data acquired during time-course experiments allow the temporal variations in gene expression to be monitored. An original postprandial fasting experiment was conducted in the mouse and the expression of 200 genes was monitored with a dedicated macroarray at 11 time points between 0 and 72 hours of fasting. The aim of this study was to provide a relevant clustering of gene expression temporal profiles. This was achieved by focusing on the shapes of the curves rather than on the absolute level of expression. Actually, we combined spline smoothing and first derivative computation with hierarchical and partitioning clustering. A heuristic approach was proposed to tune the spline smoothing parameter using both statistical and biological considerations. Clusters are illustrated a posteriori through principal component analysis and heatmap visualization. Most results were found to be in agreement with the literature on the effects of fasting on the mouse liver and provide promising directions for future biological investigations.

  11. Identification of conserved gene clusters in multiple genomes based on synteny and homology

    Directory of Open Access Journals (Sweden)

    Nikolski Macha

    2011-10-01

    Full Text Available Abstract Background Uncovering the relationship between the conserved chromosomal segments and the functional relatedness of elements within these segments is an important question in computational genomics. We build upon the series of works on gene teams and homology teams. Results Our primary contribution is a local sliding-window SYNS (SYNtenic teamS algorithm that refines an existing family structure into orthologous sub-families by analyzing the neighborhoods around the members of a given family with a locally sliding window. The neighborhood analysis is done by computing conserved gene clusters. We evaluate our algorithm on the existing homologous families from the Genolevures database over five genomes of the Hemyascomycete phylum. Conclusions The result is an efficient algorithm that works on multiple genomes, considers paralogous copies of genes and is able to uncover orthologous clusters even in distant genomes. Resulting orthologous clusters are comparable to those obtained by manual curation.

  12. Clustering Time Series Gene Expression Data Based on Sum-of-Exponentials Fitting

    Directory of Open Access Journals (Sweden)

    Giurcăneanu Ciprian Doru

    2005-01-01

    Full Text Available This paper presents a method based on fitting a sum-of-exponentials model to the nonuniformly sampled data, for clustering the time series of gene expression data. The structure of the model is estimated by using the minimum description length (MDL principle for nonlinear regression, in a new form, incorporating a normalized maximum-likelihood (NML model for a subset of the parameters. The performance of the structure estimation method is studied using simulated data, and the superiority of the new selection criterion over earlier criteria is demonstrated. The accuracy of the nonlinear estimates of the model parameters is analyzed with respect to the Cramér-Rao lower bounds. Clustering examples of gene expression data sets from a developmental biology application are presented, revealing gene grouping into clusters according to functional classes.

  13. Form gene clustering method about pan-ethnic-group products based on emotional semantic

    Science.gov (United States)

    Chen, Dengkai; Ding, Jingjing; Gao, Minzhuo; Ma, Danping; Liu, Donghui

    2016-09-01

    The use of pan-ethnic-group products form knowledge primarily depends on a designer's subjective experience without user participation. The majority of studies primarily focus on the detection of the perceptual demands of consumers from the target product category. A pan-ethnic-group products form gene clustering method based on emotional semantic is constructed. Consumers' perceptual images of the pan-ethnic-group products are obtained by means of product form gene extraction and coding and computer aided product form clustering technology. A case of form gene clustering about the typical pan-ethnic-group products is investigated which indicates that the method is feasible. This paper opens up a new direction for the future development of product form design which improves the agility of product design process in the era of Industry 4.0.

  14. Organization, expression and evolution of a disease resistance gene cluster in soybean.

    Science.gov (United States)

    Graham, Michelle A; Marek, Laura Fredrick; Shoemaker, Randy C

    2002-01-01

    PCR amplification was previously used to identify a cluster of resistance gene analogues (RGAs) on soybean linkage group J. Resistance to powdery mildew (Rmd-c), Phytophthora stem and root rot (Rps2), and an ineffective nodulation gene (Rj2) map within this cluster. BAC fingerprinting and RGA-specific primers were used to develop a contig of BAC clones spanning this region in cultivar "Williams 82" [rps2, Rmd (adult onset), rj2]. Two cDNAs with homology to the TIR/NBD/LRR family of R-genes have also been mapped to opposite ends of a BAC in the contig Gm_Isb001_091F11 (BAC 91F11). Sequence analyses of BAC 91F11 identified 16 different resistance-like gene (RLG) sequences with homology to the TIR/NBD/LRR family of disease resistance genes. Four of these RLGs represent two potentially novel classes of disease resistance genes: TIR/NBD domains fused inframe to a putative defense-related protein (NtPRp27-like) and TIR domains fused inframe to soybean calmodulin Ca(2+)-binding domains. RT-PCR analyses using gene-specific primers allowed us to monitor the expression of individual genes in different tissues and developmental stages. Three genes appeared to be constitutively expressed, while three were differentially expressed. Analyses of the R-genes within this BAC suggest that R-gene evolution in soybean is a complex and dynamic process. PMID:12524363

  15. A remarkably stable TipE gene cluster: evolution of insect Para sodium channel auxiliary subunits

    Directory of Open Access Journals (Sweden)

    Li Jia

    2011-11-01

    Full Text Available Abstract Background First identified in fruit flies with temperature-sensitive paralysis phenotypes, the Drosophila melanogaster TipE locus encodes four voltage-gated sodium (NaV channel auxiliary subunits. This cluster of TipE-like genes on chromosome 3L, and a fifth family member on chromosome 3R, are important for the optional expression and functionality of the Para NaV channel but appear quite distinct from auxiliary subunits in vertebrates. Here, we exploited available arthropod genomic resources to trace the origin of TipE-like genes by mapping their evolutionary histories and examining their genomic architectures. Results We identified a remarkably conserved synteny block of TipE-like orthologues with well-maintained local gene arrangements from 21 insect species. Homologues in the water flea, Daphnia pulex, suggest an ancestral pancrustacean repertoire of four TipE-like genes; a subsequent gene duplication may have generated functional redundancy allowing gene losses in the silk moth and mosquitoes. Intronic nesting of the insect TipE gene cluster probably occurred following the divergence from crustaceans, but in the flour beetle and silk moth genomes the clusters apparently escaped from nesting. Across Pancrustacea, TipE gene family members have experienced intronic nesting, escape from nesting, retrotransposition, translocation, and gene loss events while generally maintaining their local gene neighbourhoods. D. melanogaster TipE-like genes exhibit coordinated spatial and temporal regulation of expression distinct from their host gene but well-correlated with their regulatory target, the Para NaV channel, suggesting that functional constraints may preserve the TipE gene cluster. We identified homology between TipE-like NaV channel regulators and vertebrate Slo-beta auxiliary subunits of big-conductance calcium-activated potassium (BKCa channels, which suggests that ion channel regulatory partners have evolved distinct lineage

  16. Functions of genes and enzymes involved in phenalinolactone biosynthesis.

    Science.gov (United States)

    Daum, Martina; Schnell, Hans-Jörg; Herrmann, Simone; Günther, Andreas; Murillo, Renato; Müller, Rolf; Bisel, Philippe; Müller, Michael; Bechthold, Andreas

    2010-07-05

    Phenalinolactones are novel terpene glycoside antibiotics produced by Streptomyces sp. Tü6071. Inactivation of three oxygenase genes (plaO2, plaO3 and plaO5), two dehydrogenase genes (plaU, plaZ) and one putative acetyltransferase gene (plaV) led to the production of novel phenalinolactone derivatives (PL HS6, PL HS7, PL HS2 and PL X1). Furthermore, the exact biosynthetic functions of two enzymes were determined, and their in vitro activities were demonstrated. PlaO1, an Fe(II)/alpha-ketoglutarate-dependent dioxygenase, is responsible for the key step in gamma-butyrolactone formation, whereas PlaO5, a cytochrome P450-dependent monooxygenase, catalyses the 1-C-hydroxylation of phenalinolactone D. In addition, stable isotope feeding experiments with biosynthetic precursors shed light on the origin of the carbons in the gamma-butyrolactone moiety.

  17. Sex-related gene expression profiles in the adrenal cortex in the mature rat: microarray analysis with emphasis on genes involved in steroidogenesis.

    Science.gov (United States)

    Trejter, Marcin; Hochol, Anna; Tyczewska, Marianna; Ziolkowska, Agnieszka; Jopek, Karol; Szyszka, Marta; Malendowicz, Ludwik K; Rucinski, Marcin

    2015-03-01

    Notable sex-related differences exist in mammalian adrenal cortex structure and function. In adult rats, the adrenal weight and the average volume of zona fasciculata cells of females are larger and secrete greater amounts of corticosterone than those of males. The molecular bases of these sex-related differences are poorly understood. In this study, to explore the molecular background of these differences, we defined zone- and sex-specific transcripts in adult male and female (estrous cycle phase) rats. Twelve-week-old rats of both genders were used and samples were taken from the zona glomerulosa (ZG) and zona fasciculata/reticularis (ZF/R) zones. Transcriptome identification was carried out using the Affymetrix(®) Rat Gene 1.1 ST Array. The microarray data were compared by fold change with significance according to moderated t-statistics. Subsequently, we performed functional annotation clustering using the Gene Ontology (GO) and Database for Annotation, Visualization and Integrated Discovery (DAVID). In the first step, we explored differentially expressed transcripts in the adrenal ZG and ZF/R. The number of differentially expressed transcripts was notably higher in the female than in the male rats (702 vs. 571). The differentially expressed genes which were significantly enriched included genes involved in steroid hormone metabolism, and their expression levels in the ZF/R of adult female rats were significantly higher compared with those in the male rats. In the female ZF/R, when compared with that of the males, prevailing numbers of genes linked to cell fraction, oxidation/reduction processes, response to nutrients and to extracellular stimuli or steroid hormone stimuli were downregulated. The microarray data for key genes involved directly in steroidogenesis were confirmed by qPCR. Thus, when compared with that of the males, in the female ZF/R, higher expression levels of genes involved directly in steroid hormone synthesis were accompanied by lower

  18. Physical and genetic map of the major nif gene cluster from Azotobacter vinelandii.

    Science.gov (United States)

    Jacobson, M R; Brigle, K E; Bennett, L T; Setterquist, R A; Wilson, M S; Cash, V L; Beynon, J; Newton, W E; Dean, D R

    1989-02-01

    Determination of a 28,793-base-pair DNA sequence of a region from the Azotobacter vinelandii genome that includes and flanks the nitrogenase structural gene region was completed. This information was used to revise the previously proposed organization of the major nif cluster. The major nif cluster from A. vinelandii encodes 15 nif-specific genes whose products bear significant structural identity to the corresponding nif-specific gene products from Klebsiella pneumoniae. These genes include nifH, nifD, nifK, nifT, nifY, nifE, nifN, nifX, nifU, nifS, nifV, nifW, nifZ, nifM, and nifF. Although there are significant spatial differences, the identified A. vinelandii nif-specific genes have the same sequential arrangement as the corresponding nif-specific genes from K. pneumoniae. Twelve other potential genes whose expression could be subject to nif-specific regulation were also found interspersed among the identified nif-specific genes. These potential genes do not encode products that are structurally related to the identified nif-specific gene products. Eleven potential nif-specific promoters were identified within the major nif cluster, and nine of these are preceded by an appropriate upstream activator sequence. A + T-rich regions were identified between 8 of the 11 proposed nif promoter sequences and their upstream activator sequences. Site-directed deletion-and-insertion mutagenesis was used to establish a genetic map of the major nif cluster.

  19. Gene microarray data analysis using parallel point-symmetry-based clustering.

    Science.gov (United States)

    Sarkar, Anasua; Maulik, Ujjwal

    2015-01-01

    Identification of co-expressed genes is the central goal in microarray gene expression analysis. Point-symmetry-based clustering is an important unsupervised learning technique for recognising symmetrical convex- or non-convex-shaped clusters. To enable fast clustering of large microarray data, we propose a distributed time-efficient scalable approach for point-symmetry-based K-Means algorithm. A natural basis for analysing gene expression data using symmetry-based algorithm is to group together genes with similar symmetrical expression patterns. This new parallel implementation also satisfies linear speedup in timing without sacrificing the quality of clustering solution on large microarray data sets. The parallel point-symmetry-based K-Means algorithm is compared with another new parallel symmetry-based K-Means and existing parallel K-Means over eight artificial and benchmark microarray data sets, to demonstrate its superiority, in both timing and validity. The statistical analysis is also performed to establish the significance of this message-passing-interface based point-symmetry K-Means implementation. We also analysed the biological relevance of clustering solutions.

  20. Transcriptional analysis of the jamaicamide gene cluster from the marine cyanobacterium Lyngbya majuscula and identification of possible regulatory proteins

    Directory of Open Access Journals (Sweden)

    Dorrestein Pieter C

    2009-12-01

    Full Text Available Abstract Background The marine cyanobacterium Lyngbya majuscula is a prolific producer of bioactive secondary metabolites. Although biosynthetic gene clusters encoding several of these compounds have been identified, little is known about how these clusters of genes are transcribed or regulated, and techniques targeting genetic manipulation in Lyngbya strains have not yet been developed. We conducted transcriptional analyses of the jamaicamide gene cluster from a Jamaican strain of Lyngbya majuscula, and isolated proteins that could be involved in jamaicamide regulation. Results An unusually long untranslated leader region of approximately 840 bp is located between the jamaicamide transcription start site (TSS and gene cluster start codon. All of the intergenic regions between the pathway ORFs were transcribed into RNA in RT-PCR experiments; however, a promoter prediction program indicated the possible presence of promoters in multiple intergenic regions. Because the functionality of these promoters could not be verified in vivo, we used a reporter gene assay in E. coli to show that several of these intergenic regions, as well as the primary promoter preceding the TSS, are capable of driving β-galactosidase production. A protein pulldown assay was also used to isolate proteins that may regulate the jamaicamide pathway. Pulldown experiments using the intergenic region upstream of jamA as a DNA probe isolated two proteins that were identified by LC-MS/MS. By BLAST analysis, one of these had close sequence identity to a regulatory protein in another cyanobacterial species. Protein comparisons suggest a possible correlation between secondary metabolism regulation and light dependent complementary chromatic adaptation. Electromobility shift assays were used to evaluate binding of the recombinant proteins to the jamaicamide promoter region. Conclusion Insights into natural product regulation in cyanobacteria are of significant value to drug discovery

  1. Increased incidence of rare codon clusters at 5' and 3' gene termini:implications for function

    Directory of Open Access Journals (Sweden)

    Clark Patricia L

    2010-02-01

    Full Text Available Abstract Background The process of translation can be affected by the use of rare versus common codons within the mRNA transcript. Results Here, we show that rare codons are enriched at the 5' and 3' termini of genes from E. coli and other prokaryotes. Genes predicted to be secreted show significant enrichment in 5' rare codon clusters, but not 3' rare codon clusters. Surprisingly, no correlation between 5' mRNA structure and rare codon usage was observed. Conclusions Potential functional roles for the enrichment of rare codons at terminal positions are explored.

  2. plantiSMASH: automated identification, annotation and expression analysis of plant biosynthetic gene clusters

    DEFF Research Database (Denmark)

    Kautsar, Satria A.; Suarez Duran, Hernando G.; Blin, Kai

    2017-01-01

    of predicted biosynthetic enzyme-coding genes, and facilitates comparative genomic analysis to study the evolutionary conservation of each cluster. Applied on 48 high-quality plant genomes, plantiSMASH identifies a rich diversity of candidate plant BGCs. These results will guide further experimental...... exploration of the nature and dynamics of gene clustering in plant metabolism. Moreover, spurred by the continuing decrease in costs of plant genome sequencing, they will allow genome mining technologies to be applied to plant natural product discovery. The plantiSMASH web server, precalculated results...

  3. Sleeping Beauty mouse models identify candidate genes involved in gliomagenesis.

    Directory of Open Access Journals (Sweden)

    Irina Vyazunova

    Full Text Available Genomic studies of human high-grade gliomas have discovered known and candidate tumor drivers. Studies in both cell culture and mouse models have complemented these approaches and have identified additional genes and processes important for gliomagenesis. Previously, we found that mobilization of Sleeping Beauty transposons in mice ubiquitously throughout the body from the Rosa26 locus led to gliomagenesis with low penetrance. Here we report the characterization of mice in which transposons are mobilized in the Glial Fibrillary Acidic Protein (GFAP compartment. Glioma formation in these mice did not occur on an otherwise wild-type genetic background, but rare gliomas were observed when mobilization occurred in a p19Arf heterozygous background. Through cloning insertions from additional gliomas generated by transposon mobilization in the Rosa26 compartment, several candidate glioma genes were identified. Comparisons to genetic, epigenetic and mRNA expression data from human gliomas implicates several of these genes as tumor suppressor genes and oncogenes in human glioblastoma.

  4. Sleeping Beauty Mouse Models Identify Candidate Genes Involved in Gliomagenesis

    Science.gov (United States)

    Vyazunova, Irina; Maklakova, Vilena I.; Berman, Samuel; De, Ishani; Steffen, Megan D.; Hong, Won; Lincoln, Hayley; Morrissy, A. Sorana; Taylor, Michael D.; Akagi, Keiko; Brennan, Cameron W.; Rodriguez, Fausto J.; Collier, Lara S.

    2014-01-01

    Genomic studies of human high-grade gliomas have discovered known and candidate tumor drivers. Studies in both cell culture and mouse models have complemented these approaches and have identified additional genes and processes important for gliomagenesis. Previously, we found that mobilization of Sleeping Beauty transposons in mice ubiquitously throughout the body from the Rosa26 locus led to gliomagenesis with low penetrance. Here we report the characterization of mice in which transposons are mobilized in the Glial Fibrillary Acidic Protein (GFAP) compartment. Glioma formation in these mice did not occur on an otherwise wild-type genetic background, but rare gliomas were observed when mobilization occurred in a p19Arf heterozygous background. Through cloning insertions from additional gliomas generated by transposon mobilization in the Rosa26 compartment, several candidate glioma genes were identified. Comparisons to genetic, epigenetic and mRNA expression data from human gliomas implicates several of these genes as tumor suppressor genes and oncogenes in human glioblastoma. PMID:25423036

  5. Genes involved in bovine milk-fat composition

    NARCIS (Netherlands)

    Schennink, A.

    2009-01-01

    The aim of the research described in this thesis was to identify genes that underlie the genetic variation in bovine milk-fat composition. The fat composition of milk samples from approximately 2,000 Dutch Holstein Friesian cows in their first lactation was measured by gas chromatography. Quantita

  6. Apple contains receptor-like genes homologous to the Cladosporium fulvum resistance gene family of tomato with a cluster of genes cosegregating with Vf apple scab resistance.

    Science.gov (United States)

    Vinatzer, B A; Patocchi, A; Gianfranceschi, L; Tartarini, S; Zhang, H B; Gessler, C; Sansavini, S

    2001-04-01

    Scab caused by the fungal pathogen Venturia inaequalis is the most common disease of cultivated apple (Malus x domestica Borkh.). Monogenic resistance against scab is found in some small-fruited wild Malus species and has been used in apple breeding for scab resistance. Vf resistance of Malus floribunda 821 is the most widely used scab resistance source. Because breeding a high-quality cultivar in perennial fruit trees takes dozens of years, cloning disease resistance genes and using them in the transformation of high-quality apple varieties would be advantageous. We report the identification of a cluster of receptor-like genes with homology to the Cladosporium fulvum (Cf) resistance gene family of tomato on bacterial artificial chromosome clones derived from the Vf scab resistance locus. Three members of the cluster were sequenced completely. Similar to the Cf gene family of tomato, the deduced amino acid sequences coded by these genes contain an extracellular leucine-rich repeat domain and a transmembrane domain. The transcription of three members of the cluster was determined by reverse transcriptionpolymerase chain reaction to be constitutive, and the transcription and translation start of one member was verified by 5' rapid amplification of cDNA ends. We discuss the parallels between Cf resistance of tomato and Vf resistance of apple and the possibility that one of the members of the gene cluster is the Vf gene. Cf homologs from other regions of the apple genome also were identified and are likely to present other scab resistance genes.

  7. Genes and Gene Networks Involved in Sodium Fluoride-Elicited Cell Death Accompanying Endoplasmic Reticulum Stress in Oral Epithelial Cells

    Directory of Open Access Journals (Sweden)

    Yoshiaki Tabuchi

    2014-05-01

    Full Text Available Here, to understand the molecular mechanisms underlying cell death induced by sodium fluoride (NaF, we analyzed gene expression patterns in rat oral epithelial ROE2 cells exposed to NaF using global-scale microarrays and bioinformatics tools. A relatively high concentration of NaF (2 mM induced cell death concomitant with decreases in mitochondrial membrane potential, chromatin condensation and caspase-3 activation. Using 980 probe sets, we identified 432 up-regulated and 548 down-regulated genes, that were differentially expressed by >2.5-fold in the cells treated with 2 mM of NaF and categorized them into 4 groups by K-means clustering. Ingenuity® pathway analysis revealed several gene networks from gene clusters. The gene networks Up-I and Up-II included many up-regulated genes that were mainly associated with the biological function of induction or prevention of cell death, respectively, such as Atf3, Ddit3 and Fos (for Up-I and Atf4 and Hspa5 (for Up-II. Interestingly, knockdown of Ddit3 and Hspa5 significantly increased and decreased the number of viable cells, respectively. Moreover, several endoplasmic reticulum (ER stress-related genes including, Ddit3, Atf4 and Hapa5, were observed in these gene networks. These findings will provide further insight into the molecular mechanisms of NaF-induced cell death accompanying ER stress in oral epithelial cells.

  8. Blast fungus-induction and developmental and tissuespecific expression of a rice P450 CYP72A gene cluster

    Institute of Scientific and Technical Information of China (English)

    WANG Yaling; LI Qun; HE Zuhua

    2004-01-01

    Cytochrome P450 gene superfamily is widely involved in diverse processes of plant development and environmental responses including defense response to pathogens. We previously isolated a rice cDNA fragment in a DD-PCR screening for blast fungus-induced genes. In the current study, we isolated a CYP72A gene cluster consisting of 7 P450 CYP72A genes (CYP72A17~23) with the conserved cDNA sequence through the public rice genome data. There are total 14 putative CYP72A members in the rice genome, with high diversity at N-terminal sequences while high homology at C-terminal sequences of those 14 putative proteins. We analyzed expression profiles of the cloned 7 CYP72A genes during pathogen infection and development. The results showed that expression of CYP72A18, 19, 22 and 23 was differentially regulated in the incompatible and compatible interactions between rice and blast fungus. Except CYP72A20, a pseudogene, other 6 CYP72A genes also exhibited temporal and spatial expression patterns, respectively. These findings provide fundamental data for rice P450 gene function analysis.

  9. An original SERPINA3 gene cluster: Elucidation of genomic organization and gene expression in the Bos taurus 21q24 region

    Directory of Open Access Journals (Sweden)

    Ouali Ahmed

    2008-04-01

    Full Text Available Abstract Background The superfamily of serine proteinase inhibitors (serpins is involved in numerous fundamental biological processes as inflammation, blood coagulation and apoptosis. Our interest is focused on the SERPINA3 sub-family. The major human plasma protease inhibitor, α1-antichymotrypsin, encoded by the SERPINA3 gene, is homologous to genes organized in clusters in several mammalian species. However, although there is a similar genic organization with a high degree of sequence conservation, the reactive-centre-loop domains, which are responsible for the protease specificity, show significant divergences. Results We provide additional information by analyzing the situation of SERPINA3 in the bovine genome. A cluster of eight genes and one pseudogene sharing a high degree of identity and the same structural organization was characterized. Bovine SERPINA3 genes were localized by radiation hybrid mapping on 21q24 and only spanned over 235 Kilobases. For all these genes, we propose a new nomenclature from SERPINA3-1 to SERPINA3-8. They share approximately 70% of identity with the human SERPINA3 homologue. In the cluster, we described an original sub-group of six members with an unexpected high degree of conservation for the reactive-centre-loop domain, suggesting a similar peptidase inhibitory pattern. Preliminary expression analyses of these bovSERPINA3s showed different tissue-specific patterns and diverse states of glycosylation and phosphorylation. Finally, in the context of phylogenetic analyses, we improved our knowledge on mammalian SERPINAs evolution. Conclusion Our experimental results update data of the bovine genome sequencing, substantially increase the bovSERPINA3 sub-family and enrich the phylogenetic tree of serpins. We provide new opportunities for future investigations to approach the biological functions of this unusual subset of serine proteinase inhibitors.

  10. Copy number variants in the kallikrein gene cluster.

    Directory of Open Access Journals (Sweden)

    Pernilla Lindahl

    Full Text Available The kallikrein gene family (KLK1-KLK15 is the largest contiguous group of protease genes within the human genome and is associated with both risk and outcome of cancer and other diseases. We searched for copy number variants in all KLK genes using quantitative PCR analysis and analysis of inheritance patterns of single nucleotide polymorphisms. Two deletions were identified: one 2235-bp deletion in KLK9 present in 1.2% of alleles, and one 3394-bp deletion in KLK15 present in 4.0% of alleles. Each deletion eliminated one complete exon and created out-of-frame coding that eliminated the catalytic triad of the resulting truncated gene product, which therefore likely is a non-functional protein. Deletion breakpoints identified by DNA sequencing located the KLK9 deletion breakpoint to a long interspersed element (LINE repeated sequence, while the deletion in KLK15 is located in a single copy sequence. To search for an association between each deletion and risk of prostate cancer (PC, we analyzed a cohort of 667 biopsied men (266 PC cases and 401 men with no evidence of PC at biopsy using short deletion-specific PCR assays. There was no association between evidence of PC in this cohort and the presence of either gene deletion. Haplotyping revealed a single origin of each deletion, with most recent common ancestor estimates of 3000-8000 and 6000-14 000 years for the deletions in KLK9 and KLK15, respectively. The presence of the deletions on the same haplotypes in 1000 Genomes data of both European and African populations indicate an early origin of both deletions. The old age in combination with homozygous presence of loss-of-function variants suggests that some kallikrein-related peptidases have non-essential functions.

  11. The evolution and maintenance of Hox gene clusters in vertebrates and the teleost-specific genome duplication.

    Science.gov (United States)

    Kuraku, Shigehiro; Meyer, Axel

    2009-01-01

    Hox genes are known to specify spatial identities along the anterior-posterior axis during embryogenesis. In vertebrates and most other deuterostomes, they are arranged in sets of uninterrupted clusters on chromosomes, and are in most cases expressed in a "colinear" fashion, in which genes closer to the 3-end of the Hox clusters are expressed earlier and more anteriorly and genes close to the 5-end of the clusters later and more posteriorly. In this review, we summarize the current understanding of how Hox gene clusters have been modified from basal lineages of deuterostomes to diverse taxa of vertebrates. Our parsimony reconstruction of Hox cluster architecture at various stages of vertebrate evolution highlights that the variation in Hox cluster structures among jawed vertebrates is mostly due to secondary lineage-specific gene losses and an additional genome duplication that occurred in the actinopterygian stem lineage, the teleost-specific genome duplication (TSGD).

  12. A gene cluster for biosynthesis of the sesquiterpenoid antibiotic pentalenolactone in Streptomyces avermitilis.

    Science.gov (United States)

    Tetzlaff, Charles N; You, Zheng; Cane, David E; Takamatsu, Satoshi; Omura, Satoshi; Ikeda, Haruo

    2006-05-16

    Streptomyces avermitilis, an industrial organism responsible for the production of the anthelminthic avermectins, harbors a 13.4 kb gene cluster containing 13 unidirectionally transcribed open reading frames corresponding to the apparent biosynthetic operon for the sesquiterpene antibiotic pentalenolactone. The advanced intermediate pentalenolactone F, along with the shunt metabolite pentalenic acid, could be isolated from cultures of S. avermitilis, thereby establishing that the pentalenolactone biosynthetic pathway is functional in S. avermitilis. Deletion of the entire 13.4 kb cluster from S. avermitilis abolished formation of pentalenolactone metabolites, while transfer of the intact cluster to the pentalenolactone nonproducer Streptomyces lividans 1326 resulted in production of pentalenic acid. Direct evidence for the biochemical function of the individual biosynthetic genes came from expression of the ptlA gene (SAV2998) in Escherichia coli. Assay of the resultant protein established that PtlA is a pentalenene synthase, catalyzing the cyclization of farnesyl diphosphate to pentalenene, the parent hydrocarbon of the pentalenolactone family of metabolites. The most upstream gene in the cluster, gap1 (SAV2990), was shown to correspond to the pentalenolactone resistance gene, based on expression in E. coli and demonstration that the resulting glyceraldehyde-3-phosphate dehydrogenase, the normal target of pentalenolactone, was insensitive to the antibiotic. Furthermore, a second GAPDH isozyme (gap2, SAV6296) has been expressed in E. coli and shown to be inactivated by pentalenolactone.

  13. Nucleotide sequence analysis of hypervariable junctions of Haemophilus influenzae pilus gene clusters.

    Science.gov (United States)

    Read, T D; Satola, S W; Farley, M M

    2000-12-01

    Haemophilus influenzae pili are surface structures that promote attachment to human epithelial cells. The five genes that encode pili, hifABCDE, are found inserted in genomes either between pmbA and hpt (hif-1) or between purE and pepN (hif-2). We determined the sequence between the ends of the pilus clusters and bordering genes in a number of H. influenzae strains. The junctions of the hif-1 cluster (limited to biogroup aegyptius isolates) are structurally simple. In contrast, hif-2 junctions are highly diverse, complex assemblies of conserved intergenic sequences (including genes hicA and hicB) with evidence of frequent recombination. Variation at hif-2 junctions seems to be tied to multiple copies of a 23-bp Haemophilus intergenic dyad sequence. The hif-1 cluster appears to have originated in biogroup aegyptius strains from invasion of the hpt-pmbA region by a DNA template containing the hif-2 genes with termini in the hairpin loop of flanking intergenic dyad sequences. The pilus gene clusters are an interesting model of a mobile "pathogenicity island" not associated with a phage, transposon, or insertion element.

  14. Sequencing, physical organization and kinetic expression of the patulin biosynthetic gene cluster from Penicillium expansum.

    Science.gov (United States)

    Tannous, Joanna; El Khoury, Rhoda; Snini, Selma P; Lippi, Yannick; El Khoury, André; Atoui, Ali; Lteif, Roger; Oswald, Isabelle P; Puel, Olivier

    2014-10-17

    Patulin is a polyketide-derived mycotoxin produced by numerous filamentous fungi. Among them, Penicillium expansum is by far the most problematic species. This fungus is a destructive phytopathogen capable of growing on fruit, provoking the blue mold decay of apples and producing significant amounts of patulin. The biosynthetic pathway of this mycotoxin is chemically well-characterized, but its genetic bases remain largely unknown with only few characterized genes in less economic relevant species. The present study consisted of the identification and positional organization of the patulin gene cluster in P. expansum strain NRRL 35695. Several amplification reactions were performed with degenerative primers that were designed based on sequences from the orthologous genes available in other species. An improved genome Walking approach was used in order to sequence the remaining adjacent genes of the cluster. RACE-PCR was also carried out from mRNAs to determine the start and stop codons of the coding sequences. The patulin gene cluster in P. expansum consists of 15 genes in the following order: patH, patG, patF, patE, patD, patC, patB, patA, patM, patN, patO, patL, patI, patJ, and patK. These genes share 60-70% of identity with orthologous genes grouped differently, within a putative patulin cluster described in a non-producing strain of Aspergillus clavatus. The kinetics of patulin cluster genes expression was studied under patulin-permissive conditions (natural apple-based medium) and patulin-restrictive conditions (Eagle's minimal essential medium), and demonstrated a significant association between gene expression and patulin production. In conclusion, the sequence of the patulin cluster in P. expansum constitutes a key step for a better understanding of the mechanisms leading to patulin production in this fungus. It will allow the role of each gene to be elucidated, and help to define strategies to reduce patulin production in apple-based products.

  15. When genome-based approach meets the ‘old but good’: revealing genes involved in the antibacterial activity of Pseudomonas sp. P482 against soft rot pathogens.

    Directory of Open Access Journals (Sweden)

    Dorota Magdalena Krzyżanowska

    2016-05-01

    Full Text Available Dickeya solani and Pectobacterium carotovorum subsp. brasili¬ense are recently established species of bacterial plant pathogens causing black leg and soft rot of many vegetables and ornamental plants. Pseudomonas sp. strain P482 inhibits the growth of these pathogens, a desired trait considering the limited measures to combat these diseases. In this study, we determined the genetic background of the antibacterial activity of P482, and established the phylogenetic position of this strain.Pseudomonas sp. P482 was classified as Pseudomonas donghuensis. Genome mining revealed that the P482 genome does not contain genes determining the synthesis of known antimicrobials. However, the ClusterFinder algorithm, designed to detect atypical or novel classes of secondary metabolite gene clusters, predicted 18 such clusters in the genome. Screening of a Tn5 mutant library yielded an antimicrobial negative transposon mutant. The transposon insertion was located in a gene encoding an HpcH/HpaI aldolase/citrate lyase family protein. This gene is located in a hypothetical cluster predicted by the ClusterFinder, together with the downstream homologues of four nfs genes, that confer production of a nonfluorescent siderophore by P. donghuensis HYST. Site-directed inactivation of the HpcH/HpaI aldolase gene, the adjacent short chain dehydrogenase gene, as well as a homologue of an essential nfs cluster gene, all abolished the antimicrobial activity of the P482, suggesting their involvement in a common biosynthesis pathway. However, none of the mutants showed a decreased siderophore yield, neither was the antimicrobial activity of the wild type P482 compromised by high iron bioavailability.A genomic region comprising the nfs cluster and three upstream genes is involved in the antibacterial activity of P. donghuensis P482 against D. solani and P. carotovorum subsp. brasiliense. The genes studied are unique to the two known P. donghuensis strains. This study

  16. Organization of the human keratin type II gene cluster at 12q13

    Energy Technology Data Exchange (ETDEWEB)

    Yoon, S.J.; LeBlanc-Straceski, J.; Krauter, K. [Albert Einstein College of Medicine, Bronx, NY (United States)] [and others

    1994-12-01

    Keratin proteins constitute intermediate filaments and are the major differentiation products of mammalian epithelial cells. The epithelial keratins are classified into two groups, type I and type II, and one member of each group is expressed in a given epithelial cell differentiation stage. Mutations in type I and type II keratin genes have now been implicated in three different human genetic disorders, epidermolysis bullosa simplex, epidermolytic hyperkeratosis, and epidermolytic palmoplantar keratoderma. Members of the type I keratins are mapped to human chromosome 17, and the type II keratin genes are mapped to chromosome 12. To understand the organization of the type II keratin genes on chromosome 12, we isolated several yeast artificial chromosomes carrying these keratin genes and examined them in detail. We show that eight already known type II keratin genes are located in a cluster at 12q13, and their relative organization reflects their evolutionary relationship. We also determined that a type I keratin gene, KRT8, is located next to its partner, KRT18, in this cluster. Careful examination of the cluster also revealed that there may be a number of additional keratin genes at this locus that have not been described previously. 41 refs., 3 figs., 1 tab.

  17. The impact of advertising patient and public involvement on trial recruitment: embedded cluster randomised recruitment trial.

    Science.gov (United States)

    Hughes-Morley, Adwoa; Hann, Mark; Fraser, Claire; Meade, Oonagh; Lovell, Karina; Young, Bridget; Roberts, Chris; Cree, Lindsey; More, Donna; O'Leary, Neil; Callaghan, Patrick; Waheed, Waquas; Bower, Peter

    2016-12-08

    Patient and public involvement in research (PPIR) may improve trial recruitment rates, but it is unclear how. Where trials use PPIR to improve design and conduct, many do not communicate this clearly to potential participants. Better communication of PPIR might encourage patient enrolment, as trials may be perceived as more socially valid, relevant and trustworthy. We aimed to evaluate the impact on recruitment of directly advertising PPIR to potential trial participants. This is a cluster trial, embedded within a host trial ('EQUIP') recruiting service users diagnosed with severe mental illness. The intervention was informed by a systematic review, a qualitative study, social comparison theory and a stakeholder workshop including service users and carers. Adopting Participatory Design approaches, we co-designed the recruitment intervention with PPIR partners using a leaflet to advertise the PPIR in EQUIP and sent potential participants invitations with the leaflet (intervention group) or not (control group). Primary outcome was the proportion of patients enrolled in EQUIP. Secondary outcomes included the proportions of patients who positively responded to the trial invitation. Thirty-four community mental health teams were randomised and 8182 service users invited. For the primary outcome, 4% of patients in the PPIR group were enrolled versus 5.3% of the control group. The intervention was not effective for improving recruitment rates (adjusted OR = 0.75, 95% CI = 0.53 to 1.07, p = 0.113). For the secondary outcome of positive response, the intervention was not effective, with 7.3% of potential participants in the intervention group responding positively versus 7.9% of the control group (adjusted OR = 0.74, 95% CI = 0.53 to 1.04, p = 0.082). We did not find a positive impact of directly advertising PPIR on any other outcomes. To our knowledge, this is the largest ever embedded trial to evaluate a recruitment or PPIR intervention

  18. Identification and validation of genes involved in cervical tumourigenesis

    Directory of Open Access Journals (Sweden)

    Bose Mayil

    2011-02-01

    Full Text Available Abstract Background Cervical cancer is the most common cancer among Indian women. This cancer has well defined pre-cancerous stages and evolves over 10-15 years or more. This study was undertaken to identify differentially expressed genes between normal, dysplastic and invasive cervical cancer. Materials and methods A total of 28 invasive cervical cancers, 4 CIN3/CIS, 4 CIN1/CIN2 and 5 Normal cervix samples were studied. We have used microarray technique followed by validation of the significant genes by relative quantitation using Taqman Low Density Array Real Time PCR. Immunohistochemistry was used to study the protein expression of MMP3, UBE2C and p16 in normal, dysplasia and cancers of the cervix. The effect of a dominant negative UBE2C on the growth of the SiHa cells was assessed using a MTT assay. Results Our study, for the first time, has identified 20 genes to be up-regulated and 14 down-regulated in cervical cancers and 5 up-regulated in CIN3. In addition, 26 genes identified by other studies, as to playing a role in cervical cancer, were also confirmed in our study. UBE2C, CCNB1, CCNB2, PLOD2, NUP210, MELK, CDC20 genes were overexpressed in tumours and in CIN3/CIS relative to both Normal and CIN1/CIN2, suggesting that they could have a role to play in the early phase of tumorigenesis. IL8, INDO, ISG15, ISG20, AGRN, DTXL, MMP1, MMP3, CCL18, TOP2A AND STAT1 were found to be upregulated in tumours. Using Immunohistochemistry, we showed over-expression of MMP3, UBE2C and p16 in cancers compared to normal cervical epithelium and varying grades of dysplasia. A dominant negative UBE2C was found to produce growth inhibition in SiHa cells, which over-expresses UBE2C 4 fold more than HEK293 cells. Conclusions Several novel genes were found to be differentially expressed in cervical cancer. MMP3, UBE2C and p16 protein overexpression in cervical cancers was confirmed by immunohistochemistry. These will need to be validated further in a larger

  19. Mapping of the {alpha}{sub 4} subunit gene (GABRA4) to human chromosome 4 defines an {alpha}{sub 2}-{alpha}{sub 4}-{beta}{sub 1}-{gamma}{sub 1} gene cluster: Further evidence that modern GABA{sub a} receptor gene clusters are derived from an ancestral cluster

    Energy Technology Data Exchange (ETDEWEB)

    McLean, P.J.; Farb, D.H.; Russek, S.J. [Boston Univ. School of Medicine, MA (United States)] [and others

    1995-04-10

    We demonstrated previously that an {alpha}{sub 1}-{beta}{sub 2}-{gamma}{sub 2} gene cluster of the {gamma}-aminobutyric acid (GABA{sub A}) receptor is located on human chromosome 5q34-q35 and that an ancestral {alpha}-{beta}-{gamma} gene cluster probably spawned clusters on chromosomes 4, 5, and 15. Here, we report that the {alpha}{sub 4} gene (GABRA4) maps to human chromosome 4p14-q12, defining a cluster comprising the {alpha}{sub 2}, {alpha}{sub 4}, {beta}{sub 1}, and {gamma}{sub 1} genes. The existence of an {alpha}{sub 2}-{alpha}{sub 4}-{beta}{sub 1}-{gamma}{sub 2} cluster on chromosome 4 and an {alpha}{sub 1}-{alpha}{sub 6}-{beta}{sub 2}-{gamma}{sub 2} cluster on chromosome 5 provides further evidence that the number of ancestral GABA{sub A} receptor subunit genes has been expanded by duplication within an ancestral gene cluster. Moreover, if duplication of the {alpha} gene occurred before duplication of the ancestral gene cluster, then a heretofore undiscovered subtype of a subunit should be located on human chromosome 15q11-q13 within an {alpha}{sub 5}-{alpha}{sub x}-{beta}{sub 3}-{gamma}{sub 3} gene cluster at the locus for Angelman and Prader-Willi syndromes. 34 refs., 6 figs., 1 tab.

  20. A species-specific cluster of defensin-like genes encodes diffusible pollen tube attractants in Arabidopsis.

    Directory of Open Access Journals (Sweden)

    Hidenori Takeuchi

    Full Text Available Genes directly involved in male/female and host/parasite interactions are believed to be under positive selection. The flowering plant Arabidopsis thaliana has more than 300 defensin-like (DEFL genes, which are likely to be involved in both natural immunity and cell-to-cell communication including pollen-pistil interactions. However, little is known of the relationship between the molecular evolution of DEFL genes and their functions. Here, we identified a recently evolved cluster of DEFL genes in A. thaliana and demonstrated that these DEFL (cysteine-rich peptide [CRP810_1] peptides, named AtLURE1 peptides, are pollen tube attractants guiding pollen tubes to the ovular micropyle. The AtLURE1 genes formed the sole species-specific cluster among DEFL genes compared to its close relative, A. lyrata. No evidence for positive selection was detected in AtLURE1 genes and their orthologs, implying neutral evolution of AtLURE1 genes. AtLURE1 peptides were specifically expressed in egg-accompanying synergid cells and secreted toward the funicular surface through the micropyle. Genetic analyses showed that gametophytic mutants defective in micropylar guidance (myb98, magatama3, and central cell guidance do not express AtLURE1 peptides. Downregulation of the expression of these peptides impaired precise pollen tube attraction to the micropylar opening of some populations of ovules. Recombinant AtLURE1 peptides attracted A. thaliana pollen tubes at a higher frequency compared to A. lyrata pollen tubes, suggesting that these peptides are species-preferential attractants in micropylar guidance. In support of this idea, the heterologous expression of a single AtLURE1 peptide in the synergid cell of Torenia fournieri was sufficient to guide A. thaliana pollen tubes to the T. fournieri embryo sac and to permit entry into it. Our results suggest the unique evolution of AtLURE1 genes, which are directly involved in male-female interaction among the DEFL multigene

  1. Mouse models for genes involved in impaired spermatogenesis.

    Science.gov (United States)

    O'Bryan, M K; de Kretser, D

    2006-02-01

    Since the introduction of molecular biology and gene ablation technologies there have been substantial advances in our understanding of how sperm are made and fertilization occurs. There have been at least 150 different models of specifically altered gene function produced that have resulted in male infertility spanning virtually all aspects of the spermatogenic, sperm maturation and fertilization processes. While each has, or potentially will reveal, novel aspects of these processes, there is still much of which we have little knowledge. The current review is by no means a comprehensive list of these mouse models, rather it gives an overview of the potential for such models which up to this point have generally been 'knockouts'; it presents alternative strategies for the production of new models and emphasizes the importance of thorough phenotypic analysis in order to extract a maximum amount of information from each model.

  2. Involvement of distinct PKC gene products in T cell functions

    Directory of Open Access Journals (Sweden)

    Gottfried eBaier

    2012-08-01

    Full Text Available It is well established that members of the Protein kinase C(PKC family seem to have important roles in T cells. Focusing on the physiological and non-redundant PKC functions established in primary mouse T cells via germline gene-targeting approaches, our current knowledge defines two particularly critical PKC gene products, PKCθ and PKCα, as the flavor of PKC in T cells that appear to have a positive role in signaling pathways that are necessary for full antigen receptor-mediated T cell activation ex vivo and T cell-mediated immunity in vivo. Consistently, in spite of the current dogma that PKCθ inhibition might be sufficient to achieve complete immunosuppressive effects, more recent results have indicated that the pharmacological inhibition of PKCθ, and additionally, at least PKCα, appears to be needed to provide a successful approach for the prevention of allograft rejection and treatment of autoimmune diseases.

  3. The expression of type III hyperlipoproteinemia: involvement of lipolysis genes

    Science.gov (United States)

    Henneman, Peter; van der Sman-de Beer, Femke; Moghaddam, Payman Hanifi; Huijts, Petra; Stalenhoef, Anton FH; Kastelein, John JP; van Duijn, Cornelia M; Havekes, Louis M; Frants, Rune R; van Dijk, Ko Willems; Smelt, Augustinus HM

    2009-01-01

    Type III hyperlipoproteinemia (HLP) is mainly found in homozygous apolipoprotein (APO) E2 (R158C) carriers. Genetic factors contributing to the expression of type III HLP were investigated in 113 hyper- and 52 normolipidemic E2/2 subjects, by testing for polymorphisms in APOC3, APOA5, HL (hepatic lipase) and LPL (lipoprotein lipase) genes. In addition, 188 normolipidemic Dutch control panels (NDCP) and 141 hypertriglyceridemic (HTG) patients were genotyped as well. No associations were found for four HL gene polymorphisms and two LPL gene polymorphisms and type III HLP. The frequency of the rare allele of APOC3 3238 G>C and APOA5 −1131 T>C (in linkage disequilibrium) was significantly higher in type III HLP patients when compared with normolipidemic E2/2 subjects, 15.6 vs 6.9% and 15.1 vs 5.8%, respectively, (PC polymorphism and LPL c.27 G>A mutation were higher in type III HLP patients, though not significant. Some 58% of the type III HLP patients carried either the APOA5 −1131 T>C, c.56 G>C and/or LPL c.27 G>A mutation as compared to 27% of the normolipidemic APOE2/2 subjects (odds ratio 3.7, 95% confidence interval=1.8–7.5, PC/APOA5 −1131 T>C polymorphism showed a more severe hyperlipidemia than patients without this polymorphism. Polymorphisms in lipolysis genes associate with the expression and severity of type III HLP in APOE2/2. PMID:19034316

  4. Genes involved in virulence of the entomopathogenic fungus Beauveria bassiana.

    Science.gov (United States)

    Valero-Jiménez, Claudio A; Wiegers, Harm; Zwaan, Bas J; Koenraadt, Constantianus J M; van Kan, Jan A L

    2016-01-01

    Pest insects cause severe damage to global crop production and pose a threat to human health by transmitting diseases. Traditionally, chemical pesticides (insecticides) have been used to control such pests and have proven to be effective only for a limited amount of time because of the rapid spread of genetic insecticide resistance. The basis of this resistance is mostly caused by (co)dominant mutations in single genes, which explains why insecticide use alone is an unsustainable solution. Therefore, robust solutions for insect pest control need to be sought in alternative methods such as biological control agents for which single-gene resistance is less likely to evolve. The entomopathogenic fungus Beauveria bassiana has shown potential as a biological control agent of insects, and insight into the mechanisms of virulence is essential to show the robustness of its use. With the recent availability of the whole genome sequence of B. bassiana, progress in understanding the genetics that constitute virulence toward insects can be made more quickly. In this review we divide the infection process into distinct steps and provide an overview of what is currently known about genes and mechanisms influencing virulence in B. bassiana. We also discuss the need for novel strategies and experimental methods to better understand the infection mechanisms deployed by entomopathogenic fungi. Such knowledge can help improve biocontrol agents, not only by selecting the most virulent genotypes, but also by selecting the genotypes that use combinations of virulence mechanisms for which resistance in the insect host is least likely to develop.

  5. Pathway-specific regulation revisited: cross-regulation of multiple disparate gene clusters by PAS-LuxR transcriptional regulators.

    Science.gov (United States)

    Vicente, Cláudia M; Payero, Tamara D; Santos-Aberturas, Javier; Barreales, Eva G; de Pedro, Antonio; Aparicio, Jesús F

    2015-06-01

    PAS-LuxR regulators are highly conserved proteins devoted to the control of antifungal production by binding to operators located in given promoters of polyene biosynthetic genes. The canonical operator of PimM, archetype of this class of regulators, has been used here to search for putative targets of orthologous protein PteF in the genome of Streptomyces avermitilis, finding 97 putative operators outside the pentaene filipin gene cluster (pte). The processes putatively affected included genetic information processing; energy, carbohydrate, and lipid metabolism; DNA replication and repair; morphological differentiation; secondary metabolite biosynthesis; and transcriptional regulation, among others. Seventeen of these operators were selected, and their binding to PimM DNA-binding domain was assessed by electrophoretic mobility shift assays. Strikingly, the protein bound all predicted operators suggesting a direct control over targeted processes. As a proof of concept, we studied the biosynthesis of the ATP-synthase inhibitor oligomycin whose gene cluster included two operators. Regulator mutants showed a severe loss of oligomycin production, whereas gene complementation of the mutant restored phenotype, and gene duplication in the wild-type strain boosted oligomycin production. Comparative gene expression analyses in parental and mutant strains by reverse transcription-quantitative polymerase chain reaction of selected olm genes corroborated production results. These results demonstrate that PteF is able to cross-regulate the biosynthesis of two related secondary metabolites, filipin and oligomycin, but might be extended to all the processes indicated above. This study highlights the complexity of the network of interactions in which PAS-LuxR regulators are involved and opens new possibilities for the manipulation of metabolite production in Streptomycetes.

  6. Identification of a gene cluster associated with triclosan catabolism.

    Science.gov (United States)

    Kagle, Jeanne M; Paxson, Clayton; Johnstone, Precious; Hay, Anthony G

    2015-06-01

    Aerobic degradation of bis-aryl ethers like the antimicrobial triclosan typically proceeds through oxygenase-dependent catabolic pathways. Although several studies have reported on bacteria capable of degrading triclosan aerobically, there are no reports describing the genes responsible for this process. In this study, a gene encoding the large subunit of a putative triclosan oxygenase, designated tcsA was identified in a triclosan-degrading fosmid clone from a DNA library of Sphingomonas sp. RD1. Consistent with tcsA's similarity to two-part dioxygenases, a putative FMN-dependent ferredoxin reductase, designated tcsB was found immediately downstream of tcsA. Both tcsAB were found in the midst of a putative chlorocatechol degradation operon. We show that RD1 produces hydroxytriclosan and chlorocatechols during triclosan degradation and that tcsA is induced by triclosan. This is the first study to report on the genetics of triclosan degradation.

  7. Isolation of Hox cluster genes from insects reveals an accelerated sequence evolution rate.

    Directory of Open Access Journals (Sweden)

    Heike Hadrys

    Full Text Available Among gene families it is the Hox genes and among metazoan animals it is the insects (Hexapoda that have attracted particular attention for studying the evolution of development. Surprisingly though, no Hox genes have been isolated from 26 out of 35 insect orders yet, and the existing sequences derive mainly from only two orders (61% from Hymenoptera and 22% from Diptera. We have designed insect specific primers and isolated 37 new partial homeobox sequences of Hox cluster genes (lab, pb, Hox3, ftz, Antp, Scr, abd-a, Abd-B, Dfd, and Ubx from six insect orders, which are crucial to insect phylogenetics. These new gene sequences provide a first step towards comparative Hox gene studies in insects. Furthermore, comparative distance analyses of homeobox sequences reveal a correlation between gene divergence rate and species radiation success with insects showing the highest rate of homeobox sequence evolution.

  8. Gene Expression Analysis for the Identification of Genes Involved in Early Tumour Development

    Science.gov (United States)

    Forte, Stefano; Scarpulla, Salvatore; Lagana, Alessandro; Memeo, Lorenzo; Gulisano, Massimo

    Prostatic tissues can undergo to cancer insurgence and prostate cancer is one of the most common types of malignancies affecting adult men in the United States. Primary adenocarcinoma of the seminal vesi-cles (SVCA) is a very rare neoplasm with only 48 histologically confirmed cases reported in the European and United States literature. Prostatic tissues, seminal vesicles and epididymis belongs all to the same microenvironment, shows a very close morphology and share the same embryological origin. Despite these common features the rate of cancer occurrence is very different. The understanding of molecular differences between non neoplastic prostatic tissues and non neoplastic epididymis or seminal vesicles may suggest potential mechanisms of resistance to tumour occurrence. The comparison of expression patterns of non neoplastic prostatic and seminal vesicles tissues to identify differentially expressed genes can help researchers in the identification of biological actors involved in the early stages of the tumour development.

  9. The gp63 Gene Cluster Is Highly Polymorphic in Natural Leishmania (Viannia) braziliensis Populations, but Functional Sites Are Conserved

    Science.gov (United States)

    Medina, Lilian S.; Souza, Bruno Araújo; Queiroz, Adriano; Guimarães, Luiz Henrique; Lima Machado, Paulo Roberto; M Carvalho, Edgar; Wilson, Mary Edythe; Schriefer, Albert

    2016-01-01

    GP63 or leishmanolysin is the major surface protease of Leishmania spp. involved in parasite virulence and host cell interaction. As such, GP63 is a potential target of eventual vaccines against these protozoa. In the current study we evaluate the polymorphism of gp63 in Leishmania (Viannia) braziliensis isolated from two sets of American tegumentary leishmaniasis (ATL) cases from Corte de Pedra, Brazil, including 35 cases diagnosed between 1994 and 2001 and 6 cases diagnosed between 2008 and 2011. Parasites were obtained from lesions by needle aspiration and cultivation. Genomic DNA was extracted, and 405 bp fragments, including sequences encoding the putative macrophage interacting sites, were amplified from gp63 genes of all isolates. DNA amplicons were cloned into plasmid vectors and ten clones per L. (V.) braziliensis isolate were sequenced. Alignment of cloned sequences showed extensive polymorphism among gp63 genes within, and between parasite isolates. Overall, 45 different polymorphic alleles were detected in all samples, which could be segregated into two clusters. Cluster one included 25, and cluster two included 20 such genotypes. The predicted peptides showed overall conservation below 50%. In marked contrast, the conservation at segments with putative functional domains approached 90% (Fisher’s exact test p<0.0001). These findings show that gp63 is very polymorphic even among parasites from a same endemic focus, but the functional domains interacting with the mammalian host environment are conserved. PMID:27648939

  10. The Fdb3 transcription factor of the Fusarium Detoxification of Benzoxazolinone gene cluster is required for MBOA but not BOA degradation in Fusarium pseudograminearum.

    Science.gov (United States)

    Kettle, Andrew J; Carere, Jason; Batley, Jacqueline; Manners, John M; Kazan, Kemal; Gardiner, Donald M

    2016-03-01

    A number of cereals produce the benzoxazolinone class of phytoalexins. Fusarium species pathogenic towards these hosts can typically degrade these compounds via an aminophenol intermediate, and the ability to do so is encoded by a group of genes found in the Fusarium Detoxification of Benzoxazolinone (FDB) cluster. A zinc finger transcription factor encoded by one of the FDB cluster genes (FDB3) has been proposed to regulate the expression of other genes in the cluster and hence is potentially involved in benzoxazolinone degradation. Herein we show that Fdb3 is essential for the ability of Fusarium pseudograminearum to efficiently detoxify the predominant wheat benzoxazolinone, 6-methoxy-benzoxazolin-2-one (MBOA), but not benzoxazoline-2-one (BOA). Furthermore, additional genes thought to be part of the FDB gene cluster, based upon transcriptional response to benzoxazolinones, are regulated by Fdb3. However, deletion mutants for these latter genes remain capable of benzoxazolinone degradation, suggesting that they are not essential for this process. Crown Copyright © 2016. Published by Elsevier Inc. All rights reserved.

  11. The O28 Antigen Gene Clusters of Salmonella enterica subsp. enterica Serovar Dakar and Serovar Pomona Are Different

    Directory of Open Access Journals (Sweden)

    Clifford G. Clark

    2010-01-01

    Full Text Available A 10 kb O-antigen gene cluster was sequenced from a Salmonella enterica subsp. enterica Dakar O28 reference strain and from two S. Pomona serogroup O28 isolates. The two S. Pomona O antigen gene clusters showed only moderate identity with the S. Dakar O28 gene cluster, suggesting that the O antigen oligosaccharides may contain one or more sugars conferring the O28 epitope but may otherwise be different. These novel findings are absolutely critical for the correct interpretation of molecular serotyping assays targeting genes within the O antigen gene clusters of these Salmonella serotypes and suggest the possibility that the O antigen gene clusters of other Salmonella serovars may also be heterogenous.

  12. Characterization of a major cluster of nif, fix, and associated genes in a sugarcane endophyte, Acetobacter diazotrophicus.

    Science.gov (United States)

    Lee, S; Reth, A; Meletzus, D; Sevilla, M; Kennedy, C

    2000-12-01

    A major 30.5-kb cluster of nif and associated genes of Acetobacter diazotrophicus (syn. Gluconacetobacter diazotrophicus), a nitrogen-fixing endophyte of sugarcane, was sequenced and analyzed. This cluster represents the largest assembly of contiguous nif-fix and associated genes so far characterized in any diazotrophic bacterial species. Northern blots and promoter sequence analysis indicated that the genes are organized into eight transcriptional units. The overall arrangement of genes is most like that of the nif-fix cluster in Azospirillum brasilense, while the individual gene products are more similar to those in species of Rhizobiaceae or in Rhodobacter capsulatus.

  13. Cloning and Characterization of the Polyether Salinomycin Biosynthesis Gene Cluster of Streptomyces albus XM211

    OpenAIRE

    Jiang, Chunyan; Wang, Hougen; Kang, Qianjin; Jing LIU; Bai, Linquan

    2012-01-01

    Salinomycin is widely used in animal husbandry as a food additive due to its antibacterial and anticoccidial activities. However, its biosynthesis had only been studied by feeding experiments with isotope-labeled precursors. A strategy with degenerate primers based on the polyether-specific epoxidase sequences was successfully developed to clone the salinomycin gene cluster. Using this strategy, a putative epoxidase gene, slnC, was cloned from the salinomycin producer Streptomyces albus XM211...

  14. The Serratia gene cluster encoding biosynthesis of the red antibiotic, prodigiosin, shows species- and strain-dependent genome context variation

    DEFF Research Database (Denmark)

    Harris, Abigail K P; Williamson, Neil R; Slater, Holly

    2004-01-01

    The prodigiosin biosynthesis gene cluster (pig cluster) from two strains of Serratia (S. marcescens ATCC 274 and Serratia sp. ATCC 39006) has been cloned, sequenced and expressed in heterologous hosts. Sequence analysis of the respective pig clusters revealed 14 ORFs in S. marcescens ATCC 274 and...

  15. Genes Involved in Human Ribosome Biogenesis areTranscriptionally Upregulated in Colorectal Cancer

    DEFF Research Database (Denmark)

    Mansilla, Francisco; Lamy, Philippe; Ørntoft, Torben Falck

    2009-01-01

    Microarray gene expression profiling comprising 168 colorectal adenocarcinomas and 10 normal mucosas showed that over 79% of the genes involved in human ribosome biogenesis are significantly upregulated (log2>0.5, p<10-3) when compared to normal mucosa. Overexpression was independent of microsate......Microarray gene expression profiling comprising 168 colorectal adenocarcinomas and 10 normal mucosas showed that over 79% of the genes involved in human ribosome biogenesis are significantly upregulated (log2>0.5, p... of rRNA processing genes points towards a coordinated process enabling the overproduction of matured ribosomal structures....

  16. Cluster Analysis and Significance of Novel Genes Related to Molecular Classification of Glioma

    Institute of Scientific and Technical Information of China (English)

    Juxiang Chen; Yicheng Lu; Guohan Hu; Kehua Sun; Chun Luo; Meiqing Lou; Kang Ying; Yao Li

    2005-01-01

    OBJECTIVE To screen differentially expressed genes in the development of human glioma and establish a primary molecular classification of glioma based on gene expression using cDNA microarrays.METHODS Brain specimens were obtained from 18 patients with glioma, 10males and 8 females, ages 14~62 with an average age of 44.4. The total RNAs of these glioma specimens and two specimens of donated brain of normal adults were extracted. BioStarH140S microarrays (including 8,347old genes and 5,592 novel genes) were adopted and hybridized with probes which were prepared from the total RNAs. Differentially expressed genes between normal tissues and glioma tissues were assayed after scanning cDNA microarrays with ScanArray4000. Northern hybridization and in situ hybridization (ISH) were used to identify functions of novel genes. Those differentially expressed genes were studied with a Hierarchical method and molecular classification of glioma was preliminary carried out.RESULTS Among the 13,939 target genes, there were 1,200 (8.61%)differentially expressed genes, of which 395 (2.83%) were novel genes. A total of 348 genes were up-regulated and 852 genes were down-regulated in the gliomas. The results of bioinformatical analysis, Northern hybridization and ISH revealed that those novel genes were highly associated with gliomas. There were multiple genes, such as the MAP gene、cytoskeleton & matrix motility genes, etc, which were of relevance to classification by the Hierarchical method. Molecular classification of glioma using a Hierarchical cluster was in accordance with pathology and suggested a molecular process of tumorigenesis and development.CONCLUSION Multiple genes play important roles in development of glioma. cDNA microarray technology is a powerful technique in screening for differentially expressed genes between two different kinds of tissues. Further analysis of gene expression and novel genes would be helpful to understand the molecular mechanism of glioma

  17. Oxidative stress enhances the expression of sulfur assimilation genes: preliminary insights on the Enterococcus faecalis iron-sulfur cluster machinery regulation

    Science.gov (United States)

    Riboldi, Gustavo Pelicioli; Bierhals, Christine Garcia; de Mattos, Eduardo Preusser; Frazzon, Ana Paula Guedes; d‘Azevedo, Pedro Alves; Frazzon, Jeverson

    2014-01-01

    The Firmicutes bacteria participate extensively in virulence and pathological processes. Enterococcus faecalis is a commensal microorganism; however, it is also a pathogenic bacterium mainly associated with nosocomial infections in immunocompromised patients. Iron-sulfur [Fe-S] clusters are inorganic prosthetic groups involved in diverse biological processes, whose in vivo formation requires several specific protein machineries. Escherichia coli is one of the most frequently studied microorganisms regarding [Fe-S] cluster biogenesis and encodes the iron-sulfur cluster and sulfur assimilation systems. In Firmicutes species, a unique operon composed of the sufCDSUB genes is responsible for [Fe-S] cluster biogenesis. The aim of this study was to investigate the potential of the E. faecalis sufCDSUB system in the [Fe-S] cluster assembly using oxidative stress and iron depletion as adverse growth conditions. Quantitative real-time polymerase chain reaction demonstrated, for the first time, that Gram-positive bacteria possess an OxyR component responsive to oxidative stress conditions, as fully described for E. coli models. Likewise, strong expression of the sufCDSUB genes was observed in low concentrations of hydrogen peroxide, indicating that the lowest concentration of oxygen free radicals inside cells, known to be highly damaging to [Fe-S] clusters, is sufficient to trigger the transcriptional machinery for prompt replacement of [Fe-S] clusters. PMID:24936909

  18. WRKY Transcription Factors Involved in Activation of SA Biosynthesis Genes

    Directory of Open Access Journals (Sweden)

    Bol John F

    2011-05-01

    Full Text Available Abstract Background Increased defense against a variety of pathogens in plants is achieved through activation of a mechanism known as systemic acquired resistance (SAR. The broad-spectrum resistance brought about by SAR is mediated through salicylic acid (SA. An important step in SA biosynthesis in Arabidopsis is the conversion of chorismate to isochorismate through the action of isochorismate synthase, encoded by the ICS1 gene. Also AVRPPHB SUSCEPTIBLE 3 (PBS3 plays an important role in SA metabolism, as pbs3 mutants accumulate drastically reduced levels of SA-glucoside, a putative storage form of SA. Bioinformatics analysis previously performed by us identified WRKY28 and WRKY46 as possible regulators of ICS1 and PBS3. Results Expression studies with ICS1 promoter::β-glucuronidase (GUS genes in Arabidopsis thaliana protoplasts cotransfected with 35S::WRKY28 showed that over expression of WRKY28 resulted in a strong increase in GUS expression. Moreover, qRT-PCR analyses indicated that the endogenous ICS1 and PBS3 genes were highly expressed in protoplasts overexpressing WRKY28 or WRKY46, respectively. Electrophoretic mobility shift assays indentified potential WRKY28 binding sites in the ICS1 promoter, positioned -445 and -460 base pairs upstream of the transcription start site. Mutation of these sites in protoplast transactivation assays showed that these binding sites are functionally important for activation of the ICS1 promoter. Chromatin immunoprecipitation assays with haemagglutinin-epitope-tagged WRKY28 showed that the region of the ICS1 promoter containing the binding sites at -445 and -460 was highly enriched in the immunoprecipitated DNA. Conclusions The results obtained here confirm results from our multiple microarray co-expression analyses indicating that WRKY28 and WRKY46 are transcriptional activators of ICS1 and PBS3, respectively, and support this in silico screening as a powerful tool for identifying new components of stress

  19. The WSB1 gene is involved in pancreatic cancer progression.

    Directory of Open Access Journals (Sweden)

    Cendrine Archange

    Full Text Available BACKGROUND: Pancreatic cancer cells generate metastases because they can survive the stress imposed by the new environment of the host tissue. To mimic this process, pancreatic cancer cells which are not stressed in standard culture conditions are injected into nude mice. Because they develop xenografts, they should have developed adequate stress response. Characterizing that response might provide new strategies to interfere with pancreatic cancer metastasis. METHODOLOGY/PRINCIPAL FINDINGS: In the human pancreatic cancer cell lines Panc-1, Mia-PaCa2, Capan-1, Capan-2 and BxPC3, we used Affymetrix DNA microarrays to compare the expressions of 22.000 genes in vitro and in the corresponding xenografts. We identified 228 genes overexpressed in xenografts and characterized the implication of one of them, WSB1, in the control of apoptosis and cell proliferation. WSB1 generates 3 alternatively spliced transcripts encoding distinct protein isoforms. In xenografts and in human pancreatic tumors, global expression of WSB1 mRNA is modestly increased whereas isoform 3 is strongly overexpressed and isoforms 1 and 2 are down-regulated. Treating Mia-PaCa2 cells with stress-inducing agents induced similar changes. Whereas retrovirus-forced expression of WSB1 isoforms 1 and 2 promoted cell growth and sensitized the cells to gemcitabine- and doxorubicin-induced apoptosis, WSB1 isoform 3 expression reduced cell proliferation and enhanced resistance to apoptosis, showing that stress-induced modulation of WSB1 alternative splicing increases resistance to apoptosis of pancreatic cancer cells. CONCLUSIONS/SIGNIFICANCE: Data on WSB1 regulation support the hypothesis that activation of stress-response mechanisms helps cancer cells establishing metastases and suggest relevance to cancer development of other genes overexpressed in xenografts.

  20. Identification of genes involved in radioresistance of nasopharyngeal carcinoma by integrating gene ontology and protein-protein interaction networks.

    Science.gov (United States)

    Guo, Ya; Zhu, Xiao-Dong; Qu, Song; Li, Ling; Su, Fang; Li, Ye; Huang, Shi-Ting; Li, Dan-Rong

    2012-01-01

    Radioresistance remains one of the important factors in relapse and metastasis of nasopharyngeal carcinoma. Thus, it is imperative to identify genes involved in radioresistance and explore the underlying biological processes in the development of radioresistance. In this study, we used cDNA microarrays to select differential genes between radioresistant CNE-2R and parental CNE-2 cell lines. One hundred and eighty-three significantly differentially expressed genes (pgenes were upregulated and 45 genes were downregulated in CNE-2R. We further employed publicly available bioinformatics related software, such as GOEAST and STRING to examine the relationship among differentially expressed genes. The results show that these genes were involved in type I interferon-mediated signaling pathway biological processes; the nodes tended to have high connectivity with the EGFR pathway, IFN-related pathways, NF-κB. The node STAT1 has high connectivity with other nodes in the protein-protein interaction (PPI) networks. Finally, the reliability of microarray data was validated for selected genes by semi-quantitative RT-PCR and Western blotting. The results were consistent with the microarray data. Our study suggests that microarrays combined with gene ontology and protein interaction networks have great value in the identification of genes of radioresistance in nasopharyngeal carcinoma; genes involved in several biological processes and protein interaction networks may be relevant to NPC radioresistance; in particular, the verified genes CCL5, STAT1-α, STAT2 and GSTP1 may become potential biomarkers for predicting NPC response to radiotherapy.

  1. Accurate prediction of secondary metabolite gene clusters in filamentous fungi

    DEFF Research Database (Denmark)

    Andersen, Mikael Rørdam; Nielsen, Jakob Blæsbjerg; Klitgaard, Andreas

    2013-01-01

    Biosynthetic pathways of secondary metabolites from fungi are currently subject to an intense effort to elucidate the genetic basis for these compounds due to their large potential within pharmaceutics and synthetic biochemistry. The preferred method is methodical gene deletions to identify suppo...... used A. nidulans for our method development and validation due to the wealth of available biochemical data, but the method can be applied to any fungus with a sequenced and assembled genome, thus supporting further secondary metabolite pathway elucidation in the fungal kingdom....

  2. Evidence suggesting possible SCA1 gene involvement in schizophrenia

    Energy Technology Data Exchange (ETDEWEB)

    Diehl, S.R.; Wange, S.; Sun, C. [NIDR, Bethesda, MD (United States)] [and others

    1994-09-01

    Several findings suggest a possible role for the SCA1 gene on chromosome 6p in some cases of schizophrenia. First, linkage analyses in Irish pedigrees provided LOD scores up to 3.0 for one model tested using microsatellites closely linked to SCA1. Reanalysis of these data using affected sibpair methods yielded a significant result (p = 0.01) for one marker. An attempt to replicate this linkage finding was made using 44 NIMH families (206 individuals, 80 affected) and 12 Utah families (120 individuals, 49 affected). LOD scores were negative in these new families, even allowing for heterogeneity, as were results using affected sibpair methods. However, one Utah family provided a LOD score of 1.3. We also screened the SCA1 trinucleotide repeat to search for expansions characteristic of this disorder in these families and in 38 additional unrelated schizophrenics. We found 1 schizophrenic with 41 repeats, which is substantially larger than the maximum size of 36 repeats observed in previous studies of several hundred controls. We are now assessing whether the distribution of SCA1 repeats differs significantly in schizophrenia versus controls. Recent reports suggest possible anticipation in schizophrenia (also characteristic of SCA1) and a few cases of psychiatric symptoms suggesting schizophrenia have been observed in the highly related disorder DRPLA (SCA2), which is also based on trinucleotide repeat expansion. These findings suggest that further investigations of this gene and chromosome region may be a priority.

  3. GenClust: A genetic algorithm for clustering gene expression data

    Directory of Open Access Journals (Sweden)

    Raimondi Alessandra

    2005-12-01

    Full Text Available Abstract Background Clustering is a key step in the analysis of gene expression data, and in fact, many classical clustering algorithms are used, or more innovative ones have been designed and validated for the task. Despite the widespread use of artificial intelligence techniques in bioinformatics and, more generally, data analysis, there are very few clustering algorithms based on the genetic paradigm, yet that paradigm has great potential in finding good heuristic solutions to a difficult optimization problem such as clustering. Results GenClust is a new genetic algorithm for clustering gene expression data. It has two key features: (a a novel coding of the search space that is simple, compact and easy to update; (b it can be used naturally in conjunction with data driven internal validation methods. We have experimented with the FOM methodology, specifically conceived for validating clusters of gene expression data. The validity of GenClust has been assessed experimentally on real data sets, both with the use of validation measures and in comparison with other algorithms, i.e., Average Link, Cast, Click and K-means. Conclusion Experiments show that none of the algorithms we have used is markedly superior to the others across data sets and validation measures; i.e., in many cases the observed differences between the worst and best performing algorithm may be statistically insignificant and they could be considered equivalent. However, there are cases in which an algorithm may be better than others and therefore worthwhile. In particular, experiments for GenClust show that, although simple in its data representation, it converges very rapidly to a local optimum and that its ability to identify meaningful clusters is comparable, and sometimes superior, to that of more sophisticated algorithms. In addition, it is well suited for use in conjunction with data driven internal validation measures and, in particular, the FOM methodology.

  4. GenClust: a genetic algorithm for clustering gene expression data.

    Science.gov (United States)

    Di Gesú, Vito; Giancarlo, Raffaele; Lo Bosco, Giosué; Raimondi, Alessandra; Scaturro, Davide

    2005-12-07

    Clustering is a key step in the analysis of gene expression data, and in fact, many classical clustering algorithms are used, or more innovative ones have been designed and validated for the task. Despite the widespread use of artificial intelligence techniques in bioinformatics and, more generally, data analysis, there are very few clustering algorithms based on the genetic paradigm, yet that paradigm has great potential in finding good heuristic solutions to a difficult optimization problem such as clustering. GenClust is a new genetic algorithm for clustering gene expression data. It has two key features: (a) a novel coding of the search space that is simple, compact and easy to update; (b) it can be used naturally in conjunction with data driven internal validation methods. We have experimented with the FOM methodology, specifically conceived for validating clusters of gene expression data. The validity of GenClust has been assessed experimentally on real data sets, both with the use of validation measures and in comparison with other algorithms, i.e., Average Link, Cast, Click and K-means. Experiments show that none of the algorithms we have used is markedly superior to the others across data sets and validation measures; i.e., in many cases the observed differences between the worst and best performing algorithm may be statistically insignificant and they could be considered equivalent. However, there are cases in which an algorithm may be better than others and therefore worthwhile. In particular, experiments for GenClust show that, although simple in its data representation, it converges very rapidly to a local optimum and that its ability to identify meaningful clusters is comparable, and sometimes superior, to that of more sophisticated algorithms. In addition, it is well suited for use in conjunction with data driven internal validation measures and, in particular, the FOM methodology.

  5. antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters

    DEFF Research Database (Denmark)

    Weber, Tilmann; Blin, Kai; Duddela, Srikanth

    2015-01-01

    Microbial secondary metabolism constitutes a rich source of antibiotics, chemotherapeutics, insecticides and other high-value chemicals. Genome mining of gene clusters that encode the biosynthetic pathways for these metabolites has become a key methodology for novel compound discovery. In 2011, we...... introduced antiSMASH, a web server and stand-alone tool for the automatic genomic identification and analysis of biosynthetic gene clusters, available at http://antismash.secondarymetabolites.org. Here, we present version 3.0 of antiSMASH, which has undergone major improvements. A full integration...... of the recently published ClusterFinder algorithm now allows using this probabilistic algorithm to detect putative gene clusters of unknown types. Also, a new dereplication variant of the ClusterBlast module now identifies similarities of identified clusters to any of 1172 clusters with known end products...

  6. Identification and Characterization of a Gene Cluster Mediating Enteroaggregative Escherichia Coli Aggregative Adherence Fimbria I Biogenesis

    Science.gov (United States)

    1994-08-01

    adherent E. coli ( DAEC ). respectively. The LA ties to other known fimbrial biogenesis systems of pathogenic pattern is typified by the formation of...agg gene cluster is configured similarly to 60 to 80% of DAEC strains share relatedness with F1845 the determinants of members of the Dr adhesin

  7. Evolutionary history of the phl gene cluster in the plant-associated bacterium Pseudomonas fluorescens

    NARCIS (Netherlands)

    Moynihan, J.A.; Morrissey, J.P.; Coppoolse, E.; Stiekema, W.J.; O'Gara, F.; Boyd, E.F.

    2009-01-01

    Pseudomonas fluorescens is of agricultural and economic importance as a biological control agent largely because of its plant-association and production of secondary metabolites, in particular 2, 4-diacetylphloroglucinol (2, 4-DAPG). This polyketide, which is encoded by the eight gene phl cluster,

  8. Design-based re-engineering of biosynthetic gene clusters : plug-and-play in practice

    NARCIS (Netherlands)

    Frasch, Hans-Jörg; Medema, Marnix H.; Takano, Eriko; Breitling, Rainer; Gago, Federico; Parayil, Ajikumar

    2013-01-01

    Synthetic biology is revolutionizing the way in which the biosphere is explored for natural products. Through computational genome mining, thousands of biosynthetic gene clusters are being identified in microbial genomes, which constitute a rich source of potential novel pharmaceuticals. New methods

  9. SATB1 regulates {beta}-like globin genes through matrix related nuclear relocation of the cluster

    Energy Technology Data Exchange (ETDEWEB)

    Gong, Huan; Wang, Zhao; Zhao, Guo-wei; Lv, Xiang; Wei, Gong-hong; Wang, Li [National Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences (CAMS) and Peking Union Medical College (PUMC), 5 Dong Dan San Tiao, Beijing 100005 (China); Liu, De-pei, E-mail: liudp@pumc.edu.cn [National Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences (CAMS) and Peking Union Medical College (PUMC), 5 Dong Dan San Tiao, Beijing 100005 (China); Liang, Chih-chuan [National Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences (CAMS) and Peking Union Medical College (PUMC), 5 Dong Dan San Tiao, Beijing 100005 (China)

    2009-05-22

    The nuclear location and relocation of genes play crucial regulatory roles in gene expression. SATB1, a MAR-binding protein, has been found to regulate {beta}-like globin genes through chromatin remodeling. In this study, we generated K562 cells over-expressing wild-type or nuclear matrix targeting sequences (NMTS)-deficient SATB1 and found that like wild-type SATB1, NMTS-deficient SATB1 induces out loop of {beta}-globin cluster from its chromosome territory (CT), while it is unable to associate the cluster with the nuclear matrix as wild-type SATB1 does and had no regulatory functions to the {beta}-globin cluster. Besides, our data showed that the transacting factor occupancies and chromatin modifications at {beta}-globin cluster were differentially affected by wild-type and NMTS-deficient SATB1. These results indicate that SATB1 regulates {beta}-like globin genes at the nuclear level interlaced with chromatin and DNA level, and emphasize the nuclear matrix binding activity of SATB1 to its regulatory function.

  10. Design-based re-engineering of biosynthetic gene clusters : plug-and-play in practice

    NARCIS (Netherlands)

    Frasch, Hans-Jörg; Medema, Marnix H.; Takano, Eriko; Breitling, Rainer; Gago, Federico; Parayil, Ajikumar

    2013-01-01

    Synthetic biology is revolutionizing the way in which the biosphere is explored for natural products. Through computational genome mining, thousands of biosynthetic gene clusters are being identified in microbial genomes, which constitute a rich source of potential novel pharmaceuticals. New methods

  11. Gene ontology study of methyl jasmonate-treated and non-treated hairy roots of Panax ginseng to identify genes involved in secondary metabolic pathway.

    Science.gov (United States)

    Sathiyamoorthy, S; In, J G; Gayathri, S; Kim, Y Ju; Yang, D Ch

    2010-07-01

    The roots of Panax ginseng C.A. Meyer, known as Korean ginseng have been a valuable and important folk medicine in East Asian countries. It mainly used to maintain the homeostasis of the human body, with the presence ofginsenosides and non-saponin compounds like phenol compounds, acidic polysaccharides and polyethylene compounds. Functional genomics aid to annotate based on gene ontology. In this study, we focused on the genes involving in secondary metabolic pathways and to visualize temporal changes of gene expression in ginseng hairy roots with methyl ester methyl jasmonate (MeJA) along with non-treated hairy roots. A 5.774 EST clones were clustered and assembled as 501 contigs and 2.955 singletons. Annotations categorized with molecular functions, biological processes, cellular compounds of gene ontological terms and biochemical functions, enzyme commission to sequences were assigned to metabolic pathways of Kyoto Encyclopedia of Genes and Genomes database. Comparatively, EST sequences are assigned to cellular process, metabolic process, biotic and abiotic stress stimuli, developmental and biological regulations and transports are up-regulated 2-3 fold in MeJA treated hairy roots. 46 different sub groups of enzymes found in the MeJA treated plants. These annotated ESTs represents a significant proportion of the P. ginseng and provides molecular resource for developmental of microarrays for gene expression studies concerning development, metabolism and reproduction.

  12. Interrogating the function of metazoan histones using engineered gene clusters.

    Science.gov (United States)

    McKay, Daniel J; Klusza, Stephen; Penke, Taylor J R; Meers, Michael P; Curry, Kaitlin P; McDaniel, Stephen L; Malek, Pamela Y; Cooper, Stephen W; Tatomer, Deirdre C; Lieb, Jason D; Strahl, Brian D; Duronio, Robert J; Matera, A Gregory

    2015-02-09

    Histones and their posttranslational modifications influence the regulation of many DNA-dependent processes. Although an essential role for histone-modifying enzymes in these processes is well established, defining the specific contribution of individual histone residues remains a challenge because many histone-modifying enzymes have nonhistone targets. This challenge is exacerbated by the paucity of suitable approaches to genetically engineer histone genes in metazoans. Here, we describe a platform in Drosophila for generating and analyzing any desired histone genotype, and we use it to test the in vivo function of three histone residues. We demonstrate that H4K20 is neither essential for DNA replication nor for completion of development, unlike inferences drawn from analyses of H4K20 methyltransferases. We also show that H3K36 is required for viability and H3K27 is essential for maintenance of cellular identity but not for gene activation. These findings highlight the power of engineering histones to interrogate genome structure and function in animals. Copyright © 2015 Elsevier Inc. All rights reserved.

  13. Clustered array of ochratoxin A biosynthetic genes in Aspergillus steynii and their expression patterns in permissive conditions.

    Science.gov (United States)

    Gil-Serna, Jessica; Vázquez, Covadonga; González-Jaén, María Teresa; Patiño, Belén

    2015-12-01

    Aspergillus steynii is probably the most relevant species of section Circumdati producing ochratoxin A (OTA). This mycotoxin contaminates a wide number of commodities and it is highly toxic for humans and animals. Little is known on the biosynthetic genes and their regulation in Aspergillus species. In this work, we identified and analysed three contiguous genes in A. steynii using 5'-RACE and genome walking approaches which predicted a cytochrome P450 monooxygenase (p450ste), a non-ribosomal peptide synthetase (nrpsste) and a polyketide synthase (pksste). These three genes were contiguous within a 20742 bp long genomic DNA fragment. Their corresponding cDNA were sequenced and their expression was analysed in three A. steynii strains using real time RT-PCR specific assays in permissive conditions in in vitro cultures. OTA was also analysed in these cultures. Comparative analyses of predicted genomic, cDNA and amino acid sequences were performed with sequences of similar gene functions. All the results obtained in these analyses were consistent and point out the involvement of these three genes in OTA biosynthesis by A. steynii and showed a co-ordinated expression pattern. This is the first time that a clustered organization OTA biosynthetic genes has been reported in Aspergillus genus. The results also suggested that this situation might be common in Aspergillus OTA-producing species and distinct to the one described for Penicillium species.

  14. Characterization of genes involved in ceramide metabolism in the Pacific oyster (Crassostrea gigas

    Directory of Open Access Journals (Sweden)

    Timmins-Schiffman Emma

    2012-09-01

    Full Text Available Abstract Background The lipid signaling molecule, ceramide, is a key component of the vertebrate stress response, however, there is limited information concerning its role in invertebrate species. In order to identify genes involved in ceramide metabolism in bivalve molluscs, Pacific oyster genomic resources were examined for genes associated with ceramide metabolism and signaling. Results Several genes were identified including full-length sequences characterized for serine palmitoyltransferase-1, 3-ketodihydrosphingosine reductase, acid ceramidase, and ceramide glucosyltransferase. Genes involved in ceramide synthesis and metabolism are conserved across taxa in both form and function. Expression analysis as assessed by quantitative PCR indicated all genes were expressed at high levels in gill tissue. The role of the ceramide pathway genes in the invertebrate stress response was also explored by measuring expression levels in adult oysters exposed to Vibrio vulnificus. Two genes demonstrated increased expression during the bacterial challenge: a gene involved in hydrolytic breakdown of ceramide (acid ceramidase and a gene involved in de novo generation of ceramide (3-ketodihydrosphingosine reductase, suggesting a possible role of ceramide in the invertebrate stress and immune responses. Conclusions In silico and laboratory results support that Pacific oysters have the basic components of the ceramide metabolism pathway. These results also indicate that ceramide may have analogous functions in vertebrates and invertebrates. The gene expression pattern of acid ceramidase and 3-kethodihydrosphingosine reductase in response to bacterial exposure especially supports that ceramide and sphingolipid metabolism may be involved in the oyster’s stress and/or immune responses.

  15. Characterization of genes involved in ceramide metabolism in the Pacific oyster (Crassostrea gigas).

    Science.gov (United States)

    Timmins-Schiffman, Emma; Roberts, Steven

    2012-09-13

    The lipid signaling molecule, ceramide, is a key component of the vertebrate stress response, however, there is limited information concerning its role in invertebrate species. In order to identify genes involved in ceramide metabolism in bivalve molluscs, Pacific oyster genomic resources were examined for genes associated with ceramide metabolism and signaling. Several genes were identified including full-length sequences characterized for serine palmitoyltransferase-1, 3-ketodihydrosphingosine reductase, acid ceramidase, and ceramide glucosyltransferase. Genes involved in ceramide synthesis and metabolism are conserved across taxa in both form and function. Expression analysis as assessed by quantitative PCR indicated all genes were expressed at high levels in gill tissue. The role of the ceramide pathway genes in the invertebrate stress response was also explored by measuring expression levels in adult oysters exposed to Vibrio vulnificus. Two genes demonstrated increased expression during the bacterial challenge: a gene involved in hydrolytic breakdown of ceramide (acid ceramidase) and a gene involved in de novo generation of ceramide (3-ketodihydrosphingosine reductase), suggesting a possible role of ceramide in the invertebrate stress and immune responses. In silico and laboratory results support that Pacific oysters have the basic components of the ceramide metabolism pathway. These results also indicate that ceramide may have analogous functions in vertebrates and invertebrates. The gene expression pattern of acid ceramidase and 3-kethodihydrosphingosine reductase in response to bacterial exposure especially supports that ceramide and sphingolipid metabolism may be involved in the oyster's stress and/or immune responses.

  16. Characterization and biological role of the O-polysaccharide gene cluster of Yersinia enterocolitica serotype O : 9

    DEFF Research Database (Denmark)

    Skurnik, Mikael; Biedzka-Sarek, Marta; Lubeck, Peter S.

    2007-01-01

    as an attachment site for both the outer core (OC) hexasaccharide and the O-polysaccharide (OPS; a homopolymer of N-formylperosamine). In this work, we cloned the OPS gene cluster of O:9 and identified 12 genes organized into four operons upstream of the gnd gene. Ten genes were predicted to encode...... glycosyltransferases, the ATP-binding cassette polysaccharide translocators, or enzymes required for the biosynthesis of GDP-N-formylperosamine. The two remaining genes within the OPS gene cluster, galF and galU, were not ascribed a clear function in OPS biosynthesis; however, the latter gene appeared to be essential...

  17. Identification, characterization and metagenome analysis of oocyte-specific genes organized in clusters in the mouse genome

    Directory of Open Access Journals (Sweden)

    Vaiman Daniel

    2005-05-01

    Full Text Available Abstract Background Genes specifically expressed in the oocyte play key roles in oogenesis, ovarian folliculogenesis, fertilization and/or early embryonic development. In an attempt to identify novel oocyte-specific genes in the mouse, we have used an in silico subtraction methodology, and we have focused our attention on genes that are organized in genomic clusters. Results In the present work, five clusters have been studied: a cluster of thirteen genes characterized by an F-box domain localized on chromosome 9, a cluster of six genes related to T-cell leukaemia/lymphoma protein 1 (Tcl1 on chromosome 12, a cluster composed of a SPErm-associated glutamate (E-Rich (Speer protein expressed in the oocyte in the vicinity of four unknown genes specifically expressed in the testis on chromosome 14, a cluster composed of the oocyte secreted protein-1 (Oosp-1 gene and two Oosp-related genes on chromosome 19, all three being characterized by a partial N-terminal zona pellucida-like domain, and another small cluster of two genes on chromosome 19 as well, composed of a TWIK-Related spinal cord K+ channel encoding-gene, and an unknown gene predicted in silico to be testis-specific. The specificity of expression was confirmed by RT-PCR and in situ hybridization for eight and five of them, respectively. Finally, we showed by comparing all of the isolated and clustered oocyte-specific genes identified so far in the mouse genome, that the oocyte-specific clusters are significantly closer to telomeres than isolated oocyte-specific genes are. Conclusion We have studied five clusters of genes specifically expressed in female, some of them being also expressed in male germ-cells. Moreover, contrarily to non-clustered oocyte-specific genes, those that are organized in clusters tend to map near chromosome ends, suggesting that this specific near-telomere position of oocyte-clusters in rodents could constitute an evolutionary advantage. Understanding the biological

  18. Identification and validation of genes involved in gastric tumorigenesis

    Directory of Open Access Journals (Sweden)

    Shirley Sundersingh

    2010-11-01

    Full Text Available Abstract Background Gastric cancer is one of the common cancers seen in south India. Unfortunately more than 90% are advanced by the time they report to a tertiary centre in the country. There is an urgent need to characterize these cancers and try to identify potential biomarkers and novel therapeutic targets. Materials and methods We used 24 gastric cancers, 20 Paired normal (PN and 5 apparently normal gastric tissues obtained from patients with non-gastric cancers (Apparently normal - AN for the microarray study followed by validation of the significant genes (n = 63 by relative quantitation using Taqman Low Density Array Real Time PCR. We then used a custom made Quantibody protein array to validate the expression of 15 proteins in gastric tissues (4 AN, 9 PN and 9 gastric cancers. The same array format was used to study the plasma levels of these proteins in 58 patients with gastric cancers and 18 from patients with normal/non-malignant gastric conditions. Results Seventeen genes (ASPN, CCL15/MIP-1δ, MMP3, SPON2, PRSS2, CCL3, TMEPAI/PMEPAI, SIX3, MFNG, SOSTDC1, SGNE1, SST, IGHA1, AKR1B10, FCGBP, ATP4B, NCAPH2 were shown to be differentially expressed between the tumours and the paired normal, for the first time. EpCAM (p = 0.0001, IL8 (p = 0.0003, CCL4/MIP-1β (p = 0.0026, CCL20/MIP-3α (p = 0.039 and TIMP1 (p = 0.0017 tissue protein levels were significantly different (Mann Whitney U test between tumours versus AN & PN. In addition, median plasma levels of IL8, CXCL9/MIG, CCL3/MIP-1α, CCL20/MIP-3α, PDGFR-B and TIMP1 proteins were significantly different between the non-malignant group and the gastric cancer group. The post-surgical levels of EpCAM, IGFBP3, IL8, CXCL10/IP10, CXCL9/MIG, CCL3/MIP-1α, CCL20/MIP-3α, SPP1/OPN and PDGFR-B showed a uniform drop in all the samples studied. Conclusions Our study has identified several genes differentially expressed in gastric cancers, some for the first time. Some of these have been confirmed at

  19. Morphogenesis of the C. elegans Intestine Involves Axon Guidance Genes.

    Science.gov (United States)

    Asan, Alparsan; Raiders, Stephan A; Priess, James R

    2016-04-01

    Genetic and molecular studies have provided considerable insight into how various tissue progenitors are specified in early embryogenesis, but much less is known about how those progenitors create three-dimensional tissues and organs. The C. elegans intestine provides a simple system for studying how a single progenitor, the E blastomere, builds an epithelial tube of 20 cells. As the E descendants divide, they form a primordium that transitions between different shapes over time. We used cell contours, traced from confocal optical z-stacks, to build a 3D graphic reconstruction of intestine development. The reconstruction revealed several new aspects of morphogenesis that extend and clarify previous observations. The first 8 E descendants form a plane of four right cells and four left cells; the plane arises through oriented cell divisions and VANG-1/Van Gogh-dependent repositioning of any non-planar cells. LIN-12/Notch signaling affects the left cells in the E8 primordium, and initiates later asymmetry in cell packing. The next few stages involve cell repositioning and intercalation events that shuttle cells to their final positions, like shifting blocks in a Rubik's cube. Repositioning involves breaking and replacing specific adhesive contacts, and some of these events involve EFN-4/Ephrin, MAB-20/semaphorin-2a, and SAX-3/Robo. Once cells in the primordium align along a common axis and in the correct order, cells at the anterior end rotate clockwise around the axis of the intestine. The anterior rotation appears to align segments of the developing lumen into a continuous structure, and requires the secreted ligand UNC-6/netrin, the receptor UNC-40/DCC, and an interacting protein called MADD-2. Previous studies showed that rotation requires a second round of LIN-12/Notch signaling in cells on the right side of the primordium, and we show that MADD-2-GFP appears to be downregulated in those cells.

  20. Morphogenesis of the C. elegans Intestine Involves Axon Guidance Genes.

    Directory of Open Access Journals (Sweden)

    Alparsan Asan

    2016-04-01

    Full Text Available Genetic and molecular studies have provided considerable insight into how various tissue progenitors are specified in early embryogenesis, but much less is known about how those progenitors create three-dimensional tissues and organs. The C. elegans intestine provides a simple system for studying how a single progenitor, the E blastomere, builds an epithelial tube of 20 cells. As the E descendants divide, they form a primordium that transitions between different shapes over time. We used cell contours, traced from confocal optical z-stacks, to build a 3D graphic reconstruction of intestine development. The reconstruction revealed several new aspects of morphogenesis that extend and clarify previous observations. The first 8 E descendants form a plane of four right cells and four left cells; the plane arises through oriented cell divisions and VANG-1/Van Gogh-dependent repositioning of any non-planar cells. LIN-12/Notch signaling affects the left cells in the E8 primordium, and initiates later asymmetry in cell packing. The next few stages involve cell repositioning and intercalation events that shuttle cells to their final positions, like shifting blocks in a Rubik's cube. Repositioning involves breaking and replacing specific adhesive contacts, and some of these events involve EFN-4/Ephrin, MAB-20/semaphorin-2a, and SAX-3/Robo. Once cells in the primordium align along a common axis and in the correct order, cells at the anterior end rotate clockwise around the axis of the intestine. The anterior rotation appears to align segments of the developing lumen into a continuous structure, and requires the secreted ligand UNC-6/netrin, the receptor UNC-40/DCC, and an interacting protein called MADD-2. Previous studies showed that rotation requires a second round of LIN-12/Notch signaling in cells on the right side of the primordium, and we show that MADD-2-GFP appears to be downregulated in those cells.

  1. Enzymes and Genes Involved in Aerobic Alkane Degradation

    Directory of Open Access Journals (Sweden)

    Zongze eShao

    2013-05-01

    Full Text Available Alkanes are major constituents of crude oil. They are also present at low concentrations in diverse non-contaminated because many living organisms produce them as chemo-attractants or as protecting agents against water loss. Alkane degradation is a widespread phenomenon in nature. The numerous microorganisms, both prokaryotic and eukaryotic, capable of utilizing alkanes as a carbon and energy source, have been isolated and characterized. This review summarizes the current knowledge of how bacteria metabolize alkanes aerobically, with a particular emphasis on the oxidation of long-chain alkanes, including factors that are responsible for chemotaxis to alkanes , transport across cell membrane of alkanes , the regulation of alkane degradation gene and initial oxidation.

  2. Involvement of calcitonin gene-related peptide in migraine

    DEFF Research Database (Denmark)

    Lassen, L H; Jacobsen, V B; Haderslev, P A

    2008-01-01

    mug/min) or placebo for 20 min was studied in 12 patients with migraine without aura outside attacks. Xenon-133 inhalation SPECT-determined regional cerebral blood flow (rCBF) and transcranial Doppler (TCD)-determined blood velocity (V (mean)) in the middle cerebral artery (MCA), as well as the heart......Calcitonin gene-related peptide (CGRP)-containing nerves are closely associated with cranial blood vessels. CGRP is the most potent vasodilator known in isolated cerebral blood vessels. CGRP can induce migraine attacks, and two selective CGRP receptor antagonists are effective in the treatment...... of migraine attacks. It is therefore important to investigate its mechanism of action in patients with migraine. We here investigate the effects of intravenous human alpha-CGRP (halphaCGRP) on intracranial hemodynamics. In a double-blind, cross-over study, the effect of intravenous infusion of halphaCGRP (2...

  3. Genes involved in long-chain alkene biosynthesis in Micrococcus luteus

    Energy Technology Data Exchange (ETDEWEB)

    Beller, Harry R.; Goh, Ee-Been; Keasling, Jay D.

    2010-01-07

    Aliphatic hydrocarbons are highly appealing targets for advanced cellulosic biofuels, as they are already predominant components of petroleum-based gasoline and diesel fuels. We have studied alkene biosynthesis in Micrococcus luteus ATCC 4698, a close relative of Sarcina lutea (now Kocuria rhizophila), which four decades ago was reported to biosynthesize iso- and anteiso branched, long-chain alkenes. The underlying biochemistry and genetics of alkene biosynthesis were not elucidated in those studies. We show here that heterologous expression of a three-gene cluster from M. luteus (Mlut_13230-13250) in a fatty-acid overproducing E. coli strain resulted in production of long-chain alkenes, predominantly 27:3 and 29:3 (no. carbon atoms: no. C=C bonds). Heterologous expression of Mlut_13230 (oleA) alone produced no long-chain alkenes but unsaturated aliphatic monoketones, predominantly 27:2, and in vitro studies with the purified Mlut_13230 protein and tetradecanoyl-CoA produced the same C27 monoketone. Gas chromatography-time of flight mass spectrometry confirmed the elemental composition of all detected long-chain alkenes and monoketones (putative intermediates of alkene biosynthesis). Negative controls demonstrated that the M. luteus genes were responsible for production of these metabolites. Studies with wild-type M. luteus showed that the transcript copy number of Mlut_13230-13250 and the concentrations of 29:1 alkene isomers (the dominant alkenes produced by this strain) generally corresponded with bacterial population over time. We propose a metabolic pathway for alkene biosynthesis starting with acyl-CoA (or -ACP) thioesters and involving decarboxylative Claisen condensation as a key step, which we believe is catalyzed by OleA. Such activity is consistent with our data and with the homology (including the conserved Cys-His-Asn catalytic triad) of Mlut_13230 (OleA) to FabH (?-ketoacyl-ACP synthase III), which catalyzes decarboxylative Claisen condensation during

  4. Birth, death and horizontal transfer of the fumonisin biosynthetic gene cluster during the evolutionary diversification of Fusarium

    Science.gov (United States)

    In fungi, genes required for synthesis of secondary metabolites are often clustered. The FUM gene cluster is required for synthesis of a family of toxic secondary metabolites, fumonisins, produced by species of Fusarium in the Gibberella fujikuroi species complex (GFSC). Fumonisins are a health and ...

  5. The impact of polyploidy on the evolution of a complex NB-LRR resistance gene cluster in soybean

    Science.gov (United States)

    A comparative genomics approach was used to investigate the evolution of a complex NB-LRR gene cluster found in soybean (Glycine max), common bean (Phaseolus vulgaris), and other legumes. In soybean, the cluster is associated with several disease resistance (R) genes of known function including Rpg1...

  6. Complete Genome Sequence of the Filamentous Fungus Aspergillus westerdijkiae Reveals the Putative Biosynthetic Gene Cluster of Ochratoxin A

    Science.gov (United States)

    Chakrabortti, Alolika; Li, Jinming

    2016-01-01

    Ochratoxin A (OTA) is a common mycotoxin that contaminates food and agricultural products. Sequencing of the complete genome of Aspergillus westerdijkiae, a major producer of OTA, reveals more than 50 biosynthetic gene clusters, including a putative OTA biosynthetic gene cluster that encodes a dozen of enzymes, transporters, and regulatory proteins. PMID:27635003

  7. Clostridium botulinum strain Af84 contains three neurotoxin gene clusters: bont/A2, bont/F4 and bont/F5.

    Directory of Open Access Journals (Sweden)

    Nir Dover

    Full Text Available Sanger and shotgun sequencing of Clostridium botulinum strain Af84 type Af and its botulinum neurotoxin gene (bont clusters identified the presence of three bont gene clusters rather than the expected two. The three toxin gene clusters consisted of bont subtypes A2, F4 and F5. The bont/A2 and bont/F4 gene clusters were located within the chromosome (the latter in a novel location, while the bont/F5 toxin gene cluster was located within a large 246 kb plasmid. These findings are the first identification of a C. botulinum strain that contains three botulinum neurotoxin gene clusters.

  8. A brain-specific gene cluster isolated from the region of the mouse obesity locus is expressed in the adult hypothalamus and during mouse development

    Energy Technology Data Exchange (ETDEWEB)

    Laig-Webster, M.; Lim, M.E.; Chehab, F.F. [Univ. of California, San Francisco, CA (United States)

    1994-09-01

    The molecular defect underlying an autosomal recessive form of genetic obesity in a classical mouse model C57 BL/6J-ob/ob has not yet been elucidated. Whereas metabolic and physiological disturbances such as diabetes and hypertension are associated with obesity, the site of expression and the nature of the primary lesion responsible for this cascade of events remains elusive. Our efforts aimed at the positional cloning of the ob gene by YAC contig mapping and gene identification have resulted in the cloning of a brain-specific gene cluster from the ob critical region. The expression of this gene cluster is remarkably complex owing to the multitude of brain-specific mRNA transcripts detected on Northern blots. cDNA cloning of these transcripts suggests that they are expressed from different genes as well as by alternate splicing mechanisms. Furthermore, the genomic organization of the cluster appears to consist of at least two identical promoters displaying CpG islands characteristic of housekeeping genes, yet clearly involving tissue-specific expression. Sense and anti-sense synthetic RNA probes were derived from a common DNA sequence on 3 cDNA clones and hybridized to 8-16 days mouse embryonic stages and mouse adult brain sections. Expression in development was noticeable as of the 11th day of gestation and confined to the central nervous system mainly in the telencephalon and spinal cord. Coronal and sagittal sections of the adult mouse brain showed expression only in 3 different regions of the brain stem. In situ hybridization to mouse hypothalamus sections revealed the presence of a localized and specialized group of cells expressing high levels of mRNA, suggesting that this gene cluster may also be involved in the regulation of hypothalamic activities. The hypothalamus has long been hypothesized as a primary candidate tissue for the expression of the obesity gene mainly because of its well-established role in the regulation of energy metabolism and food intake.

  9. The methylcitric acid pathway in Ralstonia eutropha: new genes identified involved in propionate metabolism.

    Science.gov (United States)

    Brämer, C O; Steinbüchel, A

    2001-08-01

    From Ralstonia eutropha HF39 null-allele mutants were created by Tn5 mutagenesis and by homologous recombination which were impaired in growth on propionic acid and levulinic acid. From the molecular, physiological and enzymic analysis of these mutants it was concluded that in this bacterium propionic acid is metabolized via the methylcitric acid pathway. The genes encoding enzymes of this pathway are organized in a cluster in the order prpR, prpB, prpC, acnM, ORF5 and prpD, with prpR transcribed divergently from the other genes. (i) prpC encodes a 2-methylcitric acid synthase (42720 Da) as shown by the measurement of the respective enzyme activity, complementation of a prpC mutant of Salmonella enterica serovar Typhimurium and high sequence similarity. (ii) For the translational product of acnM the function of a 2-methyl-cis-aconitic acid hydratase (94726 Da) is proposed. This protein and also the ORF5 translational product are essential for growth on propionic acid, as revealed by the propionic-acid-negative phenotype of Tn5-insertion mutants, and are required for the conversion of 2-methylcitric acid into 2-methylisocitric acid as shown by the accumulation of the latter, which could be purified as its calcium salt from the supernatants of these mutants. In contrast, inactivation of prpD did not block the ability of the cell to use propionic acid as carbon and energy source, as shown by the propionic acid phenotype of a null-allele mutant. It is therefore unlikely that prpD from R. eutropha encodes a 2-methyl-cis-aconitic acid dehydratase as proposed recently for the homologous prpD gene from S. enterica. (iii) The translational product of prpB encodes 2-methylisocitric acid lyase (32314 Da) as revealed by measurement of the respective enzyme activity and by demonstrating accumulation of methylisocitric acid in the supernatant of a prpB null-allele mutant. (iv) The expression of prpC and probably also of the other enzymes is regulated and is induced during

  10. Evolutionary dynamics of rRNA gene clusters in cichlid fish

    Directory of Open Access Journals (Sweden)

    Nakajima Rafael T

    2012-10-01

    Full Text Available Abstract Background Among multigene families, ribosomal RNA (rRNA genes are the most frequently studied and have been explored as cytogenetic markers to study the evolutionary history of karyotypes among animals and plants. In this report, we applied cytogenetic and genomic methods to investigate the organization of rRNA genes among cichlid fishes. Cichlids are a group of fishes that are of increasing scientific interest due to their rapid and convergent adaptive radiation, which has led to extensive ecological diversity. Results The present paper reports the cytogenetic mapping of the 5S rRNA genes from 18 South American, 22 African and one Asian species and the 18S rRNA genes from 3 African species. The data obtained were comparatively analyzed with previously published information related to the mapping of rRNA genes in cichlids. The number of 5S rRNA clusters per diploid genome ranged from 2 to 15, with the most common pattern being the presence of 2 chromosomes bearing a 5S rDNA cluster. Regarding 18S rDNA mapping, the number of sites ranged from 2 to 6, with the most common pattern being the presence of 2 sites per diploid genome. Furthermore, searching the Oreochromis niloticus genome database led to the identification of a total of 59 copies of 5S rRNA and 38 copies of 18S rRNA genes that were distributed in several genomic scaffolds. The rRNA genes were frequently flanked by transposable elements (TEs and spread throughout the genome, complementing the FISH analysis that detect only clustered copies of rRNA genes. Conclusions The organization of rRNA gene clusters seems to reflect their intense and particular evolutionary pathway and not the evolutionary history of the associated taxa. The possible role of TEs as one source of rRNA gene movement, that could generates the spreading of ribosomal clusters/copies, is discussed. The present paper reinforces the notion that the integration of cytogenetic data and genomic analysis provides a

  11. Evolutionary dynamics of rRNA gene clusters in cichlid fish

    Science.gov (United States)

    2012-01-01

    Background Among multigene families, ribosomal RNA (rRNA) genes are the most frequently studied and have been explored as cytogenetic markers to study the evolutionary history of karyotypes among animals and plants. In this report, we applied cytogenetic and genomic methods to investigate the organization of rRNA genes among cichlid fishes. Cichlids are a group of fishes that are of increasing scientific interest due to their rapid and convergent adaptive radiation, which has led to extensive ecological diversity. Results The present paper reports the cytogenetic mapping of the 5S rRNA genes from 18 South American, 22 African and one Asian species and the 18S rRNA genes from 3 African species. The data obtained were comparatively analyzed with previously published information related to the mapping of rRNA genes in cichlids. The number of 5S rRNA clusters per diploid genome ranged from 2 to 15, with the most common pattern being the presence of 2 chromosomes bearing a 5S rDNA cluster. Regarding 18S rDNA mapping, the number of sites ranged from 2 to 6, with the most common pattern being the presence of 2 sites per diploid genome. Furthermore, searching the Oreochromis niloticus genome database led to the identification of a total of 59 copies of 5S rRNA and 38 copies of 18S rRNA genes that were distributed in several genomic scaffolds. The rRNA genes were frequently flanked by transposable elements (TEs) and spread throughout the genome, complementing the FISH analysis that detect only clustered copies of rRNA genes. Conclusions The organization of rRNA gene clusters seems to reflect their intense and particular evolutionary pathway and not the evolutionary history of the associated taxa. The possible role of TEs as one source of rRNA gene movement, that could generates the spreading of ribosomal clusters/copies, is discussed. The present paper reinforces the notion that the integration of cytogenetic data and genomic analysis provides a more complete picture for

  12. Pseudomonas aeruginosa IscR-Regulated Ferredoxin NADP(+ Reductase Gene (fprB Functions in Iron-Sulfur Cluster Biogenesis and Multiple Stress Response.

    Directory of Open Access Journals (Sweden)

    Adisak Romsang

    Full Text Available P. aeruginosa (PAO1 has two putative genes encoding ferredoxin NADP(+ reductases, denoted fprA and fprB. Here, the regulation of fprB expression and the protein's physiological roles in [4Fe-4S] cluster biogenesis and stress protection are characterized. The fprB mutant has defects in [4Fe-4S] cluster biogenesis, as shown by reduced activities of [4Fe-4S] cluster-containing enzymes. Inactivation of the gene resulted in increased sensitivity to oxidative, thiol, osmotic and metal stresses compared with the PAO1 wild type. The increased sensitivity could be partially or completely suppressed by high expression of genes from the isc operon, which are involved in [Fe-S] cluster biogenesis, indicating that stress sensitivity in the fprB mutant is partially caused by a reduction in levels of [4Fe-4S] clusters. The pattern and regulation of fprB expression are in agreement with the gene physiological roles; fprB expression was highly induced by redox cycling drugs and diamide and was moderately induced by peroxides, an iron chelator and salt stress. The stress-induced expression of fprB was abolished by a deletion of the iscR gene. An IscR DNA-binding site close to fprB promoter elements was identified and confirmed by specific binding of purified IscR. Analysis of the regulation of fprB expression supports the role of IscR in directly regulating fprB transcription as a transcription activator. The combination of IscR-regulated expression of fprB and the fprB roles in response to multiple stressors emphasizes the importance of [Fe-S] cluster homeostasis in both gene regulation and stress protection.

  13. Phenotype-Dependent Coexpression Gene Clusters: Application to Normal and Premature Ageing.

    Science.gov (United States)

    Wang, Kun; Das, Avinash; Xiong, Zheng-Mei; Cao, Kan; Hannenhalli, Sridhar

    2015-01-01

    Hutchinson Gilford progeria syndrome (HGPS) is a rare genetic disease with symptoms of aging at a very early age. Its molecular basis is not entirely clear, although profound gene expression changes have been reported, and there are some known and other presumed overlaps with normal aging process. Identification of genes with agingor HGPS-associated expression changes is thus an important problem. However, standard regression approaches are currently unsuitable for this task due to limited sample sizes, thus motivating development of alternative approaches. Here, we report a novel iterative multiple regression approach that leverages co-expressed gene clusters to identify gene clusters whose expression co-varies with age and/or HGPS. We have applied our approach to novel RNA-seq profiles in fibroblast cell cultures at three different cellular ages, both from HGPS patients and normal samples. After establishing the robustness of our approach, we perform a comparative investigation of biological processes underlying normal aging and HGPS. Our results recapitulate previously known processes underlying aging as well as suggest numerous unique processes underlying aging and HGPS. The approach could also be useful in detecting phenotype-dependent co-expression gene clusters in other contexts with limited sample sizes.

  14. Polymorphisms and linkage analysis for ICAM-1 and the selectin gene cluster

    Energy Technology Data Exchange (ETDEWEB)

    Vora, D.K.; Rosenbloom, C.L.; Cottingham, R.W. [Baylor College of Medicine, Houston, TX (United States)] [and others

    1994-06-01

    Genetic polymorphisms in leukocyte and endothelial cell adhesion molecules may be important variables with regard to susceptibility to multifactorial disease processes that include an inflammatory component. For this reason, polymorphisms were sought for intercellular adhesion molecule-1 (ICAM-1; gene symbol ICAM1) and for the three genes in the selectin cluster, P-selectin, L-selectin, and E-selectin (gene symbols SELP, SELL, and SELE, respectively). Two amino acid polymorphisms were identified for ICAM-1; Gly or Arg at codon 241 and Lys or Glu at codon 469. Dinucleotide repeat polymorphisms were identified in the 3{prime}-untranslated region for ICAM-1 and in intron 9 for P-selectin. Restriction fragment length polymorphisms were found using cDNAs for each of the three selectin genes as probes; E-selectin with BglII, P-selectin with ScaI, and L-selectin with HincII. Linkage analysis was performed for the selectin gene cluster and for ICAM-1 using the CEPH families; ICAM-1 is very tightly linked to the LDL receptor on chromosome 19, and the selectin cluster is linked to markers at chromosome 1q23. 41 refs., 2 tabs.

  15. Cloning of three genes involved in the flavonoid metabolic pathway and their expression during insect resistance in Pinus massoniana Lamb.

    Science.gov (United States)

    Yang, Z Q; Chen, H; Tan, J H; Xu, H L; Jia, J; Feng, Y H

    2016-12-23

    Pinus massoniana Lamb. is an important timber and turpentine-producing tree species in China. Dendrolimus punctatus and Dasychira axutha are leaf-eating pests that have harmful effects on P. massoniana production. Few studies have focused on the molecular mechanisms underlying pest resistance in P. massoniana. Based on sequencing analysis of the transcriptomes of insect-resistant P. massoniana, three key genes involved in the flavonoid metabolic pathway were identified in the present study (PmF3H, PmF3'5'H, and PmC4H). Structural domain analysis showed that the PmF3H gene contains typical binding sites for the 2OG-Fe (II) oxygenase superfamily, while PmF3'5'H and PmC4H both contain the cytochrome P450 structural domain, which is specific for P450 enzymes. Phylogenetic analysis showed that each of the three P. massoniana genes, and the homologous genes in gymnosperms, clustered into a group. Expression of these three genes was highest in the stems, and was higher in the insect-resistant P. massoniana varieties than in the controls. The extent of the increased expression in the insect-resistant P. massoniana varieties indicated that these three genes are involved in defense mechanisms against pests in this species. In the insect-resistant varieties, rapid induction of PmF3H increased the levels of PmF3'5'H and PmC4H expression. The enhanced anti-pest capability of the insect-resistant varieties could be related to temperature and humidity. In addition, these results suggest that these three genes maycontribute to the change in flower color during female cone development.

  16. In silico Analysis of Candidate Genes Involved in Sanfilippo Syndrome

    Directory of Open Access Journals (Sweden)

    Mehreen Zaka

    2015-04-01

    Full Text Available Sanfilippo syndrome is an autosomal recessive lysosomal storage disorder, caused by the deficiency of enzymes that play an important role in degradation of glycosaminoglycans and also called mucopolysaccharidosis III. Mucopolysaccharidosis is genetic disorder. Here, we searched the candidate genes for Sanfilippo syndrome by using BLAST with the query sequence. As no suitable homology was found against the query sequence we moved towards threading approach. The threading approach was carried out by employing online CPH models and LOMETS tools. Through present research, domains of the proteins were predicted by utilizing the Domain Sweep tools, GNS and two domains were reported. Motif search reported the maximum number of motifs for Type D protein as compared to other types. All four proteins were totally soluble proteins and no transmembrane domains were found. In future, these results and predicted 3D structures can be used for the molecular docking studies, binding activities and protein-protein interactions for all the four types of Sanfilippo syndrome.

  17. Diverse chromatin remodeling genes antagonize the Rb-involved SynMuv pathways in C. elegans.

    Directory of Open Access Journals (Sweden)

    Mingxue Cui

    2006-05-01

    Full Text Available In Caenorhabditis elegans, vulval cell-fate specification involves the activities of multiple signal transduction and regulatory pathways that include a receptor tyrosine kinase/Ras/mitogen-activated protein kinase pathway and synthetic multivulva (SynMuv pathways. Many genes in the SynMuv pathways encode transcription factors including the homologs of mammalian Rb, E2F, and components of the nucleosome-remodeling deacetylase complex. To further elucidate the functions of the SynMuv genes, we performed a genome-wide RNA interference (RNAi screen to search for genes that antagonize the SynMuv gene activities. Among those that displayed a varying degree of suppression of the SynMuv phenotype, 32 genes are potentially involved in chromatin remodeling (called SynMuv suppressor genes herein. Genetic mutations of two representative genes (zfp-1 and mes-4 were used to further characterize their positive roles in vulval induction and relationships with Ras function. Our analysis revealed antagonistic roles of the SynMuv suppressor genes and the SynMuv B genes in germline-soma distinction, RNAi, somatic transgene silencing, and tissue specific expression of pgl-1 and the lag-2/Delta genes. The opposite roles of these SynMuv B and SynMuv suppressor genes on transcriptional regulation were confirmed in somatic transgene silencing. We also report the identifications of ten new genes in the RNAi pathway and six new genes in germline silencing. Among the ten new RNAi genes, three encode homologs of proteins involved in both protein degradation and chromatin remodeling. Our findings suggest that multiple chromatin remodeling complexes are involved in regulating the expression of specific genes that play critical roles in developmental decisions.

  18. The hnRNP 2H9 gene, which is involved in the splicing reaction, is a multiply spliced gene

    DEFF Research Database (Denmark)

    Honoré, B

    2000-01-01

    The hnRNP 2H9 gene products are involved in the splicing process and participate in early heat shock-induced splicing arrest. By combining low/high stringency hybridisation, database search, Northern and Western blotting it is shown that the gene is alternatively spliced into at least six transcr...

  19. A scan statistic to extract causal gene clusters from case-control genome-wide rare CNV data

    Directory of Open Access Journals (Sweden)

    Scherer Stephen W

    2011-05-01

    Full Text Available Abstract Background Several statistical tests have been developed for analyzing genome-wide association data by incorporating gene pathway information in terms of gene sets. Using these methods, hundreds of gene sets are typically tested, and the tested gene sets often overlap. This overlapping greatly increases the probability of generating false positives, and the results obtained are difficult to interpret, particularly when many gene sets show statistical significance. Results We propose a flexible statistical framework to circumvent these problems. Inspired by spatial scan statistics for detecting clustering of disease occurrence in the field of epidemiology, we developed a scan statistic to extract disease-associated gene clusters from a whole gene pathway. Extracting one or a few significant gene clusters from a global pathway limits the overall false positive probability, which results in increased statistical power, and facilitates the interpretation of test results. In the present study, we applied our method to genome-wide association data for rare copy-number variations, which have been strongly implicated in common diseases. Application of our method to a simulated dataset demonstrated the high accuracy of this method in detecting disease-associated gene clusters in a whole gene pathway. Conclusions The scan statistic approach proposed here shows a high level of accuracy in detecting gene clusters in a whole gene pathway. This study has provided a sound statistical framework for analyzing genome-wide rare CNV data by incorporating topological information on the gene pathway.

  20. Sequencing and transcriptional analysis of the biosynthesis gene cluster of putrescine-producing Lactococcus lactis.

    Science.gov (United States)

    Ladero, Victor; Rattray, Fergal P; Mayo, Baltasar; Martín, María Cruz; Fernández, María; Alvarez, Miguel A

    2011-09-01

    Lactococcus lactis is a prokaryotic microorganism with great importance as a culture starter and has become the model species among the lactic acid bacteria. The long and safe history of use of L. lactis in dairy fermentations has resulted in the classification of this species as GRAS (General Regarded As Safe) or QPS (Qualified Presumption of Safety). However, our group has identified several strains of L. lactis subsp. lactis and L. lactis subsp. cremoris that are able to produce putrescine from agmatine via the agmatine deiminase (AGDI) pathway. Putrescine is a biogenic amine that confers undesirable flavor characteristics and may even have toxic effects. The AGDI cluster of L. lactis is composed of a putative regulatory gene, aguR, followed by the genes (aguB, aguD, aguA, and aguC) encoding the catabolic enzymes. These genes are transcribed as an operon that is induced in the presence of agmatine. In some strains, an insertion (IS) element interrupts the transcription of the cluster, which results in a non-putrescine-producing phenotype. Based on this knowledge, a PCR-based test was developed in order to differentiate nonproducing L. lactis strains from those with a functional AGDI cluster. The analysis of the AGDI cluster and their flanking regions revealed that the capacity to produce putrescine via the AGDI pathway could be a specific characteristic that was lost during the adaptation to the milk environment by a process of reductive genome evolution.

  1. antiSMASH 4.0-improvements in chemistry prediction and gene cluster boundary identification

    DEFF Research Database (Denmark)

    Blin, Kai; Wolf, Thomas; Chevrette, Marc G.

    2017-01-01

    Many antibiotics, chemotherapeutics, crop protection agents and food preservatives originate from molecules produced by bacteria, fungi or plants. In recent years, genome mining methodologies have been widely adopted to identify and characterize the biosynthetic gene clusters encoding the product......Many antibiotics, chemotherapeutics, crop protection agents and food preservatives originate from molecules produced by bacteria, fungi or plants. In recent years, genome mining methodologies have been widely adopted to identify and characterize the biosynthetic gene clusters encoding...... the production of such compounds. Since 2011, the 'antibiotics and secondary metabolite analysis shell-antiSMASH' has assisted researchers in efficiently performing this, both as a web server and a standalone tool. Here, we present the thoroughly updated antiSMASH version 4, which adds several novel features......, including prediction of gene cluster boundaries using the ClusterFinder method or the newly integrated CASSIS algorithm, improved substrate specificity prediction for non-ribosomal peptide synthetase adenylation domains based on the new SANDPUMA algorithm, improved predictions for terpene and ribosomally...

  2. Evolution of the Genome 3D Organization: Comparison of Fused and Segregated Globin Gene Clusters.

    Science.gov (United States)

    Kovina, Anastasia P; Petrova, Natalia V; Gushchanskaya, Ekaterina S; Dolgushin, Konstantin V; Gerasimov, Evgeny S; Galitsyna, Aleksandra A; Penin, Alexey A; Flyamer, Ilya M; Ioudinkova, Elena S; Gavrilov, Alexey A; Vassetzky, Yegor S; Ulianov, Sergey V; Iarovaia, Olga V; Razin, Sergey V

    2017-06-01

    The genomes are folded in a complex three-dimensional (3D) structure. Some features of this organization are common for all eukaryotes, but little is known about its evolution. Here, we have studied the 3D organization and regulation of zebrafish globin gene domain and compared its organization and regulation with those of other vertebrate species. In birds and mammals, the α- and β-globin genes are segregated into separate clusters located on different chromosomes and organized into chromatin domains of different types, whereas in cold-blooded vertebrates, including Danio rerio, α- and β-globin genes are organized into common clusters. The major globin gene locus of Danio rerio is of particular interest as it is located in a genomic area that is syntenic in vertebrates and is controlled by a conserved enhancer. We have found that the major globin gene locus of Danio rerio is structurally and functionally segregated into two spatially distinct subloci harboring either adult or embryo-larval globin genes. These subloci demonstrate different organization at the level of chromatin domains and different modes of spatial organization, which appears to be due to selective interaction of the upstream enhancer with the sublocus harboring globin genes of the adult type. These data are discussed in terms of evolution of linear and 3D organization of gene clusters in vertebrates. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  3. Genome-wide upstream motif analysis of Cryptosporidium parvum genes clustered by expression profile.

    Science.gov (United States)

    Oberstaller, Jenna; Joseph, Sandeep J; Kissinger, Jessica C

    2013-07-29

    There are very few molecular genetic tools available to study the apicomplexan parasite Cryptosporidium parvum. The organism is not amenable to continuous in vitro cultivation or transfection, and purification of intracellular developmental stages in sufficient numbers for most downstream molecular applications is difficult and expensive since animal hosts are required. As such, very little is known about gene regulation in C. parvum. We have clustered whole-genome gene expression profiles generated from a previous study of seven post-infection time points of 3,281 genes to identify genes that show similar expression patterns throughout the first 72 hours of in vitro epithelial cell culture. We used the algorithms MEME, AlignACE and FIRE to identify conserved, overrepresented DNA motifs in the upstream promoter region of genes with similar expression profiles. The most overrepresented motifs were E2F (5'-TGGCGCCA-3'); G-box (5'-G.GGGG-3'); a well-documented ApiAP2 binding motif (5'-TGCAT-3'), and an unknown motif (5'-[A/C] AACTA-3'). We generated a recombinant C. parvum DNA-binding protein domain from a putative ApiAP2 transcription factor [CryptoDB: cgd8_810] and determined its binding specificity using protein-binding microarrays. We demonstrate that cgd8_810 can putatively bind the overrepresented G-box motif, implicating this ApiAP2 in the regulation of many gene clusters. Several DNA motifs were identified in the upstream sequences of gene clusters that might serve as potential cis-regulatory elements. These motifs, in concert with protein DNA binding site data, establish for the first time the beginnings of a global C. parvum gene regulatory map that will contribute to our understanding of the development of this zoonotic parasite.

  4. Complete sequence of a plasmid from a bovine methicillin-resistant Staphylococcus aureus harbouring a novel ica-like gene cluster in addition to antimicrobial and heavy metal resistance genes.

    Science.gov (United States)

    Feßler, Andrea T; Zhao, Qin; Schoenfelder, Sonja; Kadlec, Kristina; Brenner Michael, Geovana; Wang, Yang; Ziebuhr, Wilma; Shen, Jianzhong; Schwarz, Stefan

    2017-02-01

    The multiresistance plasmid pAFS11, obtained from a bovine methicillin-resistant Staphylococcus aureus (MRSA) isolate, was completely sequenced and analysed for its structure and organisation. Moreover, the susceptibility to the heavy metals cadmium and copper was determined by broth macrodilution. The 49,189-bp plasmid harboured the apramycin resistance gene apmA, two copies of the macrolide/lincosamide/streptogramin B resistance gene erm(B) (both located on remnants of a truncated transposon Tn917), the kanamycin/neomycin resistance gene aadD, the tetracycline resistance gene tet(L) and the trimethoprim resistance gene dfrK. The latter three genes were part of a 7,284-bp segment which was bracketed by two copies of IS431. In addition, the cadmium resistance operon cadDX as well as the copper resistance genes copA and mco were located on the plasmid and mediated a reduced susceptibility to cadmium and copper. Moreover, a complete novel ica-like gene cluster of so far unknown genetic origin was detected on this plasmid. The ica-like gene cluster comprised four different genes whose products showed 64.4-76.9% homology to the Ica proteins known to be involved in biofilm formation of the S. aureus strains Mu50, Mu3 and N315. However, 96.2-99.4% homology was seen to proteins from S. sciuri NS1 indicating an S. sciuri origin. The finding of five different antibiotic resistance genes co-located on a plasmid with heavy metal resistance genes and an ica-like gene cluster is alarming. With the acquisition of this plasmid, antimicrobial multiresistance, heavy metal resistances and potential virulence properties may be co-selected and spread via a single horizontal gene transfer event. Copyright © 2016 Elsevier B.V. All rights reserved.

  5. Genetic variations and haplotype diversity of the UGT1 gene cluster in the Chinese population.

    Directory of Open Access Journals (Sweden)

    Jing Yang

    Full Text Available Vertebrates require tremendous molecular diversity to defend against numerous small hydrophobic chemicals. UDP-glucuronosyltransferases (UGTs are a large family of detoxification enzymes that glucuronidate xenobiotics and endobiotics, facilitating their excretion from the body. The UGT1 gene cluster contains a tandem array of variable first exons, each preceded by a specific promoter, and a common set of downstream constant exons, similar to the genomic organization of the protocadherin (Pcdh, immunoglobulin, and T-cell receptor gene clusters. To assist pharmacogenomics studies in Chinese, we sequenced nine first exons, promoter and intronic regions, and five common exons of the UGT1 gene cluster in a population sample of 253 unrelated Chinese individuals. We identified 101 polymorphisms and found 15 novel SNPs. We then computed allele frequencies for each polymorphism and reconstructed their linkage disequilibrium (LD map. The UGT1 cluster can be divided into five linkage blocks: Block 9 (UGT1A9, Block 9/7/6 (UGT1A9, UGT1A7, and UGT1A6, Block 5 (UGT1A5, Block 4/3 (UGT1A4 and UGT1A3, and Block 3' UTR. Furthermore, we inferred haplotypes and selected their tagSNPs. Finally, comparing our data with those of three other populations of the HapMap project revealed ethnic specificity of the UGT1 genetic diversity in Chinese. These findings have important implications for future molecular genetic studies of the UGT1 gene cluster as well as for personalized medical therapies in Chinese.

  6. New Genes Involved in Osmotic Stress Tolerance in Saccharomyces cerevisiae

    Science.gov (United States)

    Gonzalez, Ramon; Morales, Pilar; Tronchoni, Jordi; Cordero-Bueso, Gustavo; Vaudano, Enrico; Quirós, Manuel; Novo, Maite; Torres-Pérez, Rafael; Valero, Eva

    2016-01-01

    Adaptation to changes in osmolarity is fundamental for the survival of living cells, and has implications in food and industrial biotechnology. It has been extensively studied in the yeast Saccharomyces cerevisiae, where the Hog1 stress activated protein kinase was discovered about 20 years ago. Hog1 is the core of the intracellular signaling pathway that governs the adaptive response to osmotic stress in this species. The main endpoint of this program is synthesis and intracellular retention of glycerol, as a compatible osmolyte. Despite many details of the signaling pathways and yeast responses to osmotic challenges have already been described, genome-wide approaches are contributing to refine our knowledge of yeast adaptation to hypertonic media. In this work, we used a quantitative fitness analysis approach in order to deepen our understanding of the interplay between yeast cells and the osmotic environment. Genetic requirements for proper growth under osmotic stress showed both common and specific features when hypertonic conditions were induced by either glucose or sorbitol. Tolerance to high-glucose content requires mitochondrial function, while defective protein targeting to peroxisome, GID-complex function (involved in negative regulation of gluconeogenesis), or chromatin dynamics, result in poor survival to sorbitol-induced osmotic stress. On the other side, the competitive disadvantage of yeast strains defective in the endomembrane system is relieved by hypertonic conditions. This finding points to the Golgi-endosome system as one of the main cell components negatively affected by hyperosmolarity. Most of the biological processes highlighted in this analysis had not been previously related to osmotic stress but are probably relevant in an ecological and evolutionary context. PMID:27733850

  7. The Histidine Decarboxylase Gene Cluster of Lactobacillus parabuchneri Was Gained by Horizontal Gene Transfer and Is Mobile within the Species

    Science.gov (United States)

    Wüthrich, Daniel; Berthoud, Hélène; Wechsler, Daniel; Eugster, Elisabeth; Irmler, Stefan; Bruggmann, Rémy

    2017-01-01

    Histamine in food can cause intolerance reactions in consumers. Lactobacillus parabuchneri (L. parabuchneri) is one of the major causes of elevated histamine levels in cheese. Despite its significant economic impact and negative influence on human health, no genomic study has been published so far. We sequenced and analyzed 18 L. parabuchneri strains of which 12 were histamine positive and 6 were histamine negative. We determined the complete genome of the histamine positive strain FAM21731 with PacBio as well as Illumina and the genomes of the remaining 17 strains using the Illumina technology. We developed the synteny aware ortholog finding algorithm SynOrf to compare the genomes and we show that the histidine decarboxylase (HDC) gene cluster is located in a genomic island. It is very likely that the HDC gene cluster was transferred from other lactobacilli, as it is highly conserved within several lactobacilli species. Furthermore, we have evidence that the HDC gene cluster was transferred within the L. parabuchneri species. PMID:28261177

  8. Modularity of Plant Metabolic Gene Clusters: A Trio of Linked Genes That Are Collectively Required for Acylation of Triterpenes in Oat[W][OA

    Science.gov (United States)

    Mugford, Sam T.; Louveau, Thomas; Melton, Rachel; Qi, Xiaoquan; Bakht, Saleha; Hill, Lionel; Tsurushima, Tetsu; Honkanen, Suvi; Rosser, Susan J.; Lomonossoff, George P.; Osbourn, Anne

    2013-01-01

    Operon-like gene clusters are an emerging phenomenon in the field of plant natural products. The genes encoding some of the best-characterized plant secondary metabolite biosynthetic pathways are scattered across plant genomes. However, an increasing number of gene clusters encoding the synthesis of diverse natural products have recently been reported in plant genomes. These clusters have arisen through the neo-functionalization and relocation of existing genes within the genome, and not by horizontal gene transfer from microbes. The reasons for clustering are not yet clear, although this form of gene organization is likely to facilitate co-inheritance and co-regulation. Oats (Avena spp) synthesize antimicrobial triterpenoids (avenacins) that provide protection against disease. The synthesis of these compounds is encoded by a gene cluster. Here we show that a module of three adjacent genes within the wider biosynthetic gene cluster is required for avenacin acylation. Through the characterization of these genes and their encoded proteins we present a model of the subcellular organization of triterpenoid biosynthesis. PMID:23532069

  9. Clusters of ancestrally related genes that show paralogy in whole or in part are a major feature of the genomes of humans and other species.

    Directory of Open Access Journals (Sweden)

    Michael B Walker

    Full Text Available Arrangements of genes along chromosomes are a product of evolutionary processes, and we can expect that preferable arrangements will prevail over the span of evolutionary time, often being reflected in the non-random clustering of structurally and/or functionally related genes. Such non-random arrangements can arise by two distinct evolutionary processes: duplications of DNA sequences that give rise to clusters of genes sharing both sequence similarity and common sequence features and the migration together of genes related by function, but not by common descent. To provide a background for distinguishing between the two, which is important for future efforts to unravel the evolutionary processes involved, we here provide a description of the extent to which ancestrally related genes are found in proximity.Towards this purpose, we combined information from five genomic datasets, InterPro, SCOP, PANTHER, Ensembl protein families, and Ensembl gene paralogs. The results are provided in publicly available datasets (http://cgd.jax.org/datasets/clustering/paraclustering.shtml describing the extent to which ancestrally related genes are in proximity beyond what is expected by chance (i.e. form paraclusters in the human and nine other vertebrate genomes, as well as the D. melanogaster, C. elegans, A. thaliana, and S. cerevisiae genomes. With the exception of Saccharomyces, paraclusters are a common feature of the genomes we examined. In the human genome they are estimated to include at least 22% of all protein coding genes. Paraclusters are far more prevalent among some gene families than others, are highly species or clade specific and can evolve rapidly, sometimes in response to environmental cues. Altogether, they account for a large portion of the functional clustering previously reported in several genomes.

  10. A Papaver somniferum 10-gene cluster for synthesis of the anticancer alkaloid noscapine.

    Science.gov (United States)

    Winzer, Thilo; Gazda, Valeria; He, Zhesi; Kaminski, Filip; Kern, Marcelo; Larson, Tony R; Li, Yi; Meade, Fergus; Teodor, Roxana; Vaistij, Fabián E; Walker, Carol; Bowser, Tim A; Graham, Ian A

    2012-06-29

    Noscapine is an antitumor alkaloid from opium poppy that binds tubulin, arrests metaphase, and induces apoptosis in dividing human cells. Elucidation of the biosynthetic pathway will enable improvement in the commercial production of noscapine and related bioactive molecules. Transcriptomic analysis revealed the exclusive expression of 10 genes encoding five distinct enzyme classes in a high noscapine-producing poppy variety, HN1. Analysis of an F(2) mapping population indicated that these genes are tightly linked in HN1, and bacterial artificial chromosome sequencing confirmed that they exist as a complex gene cluster for plant alkaloids. Virus-induced gene silencing resulted in accumulation of pathway intermediates, allowing gene function to be linked to noscapine synthesis and a novel biosynthetic pathway to be proposed.

  11. Identification of genes involved in indole-3-acetic acid biosynthesis by Gluconacetobacter diazotrophicus PAL5 strain using transposon mutagenesis

    Directory of Open Access Journals (Sweden)

    ELISETE PAINS RODRIGUES

    2016-10-01

    Full Text Available Gluconacetobacter diazotrophicus is a beneficial nitrogen-fixing endophyte found in association with sugarcane plants and other important crops. Beneficial effects of G. diazotrophicus on sugarcane growth and productivity have been attributed to biological nitrogen fixation process and production of phytohormones especially indole-3-acetic acid (IAA; however, information about the biosynthesis and function of IAA in G. diazotrophicus is still scarce. Therefore, the aim of this work was to identify genes and pathways involved in IAA biosynthesis in this bacterium. In our study, the screening of two independent Tn5 mutant libraries of PAL5T strain using the Salkowski colorimetric assay revealed two mutants (Gdiaa34 and Gdiaa01, which exhibited 95% less indolic compounds that the parental strain when grown in LGIP medium supplemented with L-tryptophan. HPLC chromatograms of the wild-type strain revealed the presence of IAA and of the biosynthetic intermediates indole-3-pyruvic acid (IPyA and indole-3-lactate (ILA. In contrast, the HPLC profiles of both mutants showed no IAA but only a large peak of non-metabolized tryptophan and low levels of IPyA and ILA were detected. Molecular characterization revealed that Gdiaa01 and Gdiaa34 mutants had unique Tn5 insertions at different sites within the GDI2456 open read frame, which is predicted to encode a L-amino acid oxidase (LAAO. GDI2456 (lao gene forms a cluster with GDI2455 and GDI2454 ORFs, which are predicted to encode a cytochrome C and an RidA protein, respectively. RT-qPCR showed that transcript levels of lao, cccA and ridA genes were reduced in the Gdiaa01 as compared to PAL5T. In addition, rice plants inoculated with Gdiaa01 showed significantly smaller root development (length, surface area, number of forks and tips than those plants inoculated with PAL5T. In conclusion, our study demonstrated that G. diazotrophicus PAL5T produces IAA via the IPyA pathway in cultures supplemented with

  12. Identification of Genes Involved in Indole-3-Acetic Acid Biosynthesis by Gluconacetobacter diazotrophicus PAL5 Strain Using Transposon Mutagenesis

    Science.gov (United States)

    Rodrigues, Elisete P.; Soares, Cleiton de Paula; Galvão, Patrícia G.; Imada, Eddie L.; Simões-Araújo, Jean L.; Rouws, Luc F. M.; de Oliveira, André L. M.; Vidal, Márcia S.; Baldani, José I.

    2016-01-01

    Gluconacetobacter diazotrophicus is a beneficial nitrogen-fixing endophyte found in association with sugarcane plants and other important crops. Beneficial effects of G. diazotrophicus on sugarcane growth and productivity have been attributed to biological nitrogen fixation process and production of phytohormones especially indole-3-acetic acid (IAA); however, information about the biosynthesis and function of IAA in G. diazotrophicus is still scarce. Therefore, the aim of this work was to identify genes and pathways involved in IAA biosynthesis in this bacterium. In our study, the screening of two independent Tn5 mutant libraries of PAL5T strain using the Salkowski colorimetric assay revealed two mutants (Gdiaa34 and Gdiaa01), which exhibited 95% less indolic compounds than the parental strain when grown in LGIP medium supplemented with L-tryptophan. HPLC chromatograms of the wild-type strain revealed the presence of IAA and of the biosynthetic intermediates indole-3-pyruvic acid (IPyA) and indole-3-lactate (ILA). In contrast, the HPLC profiles of both mutants showed no IAA but only a large peak of non-metabolized tryptophan and low levels of IPyA and ILA were detected. Molecular characterization revealed that Gdiaa01 and Gdiaa34 mutants had unique Tn5 insertions at different sites within the GDI2456 open read frame, which is predicted to encode a L-amino acid oxidase (LAAO). GDI2456 (lao gene) forms a cluster with GDI2455 and GDI2454 ORFs, which are predicted to encode a cytochrome C and an RidA protein, respectively. RT-qPCR showed that transcript levels of lao. cccA, and ridA genes were reduced in the Gdiaa01 as compared to PAL5T. In addition, rice plants inoculated with Gdiaa01 showed significantly smaller root development (length, surface area, number of forks and tips) than those plants inoculated with PAL5T. In conclusion, our study demonstrated that G. diazotrophicus PAL5T produces IAA via the IPyA pathway in cultures supplemented with tryptophan and

  13. MeSH key terms for validation and annotation of gene expression clusters

    Energy Technology Data Exchange (ETDEWEB)

    Rechtsteiner, A. (Andreas); Rocha, L. M. (Luis Mateus)

    2004-01-01

    Integration of different sources of information is a great challenge for the analysis of gene expression data, and for the field of Functional Genomics in general. As the availability of numerical data from high-throughput methods increases, so does the need for technologies that assist in the validation and evaluation of the biological significance of results extracted from these data. In mRNA assaying with microarrays, for example, numerical analysis often attempts to identify clusters of co-expressed genes. The important task to find the biological significance of the results and validate them has so far mostly fallen to the biological expert who had to perform this task manually. One of the most promising avenues to develop automated and integrative technology for such tasks lies in the application of modern Information Retrieval (IR) and Knowledge Management (KM) algorithms to databases with biomedical publications and data. Examples of databases available for the field are bibliographic databases c ntaining scientific publications (e.g. MEDLINE/PUBMED), databases containing sequence data (e.g. GenBank) and databases of semantic annotations (e.g. the Gene Ontology Consortium and Medical Subject Headings (MeSH)). We present here an approach that uses the MeSH terms and their concept hierarchies to validate and obtain functional information for gene expression clusters. The controlled and hierarchical MeSH vocabulary is used by the National Library of Medicine (NLM) to index all the articles cited in MEDLINE. Such indexing with a controlled vocabulary eliminates some of the ambiguity due to polysemy (terms that have multiple meanings) and synonymy (multiple terms have similar meaning) that would be encountered if terms would be extracted directly from the articles due to differing article contexts or author preferences and background. Further, the hierarchical organization of the MeSH terms can illustrate the conceptuallfunctional relationships of genes

  14. Genuine genetic redundancy in maleylacetate-reductase-encoding genes involved in degradation of haloaromatic compounds by Cupriavidus necator JMP134.

    Science.gov (United States)

    Pérez-Pantoja, Danilo; Donoso, Raúl A; Sánchez, Miguel A; González, Bernardo

    2009-11-01

    Maleylacetate reductases (MAR) are required for biodegradation of several substituted aromatic compounds. To date, the functionality of two MAR-encoding genes (tfdF(I) and tfdF(II)) has been reported in Cupriavidus necator JMP134(pJP4), a known degrader of aromatic compounds. These two genes are located in tfd gene clusters involved in the turnover of 2,4-dichlorophenoxyacetate (2,4-D) and 3-chlorobenzoate (3-CB). The C. necator JMP134 genome comprises at least three other genes that putatively encode MAR (tcpD, hqoD and hxqD), but confirmation of their functionality and their role in the catabolism of haloaromatic compounds has not been assessed. RT-PCR expression analyses of C. necator JMP134 cells exposed to 2,4-D, 3-CB, 2,4,6-trichlorophenol (2,4,6-TCP) or 4-fluorobenzoate (4-FB) showed that tfdF(I) and tfdF(II) are induced by haloaromatics channelled to halocatechols as intermediates. In contrast, 2,4,6-TCP only induces tcpD, and any haloaromatic compounds tested did not induce hxqD and hqoD. However, the tcpD, hxqD and hqoD gene products showed MAR activity in cell extracts and provided the MAR function for 2,4-D catabolism when heterologously expressed in MAR-lacking strains. Growth tests for mutants of the five MAR-encoding genes in strain JMP134 showed that none of these genes is essential for degradation of the tested compounds. However, the role of tfdF(I)/tfdF(II) and tcpD genes in the expression of MAR activity during catabolism of 2,4-D and 2,4,6-TCP, respectively, was confirmed by enzyme activity tests in mutants. These results reveal a striking example of genetic redundancy in the degradation of aromatic compounds.

  15. Evolution of coding and non-coding genes in HOX clusters of a marsupial

    Directory of Open Access Journals (Sweden)

    Yu Hongshi

    2012-06-01

    Full Text Available Abstract Background The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Results Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. Conclusions This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial.

  16. Clustering Gene Expression Data Based on Predicted Differential Effects of G V Interaction

    Institute of Scientific and Technical Information of China (English)

    Hai-Yan Pan; Jun Zhu; Dan-Fu Han

    2005-01-01

    Microarray has become a popular biotechnology in biological and medical research.However, systematic and stochastic variabilities in microarray data are expected and unavoidable, resulting in the problem that the raw measurements have inherent "noise" within microarray experiments. Currently, logarithmic ratios are usually analyzed by various clustering methods directly, which may introduce bias interpretation in identifying groups of genes or samples. In this paper, a statistical method based on mixed model approaches was proposed for microarray data cluster analysis. The underlying rationale of this method is to partition the observed total gene expression level into various variations caused by different factors using an ANOVA model, and to predict the differential effects of G V (gene by variety)interaction using the adjusted unbiased prediction (AUP) method. The predicted G V interaction effects can then be used as the inputs of cluster analysis. We illustrated the application of our method with a gene expression dataset and elucidated the utility of our approach using an external validation.

  17. Structure and gene cluster of the o-antigen of Escherichia coli o96.

    Science.gov (United States)

    Guo, Xi; Senchenkova, Sof'ya N; Shashkov, Alexander S; Perepelov, Andrei V; Liu, Bin; Knirel, Yuriy A

    2016-02-01

    Mild acid degradation of the lipopolysaccharide of Escherichia coli O96 afforded a mixture of two polysaccharides. The following structure of the pentasaccharide repeating unit of the major polymer was established by sugar analysis, Smith degradation, and (1)H and (13)C NMR spectroscopy: [Formula: see text]. The O-antigen gene cluster of E. coli O96 between conserved galF and gnd genes was found to be consistent with this structure, and hence, the major polysaccharide represents the O96-antigen. The O96-antigen structure and gene cluster are similar to those of E. coli O170, and two proteins encoded in the gene clusters of both bacteria were putatively assigned a function of galactofuranosyltransferases. The minor polymer has the same structure as a peptidoglycan-related polysaccharide reported earlier in Providencia alcalifeciens O45 and several other O-serogoups of this species (Ovchinnikova OG, Liu B, Kocharova NA, Shashkov AS, Kondakova AN, Siwinska M, Feng L, Rozalski A, Wang L, Knirel YA. Biochemistry (Moscow) 2012;77:609-15) → 4)-β-D-GlcpNAc-(1 → 4)-β-D-GlcpNAc3(Rlac-lAla)-(1 → where Rlac-lAla indicates (R)-1-[(S)-1-carboxyethylaminocarbonyl]ethyl.

  18. Molecular cloning and characterization of