WorldWideScience

Sample records for ccl3l gene cluster

  1. Variations in CCL3L gene cluster sequence and non-specific gene copy numbers

    Directory of Open Access Journals (Sweden)

    Edberg Jeffrey C

    2010-03-01

    Full Text Available Abstract Background Copy number variations (CNVs of the gene CC chemokine ligand 3-like1 (CCL3L1 have been implicated in HIV-1 susceptibility, but the association has been inconsistent. CCL3L1 shares homology with a cluster of genes localized to chromosome 17q12, namely CCL3, CCL3L2, and, CCL3L3. These genes are involved in host defense and inflammatory processes. Several CNV assays have been developed for the CCL3L1 gene. Findings Through pairwise and multiple alignments of these genes, we have shown that the homology between these genes ranges from 50% to 99% in complete gene sequences and from 70-100% in the exonic regions, with CCL3L1 and CCL3L3 being identical. By use of MEGA 4 and BioEdit, we aligned sense primers, anti-sense primers, and probes used in several previously described assays against pre-multiple alignments of all four chemokine genes. Each set of probes and primers aligned and matched with overlapping sequences in at least two of the four genes, indicating that previously utilized RT-PCR based CNV assays are not specific for only CCL3L1. The four available assays measured median copies of 2 and 3-4 in European and African American, respectively. The concordance between the assays ranged from 0.44-0.83 suggesting individual discordant calls and inconsistencies with the assays from the expected gene coverage from the known sequence. Conclusions This indicates that some of the inconsistencies in the association studies could be due to assays that provide heterogenous results. Sequence information to determine CNV of the three genes separately would allow to test whether their association with the pathogenesis of a human disease or phenotype is affected by an individual gene or by a combination of these genes.

  2. Human CCL3L1 copy number variation, gene expression, and the role of the CCL3L1-CCR5 axis in lung function [version 1; referees: 2 approved

    Directory of Open Access Journals (Sweden)

    Adeolu B. Adewoye

    2018-02-01

    Full Text Available Background: The CCL3L1-CCR5 signaling axis is important in a number of inflammatory responses, including macrophage function, and T-cell-dependent immune responses. Small molecule CCR5 antagonists exist, including the approved antiretroviral drug maraviroc, and therapeutic monoclonal antibodies are in development. Repositioning of drugs and targets into new disease areas can accelerate the availability of new therapies and substantially reduce costs. As it has been shown that drug targets with genetic evidence supporting their involvement in the disease are more likely to be successful in clinical development, using genetic association studies to identify new target repurposing opportunities could be fruitful. Here we investigate the potential of perturbation of the CCL3L1-CCR5 axis as treatment for respiratory disease. Europeans typically carry between 0 and 5 copies of CCL3L1 and this multi-allelic variation is not detected by widely used genome-wide single nucleotide polymorphism studies.  Methods: We directly measured the complex structural variation of CCL3L1 using the Paralogue Ratio Test and imputed (with validation CCR5del32 genotypes in 5,000 individuals from UK Biobank, selected from the extremes of the lung function distribution, and analysed DNA and RNAseq data for CCL3L1 from the 1000 Genomes Project. Results: We confirmed the gene dosage effect of CCL3L1 copy number on CCL3L1 mRNA expression levels.  We found no evidence for association of CCL3L1 copy number or CCR5del32 genotype with lung function. Conclusions: These results suggest that repositioning CCR5 antagonists is unlikely to be successful for the treatment of airflow obstruction.

  3. CCL3L gene copy number and survival in an HIV-1 infected Zimbabwean population

    DEFF Research Database (Denmark)

    Larsen, Margit Hørup; Wegner, Lise Thørner; Zinyama, Rutendo

    2012-01-01

    and progression to AIDS, but these results have been inconsistent. We examined a Zimbabwean study population for an association of CCL3L CNV with HIV status, progression (CD4 T-cells and viral load), and survival. Another aim was to investigate the possible effects of CCL3L CNV on CCL3 protein concentration....... A treatment-naïve cohort, which included 153 HIV infected and 159 HIV uninfected individuals, was followed for up to 4.3 years. The CNV of the CCL3L was determined by duplex real-time polymerase chain reaction. We found no association between four CCL3L CNV strata and HIV status (P=0.7), CD4 T-cell count (P=0...

  4. The CCL3L1-CCR5 genotype influences the development of AIDS, but not HIV susceptibility or the response to HAART

    Energy Technology Data Exchange (ETDEWEB)

    Bhattacharya, Tanmoy [Los Alamos National Laboratory; Stanton, Jennifer [NORTHWESTERN UNIV; Kim, Eun - Young [NORTHWESTERN UNIV; Kunstman, Kevin [NORTHWESTERN UNIV; Phair, John [NORTHWESTERN UNIV; Jacobson, Lisa P [JOHNS HOPKINS UNIV; Wolinsky, Steven M [NORTHWESTERN UNIV

    2008-01-01

    A selective advantage against infectious diseases such as HIV/AIDS is associated with differences in the genes relevant to immunity and virus replication. The CC chemokine receptor 5 (CCR5), the principal coreceptor for HIV, and its chemokine ligands, including CCL3L1, influences the CD4+ target cells susceptibility to infection. The CCL3L1 gene is in a region of segmental duplication on the q-arm of human chromosome 17. Increased numbers of CCL3L1 gene copies that affect the gene expression phenotype might have substantial protective effects. Here we show that the population-specific CCL3L1 gene copy number and the CCR5 {Delta}32 protein-inactivating deletion that categorizes the CCL3L1-CCR5 genotype do not influence HIV/AIDS susceptibility or the robustness of immune recovery after the initiation of highly active antiretroviral therapy (HAART).

  5. CCL3L1 copy number, CCR5 genotype and susceptibility to tuberculosis.

    Science.gov (United States)

    Carpenter, Danielle; Taype, Carmen; Goulding, Jon; Levin, Mike; Eley, Brian; Anderson, Suzanne; Shaw, Marie-Anne; Armour, John A L

    2014-01-09

    Tuberculosis is a major infectious disease and functional studies have provided evidence that both the chemokine MIP-1α and its receptor CCR5 play a role in susceptibility to TB. Thus by measuring copy number variation of CCL3L1, one of the genes that encode MIP-1α, and genotyping a functional promoter polymorphism -2459A > G in CCR5 (rs1799987) we investigate the influence of MIP-1α and CCR5, independently and combined, in susceptibility to clinically active TB in three populations, a Peruvian population (n = 1132), a !Xhosa population (n = 605) and a South African Coloured population (n = 221). The three populations include patients with clinically diagnosed pulmonary TB, as well as other, less prevalent forms of extrapulmonary TB. Copy number of CCL3L1 was measured using the paralogue ratio test and exhibited ranges between 0-6 copies per diploid genome (pdg) in Peru, between 0-12 pdg in !Xhosa samples and between 0-10 pdg in South African Coloured samples. The CCR5 promoter polymorphism was observed to differ significantly in allele frequency between populations (*A; Peru f = 0.67, !Xhosa f = 0.38, Coloured f = 0.48). The case-control association studies performed however find, surprisingly, no evidence for an influence of variation in genes coding for MIP-1α or CCR5 individually or together in susceptibility to clinically active TB in these populations.

  6. CCL3L1-CCR5 genotype improves the assessment of AIDS Risk in HIV-1-infected individuals.

    Directory of Open Access Journals (Sweden)

    Hemant Kulkarni

    Full Text Available BACKGROUND: Whether vexing clinical decision-making dilemmas can be partly addressed by recent advances in genomics is unclear. For example, when to initiate highly active antiretroviral therapy (HAART during HIV-1 infection remains a clinical dilemma. This decision relies heavily on assessing AIDS risk based on the CD4+ T cell count and plasma viral load. However, the trajectories of these two laboratory markers are influenced, in part, by polymorphisms in CCR5, the major HIV coreceptor, and the gene copy number of CCL3L1, a potent CCR5 ligand and HIV-suppressive chemokine. Therefore, we determined whether accounting for both genetic and laboratory markers provided an improved means of assessing AIDS risk. METHODS AND FINDINGS: In a prospective, single-site, ethnically-mixed cohort of 1,132 HIV-positive subjects, we determined the AIDS risk conveyed by the laboratory and genetic markers separately and in combination. Subjects were assigned to a low, moderate or high genetic risk group (GRG based on variations in CCL3L1 and CCR5. The predictive value of the CCL3L1-CCR5 GRGs, as estimated by likelihood ratios, was equivalent to that of the laboratory markers. GRG status also predicted AIDS development when the laboratory markers conveyed a contrary risk. Additionally, in two separate and large groups of HIV+ subjects from a natural history cohort, the results from additive risk-scoring systems and classification and regression tree (CART analysis revealed that the laboratory and CCL3L1-CCR5 genetic markers together provided more prognostic information than either marker alone. Furthermore, GRGs independently predicted the time interval from seroconversion to CD4+ cell count thresholds used to guide HAART initiation. CONCLUSIONS: The combination of the laboratory and genetic markers captures a broader spectrum of AIDS risk than either marker alone. By tracking a unique aspect of AIDS risk distinct from that captured by the laboratory parameters

  7. Association of CCR2-CCR5 haplotypes and CCL3L1 copy number with Kawasaki Disease, coronary artery lesions, and IVIG responses in Japanese children.

    Directory of Open Access Journals (Sweden)

    Manju Mamtani

    Full Text Available BACKGROUND: The etiology of Kawasaki Disease (KD is enigmatic, although an infectious cause is suspected. Polymorphisms in CC chemokine receptor 5 (CCR5 and/or its potent ligand CCL3L1 influence KD susceptibility in US, European and Korean populations. However, the influence of these variations on KD susceptibility, coronary artery lesions (CAL and response to intravenous immunoglobulin (IVIG in Japanese children, who have the highest incidence of KD, is unknown. METHODOLOGY/PRINCIPAL FINDINGS: We used unconditional logistic regression analyses to determine the associations of the copy number of the CCL3L1 gene-containing duplication and CCR2-CCR5 haplotypes in 133 Japanese KD cases [33 with CAL and 25 with resistance to IVIG] and 312 Japanese controls without a history of KD. We observed that the deviation from the population average of four CCL3L1 copies (i.e., four copies was associated with an increased risk of KD and IVIG resistance (adjusted odds ratio (OR=2.25, p=0.004 and OR=6.26, p=0.089, respectively. Heterozygosity for the CCR5 HHF*2 haplotype was associated with a reduced risk of both IVIG resistance (OR=0.21, p=0.026 and CAL development (OR=0.44, p=0.071. CONCLUSIONS/SIGNIFICANCE: The CCL3L1-CCR5 axis may play an important role in KD pathogenesis. In addition to clinical and laboratory parameters, genetic markers may also predict risk of CAL and resistance to IVIG.

  8. Genetic and bibliographic information: CCL3L1 [GenLibi

    Lifescience Database Archive (English)

    Full Text Available CCL3L1 chemokine (C-C motif) ligand 3-like 1 human HIV Infections (MeSH); HIV-1 Virus Diseases...ctions (C02.782.815.616) > HIV Infections (C02.782.815.616.400) Virus Diseases (C02) > Sexually Transmitted Diseases... (C02.800) > Sexually Transmitted Diseases, Viral (C02.800.801) > HIV Inf...ections (C02.800.801.400) Immune System Diseases (C20) > Immunologic Deficiency Syndromes (C20.673) > HIV Infections (C20.673.480) 05A0287080 ...

  9. Genetic Variations in the Receptor-Ligand Pair CCR5 and CCL3L1 Are Important Determinants of Susceptibility to Kawasaki Disease

    Science.gov (United States)

    Burns, Jane C.; Shimizu, Chisato; Gonzalez, Enrique; Kulkarni, Hemant; Patel, Sukeshi; Shike, Hiroko; Sundel, Robert S.; Newburger, Jane W.; Ahuja, Sunil K.

    2010-01-01

    Kawasaki disease (KD) is an enigmatic, self-limited vasculitis of childhood that is complicated by development of coronary-artery aneurysms. The high incidence of KD in Asian versus European populations prompted a search for genetic polymorphisms that are differentially distributed among these populations and that influence KD susceptibility. Here, we demonstrate a striking, inverse relationship between the worldwide distribution of CCR5-Δ32 allele and the incidence of KD. In 164 KD patient-parent trios, 4 CCR5 haplotypes including the CCR5-Δ32 allele were differentially transmitted from heterozygous parents to affected children. However, the magnitude of the reduced risk of KD associated with the CCR5-Δ32 allele and certain CCR5 haplotypes was significantly greater in individuals who also possessed a high copy number of the gene encoding CCL3L1, the most potent CCR5 ligand. These findings, derived from the largest genetic study of any systemic vasculitis, suggest a central role of CCR5-CCL3L1 gene-gene interactions in KD susceptibility and the importance of gene modifiers in infectious diseases. PMID:15962231

  10. Accuracy and differential bias in copy number measurement of CCL3L1 in association studies with three auto-immune disorders.

    Science.gov (United States)

    Carpenter, Danielle; Walker, Susan; Prescott, Natalie; Schalkwijk, Joost; Armour, John Al

    2011-08-18

    Copy number variation (CNV) contributes to the variation observed between individuals and can influence human disease progression, but the accurate measurement of individual copy numbers is technically challenging. In the work presented here we describe a modification to a previously described paralogue ratio test (PRT) method for genotyping the CCL3L1/CCL4L1 copy variable region, which we use to ascertain CCL3L1/CCL4L1 copy number in 1581 European samples. As the products of CCL3L1 and CCL4L1 potentially play a role in autoimmunity we performed case control association studies with Crohn's disease, rheumatoid arthritis and psoriasis clinical cohorts. We evaluate the PRT methodology used, paying particular attention to accuracy and precision, and highlight the problems of differential bias in copy number measurements. Our PRT methods for measuring copy number were of sufficient precision to detect very slight but systematic differential bias between results from case and control DNA samples in one study. We find no evidence for an association between CCL3L1 copy number and Crohn's disease, rheumatoid arthritis or psoriasis. Differential bias of this small magnitude, but applied systematically across large numbers of samples, would create a serious risk of false positive associations in copy number, if measured using methods of lower precision, or methods relying on single uncorroborated measurements. In this study the small differential bias detected by PRT in one sample set was resolved by a simple pre-treatment by restriction enzyme digestion.

  11. Accuracy and differential bias in copy number measurement of CCL3L1 in association studies with three auto-immune disorders

    Directory of Open Access Journals (Sweden)

    Carpenter Danielle

    2011-08-01

    Full Text Available Abstract Background Copy number variation (CNV contributes to the variation observed between individuals and can influence human disease progression, but the accurate measurement of individual copy numbers is technically challenging. In the work presented here we describe a modification to a previously described paralogue ratio test (PRT method for genotyping the CCL3L1/CCL4L1 copy variable region, which we use to ascertain CCL3L1/CCL4L1 copy number in 1581 European samples. As the products of CCL3L1 and CCL4L1 potentially play a role in autoimmunity we performed case control association studies with Crohn's disease, rheumatoid arthritis and psoriasis clinical cohorts. Results We evaluate the PRT methodology used, paying particular attention to accuracy and precision, and highlight the problems of differential bias in copy number measurements. Our PRT methods for measuring copy number were of sufficient precision to detect very slight but systematic differential bias between results from case and control DNA samples in one study. We find no evidence for an association between CCL3L1 copy number and Crohn's disease, rheumatoid arthritis or psoriasis. Conclusions Differential bias of this small magnitude, but applied systematically across large numbers of samples, would create a serious risk of false positive associations in copy number, if measured using methods of lower precision, or methods relying on single uncorroborated measurements. In this study the small differential bias detected by PRT in one sample set was resolved by a simple pre-treatment by restriction enzyme digestion.

  12. Gene cluster statistics with gene families.

    Science.gov (United States)

    Raghupathy, Narayanan; Durand, Dannie

    2009-05-01

    Identifying genomic regions that descended from a common ancestor is important for understanding the function and evolution of genomes. In distantly related genomes, clusters of homologous gene pairs are evidence of candidate homologous regions. Demonstrating the statistical significance of such "gene clusters" is an essential component of comparative genomic analyses. However, currently there are no practical statistical tests for gene clusters that model the influence of the number of homologs in each gene family on cluster significance. In this work, we demonstrate empirically that failure to incorporate gene family size in gene cluster statistics results in overestimation of significance, leading to incorrect conclusions. We further present novel analytical methods for estimating gene cluster significance that take gene family size into account. Our methods do not require complete genome data and are suitable for testing individual clusters found in local regions, such as contigs in an unfinished assembly. We consider pairs of regions drawn from the same genome (paralogous clusters), as well as regions drawn from two different genomes (orthologous clusters). Determining cluster significance under general models of gene family size is computationally intractable. By assuming that all gene families are of equal size, we obtain analytical expressions that allow fast approximation of cluster probabilities. We evaluate the accuracy of this approximation by comparing the resulting gene cluster probabilities with cluster probabilities obtained by simulating a realistic, power-law distributed model of gene family size, with parameters inferred from genomic data. Surprisingly, despite the simplicity of the underlying assumption, our method accurately approximates the true cluster probabilities. It slightly overestimates these probabilities, yielding a conservative test. We present additional simulation results indicating the best choice of parameter values for data

  13. Copy number variation of CCL3-like genes affects rate of progression to simian-AIDS in Rhesus Macaques (Macaca mulatta.

    Directory of Open Access Journals (Sweden)

    Jeremiah D Degenhardt

    2009-01-01

    Full Text Available Variation in genes underlying host immunity can lead to marked differences in susceptibility to HIV infection among humans. Despite heavy reliance on non-human primates as models for HIV/AIDS, little is known about which host factors are shared and which are unique to a given primate lineage. Here, we investigate whether copy number variation (CNV at CCL3-like genes (CCL3L, a key genetic host factor for HIV/AIDS susceptibility and cell-mediated immune response in humans, is also a determinant of time until onset of simian-AIDS in rhesus macaques. Using a retrospective study of 57 rhesus macaques experimentally infected with SIVmac, we find that CCL3L CNV explains approximately 18% of the variance in time to simian-AIDS (p<0.001 with lower CCL3L copy number associating with more rapid disease course. We also find that CCL3L copy number varies significantly (p<10(-6 among rhesus subpopulations, with Indian-origin macaques having, on average, half as many CCL3L gene copies as Chinese-origin macaques. Lastly, we confirm that CCL3L shows variable copy number in humans and chimpanzees and report on CCL3L CNV within and among three additional primate species. On the basis of our findings we suggest that (1 the difference in population level copy number may explain previously reported observations of longer post-infection survivorship of Chinese-origin rhesus macaques, (2 stratification by CCL3L copy number in rhesus SIV vaccine trials will increase power and reduce noise due to non-vaccine-related differences in survival, and (3 CCL3L CNV is an ancestral component of the primate immune response and, therefore, copy number variation has not been driven by HIV or SIV per se.

  14. FunGeneClusterS

    DEFF Research Database (Denmark)

    Vesth, Tammi Camilla; Brandl, Julian; Andersen, Mikael Rørdam

    2016-01-01

    and industrial biotechnology applications. We have previously published a method for accurate prediction of clusters from genome and transcriptome data, which could also suggest cross-chemistry, however, this method was limited both in the number of parameters which could be adjusted as well as in user......Secondary metabolites of fungi are receiving an increasing amount of interest due to their prolific bioactivities and the fact that fungal biosynthesis of secondary metabolites often occurs from co-regulated and co-located gene clusters. This makes the gene clusters attractive for synthetic biology...

  15. Organization of an echinoderm Hox gene cluster

    OpenAIRE

    Martinez, Pedro; Rast, Jonathan P.; Arenas-Mena, César; Davidson, Eric H.

    1999-01-01

    The Strongylocentrotus purpuratus genome contains a single ten-gene Hox complex >0.5 megabase in length. This complex was isolated on overlapping bacterial artificial chromosome and P1 artificial chromosome genomic recombinants by using probes for individual genes and by genomic walking. Echinoderm Hox genes of Paralog Groups (PG) 1 and 2 are reported. The cluster includes genes representing all paralog groups of vertebrate Hox clusters, except that there is a sing...

  16. Association between HLA-DQA1 gene copy number polymorphisms ...

    Indian Academy of Sciences (India)

    2014-04-21

    Apr 21, 2014 ... RESEARCH NOTE. Association between HLA-DQA1 gene copy number polymorphisms and susceptibility to rheumatoid arthritis in. Chinese Han ..... 2009 Combinatorial content of CCL3L and CCL4L gene copy numbers influence HIV-AIDS susceptibility in Ukrainian children. AIDS 23, 679–688. Sirota M.

  17. Fuzzy clustering analysis of osteosarcoma related genes.

    Science.gov (United States)

    Chen, Kai; Wu, Dajiang; Bai, Yushu; Zhu, Xiaodong; Chen, Ziqiang; Wang, Chuanfeng; Zhao, Yingchuan; Li, Ming

    2014-07-01

    Osteosarcoma is the most common malignant bone-tumor with a peak manifestation during the second and third decade of life. In order to explore the influence of genetic factors on the mechanism of osteosarcoma by analyzing the inter relationship between osteosarcoma and its related genes, and then provide potential genetic references for the prevention, diagnosis and treatment of osteosarcoma, we collected osteosarcoma related gene sequences in Genebank of National Center for Biotechnology Information (NCBI) and local alignment analysis for a pair of sequences was carried out to identify the measurement association among related sequences. Then fuzzy clustering method was used for clustering analysis so as to contact the unknown genes through the consistent osteosarcoma related genes in one class. From the result of fuzzy clustering analysis, we could classify the osteosarcoma related genes into two groups and deduced that the genes clustered into one group had similar function. Based on this knowledge, we found more genes related to the pathogenesis of osteosarcoma and these genes could exert similar function as Runx2, a risk factor confirmed in osteosarcoma, this study may help better understand the genetic mechanism and provide new molecular markers and therapies for osteosarcoma.

  18. Pichia stipitis genomics, transcriptomics, and gene clusters

    Science.gov (United States)

    Thomas W. Jeffries; Jennifer R. Headman Van Vleet

    2009-01-01

    Genome sequencing and subsequent global gene expression studies have advanced our understanding of the lignocellulose-fermenting yeast Pichia stipitis. These studies have provided an insight into its central carbon metabolism, and analysis of its genome has revealed numerous functional gene clusters and tandem repeats. Specialized physiological traits are often the...

  19. Semi-supervised consensus clustering for gene expression data analysis

    OpenAIRE

    Wang, Yunli; Pan, Youlian

    2014-01-01

    Background Simple clustering methods such as hierarchical clustering and k-means are widely used for gene expression data analysis; but they are unable to deal with noise and high dimensionality associated with the microarray gene expression data. Consensus clustering appears to improve the robustness and quality of clustering results. Incorporating prior knowledge in clustering process (semi-supervised clustering) has been shown to improve the consistency between the data partitioning and do...

  20. Gene ordering in partitive clustering using microarray expressions

    Indian Academy of Sciences (India)

    PRAKASH KUMAR

    fibroblast data and showed that our approach improves the result quality of partitive clustering solution, by identifying subclusters within big clusters, grouping functionally correlated genes within clusters, minimization of summation of gene expression distances, and the maximization of biological gene ordering using MIPS ...

  1. Microarray gene cluster identification and annotation through cluster ensemble and EM-based informative textual summarization.

    Science.gov (United States)

    Hu, Xiaohua; Park, E K; Zhang, Xiaodan

    2009-09-01

    Generating high-quality gene clusters and identifying the underlying biological mechanism of the gene clusters are the important goals of clustering gene expression analysis. To get high-quality cluster results, most of the current approaches rely on choosing the best cluster algorithm, in which the design biases and assumptions meet the underlying distribution of the dataset. There are two issues for this approach: 1) usually, the underlying data distribution of the gene expression datasets is unknown and 2) there are so many clustering algorithms available and it is very challenging to choose the proper one. To provide a textual summary of the gene clusters, the most explored approach is the extractive approach that essentially builds upon techniques borrowed from the information retrieval, in which the objective is to provide terms to be used for query expansion, and not to act as a stand-alone summary for the entire document sets. Another drawback is that the clustering quality and cluster interpretation are treated as two isolated research problems and are studied separately. In this paper, we design and develop a unified system Gene Expression Miner to address these challenging issues in a principled and general manner by integrating cluster ensemble, text clustering, and multidocument summarization and provide an environment for comprehensive gene expression data analysis. We present a novel cluster ensemble approach to generate high-quality gene cluster. In our text summarization module, given a gene cluster, our expectation-maximization based algorithm can automatically identify subtopics and extract most probable terms for each topic. Then, the extracted top k topical terms from each subtopic are combined to form the biological explanation of each gene cluster. Experimental results demonstrate that our system can obtain high-quality clusters and provide informative key terms for the gene clusters.

  2. Some statistical properties of gene expression clustering for array data

    DEFF Research Database (Denmark)

    Abreu, G C G; Pinheiro, A; Drummond, R D

    2010-01-01

    DNA arrays have been a rich source of data for the study of genomic expression of a wide variety of biological systems. Gene clustering is one of the paradigms quite used to assess the significance of a gene (or group of genes). However, most of the gene clustering techniques are applied to cDNA...

  3. Gene ordering in partitive clustering using microarray expressions

    Indian Academy of Sciences (India)

    PRAKASH KUMAR

    the new hybrid approach, finds comparable or sometimes superior biological gene order in less computation time than those obtained by optimal leaf ordering in hierarchical clustering solution. Ray S S, Bandyopadhyay S and Pal S K 2007 Gene ordering in partitive clustering using microarray expressions; J. Biosci.

  4. Unique nucleotide polymorphism of ankyrin gene cluster in ...

    Indian Academy of Sciences (India)

    The ankyrin (ANK) gene cluster is a part of a multigene family encoding ANK transmembrane proteins in Arabidopsis thaliana, and plays an important role in protein–protein interactions and in signal pathways. In contrast to other regions of a genome, the ANK gene cluster exhibits an extremely high level of DNA ...

  5. Bioinformatics Prediction of Polyketide Synthase Gene Clusters from Mycosphaerella fijiensis.

    Directory of Open Access Journals (Sweden)

    Roslyn D Noar

    Full Text Available Mycosphaerella fijiensis, causal agent of black Sigatoka disease of banana, is a Dothideomycete fungus closely related to fungi that produce polyketides important for plant pathogenicity. We utilized the M. fijiensis genome sequence to predict PKS genes and their gene clusters and make bioinformatics predictions about the types of compounds produced by these clusters. Eight PKS gene clusters were identified in the M. fijiensis genome, placing M. fijiensis into the 23rd percentile for the number of PKS genes compared to other Dothideomycetes. Analysis of the PKS domains identified three of the PKS enzymes as non-reducing and two as highly reducing. Gene clusters contained types of genes frequently found in PKS clusters including genes encoding transporters, oxidoreductases, methyltransferases, and non-ribosomal peptide synthases. Phylogenetic analysis identified a putative PKS cluster encoding melanin biosynthesis. None of the other clusters were closely aligned with genes encoding known polyketides, however three of the PKS genes fell into clades with clusters encoding alternapyrone, fumonisin, and solanapyrone produced by Alternaria and Fusarium species. A search for homologs among available genomic sequences from 103 Dothideomycetes identified close homologs (>80% similarity for six of the PKS sequences. One of the PKS sequences was not similar (< 60% similarity to sequences in any of the 103 genomes, suggesting that it encodes a unique compound. Comparison of the M. fijiensis PKS sequences with those of two other banana pathogens, M. musicola and M. eumusae, showed that these two species have close homologs to five of the M. fijiensis PKS sequences, but three others were not found in either species. RT-PCR and RNA-Seq analysis showed that the melanin PKS cluster was down-regulated in infected banana as compared to growth in culture. Three other clusters, however were strongly upregulated during disease development in banana, suggesting that

  6. Bioinformatics Prediction of Polyketide Synthase Gene Clusters from Mycosphaerella fijiensis.

    Science.gov (United States)

    Noar, Roslyn D; Daub, Margaret E

    2016-01-01

    Mycosphaerella fijiensis, causal agent of black Sigatoka disease of banana, is a Dothideomycete fungus closely related to fungi that produce polyketides important for plant pathogenicity. We utilized the M. fijiensis genome sequence to predict PKS genes and their gene clusters and make bioinformatics predictions about the types of compounds produced by these clusters. Eight PKS gene clusters were identified in the M. fijiensis genome, placing M. fijiensis into the 23rd percentile for the number of PKS genes compared to other Dothideomycetes. Analysis of the PKS domains identified three of the PKS enzymes as non-reducing and two as highly reducing. Gene clusters contained types of genes frequently found in PKS clusters including genes encoding transporters, oxidoreductases, methyltransferases, and non-ribosomal peptide synthases. Phylogenetic analysis identified a putative PKS cluster encoding melanin biosynthesis. None of the other clusters were closely aligned with genes encoding known polyketides, however three of the PKS genes fell into clades with clusters encoding alternapyrone, fumonisin, and solanapyrone produced by Alternaria and Fusarium species. A search for homologs among available genomic sequences from 103 Dothideomycetes identified close homologs (>80% similarity) for six of the PKS sequences. One of the PKS sequences was not similar (< 60% similarity) to sequences in any of the 103 genomes, suggesting that it encodes a unique compound. Comparison of the M. fijiensis PKS sequences with those of two other banana pathogens, M. musicola and M. eumusae, showed that these two species have close homologs to five of the M. fijiensis PKS sequences, but three others were not found in either species. RT-PCR and RNA-Seq analysis showed that the melanin PKS cluster was down-regulated in infected banana as compared to growth in culture. Three other clusters, however were strongly upregulated during disease development in banana, suggesting that they may encode

  7. Nearest Neighbor Networks: clustering expression data based on gene neighborhoods

    Directory of Open Access Journals (Sweden)

    Olszewski Kellen L

    2007-07-01

    Full Text Available Abstract Background The availability of microarrays measuring thousands of genes simultaneously across hundreds of biological conditions represents an opportunity to understand both individual biological pathways and the integrated workings of the cell. However, translating this amount of data into biological insight remains a daunting task. An important initial step in the analysis of microarray data is clustering of genes with similar behavior. A number of classical techniques are commonly used to perform this task, particularly hierarchical and K-means clustering, and many novel approaches have been suggested recently. While these approaches are useful, they are not without drawbacks; these methods can find clusters in purely random data, and even clusters enriched for biological functions can be skewed towards a small number of processes (e.g. ribosomes. Results We developed Nearest Neighbor Networks (NNN, a graph-based algorithm to generate clusters of genes with similar expression profiles. This method produces clusters based on overlapping cliques within an interaction network generated from mutual nearest neighborhoods. This focus on nearest neighbors rather than on absolute distance measures allows us to capture clusters with high connectivity even when they are spatially separated, and requiring mutual nearest neighbors allows genes with no sufficiently similar partners to remain unclustered. We compared the clusters generated by NNN with those generated by eight other clustering methods. NNN was particularly successful at generating functionally coherent clusters with high precision, and these clusters generally represented a much broader selection of biological processes than those recovered by other methods. Conclusion The Nearest Neighbor Networks algorithm is a valuable clustering method that effectively groups genes that are likely to be functionally related. It is particularly attractive due to its simplicity, its success in the

  8. Differential Retention of Gene Functions in a Secondary Metabolite Cluster.

    Science.gov (United States)

    Reynolds, Hannah T; Slot, Jason C; Divon, Hege H; Lysøe, Erik; Proctor, Robert H; Brown, Daren W

    2017-08-01

    In fungi, distribution of secondary metabolite (SM) gene clusters is often associated with host- or environment-specific benefits provided by SMs. In the plant pathogen Alternaria brassicicola (Dothideomycetes), the DEP cluster confers an ability to synthesize the SM depudecin, a histone deacetylase inhibitor that contributes weakly to virulence. The DEP cluster includes genes encoding enzymes, a transporter, and a transcription regulator. We investigated the distribution and evolution of the DEP cluster in 585 fungal genomes and found a wide but sporadic distribution among Dothideomycetes, Sordariomycetes, and Eurotiomycetes. We confirmed DEP gene expression and depudecin production in one fungus, Fusarium langsethiae. Phylogenetic analyses suggested 6-10 horizontal gene transfers (HGTs) of the cluster, including a transfer that led to the presence of closely related cluster homologs in Alternaria and Fusarium. The analyses also indicated that HGTs were frequently followed by loss/pseudogenization of one or more DEP genes. Independent cluster inactivation was inferred in at least four fungal classes. Analyses of transitions among functional, pseudogenized, and absent states of DEP genes among Fusarium species suggest enzyme-encoding genes are lost at higher rates than the transporter (DEP3) and regulatory (DEP6) genes. The phenotype of an experimentally-induced DEP3 mutant of Fusarium did not support the hypothesis that selective retention of DEP3 and DEP6 protects fungi from exogenous depudecin. Together, the results suggest that HGT and gene loss have contributed significantly to DEP cluster distribution, and that some DEP genes provide a greater fitness benefit possibly due to a differential tendency to form network connections. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution 2017. This work is written by US Government employees and is in the public domain in the US.

  9. Clustering Algorithms: Their Application to Gene Expression Data.

    Science.gov (United States)

    Oyelade, Jelili; Isewon, Itunuoluwa; Oladipupo, Funke; Aromolaran, Olufemi; Uwoghiren, Efosa; Ameh, Faridah; Achas, Moses; Adebiyi, Ezekiel

    2016-01-01

    Gene expression data hide vital information required to understand the biological process that takes place in a particular organism in relation to its environment. Deciphering the hidden patterns in gene expression data proffers a prodigious preference to strengthen the understanding of functional genomics. The complexity of biological networks and the volume of genes present increase the challenges of comprehending and interpretation of the resulting mass of data, which consists of millions of measurements; these data also inhibit vagueness, imprecision, and noise. Therefore, the use of clustering techniques is a first step toward addressing these challenges, which is essential in the data mining process to reveal natural structures and identify interesting patterns in the underlying data. The clustering of gene expression data has been proven to be useful in making known the natural structure inherent in gene expression data, understanding gene functions, cellular processes, and subtypes of cells, mining useful information from noisy data, and understanding gene regulation. The other benefit of clustering gene expression data is the identification of homology, which is very important in vaccine design. This review examines the various clustering algorithms applicable to the gene expression data in order to discover and provide useful knowledge of the appropriate clustering technique that will guarantee stability and high degree of accuracy in its analysis procedure.

  10. Gene cluster encoding cholate catabolism in Rhodococcus spp.

    Science.gov (United States)

    Mohn, William W; Wilbrink, Maarten H; Casabon, Israël; Stewart, Gordon R; Liu, Jie; van der Geize, Robert; Eltis, Lindsay D

    2012-12-01

    Bile acids are highly abundant steroids with important functions in vertebrate digestion. Their catabolism by bacteria is an important component of the carbon cycle, contributes to gut ecology, and has potential commercial applications. We found that Rhodococcus jostii RHA1 grows well on cholate, as well as on its conjugates, taurocholate and glycocholate. The transcriptome of RHA1 growing on cholate revealed 39 genes upregulated on cholate, occurring in a single gene cluster. Reverse transcriptase quantitative PCR confirmed that selected genes in the cluster were upregulated 10-fold on cholate versus on cholesterol. One of these genes, kshA3, encoding a putative 3-ketosteroid-9α-hydroxylase, was deleted and found essential for growth on cholate. Two coenzyme A (CoA) synthetases encoded in the cluster, CasG and CasI, were heterologously expressed. CasG was shown to transform cholate to cholyl-CoA, thus initiating side chain degradation. CasI was shown to form CoA derivatives of steroids with isopropanoyl side chains, likely occurring as degradation intermediates. Orthologous gene clusters were identified in all available Rhodococcus genomes, as well as that of Thermomonospora curvata. Moreover, Rhodococcus equi 103S, Rhodococcus ruber Chol-4 and Rhodococcus erythropolis SQ1 each grew on cholate. In contrast, several mycolic acid bacteria lacking the gene cluster were unable to grow on cholate. Our results demonstrate that the above-mentioned gene cluster encodes cholate catabolism and is distinct from a more widely occurring gene cluster encoding cholesterol catabolism.

  11. Application of Gene Shaving and Mixture Models to Cluster Microarray Gene Expression Data

    Directory of Open Access Journals (Sweden)

    S. Wen

    2007-01-01

    Full Text Available Researchers are frequently faced with the analysis of microarray data of a relatively large number of genes using a small number of tissue samples. We examine the application of two statistical methods for clustering such microarray expression data: EMMIX-GENE and GeneClust. EMMIX-GENE is a mixture-model based clustering approach, designed primarily to cluster tissue samples on the basis of the genes. GeneClust is an implementation of the gene shaving methodology, motivated by research to identify distinct sets of genes for which variation in expression could be related to a biological property of the tissue samples. We illustrate the use of these two methods in the analysis of Affymetrix oligonucleotide arrays of well-known data sets from colon tissue samples with and without tumors, and of tumor tissue samples from patients with leukemia. Although the two approaches have been developed from different perspectives, the results demonstrate a clear correspondence between gene clusters produced by GeneClust and EMMIX-GENE for the colon tissue data. It is demonstrated, for the case of ribosomal proteins and smooth muscle genes in the colon data set, that both methods can classify genes into co-regulated families. It is further demonstrated that tissue types (tumor and normal can be separated on the basis of subtle distributed patterns of genes. Application to the leukemia tissue data produces a division of tissues corresponding closely to the external classification, acute myeloid leukemia (AML and acute lymphoblastic leukaemia (ALL, for both methods. In addition, we also identify genes specifi c for the subgroup of ALL-T cell samples. Overall, we find that the gene shaving method produces gene clusters at great speed; allows variable cluster sizes and can incorporate partial or full supervision; and finds clusters of genes in which the gene expression varies greatly over the tissue samples while maintaining a high level of coherence between the

  12. Characterization of the largest effector gene cluster of Ustilago maydis.

    Directory of Open Access Journals (Sweden)

    Thomas Brefort

    2014-07-01

    Full Text Available In the genome of the biotrophic plant pathogen Ustilago maydis, many of the genes coding for secreted protein effectors modulating virulence are arranged in gene clusters. The vast majority of these genes encode novel proteins whose expression is coupled to plant colonization. The largest of these gene clusters, cluster 19A, encodes 24 secreted effectors. Deletion of the entire cluster results in severe attenuation of virulence. Here we present the functional analysis of this genomic region. We show that a 19A deletion mutant behaves like an endophyte, i.e. is still able to colonize plants and complete the infection cycle. However, tumors, the most conspicuous symptoms of maize smut disease, are only rarely formed and fungal biomass in infected tissue is significantly reduced. The generation and analysis of strains carrying sub-deletions identified several genes significantly contributing to tumor formation after seedling infection. Another of the effectors could be linked specifically to anthocyanin induction in the infected tissue. As the individual contributions of these genes to tumor formation were small, we studied the response of maize plants to the whole cluster mutant as well as to several individual mutants by array analysis. This revealed distinct plant responses, demonstrating that the respective effectors have discrete plant targets. We propose that the analysis of plant responses to effector mutant strains that lack a strong virulence phenotype may be a general way to visualize differences in effector function.

  13. Identification of nitrogen-fixing genes and gene clusters from metagenomic library of acid mine drainage.

    Directory of Open Access Journals (Sweden)

    Zhimin Dai

    Full Text Available Biological nitrogen fixation is an essential function of acid mine drainage (AMD microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community.

  14. Calcitonin gene-related peptide antagonism and cluster headache

    DEFF Research Database (Denmark)

    Ashina, Håkan; Newman, Lawrence; Ashina, Sait

    2017-01-01

    Calcitonin gene-related peptide (CGRP) is a key signaling molecule involved in migraine pathophysiology. Efficacy of CGRP monoclonal antibodies and antagonists in migraine treatment has fueled an increasing interest in the prospect of treating cluster headache (CH) with CGRP antagonism. The exact...... role of CGRP and its mechanism of action in CH have not been fully clarified. A search for original studies and randomized controlled trials (RCTs) published in English was performed in PubMed and in ClinicalTrials.gov . The search term used was "cluster headache and calcitonin gene related peptide......" and "primary headaches and calcitonin gene related peptide." Reference lists of identified articles were also searched for additional relevant papers. Human experimental studies have reported elevated plasma CGRP levels during both spontaneous and glyceryl trinitrate-induced cluster attacks. CGRP may play...

  15. IGSA: Individual Gene Sets Analysis, including Enrichment and Clustering.

    Science.gov (United States)

    Wu, Lingxiang; Chen, Xiujie; Zhang, Denan; Zhang, Wubing; Liu, Lei; Ma, Hongzhe; Yang, Jingbo; Xie, Hongbo; Liu, Bo; Jin, Qing

    2016-01-01

    Analysis of gene sets has been widely applied in various high-throughput biological studies. One weakness in the traditional methods is that they neglect the heterogeneity of genes expressions in samples which may lead to the omission of some specific and important gene sets. It is also difficult for them to reflect the severities of disease and provide expression profiles of gene sets for individuals. We developed an application software called IGSA that leverages a powerful analytical capacity in gene sets enrichment and samples clustering. IGSA calculates gene sets expression scores for each sample and takes an accumulating clustering strategy to let the samples gather into the set according to the progress of disease from mild to severe. We focus on gastric, pancreatic and ovarian cancer data sets for the performance of IGSA. We also compared the results of IGSA in KEGG pathways enrichment with David, GSEA, SPIA, ssGSEA and analyzed the results of IGSA clustering and different similarity measurement methods. Notably, IGSA is proved to be more sensitive and specific in finding significant pathways, and can indicate related changes in pathways with the severity of disease. In addition, IGSA provides with significant gene sets profile for each sample.

  16. Mining Association Rules among Gene Functions in Clusters of Similar Gene Expression Maps.

    Science.gov (United States)

    An, Li; Obradovic, Zoran; Smith, Desmond; Bodenreider, Olivier; Megalooikonomou, Vasileios

    2009-11-01

    Association rules mining methods have been recently applied to gene expression data analysis to reveal relationships between genes and different conditions and features. However, not much effort has focused on detecting the relation between gene expression maps and related gene functions. Here we describe such an approach to mine association rules among gene functions in clusters of similar gene expression maps on mouse brain. The experimental results show that the detected association rules make sense biologically. By inspecting the obtained clusters and the genes having the gene functions of frequent itemsets, interesting clues were discovered that provide valuable insight to biological scientists. Moreover, discovered association rules can be potentially used to predict gene functions based on similarity of gene expression maps.

  17. Gene clustering by latent semantic indexing of MEDLINE abstracts.

    Science.gov (United States)

    Homayouni, Ramin; Heinrich, Kevin; Wei, Lai; Berry, Michael W

    2005-01-01

    A major challenge in the interpretation of high-throughput genomic data is understanding the functional associations between genes. Previously, several approaches have been described to extract gene relationships from various biological databases using term-matching methods. However, more flexible automated methods are needed to identify functional relationships (both explicit and implicit) between genes from the biomedical literature. In this study, we explored the utility of Latent Semantic Indexing (LSI), a vector space model for information retrieval, to automatically identify conceptual gene relationships from titles and abstracts in MEDLINE citations. We found that LSI identified gene-to-gene and keyword-to-gene relationships with high average precision. In addition, LSI identified implicit gene relationships based on word usage patterns in the gene abstract documents. Finally, we demonstrate here that pairwise distances derived from the vector angles of gene abstract documents can be effectively used to functionally group genes by hierarchical clustering. Our results provide proof-of-principle that LSI is a robust automated method to elucidate both known (explicit) and unknown (implicit) gene relationships from the biomedical literature. These features make LSI particularly useful for the analysis of novel associations discovered in genomic experiments. The 50-gene document collection used in this study can be interactively queried at http://shad.cs.utk.edu/sgo/sgo.html.

  18. The gentle art of gene arrangement: the meaning of gene clusters

    Science.gov (United States)

    Trowsdale, John

    2002-01-01

    Genome sequence comparisons reveal that some sets of genes are in similar linkage groups in different organisms while other sets are dispersed. Are some linkage groups maintained by chance, or is there an advantage to such an arrangement? Some insights may come from large clusters of genes, such as the major histocompatibility complex which includes many genes involved in immune defense. PMID:11897017

  19. The gentle art of gene arrangement: the meaning of gene clusters

    OpenAIRE

    Trowsdale, John

    2002-01-01

    Genome sequence comparisons reveal that some sets of genes are in similar linkage groups in different organisms while other sets are dispersed. Are some linkage groups maintained by chance, or is there an advantage to such an arrangement? Some insights may come from large clusters of genes, such as the major histocompatibility complex which includes many genes involved in immune defense.

  20. Cloning and Heterologous Expression of the Grecocycline Biosynthetic Gene Cluster.

    Directory of Open Access Journals (Sweden)

    Oksana Bilyk

    Full Text Available Transformation-associated recombination (TAR in yeast is a rapid and inexpensive method for cloning and assembly of large DNA fragments, which relies on natural homologous recombination. Two vectors, based on p15a and F-factor replicons that can be maintained in yeast, E. coli and streptomycetes have been constructed. These vectors have been successfully employed for assembly of the grecocycline biosynthetic gene cluster from Streptomyces sp. Acta 1362. Fragments of the cluster were obtained by PCR and transformed together with the "capture" vector into the yeast cells, yielding a construct carrying the entire gene cluster. The obtained construct was heterologously expressed in S. albus J1074, yielding several grecocycline congeners. Grecocyclines have unique structural moieties such as a dissacharide side chain, an additional amino sugar at the C-5 position and a thiol group. Enzymes from this pathway may be used for the derivatization of known active angucyclines in order to improve their desired biological properties.

  1. Accurate prediction of secondary metabolite gene clusters in filamentous fungi

    DEFF Research Database (Denmark)

    Andersen, Mikael Rørdam; Nielsen, Jakob Blæsbjerg; Klitgaard, Andreas

    2013-01-01

    supporting enzymes for key synthases one cluster at a time. In this study, we design and apply a DNA expression array for Aspergillus nidulans in combination with legacy data to form a comprehensive gene expression compendium. We apply a guilt-by-association-based analysis to predict the extent...

  2. The fimbrial gene cluster of Haemophilus influenzae type b

    NARCIS (Netherlands)

    van Ham, S. M.; van Alphen, L.; Mooi, F. R.; van Putten, J. P.

    1994-01-01

    Haemophilus influenzae infections are preceded by airway colonization, a process facilitated by fimbriae. Here, we identified the complete fimbrial gene cluster of H. influenzae type b. HifA forms the major subunit. HifB, a periplasmic chaperone, and HifC, an outer membrane usher, are typical

  3. The fimbria gene cluster of nonencapsulated Haemophilus influenzae

    NARCIS (Netherlands)

    Geluk, F.; Eijk, P. P.; van Ham, S. M.; Jansen, H. M.; van Alphen, L.

    1998-01-01

    The occurrence of fimbria gene clusters in nonencapsulated Haemophilus influenzae strains from chronic bronchitis patients (n = 58), patients with acute otitis media (n = 13), and healthy carriers (n = 12) was determined by DNA hybridization and PCR, based on sequences of fimbriate H. influenzae

  4. The ergot alkaloid gene cluster: Functional analyses and evolutionary aspects

    Czech Academy of Sciences Publication Activity Database

    Lorenz, N.; Haarmann, T.; Pažoutová, Sylvie; Jung, M.; Tudzynski, P.

    2009-01-01

    Roč. 70, 15-16 (2009), s. 1822-1832 ISSN 0031-9422 Institutional research plan: CEZ:AV0Z50200510 Keywords : Claviceps purpurea * Ergot fungus * Ergot alkaloid gene cluster Subject RIV: EE - Microbiology, Virology Impact factor: 3.104, year: 2009

  5. Unique nucleotide polymorphism of ankyrin gene cluster in ...

    Indian Academy of Sciences (India)

    Genomics 19, 478–493. Krumlauf R. 1992 Evolution of the vertebrate Hox homeobox genes. BioEssays 14, 245–252. Kuittinen H. and Aguadé M. 2000 Nucleotide variation at the. CHALCONE ISOMERASE locus in Arabidopsis thaliana. Ge- netics 155, 863–872. Lercher M. J., Urrutia A. O. and Hurst L. D. 2002 Clustering of.

  6. Gene ordering in partitive clustering using microarray expressions

    Indian Academy of Sciences (India)

    2007-06-28

    Jun 28, 2007 ... Two existing algorithms for optimally ordering cities in travelling salesman problem (TSP), namely, FRAG_GALK and Concorde, are hybridized individually with self organizing MAP to show the importance of gene ordering in partitive clustering framework. We validated our hybrid approach using yeast and ...

  7. The Fusarium graminearum genome reveals more secondary metabolite gene clusters and hints of horizontal gene transfer.

    Directory of Open Access Journals (Sweden)

    Christian M K Sieber

    Full Text Available Fungal secondary metabolite biosynthesis genes are of major interest due to the pharmacological properties of their products (like mycotoxins and antibiotics. The genome of the plant pathogenic fungus Fusarium graminearum codes for a large number of candidate enzymes involved in secondary metabolite biosynthesis. However, the chemical nature of most enzymatic products of proteins encoded by putative secondary metabolism biosynthetic genes is largely unknown. Based on our analysis we present 67 gene clusters with significant enrichment of predicted secondary metabolism related enzymatic functions. 20 gene clusters with unknown metabolites exhibit strong gene expression correlation in planta and presumably play a role in virulence. Furthermore, the identification of conserved and over-represented putative transcription factor binding sites serves as additional evidence for cluster co-regulation. Orthologous cluster search provided insight into the evolution of secondary metabolism clusters. Some clusters are characteristic for the Fusarium phylum while others show evidence of horizontal gene transfer as orthologs can be found in representatives of the Botrytis or Cochliobolus lineage. The presented candidate clusters provide valuable targets for experimental examination.

  8. PEACE: Parallel Environment for Assembly and Clustering of Gene Expression.

    Science.gov (United States)

    Rao, D M; Moler, J C; Ozden, M; Zhang, Y; Liang, C; Karro, J E

    2010-07-01

    We present PEACE, a stand-alone tool for high-throughput ab initio clustering of transcript fragment sequences produced by Next Generation or Sanger Sequencing technologies. It is freely available from www.peace-tools.org. Installed and managed through a downloadable user-friendly graphical user interface (GUI), PEACE can process large data sets of transcript fragments of length 50 bases or greater, grouping the fragments by gene associations with a sensitivity comparable to leading clustering tools. Once clustered, the user can employ the GUI's analysis functions, facilitating the easy collection of statistics and allowing them to single out specific clusters for more comprehensive study or assembly. Using a novel minimum spanning tree-based clustering method, PEACE is the equal of leading tools in the literature, with an interface making it accessible to any user. It produces results of quality virtually identical to those of the WCD tool when applied to Sanger sequences, significantly improved results over WCD and TGICL when applied to the products of Next Generation Sequencing Technology and significantly improved results over Cap3 in both cases. In short, PEACE provides an intuitive GUI and a feature-rich, parallel clustering engine that proves to be a valuable addition to the leading cDNA clustering tools.

  9. Comparative genomics of natural killer cell receptor gene clusters.

    Directory of Open Access Journals (Sweden)

    James Kelley

    2005-08-01

    Full Text Available Many receptors on natural killer (NK cells recognize major histocompatibility complex class I molecules in order to monitor unhealthy tissues, such as cells infected with viruses, and some tumors. Genes encoding families of NK receptors and related sequences are organized into two main clusters in humans: the natural killer complex on Chromosome 12p13.1, which encodes C-type lectin molecules, and the leukocyte receptor complex on Chromosome 19q13.4, which encodes immunoglobulin superfamily molecules. The composition of these gene clusters differs markedly between closely related species, providing evidence for rapid, lineage-specific expansions or contractions of sets of loci. The choice of NK receptor genes is polarized in the two species most studied, mouse and human. In mouse, the C-type lectin-related Ly49 gene family predominates. Conversely, the single Ly49 sequence is a pseudogene in humans, and the immunoglobulin superfamily KIR gene family is extensive. These different gene sets encode proteins that are comparable in function and genetic diversity, even though they have undergone species-specific expansions. Understanding the biological significance of this curious situation may be aided by studying which NK receptor genes are used in other vertebrates, especially in relation to species-specific differences in genes for major histocompatibility complex class I molecules.

  10. Evolution and differential expression of a vertebrate vitellogenin gene cluster

    Directory of Open Access Journals (Sweden)

    Kongshaug Heidi

    2009-01-01

    Full Text Available Abstract Background The multiplicity or loss of the vitellogenin (vtg gene family in vertebrates has been argued to have broad implications for the mode of reproduction (placental or non-placental, cleavage pattern (meroblastic or holoblastic and character of the egg (pelagic or benthic. Earlier proposals for the existence of three forms of vertebrate vtgs present conflicting models for their origin and subsequent duplication. Results By integrating phylogenetics of novel vtg transcripts from old and modern teleosts with syntenic analyses of all available genomic variants of non-metatherian vertebrates we identify the gene orthologies between the Sarcopterygii (tetrapod branch and Actinopterygii (fish branch. We argue that the vertebrate vtg gene cluster originated in proto-chromosome m, but that vtg genes have subsequently duplicated and rearranged following whole genome duplications. Sequencing of a novel fourth vtg transcript in labrid species, and the presence of duplicated paralogs in certain model organisms supports the notion that lineage-specific gene duplications frequently occur in teleosts. The data show that the vtg gene cluster is more conserved between acanthomorph teleosts and tetrapods, than in ostariophysan teleosts such as the zebrafish. The differential expression of the labrid vtg genes are further consistent with the notion that neofunctionalized Aa-type vtgs are important determinants of the pelagic or benthic character of the eggs in acanthomorph teleosts. Conclusion The vertebrate vtg gene cluster existed prior to the separation of Sarcopterygii from Actinopterygii >450 million years ago, a period associated with the second round of whole genome duplication. The presence of higher copy numbers in a more highly expressed subcluster is particularly prevalent in teleosts. The differential expression and latent neofunctionalization of vtg genes in acanthomorph teleosts is an adaptive feature associated with oocyte hydration

  11. Clustered Xenopus keratin genes: A genomic, transcriptomic, and proteomic analysis.

    Science.gov (United States)

    Suzuki, Ken-Ichi T; Suzuki, Miyuki; Shigeta, Mitsuki; Fortriede, Joshua D; Takahashi, Shuji; Mawaribuchi, Shuuji; Yamamoto, Takashi; Taira, Masanori; Fukui, Akimasa

    2017-06-15

    Keratin genes belong to the intermediate filament superfamily and their expression is altered following morphological and physiological changes in vertebrate epithelial cells. Keratin genes are divided into two groups, type I and II, and are clustered on vertebrate genomes, including those of Xenopus species. Various keratin genes have been identified and characterized by their unique expression patterns throughout ontogeny in Xenopus laevis; however, compilation of previously reported and newly identified keratin genes in two Xenopus species is required for our further understanding of keratin gene evolution, not only in amphibians but also in all terrestrial vertebrates. In this study, 120 putative type I and II keratin genes in total were identified based on the genome data from two Xenopus species. We revealed that most of these genes are highly clustered on two homeologous chromosomes, XLA9_10 and XLA2 in X. laevis, and XTR10 and XTR2 in X. tropicalis, which are orthologous to those of human, showing conserved synteny among tetrapods. RNA-Seq data from various embryonic stages and adult tissues highlighted the unique expression profiles of orthologous and homeologous keratin genes in developmental stage- and tissue-specific manners. Moreover, we identified dozens of epidermal keratin proteins from the whole embryo, larval skin, tail, and adult skin using shotgun proteomics. In light of our results, we discuss the radiation, diversification, and unique expression of the clustered keratin genes, which are closely related to epidermal development and terrestrial adaptation during amphibian evolution, including Xenopus speciation. Copyright © 2016 Elsevier Inc. All rights reserved.

  12. Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters

    Energy Technology Data Exchange (ETDEWEB)

    Santini, Simona; Boore, Jeffrey L.; Meyer, Axel

    2003-12-31

    Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involved in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.

  13. Incorporating gene ontology into fuzzy relational clustering of microarray gene expression data.

    Science.gov (United States)

    Paul, Animesh Kumar; Shill, Pintu Chandra

    2018-01-01

    The product of gene expression works together in the cell for each living organism in order to achieve different biological processes. Many proteins are involved in different roles depending on the environment of the organism for the functioning of the cell. In this paper, we propose gene ontology (GO) annotations based semi-supervised clustering algorithm called GO fuzzy relational clustering (GO-FRC) where one gene is allowed to be assigned to multiple clusters which are the most biologically relevant behavior of genes. In the clustering process, GO-FRC utilizes useful biological knowledge which is available in the form of a gene ontology, as a prior knowledge along with the gene expression data. The prior knowledge helps to improve the coherence of the groups concerning the knowledge field. The proposed GO-FRC has been tested on the two yeast (Saccharomyces cerevisiae) expression profiles datasets (Eisen and Dream5 yeast datasets) and compared with other state-of-the-art clustering algorithms. Experimental results imply that GO-FRC is able to produce more biologically relevant clusters with the use of the small amount of GO annotations. Copyright © 2017 Elsevier B.V. All rights reserved.

  14. Identification of Nocobactin NA Biosynthetic Gene Clusters in Nocardia farcinica▿ §

    OpenAIRE

    Hoshino, Yasutaka; Chiba, Kazuhiro; Ishino, Keiko; Fukai, Toshio; Igarashi, Yasuhiro; Yazawa, Katsukiyo; Mikami, Yuzuru; Ishikawa, Jun

    2010-01-01

    We identified the biosynthetic gene clusters of the siderophore nocobactin NA. The nbt clusters, which were discovered as genes highly homologous to the mycobactin biosynthesis genes by the genomic sequencing of Nocardia farcinica IFM 10152, consist of 10 genes separately located at two genomic regions. The gene organization of the nbt clusters and the predicted functions of the nbt genes, particularly the cyclization and epimerization domains, were in good agreement with the chemical structu...

  15. Gene duplication, modularity and adaptation in the evolution of the aflatoxin gene cluster

    Directory of Open Access Journals (Sweden)

    Jakobek Judy L

    2007-07-01

    Full Text Available Abstract Background The biosynthesis of aflatoxin (AF involves over 20 enzymatic reactions in a complex polyketide pathway that converts acetate and malonate to the intermediates sterigmatocystin (ST and O-methylsterigmatocystin (OMST, the respective penultimate and ultimate precursors of AF. Although these precursors are chemically and structurally very similar, their accumulation differs at the species level for Aspergilli. Notable examples are A. nidulans that synthesizes only ST, A. flavus that makes predominantly AF, and A. parasiticus that generally produces either AF or OMST. Whether these differences are important in the evolutionary/ecological processes of species adaptation and diversification is unknown. Equally unknown are the specific genomic mechanisms responsible for ordering and clustering of genes in the AF pathway of Aspergillus. Results To elucidate the mechanisms that have driven formation of these clusters, we performed systematic searches of aflatoxin cluster homologs across five Aspergillus genomes. We found a high level of gene duplication and identified seven modules consisting of highly correlated gene pairs (aflA/aflB, aflR/aflS, aflX/aflY, aflF/aflE, aflT/aflQ, aflC/aflW, and aflG/aflL. With the exception of A. nomius, contrasts of mean Ka/Ks values across all cluster genes showed significant differences in selective pressure between section Flavi and non-section Flavi species. A. nomius mean Ka/Ks values were more similar to partial clusters in A. fumigatus and A. terreus. Overall, mean Ka/Ks values were significantly higher for section Flavi than for non-section Flavi species. Conclusion Our results implicate several genomic mechanisms in the evolution of ST, OMST and AF cluster genes. Gene modules may arise from duplications of a single gene, whereby the function of the pre-duplication gene is retained in the copy (aflF/aflE or the copies may partition the ancestral function (aflA/aflB. In some gene modules, the

  16. Clustering gene expression regulators: new approach to disease subtyping.

    Directory of Open Access Journals (Sweden)

    Mikhail Pyatnitskiy

    Full Text Available One of the main challenges in modern medicine is to stratify different patient groups in terms of underlying disease molecular mechanisms as to develop more personalized approach to therapy. Here we propose novel method for disease subtyping based on analysis of activated expression regulators on a sample-by-sample basis. Our approach relies on Sub-Network Enrichment Analysis algorithm (SNEA which identifies gene subnetworks with significant concordant changes in expression between two conditions. Subnetwork consists of central regulator and downstream genes connected by relations extracted from global literature-extracted regulation database. Regulators found in each patient separately are clustered together and assigned activity scores which are used for final patients grouping. We show that our approach performs well compared to other related methods and at the same time provides researchers with complementary level of understanding of pathway-level biology behind a disease by identification of significant expression regulators. We have observed the reasonable grouping of neuromuscular disorders (triggered by structural damage vs triggered by unknown mechanisms, that was not revealed using standard expression profile clustering. For another experiment we were able to suggest the clusters of regulators, responsible for colorectal carcinoma vs adenoma discrimination and identify frequently genetically changed regulators that could be of specific importance for the individual characteristics of cancer development. Proposed approach can be regarded as biologically meaningful feature selection, reducing tens of thousands of genes down to dozens of clusters of regulators. Obtained clusters of regulators make possible to generate valuable biological hypotheses about molecular mechanisms related to a clinical outcome for individual patient.

  17. Functional Analysis of the Fusarielin Biosynthetic Gene Cluster

    Directory of Open Access Journals (Sweden)

    Aida Droce

    2016-12-01

    Full Text Available Fusarielins are polyketides with a decalin core produced by various species of Aspergillus and Fusarium. Although the responsible gene cluster has been identified, the biosynthetic pathway remains to be elucidated. In the present study, members of the gene cluster were deleted individually in a Fusarium graminearum strain overexpressing the local transcription factor. The results suggest that a trans-acting enoyl reductase (FSL5 assists the polyketide synthase FSL1 in biosynthesis of a polyketide product, which is released by hydrolysis by a trans-acting thioesterase (FSL2. Deletion of the epimerase (FSL3 resulted in accumulation of an unstable compound, which could be the released product. A novel compound, named prefusarielin, accumulated in the deletion mutant of the cytochrome P450 monooxygenase FSL4. Unlike the known fusarielins from Fusarium, this compound does not contain oxygenized decalin rings, suggesting that FSL4 is responsible for the oxygenation.

  18. Evaluation of clustering algorithms for gene expression data using gene ontology annotations.

    Science.gov (United States)

    Ma, Ning; Zhang, Zheng-Guo

    2012-09-01

    Clustering is a useful exploratory technique for interpreting gene expression data to reveal groups of genes sharing common functional attributes. Biologists frequently face the problem of choosing an appropriate algorithm. We aimed to provide a standalone, easily accessible and biologically oriented criterion for expression data clustering evaluation. An external criterion utilizing annotation based similarities between genes is proposed in this work. Gene ontology information is employed as the annotation source. Comparisons among six widely used clustering algorithms over various types of gene expression data sets were carried out based on the criterion proposed. The rank of these algorithms given by the criterion coincides with our common knowledge. Single-linkage has significantly poorer performance, even worse than the random algorithm. Ward's method archives the best performance in most cases. The criterion proposed has a strong ability to distinguish among different clustering algorithms with different distance measurements. It is also demonstrated that analyzing main contributors of the criterion may offer some guidelines in finding local compact clusters. As an addition, we suggest using Ward's algorithm for gene expression data analysis.

  19. Evidence for an ergot alkaloid gene cluster in Claviceps purpurea.

    Science.gov (United States)

    Tudzynski, P; Hölter, K; Correia, T; Arntz, C; Grammel, N; Keller, U

    1999-02-01

    A gene (cpd1) coding for the dimethylallyltryptophan synthase (DMATS) that catalyzes the first specific step in the biosynthesis of ergot alkaloids, was cloned from a strain of Claviceps purpurea that produces alkaloids in axenic culture. The derived gene product (CPD1) shows only 70% similarity to the corresponding gene previously isolated from Claviceps strain ATCC 26245, which is likely to be an isolate of C. fusiformis. Therefore, the related cpd1 most probably represents the first C. purpurea gene coding for an enzymatic step of the alkaloid biosynthetic pathway to be cloned. Analysis of the 3'-flanking region of cpd1 revealed a second, closely linked ergot alkaloid biosynthetic gene named cpps1, which codes for a 356-kDa polypeptide showing significant similarity to fungal modular peptide synthetases. The protein contains three amino acid-activating modules, and in the second module a sequence is found which matches that of an internal peptide (17 amino acids in length) obtained from a tryptic digest of lysergyl peptide synthetase 1 (LPS1) of C. purpurea, thus confirming that cpps1 encodes LPS1. LPS1 activates the three amino acids of the peptide portion of ergot peptide alkaloids during D-lysergyl peptide assembly. Chromosome walking revealed the presence of additional genes upstream of cpd1 which are probably also involved in ergot alkaloid biosynthesis: cpox1 probably codes for an FAD-dependent oxidoreductase (which could represent the chanoclavine cyclase), and a second putative oxidoreductase gene, cpox2, is closely linked to it in inverse orientation. RT-PCR experiments confirm that all four genes are expressed under conditions of peptide alkaloid biosynthesis. These results strongly suggest that at least some genes of ergot alkaloid biosynthesis in C. purpurea are clustered, opening the way for a detailed molecular genetic analysis of the pathway.

  20. Global Analysis of miRNA Gene Clusters and Gene Families Reveals Dynamic and Coordinated Expression

    Directory of Open Access Journals (Sweden)

    Li Guo

    2014-01-01

    Full Text Available To further understand the potential expression relationships of miRNAs in miRNA gene clusters and gene families, a global analysis was performed in 4 paired tumor (breast cancer and adjacent normal tissue samples using deep sequencing datasets. The compositions of miRNA gene clusters and families are not random, and clustered and homologous miRNAs may have close relationships with overlapped miRNA species. Members in the miRNA group always had various expression levels, and even some showed larger expression divergence. Despite the dynamic expression as well as individual difference, these miRNAs always indicated consistent or similar deregulation patterns. The consistent deregulation expression may contribute to dynamic and coordinated interaction between different miRNAs in regulatory network. Further, we found that those clustered or homologous miRNAs that were also identified as sense and antisense miRNAs showed larger expression divergence. miRNA gene clusters and families indicated important biological roles, and the specific distribution and expression further enrich and ensure the flexible and robust regulatory network.

  1. Translating biosynthetic gene clusters into fungal armor and weaponry.

    Science.gov (United States)

    Keller, Nancy P

    2015-09-01

    Filamentous fungi are renowned for the production of a diverse array of secondary metabolites (SMs) where the genetic material required for synthesis of a SM is typically arrayed in a biosynthetic gene cluster (BGC). These natural products are valued for their bioactive properties stemming from their functions in fungal biology, key among those protection from abiotic and biotic stress and establishment of a secure niche. The producing fungus must not only avoid self-harm from endogenous SMs but also deliver specific SMs at the right time to the right tissue requiring biochemical aid. This review highlights functions of BGCs beyond the enzymatic assembly of SMs, considering the timing and location of SM production and other proteins in the clusters that control SM activity. Specifically, self-protection is provided by both BGC-encoded mechanisms and non-BGC subcellular containment of toxic SM precursors; delivery and timing is orchestrated through cellular trafficking patterns and stress- and developmental-responsive transcriptional programs.

  2. Gene prioritization and clustering by multi-view text mining.

    Science.gov (United States)

    Yu, Shi; Tranchevent, Leon-Charles; De Moor, Bart; Moreau, Yves

    2010-01-14

    Text mining has become a useful tool for biologists trying to understand the genetics of diseases. In particular, it can help identify the most interesting candidate genes for a disease for further experimental analysis. Many text mining approaches have been introduced, but the effect of disease-gene identification varies in different text mining models. Thus, the idea of incorporating more text mining models may be beneficial to obtain more refined and accurate knowledge. However, how to effectively combine these models still remains a challenging question in machine learning. In particular, it is a non-trivial issue to guarantee that the integrated model performs better than the best individual model. We present a multi-view approach to retrieve biomedical knowledge using different controlled vocabularies. These controlled vocabularies are selected on the basis of nine well-known bio-ontologies and are applied to index the vast amounts of gene-based free-text information available in the MEDLINE repository. The text mining result specified by a vocabulary is considered as a view and the obtained multiple views are integrated by multi-source learning algorithms. We investigate the effect of integration in two fundamental computational disease gene identification tasks: gene prioritization and gene clustering. The performance of the proposed approach is systematically evaluated and compared on real benchmark data sets. In both tasks, the multi-view approach demonstrates significantly better performance than other comparing methods. In practical research, the relevance of specific vocabulary pertaining to the task is usually unknown. In such case, multi-view text mining is a superior and promising strategy for text-based disease gene identification.

  3. Coordinated evolution of co-expressed gene clusters in the Drosophila transcriptome

    Directory of Open Access Journals (Sweden)

    Jones Corbin D

    2008-01-01

    Full Text Available Abstract Background Co-expression of genes that physically cluster together is a common characteristic of eukaryotic transcriptomes. This organization of transcriptomes suggests that coordinated evolution of gene expression for clustered genes may also be common. Clusters where expression evolution of each gene is not independent of their neighbors are important units for understanding transcriptome evolution. Results We used a common microarray platform to measure gene expression in seven closely related species in the Drosophila melanogaster subgroup, accounting for confounding effects of sequence divergence. To summarize the correlation structure among genes in a chromosomal region, we analyzed the fraction of variation along the first principal component of the correlation matrix. We analyzed the correlation for blocks of consecutive genes to assess patterns of correlation that may be manifest at different scales of coordinated expression. We find that expression of physically clustered genes does evolve in a coordinated manner in many locations throughout the genome. Our analysis shows that relatively few of these clusters are near heterochromatin regions and that these clusters tend to be over-dispersed relative to the rest of the genome. This suggests that these clusters are not the byproduct of local gene clustering. We also analyzed the pattern of co-expression among neighboring genes within a single Drosophila species: D. simulans. For the co-expression clusters identified within this species, we find an under-representation of genes displaying a signature of recurrent adaptive amino acid evolution consistent with previous findings. However, clusters displaying co-evolution of expression among species are enriched for adaptively evolving genes. This finding points to a tie between adaptive sequence evolution and evolution of the transcriptome. Conclusion Our results demonstrate that co-evolution of expression in gene clusters is

  4. Gravitation field algorithm and its application in gene cluster

    Directory of Open Access Journals (Sweden)

    Zheng Ming

    2010-09-01

    Full Text Available Abstract Background Searching optima is one of the most challenging tasks in clustering genes from available experimental data or given functions. SA, GA, PSO and other similar efficient global optimization methods are used by biotechnologists. All these algorithms are based on the imitation of natural phenomena. Results This paper proposes a novel searching optimization algorithm called Gravitation Field Algorithm (GFA which is derived from the famous astronomy theory Solar Nebular Disk Model (SNDM of planetary formation. GFA simulates the Gravitation field and outperforms GA and SA in some multimodal functions optimization problem. And GFA also can be used in the forms of unimodal functions. GFA clusters the dataset well from the Gene Expression Omnibus. Conclusions The mathematical proof demonstrates that GFA could be convergent in the global optimum by probability 1 in three conditions for one independent variable mass functions. In addition to these results, the fundamental optimization concept in this paper is used to analyze how SA and GA affect the global search and the inherent defects in SA and GA. Some results and source code (in Matlab are publicly available at http://ccst.jlu.edu.cn/CSBG/GFA.

  5. Adaptive evolution of the FADS gene cluster within Africa.

    Directory of Open Access Journals (Sweden)

    Rasika A Mathias

    Full Text Available Long chain polyunsaturated fatty acids (LC-PUFAs are essential for brain structure, development, and function, and adequate dietary quantities of LC-PUFAs are thought to have been necessary for both brain expansion and the increase in brain complexity observed during modern human evolution. Previous studies conducted in largely European populations suggest that humans have limited capacity to synthesize brain LC-PUFAs such as docosahexaenoic acid (DHA from plant-based medium chain (MC PUFAs due to limited desaturase activity. Population-based differences in LC-PUFA levels and their product-to-substrate ratios can, in part, be explained by polymorphisms in the fatty acid desaturase (FADS gene cluster, which have been associated with increased conversion of MC-PUFAs to LC-PUFAs. Here, we show evidence that these high efficiency converter alleles in the FADS gene cluster were likely driven to near fixation in African populations by positive selection ∼85 kya. We hypothesize that selection at FADS variants, which increase LC-PUFA synthesis from plant-based MC-PUFAs, played an important role in allowing African populations obligatorily tethered to marine sources for LC-PUFAs in isolated geographic regions, to rapidly expand throughout the African continent 60-80 kya.

  6. Indole-Diterpene Gene Cluster from Aspergillus flavus

    Science.gov (United States)

    Zhang, Shuguang; Monahan, Brendon J.; Tkacz, Jan S.; Scott, Barry

    2004-01-01

    Aflatrem is a potent tremorgenic mycotoxin produced by the soil fungus Aspergillus flavus and is a member of a large structurally diverse group of secondary metabolites known as indole-diterpenes. By using degenerate primers for conserved domains of fungal geranylgeranyl diphosphate synthases, we cloned two genes, atmG and ggsA (an apparent pseudogene), from A. flavus. Adjacent to atmG are two other genes, atmC and atmM. These three genes have 64 to 70% amino acid sequence similarity and conserved synteny with a cluster of orthologous genes, paxG, paxC, and paxM, from Penicillium paxilli which are required for indole-diterpene biosynthesis. atmG, atmC, and atmM are coordinately expressed, with transcript levels dramatically increasing at the onset of aflatrem biosynthesis. A genomic copy of atmM can complement a paxM deletion mutant of P. paxilli, demonstrating that atmM is a functional homolog of paxM. Thus, atmG, atmC, and atmM are necessary, but not sufficient, for aflatrem biosynthesis by A. flavus. This provides the first genetic evidence for the biosynthetic pathway of aflatrem in A. flavus. PMID:15528556

  7. Arrangement of the Clostridium baratii F7 toxin gene cluster with identification of a σ factor that recognizes the botulinum toxin gene cluster promoters.

    Science.gov (United States)

    Dover, Nir; Barash, Jason R; Burke, Julianne N; Hill, Karen K; Detter, John C; Arnon, Stephen S

    2014-01-01

    Botulinum neurotoxin (BoNT) is the most poisonous substances known and its eight toxin types (A to H) are distinguished by the inability of polyclonal antibodies that neutralize one toxin type to neutralize any of the other seven toxin types. Infant botulism, an intestinal toxemia orphan disease, is the most common form of human botulism in the United States. It results from swallowed spores of Clostridium botulinum (or rarely, neurotoxigenic Clostridium butyricum or Clostridium baratii) that germinate and temporarily colonize the lumen of the large intestine, where, as vegetative cells, they produce botulinum toxin. Botulinum neurotoxin is encoded by the bont gene that is part of a toxin gene cluster that includes several accessory genes. We sequenced for the first time the complete botulinum neurotoxin gene cluster of nonproteolytic C. baratii type F7. Like the type E and the nonproteolytic type F6 botulinum toxin gene clusters, the C. baratii type F7 had an orfX toxin gene cluster that lacked the regulatory botR gene which is found in proteolytic C. botulinum strains and codes for an alternative σ factor. In the absence of botR, we identified a putative alternative regulatory gene located upstream of the C. baratii type F7 toxin gene cluster. This putative regulatory gene codes for a predicted σ factor that contains DNA-binding-domain homologues to the DNA-binding domains both of BotR and of other members of the TcdR-related group 5 of the σ70 family that are involved in the regulation of toxin gene expression in clostridia. We showed that this TcdR-related protein in association with RNA polymerase core enzyme specifically binds to the C. baratii type F7 botulinum toxin gene cluster promoters. This TcdR-related protein may therefore be involved in regulating the expression of the genes of the botulinum toxin gene cluster in neurotoxigenic C. baratii.

  8. Time-series clustering of gene expression in irradiated and bystander fibroblasts: an application of FBPA clustering

    Directory of Open Access Journals (Sweden)

    Markatou Marianthi

    2011-01-01

    Full Text Available Abstract Background The radiation bystander effect is an important component of the overall biological response of tissues and organisms to ionizing radiation, but the signaling mechanisms between irradiated and non-irradiated bystander cells are not fully understood. In this study, we measured a time-series of gene expression after α-particle irradiation and applied the Feature Based Partitioning around medoids Algorithm (FBPA, a new clustering method suitable for sparse time series, to identify signaling modules that act in concert in the response to direct irradiation and bystander signaling. We compared our results with those of an alternate clustering method, Short Time series Expression Miner (STEM. Results While computational evaluations of both clustering results were similar, FBPA provided more biological insight. After irradiation, gene clusters were enriched for signal transduction, cell cycle/cell death and inflammation/immunity processes; but only FBPA separated clusters by function. In bystanders, gene clusters were enriched for cell communication/motility, signal transduction and inflammation processes; but biological functions did not separate as clearly with either clustering method as they did in irradiated samples. Network analysis confirmed p53 and NF-κB transcription factor-regulated gene clusters in irradiated and bystander cells and suggested novel regulators, such as KDM5B/JARID1B (lysine (K-specific demethylase 5B and HDACs (histone deacetylases, which could epigenetically coordinate gene expression after irradiation. Conclusions In this study, we have shown that a new time series clustering method, FBPA, can provide new leads to the mechanisms regulating the dynamic cellular response to radiation. The findings implicate epigenetic control of gene expression in addition to transcription factor networks.

  9. Lampreys have a single gene cluster for the fast skeletal myosin heavy chain gene family.

    Directory of Open Access Journals (Sweden)

    Daisuke Ikeda

    Full Text Available Muscle tissues contain the most classic sarcomeric myosin, called myosin II, which consists of 2 heavy chains (MYHs and 4 light chains. In the case of humans (tetrapod, a total of 6 fast skeletal-type MYH genes (MYHs are clustered on a single chromosome. In contrast, torafugu (teleost contains at least 13 fast skeletal MYHs, which are distributed in 5 genomic regions; the MYHs are clustered in 3 of these regions. In the present study, the evolutionary relationship among fast skeletal MYHs is elucidated by comparing the MYHs of teleosts and tetrapods with those of cyclostome lampreys, one of two groups of extant jawless vertebrates (agnathans. We found that lampreys contain at least 3 fast skeletal MYHs, which are clustered in a head-to-tail manner in a single genomic region. Although there was apparent synteny in the corresponding MYH cluster regions between lampreys and tetrapods, phylogenetic analysis indicated that lamprey and tetrapod MYHs have independently duplicated and diversified. Subsequent transgenic approaches showed that the 5'-flanking sequences of Japanese lamprey fast skeletal MYHs function as a regulatory sequence to drive specific reporter gene expression in the fast skeletal muscle of zebrafish embryos. Although zebrafish MYH promoters showed apparent activity to direct reporter gene expression in myogenic cells derived from mice, promoters from Japanese lamprey MYHs had no activity. These results suggest that the muscle-specific regulatory mechanisms are partially conserved between teleosts and tetrapods but not between cyclostomes and tetrapods, despite the conserved synteny.

  10. Gene expression profiling in cluster headache: a pilot microarray study.

    Science.gov (United States)

    Sjöstrand, Christina; Duvefelt, Kristina; Steinberg, Anna; Remahl, Ingela Nilsson; Waldenlind, Elisabet; Hillert, Jan

    2006-01-01

    Cluster headache (CH) is a primary neurovascular headache disorder characterized by attacks of excruciating pain accompanied by ipsilateral autonomic symptoms. CH pathophysiology is presumed to involve an activation of hypothalamic and trigeminovascular systems, but inflammation and immunological mechanisms have also been hypothesized to be of importance. To identify differentially expressed genes during different clinical phases of CH, assuming that changes of pathophysiological importance would also be seen in peripheral venous blood. Blood samples were drawn at 3 consecutive occasions from 3 episodic CH patients: during attacks, between attacks and in remission, and at 1 occasion from 3 matched controls. Global gene expression was analyzed with microarray tehnology using the Affymetrix Human Genome U133 2.0 Plus GeneChip Set, covering more than 54,000 gene transcripts, corresponding to almost 22,000 genes. Quantitative RT-PCR on S100P gene expression was analyzed in 6 patients and 14 controls. Overall, quite small differences were seen intraindividually and large differences interindividually. However, pairwise comparisons of signal values showed upregulation of several S100 calcium binding proteins; S100A8 (calgranulin A), S100A12 (calgranulin C), and S100P during active phase of the disease compared to remission. Also, annexin A3 (calcium-binding) and ICAM3 showed upregulation. BIRC1 (neuronal apoptosis inhibitory protein), CREB5, HLA-DQA1, and HLA-DQB1 were upregulated in patients compared to controls. The upregulation of S100P during attack versus remission was confirmed by quantitative RT-PCR analysis. The S100A8 and S100A12 proteins are considered markers of non-infectious inflammatory disease, while the function of S100P is still largely unknown. Furthermore, upregulation of HLA-DQ genes in CH patients may also indicate an inflammatory response. Upregulation of these pro-inflammatory genes during the active phase of CH has not formerly been reported. Data

  11. The ergot alkaloid gene cluster in Claviceps purpurea: extension of the cluster sequence and intra species evolution.

    Science.gov (United States)

    Haarmann, Thomas; Machado, Caroline; Lübbe, Yvonne; Correia, Telmo; Schardl, Christopher L; Panaccione, Daniel G; Tudzynski, Paul

    2005-06-01

    The genomic region of Claviceps purpurea strain P1 containing the ergot alkaloid gene cluster [Tudzynski, P., Hölter, K., Correia, T., Arntz, C., Grammel, N., Keller, U., 1999. Evidence for an ergot alkaloid gene cluster in Claviceps purpurea. Mol. Gen. Genet. 261, 133-141] was explored by chromosome walking, and additional genes probably involved in the ergot alkaloid biosynthesis have been identified. The putative cluster sequence (extending over 68.5kb) contains 4 different nonribosomal peptide synthetase (NRPS) genes and several putative oxidases. Northern analysis showed that most of the genes were co-regulated (repressed by high phosphate), and identified probable flanking genes by lack of co-regulation. Comparison of the cluster sequences of strain P1, an ergotamine producer, with that of strain ECC93, an ergocristine producer, showed high conservation of most of the cluster genes, but significant variation in the NRPS modules, strongly suggesting that evolution of these chemical races of C. purpurea is determined by evolution of NRPS module specificity.

  12. High presence/absence gene variability in defense-related gene clusters of Cucumis melo.

    Science.gov (United States)

    González, Víctor M; Aventín, Núria; Centeno, Emilio; Puigdomènech, Pere

    2013-11-12

    Changes in the copy number of DNA sequences are one of the main mechanisms generating genome variability in eukaryotes. These changes are often related to phenotypic effects such as genetic disorders or novel pathogen resistance. The increasing availability of genome sequences through the application of next-generation massive sequencing technologies has allowed the study of genomic polymorphisms at both the interspecific and intraspecific levels, thus helping to understand how species adapt to changing environments through genome variability. Data on gene presence/absence variation (PAV) in melon was obtained by resequencing a cultivated accession and an old-relative melon variety, and using previously obtained resequencing data from three other melon cultivars, among them DHL92, on which the current draft melon genome sequence is based. A total of 1,697 PAV events were detected, involving 4.4% of the predicted melon gene complement. In all, an average 1.5% of genes were absent from each analyzed cultivar as compared to the DHL92 reference genome. The most populated functional category among the 304 PAV genes of known function was that of stress response proteins (30% of all classified PAVs). Our results suggest that genes from multi-copy families are five times more likely to be affected by PAV than singleton genes. Also, the chance of genes present in the genome in tandem arrays being affected by PAV is double that of isolated genes, with PAV genes tending to be in longer clusters. The highest concentration of PAV events detected in the melon genome was found in a 1.1 Mb region of linkage group V, which also shows the highest density of melon stress-response genes. In particular, this region contains the longest continuous gene-containing PAV sequence so far identified in melon. The first genome-wide report of PAV variation among several melon cultivars is presented here. Multi-copy and clustered genes, especially those with putative stress-response functions

  13. Challenges in microarray class discovery: a comprehensive examination of normalization, gene selection and clustering

    Science.gov (United States)

    2010-01-01

    Background Cluster analysis, and in particular hierarchical clustering, is widely used to extract information from gene expression data. The aim is to discover new classes, or sub-classes, of either individuals or genes. Performing a cluster analysis commonly involve decisions on how to; handle missing values, standardize the data and select genes. In addition, pre-processing, involving various types of filtration and normalization procedures, can have an effect on the ability to discover biologically relevant classes. Here we consider cluster analysis in a broad sense and perform a comprehensive evaluation that covers several aspects of cluster analyses, including normalization. Result We evaluated 2780 cluster analysis methods on seven publicly available 2-channel microarray data sets with common reference designs. Each cluster analysis method differed in data normalization (5 normalizations were considered), missing value imputation (2), standardization of data (2), gene selection (19) or clustering method (11). The cluster analyses are evaluated using known classes, such as cancer types, and the adjusted Rand index. The performances of the different analyses vary between the data sets and it is difficult to give general recommendations. However, normalization, gene selection and clustering method are all variables that have a significant impact on the performance. In particular, gene selection is important and it is generally necessary to include a relatively large number of genes in order to get good performance. Selecting genes with high standard deviation or using principal component analysis are shown to be the preferred gene selection methods. Hierarchical clustering using Ward's method, k-means clustering and Mclust are the clustering methods considered in this paper that achieves the highest adjusted Rand. Normalization can have a significant positive impact on the ability to cluster individuals, and there are indications that background correction is

  14. Combining Pareto-optimal clusters using supervised learning for identifying co-expressed genes.

    Science.gov (United States)

    Maulik, Ujjwal; Mukhopadhyay, Anirban; Bandyopadhyay, Sanghamitra

    2009-01-20

    The landscape of biological and biomedical research is being changed rapidly with the invention of microarrays which enables simultaneous view on the transcription levels of a huge number of genes across different experimental conditions or time points. Using microarray data sets, clustering algorithms have been actively utilized in order to identify groups of co-expressed genes. This article poses the problem of fuzzy clustering in microarray data as a multiobjective optimization problem which simultaneously optimizes two internal fuzzy cluster validity indices to yield a set of Pareto-optimal clustering solutions. Each of these clustering solutions possesses some amount of information regarding the clustering structure of the input data. Motivated by this fact, a novel fuzzy majority voting approach is proposed to combine the clustering information from all the solutions in the resultant Pareto-optimal set. This approach first identifies the genes which are assigned to some particular cluster with high membership degree by most of the Pareto-optimal solutions. Using this set of genes as the training set, the remaining genes are classified by a supervised learning algorithm. In this work, we have used a Support Vector Machine (SVM) classifier for this purpose. The performance of the proposed clustering technique has been demonstrated on five publicly available benchmark microarray data sets, viz., Yeast Sporulation, Yeast Cell Cycle, Arabidopsis Thaliana, Human Fibroblasts Serum and Rat Central Nervous System. Comparative studies of the use of different SVM kernels and several widely used microarray clustering techniques are reported. Moreover, statistical significance tests have been carried out to establish the statistical superiority of the proposed clustering approach. Finally, biological significance tests have been carried out using a web based gene annotation tool to show that the proposed method is able to produce biologically relevant clusters of co

  15. Combining Pareto-optimal clusters using supervised learning for identifying co-expressed genes

    Directory of Open Access Journals (Sweden)

    Bandyopadhyay Sanghamitra

    2009-01-01

    Full Text Available Abstract Background The landscape of biological and biomedical research is being changed rapidly with the invention of microarrays which enables simultaneous view on the transcription levels of a huge number of genes across different experimental conditions or time points. Using microarray data sets, clustering algorithms have been actively utilized in order to identify groups of co-expressed genes. This article poses the problem of fuzzy clustering in microarray data as a multiobjective optimization problem which simultaneously optimizes two internal fuzzy cluster validity indices to yield a set of Pareto-optimal clustering solutions. Each of these clustering solutions possesses some amount of information regarding the clustering structure of the input data. Motivated by this fact, a novel fuzzy majority voting approach is proposed to combine the clustering information from all the solutions in the resultant Pareto-optimal set. This approach first identifies the genes which are assigned to some particular cluster with high membership degree by most of the Pareto-optimal solutions. Using this set of genes as the training set, the remaining genes are classified by a supervised learning algorithm. In this work, we have used a Support Vector Machine (SVM classifier for this purpose. Results The performance of the proposed clustering technique has been demonstrated on five publicly available benchmark microarray data sets, viz., Yeast Sporulation, Yeast Cell Cycle, Arabidopsis Thaliana, Human Fibroblasts Serum and Rat Central Nervous System. Comparative studies of the use of different SVM kernels and several widely used microarray clustering techniques are reported. Moreover, statistical significance tests have been carried out to establish the statistical superiority of the proposed clustering approach. Finally, biological significance tests have been carried out using a web based gene annotation tool to show that the proposed method is able to

  16. Challenges in microarray class discovery: a comprehensive examination of normalization, gene selection and clustering

    Directory of Open Access Journals (Sweden)

    Landfors Mattias

    2010-10-01

    Full Text Available Abstract Background Cluster analysis, and in particular hierarchical clustering, is widely used to extract information from gene expression data. The aim is to discover new classes, or sub-classes, of either individuals or genes. Performing a cluster analysis commonly involve decisions on how to; handle missing values, standardize the data and select genes. In addition, pre-processing, involving various types of filtration and normalization procedures, can have an effect on the ability to discover biologically relevant classes. Here we consider cluster analysis in a broad sense and perform a comprehensive evaluation that covers several aspects of cluster analyses, including normalization. Result We evaluated 2780 cluster analysis methods on seven publicly available 2-channel microarray data sets with common reference designs. Each cluster analysis method differed in data normalization (5 normalizations were considered, missing value imputation (2, standardization of data (2, gene selection (19 or clustering method (11. The cluster analyses are evaluated using known classes, such as cancer types, and the adjusted Rand index. The performances of the different analyses vary between the data sets and it is difficult to give general recommendations. However, normalization, gene selection and clustering method are all variables that have a significant impact on the performance. In particular, gene selection is important and it is generally necessary to include a relatively large number of genes in order to get good performance. Selecting genes with high standard deviation or using principal component analysis are shown to be the preferred gene selection methods. Hierarchical clustering using Ward's method, k-means clustering and Mclust are the clustering methods considered in this paper that achieves the highest adjusted Rand. Normalization can have a significant positive impact on the ability to cluster individuals, and there are indications that

  17. Physical and genetic map of the major nif gene cluster from Azotobacter vinelandii.

    OpenAIRE

    Jacobson, M R; Brigle, K E; Bennett, L T; Setterquist, R A; Wilson, M S; Cash, V L; Beynon, J; Newton, W E; Dean, D R

    1989-01-01

    Determination of a 28,793-base-pair DNA sequence of a region from the Azotobacter vinelandii genome that includes and flanks the nitrogenase structural gene region was completed. This information was used to revise the previously proposed organization of the major nif cluster. The major nif cluster from A. vinelandii encodes 15 nif-specific genes whose products bear significant structural identity to the corresponding nif-specific gene products from Klebsiella pneumoniae. These genes include ...

  18. A tripartite clustering analysis on microRNA, gene and disease model.

    Science.gov (United States)

    Shen, Chengcheng; Liu, Ying

    2012-02-01

    Alteration of gene expression in response to regulatory molecules or mutations could lead to different diseases. MicroRNAs (miRNAs) have been discovered to be involved in regulation of gene expression and a wide variety of diseases. In a tripartite biological network of human miRNAs, their predicted target genes and the diseases caused by altered expressions of these genes, valuable knowledge about the pathogenicity of miRNAs, involved genes and related disease classes can be revealed by co-clustering miRNAs, target genes and diseases simultaneously. Tripartite co-clustering can lead to more informative results than traditional co-clustering with only two kinds of members and pass the hidden relational information along the relation chain by considering multi-type members. Here we report a spectral co-clustering algorithm for k-partite graph to find clusters with heterogeneous members. We use the method to explore the potential relationships among miRNAs, genes and diseases. The clusters obtained from the algorithm have significantly higher density than randomly selected clusters, which means members in the same cluster are more likely to have common connections. Results also show that miRNAs in the same family based on the hairpin sequences tend to belong to the same cluster. We also validate the clustering results by checking the correlation of enriched gene functions and disease classes in the same cluster. Finally, widely studied miR-17-92 and its paralogs are analyzed as a case study to reveal that genes and diseases co-clustered with the miRNAs are in accordance with current research findings.

  19. Recursive Cluster Elimination (RCE for classification and feature selection from gene expression data

    Directory of Open Access Journals (Sweden)

    Showe Louise C

    2007-05-01

    Full Text Available Abstract Background Classification studies using gene expression datasets are usually based on small numbers of samples and tens of thousands of genes. The selection of those genes that are important for distinguishing the different sample classes being compared, poses a challenging problem in high dimensional data analysis. We describe a new procedure for selecting significant genes as recursive cluster elimination (RCE rather than recursive feature elimination (RFE. We have tested this algorithm on six datasets and compared its performance with that of two related classification procedures with RFE. Results We have developed a novel method for selecting significant genes in comparative gene expression studies. This method, which we refer to as SVM-RCE, combines K-means, a clustering method, to identify correlated gene clusters, and Support Vector Machines (SVMs, a supervised machine learning classification method, to identify and score (rank those gene clusters for the purpose of classification. K-means is used initially to group genes into clusters. Recursive cluster elimination (RCE is then applied to iteratively remove those clusters of genes that contribute the least to the classification performance. SVM-RCE identifies the clusters of correlated genes that are most significantly differentially expressed between the sample classes. Utilization of gene clusters, rather than individual genes, enhances the supervised classification accuracy of the same data as compared to the accuracy when either SVM or Penalized Discriminant Analysis (PDA with recursive feature elimination (SVM-RFE and PDA-RFE are used to remove genes based on their individual discriminant weights. Conclusion SVM-RCE provides improved classification accuracy with complex microarray data sets when it is compared to the classification accuracy of the same datasets using either SVM-RFE or PDA-RFE. SVM-RCE identifies clusters of correlated genes that when considered together

  20. The complete coenzyme B12 biosynthesis gene cluster of Lactobacillus reuteri CRL1098

    NARCIS (Netherlands)

    Santos, F.; Vera, J.L.; van der Heijden, R.; Valdez, G.; de Vos, W.M.; Sesma, F.; Hugenholtz, J.

    2008-01-01

    The coenzyme B12 production pathway in Lactobacillus reuteri has been deduced using a combination of genetic, biochemical and bioinformatics approaches. The coenzyme B12 gene cluster of Lb. reuteri CRL1098 has the unique feature of clustering together the cbi, cob and hem genes. It consists of 29

  1. The complete coenzyme B12 biosynthesis gene cluster of Lactobacillus reuteri CRL 1098

    NARCIS (Netherlands)

    Santos, dos F.; Vera, J.L.; Heijden, van der R.; Valdez, G.F.; Vos, de W.M.; Sesma, F.; Hugenholtz, J.

    2008-01-01

    The coenzyme B12 production pathway in Lactobacillus reuteri has been deduced using a combination of genetic, biochemical and bioinformatics approaches. The coenzyme B12 gene cluster of Lb. reuteri CRL1098 has the unique feature of clustering together the cbi, cob and hem genes. It consists of 29

  2. Rough-fuzzy clustering for grouping functionally similar genes from microarray data.

    Science.gov (United States)

    Maji, Pradipta; Paul, Sushmita

    2013-01-01

    Gene expression data clustering is one of the important tasks of functional genomics as it provides a powerful tool for studying functional relationships of genes in a biological process. Identifying coexpressed groups of genes represents the basic challenge in gene clustering problem. In this regard, a gene clustering algorithm, termed as robust rough-fuzzy c-means, is proposed judiciously integrating the merits of rough sets and fuzzy sets. While the concept of lower and upper approximations of rough sets deals with uncertainty, vagueness, and incompleteness in cluster definition, the integration of probabilistic and possibilistic memberships of fuzzy sets enables efficient handling of overlapping partitions in noisy environment. The concept of possibilistic lower bound and probabilistic boundary of a cluster, introduced in robust rough-fuzzy c-means, enables efficient selection of gene clusters. An efficient method is proposed to select initial prototypes of different gene clusters, which enables the proposed c-means algorithm to converge to an optimum or near optimum solutions and helps to discover coexpressed gene clusters. The effectiveness of the algorithm, along with a comparison with other algorithms, is demonstrated both qualitatively and quantitatively on 14 yeast microarray data sets.

  3. Dominant control region of the human β- like globin gene cluster

    NARCIS (Netherlands)

    Blom van Assendelft, Margaretha van

    1989-01-01

    The structure and regulation of the human β -like globin gene cluster has been studied extensively. Genetic disorders connected with this gene cluster are responsible for human diseases associated with high levels of morbidity and mortality, such as β-thalassaemia and sickle cell anaemia. The work

  4. A phylogenomic gene cluster resource: The phylogeneticallyinferred groups (PhlGs) database

    Energy Technology Data Exchange (ETDEWEB)

    Dehal, Paramvir S.; Boore, Jeffrey L.

    2005-08-25

    We present here the PhIGs database, a phylogenomic resource for sequenced genomes. Although many methods exist for clustering gene families, very few attempt to create truly orthologous clusters sharing descent from a single ancestral gene across a range of evolutionary depths. Although these non-phylogenetic gene family clusters have been used broadly for gene annotation, errors are known to be introduced by the artifactual association of slowly evolving paralogs and lack of annotation for those more rapidly evolving. A full phylogenetic framework is necessary for accurate inference of function and for many studies that address pattern and mechanism of the evolution of the genome. The automated generation of evolutionary gene clusters, creation of gene trees, determination of orthology and paralogy relationships, and the correlation of this information with gene annotations, expression information, and genomic context is an important resource to the scientific community.

  5. Organization and differential regulation of a cluster of lignin peroxidase genes of Phanerochaete chrysosporium

    Science.gov (United States)

    Philip. Stewart; Daniel. Cullen

    1999-06-01

    The lignin peroxidases of Phanerochaete chrysosporium are encoded by a minimum of 10 closely related genes. Physical and genetic mapping of a cluster of eight lip genes revealed six genes occurring in pairs and transcriptionally convergent, suggesting that portions of the lip family arose by gene duplication events. The completed sequence of 1ipG and lipJ, together...

  6. An effective fuzzy kernel clustering analysis approach for gene expression data.

    Science.gov (United States)

    Sun, Lin; Xu, Jiucheng; Yin, Jiaojiao

    2015-01-01

    Fuzzy clustering is an important tool for analyzing microarray data. A major problem in applying fuzzy clustering method to microarray gene expression data is the choice of parameters with cluster number and centers. This paper proposes a new approach to fuzzy kernel clustering analysis (FKCA) that identifies desired cluster number and obtains more steady results for gene expression data. First of all, to optimize characteristic differences and estimate optimal cluster number, Gaussian kernel function is introduced to improve spectrum analysis method (SAM). By combining subtractive clustering with max-min distance mean, maximum distance method (MDM) is proposed to determine cluster centers. Then, the corresponding steps of improved SAM (ISAM) and MDM are given respectively, whose superiority and stability are illustrated through performing experimental comparisons on gene expression data. Finally, by introducing ISAM and MDM into FKCA, an effective improved FKCA algorithm is proposed. Experimental results from public gene expression data and UCI database show that the proposed algorithms are feasible for cluster analysis, and the clustering accuracy is higher than the other related clustering algorithms.

  7. A robust approach based on Weibull distribution for clustering gene expression data

    Directory of Open Access Journals (Sweden)

    Gong Binsheng

    2011-05-01

    Full Text Available Abstract Background Clustering is a widely used technique for analysis of gene expression data. Most clustering methods group genes based on the distances, while few methods group genes according to the similarities of the distributions of the gene expression levels. Furthermore, as the biological annotation resources accumulated, an increasing number of genes have been annotated into functional categories. As a result, evaluating the performance of clustering methods in terms of the functional consistency of the resulting clusters is of great interest. Results In this paper, we proposed the WDCM (Weibull Distribution-based Clustering Method, a robust approach for clustering gene expression data, in which the gene expressions of individual genes are considered as the random variables following unique Weibull distributions. Our WDCM is based on the concept that the genes with similar expression profiles have similar distribution parameters, and thus the genes are clustered via the Weibull distribution parameters. We used the WDCM to cluster three cancer gene expression data sets from the lung cancer, B-cell follicular lymphoma and bladder carcinoma and obtained well-clustered results. We compared the performance of WDCM with k-means and Self Organizing Map (SOM using functional annotation information given by the Gene Ontology (GO. The results showed that the functional annotation ratios of WDCM are higher than those of the other methods. We also utilized the external measure Adjusted Rand Index to validate the performance of the WDCM. The comparative results demonstrate that the WDCM provides the better clustering performance compared to k-means and SOM algorithms. The merit of the proposed WDCM is that it can be applied to cluster incomplete gene expression data without imputing the missing values. Moreover, the robustness of WDCM is also evaluated on the incomplete data sets. Conclusions The results demonstrate that our WDCM produces clusters

  8. CAR gene cluster and transcript levels of carotenogenic genes in Rhodotorula mucilaginosa.

    Science.gov (United States)

    Landolfo, Sara; Ianiri, Giuseppe; Camiolo, Salvatore; Porceddu, Andrea; Mulas, Giuliana; Chessa, Rossella; Zara, Giacomo; Mannazzu, Ilaria

    2018-01-01

    A molecular approach was applied to the study of the carotenoid biosynthetic pathway of Rhodotorula mucilaginosa. At first, functional annotation of the genome of R. mucilaginosa C2.5t1 was carried out and gene ontology categories were assigned to 4033 predicted proteins. Then, a set of genes involved in different steps of carotenogenesis was identified and those coding for phytoene desaturase, phytoene synthase/lycopene cyclase and carotenoid dioxygenase (CAR genes) proved to be clustered within a region of ~10 kb. Quantitative PCR of the genes involved in carotenoid biosynthesis showed that genes coding for 3-hydroxy-3-methylglutharyl-CoA reductase and mevalonate kinase are induced during exponential phase while no clear trend of induction was observed for phytoene synthase/lycopene cyclase and phytoene dehydrogenase encoding genes. Thus, in R. mucilaginosa the induction of genes involved in the early steps of carotenoid biosynthesis is transient and accompanies the onset of carotenoid production, while that of CAR genes does not correlate with the amount of carotenoids produced. The transcript levels of genes coding for carotenoid dioxygenase, superoxide dismutase and catalase A increased during the accumulation of carotenoids, thus suggesting the activation of a mechanism aimed at the protection of cell structures from oxidative stress during carotenoid biosynthesis. The data presented herein, besides being suitable for the elucidation of the mechanisms that underlie carotenoid biosynthesis, will contribute to boosting the biotechnological potential of this yeast by improving the outcome of further research efforts aimed at also exploring other features of interest.

  9. Effects of gene disruptions in the nisin gene cluster of Lactococcus lactis on nisin production and producer immunity

    NARCIS (Netherlands)

    Ra, Runar; Beerthuyzen, Marke M.; Vos, Willem M. de; Saris, Per E.J.; Kuipers, Oscar P.

    1999-01-01

    The lantibiotic nisin is produced by several strains of Lactococcus lactis subsp. lactis. The chromosomally located gene cluster nisABTCIPRKFEG is required for biosynthesis, development of immunity, and regulation of gene expression. In-frame deletions in the nisB and nisT genes, and disruption of

  10. Clustering of Drosophila melanogaster immune genes in interplay with recombination rate.

    Directory of Open Access Journals (Sweden)

    K Mathias Wegner

    Full Text Available BACKGROUND: Gene order in eukaryotic chromosomes is not random and has been linked to coordination of gene expression, chromatin structure and also recombination rate. The evolution of recombination rate is especially relevant for genes involved in immunity because host-parasite co-evolution could select for increased recombination rate (Red Queen hypothesis. To identify patterns left by the intimate interaction between hosts and parasites, I analysed the genomic parameters of the immune genes from 24 gene families/groups of Drosophila melanogaster. PRINCIPAL FINDINGS: Immune genes that directly interact with the pathogen (i.e. recognition and effector genes clustered in regions of higher recombination rates. Out of these, clustered effector genes were transcribed fastest indicating that transcriptional control might be one major cause for cluster formation. The relative position of clusters to each other, on the other hand, cannot be explained by transcriptional control per se. Drosophila immune genes that show epistatic interactions can be found at an average distance of 15.44+/-2.98 cM, which is considerably closer than genes that do not interact (30.64+/-1.95 cM. CONCLUSIONS: Epistatically interacting genes rarely belong to the same cluster, which supports recent models of optimal recombination rates between interacting genes in antagonistic host-parasite co-evolution. These patterns suggest that formation of local clusters might be a result of transcriptional control, but that in the condensed genome of D. melanogaster relative position of these clusters may be a result of selection for optimal rather than maximal recombination rates between these clusters.

  11. DNACLUST: accurate and efficient clustering of phylogenetic marker genes

    Directory of Open Access Journals (Sweden)

    Liu Bo

    2011-06-01

    Full Text Available Abstract Background Clustering is a fundamental operation in the analysis of biological sequence data. New DNA sequencing technologies have dramatically increased the rate at which we can generate data, resulting in datasets that cannot be efficiently analyzed by traditional clustering methods. This is particularly true in the context of taxonomic profiling of microbial communities through direct sequencing of phylogenetic markers (e.g. 16S rRNA - the domain that motivated the work described in this paper. Many analysis approaches rely on an initial clustering step aimed at identifying sequences that belong to the same operational taxonomic unit (OTU. When defining OTUs (which have no universally accepted definition, scientists must balance a trade-off between computational efficiency and biological accuracy, as accurately estimating an environment's phylogenetic composition requires computationally-intensive analyses. We propose that efficient and mathematically well defined clustering methods can benefit existing taxonomic profiling approaches in two ways: (i the resulting clusters can be substituted for OTUs in certain applications; and (ii the clustering effectively reduces the size of the data-sets that need to be analyzed by complex phylogenetic pipelines (e.g., only one sequence per cluster needs to be provided to downstream analyses. Results To address the challenges outlined above, we developed DNACLUST, a fast clustering tool specifically designed for clustering highly-similar DNA sequences. Given a set of sequences and a sequence similarity threshold, DNACLUST creates clusters whose radius is guaranteed not to exceed the specified threshold. Underlying DNACLUST is a greedy clustering strategy that owes its performance to novel sequence alignment and k-mer based filtering algorithms. DNACLUST can also produce multiple sequence alignments for every cluster, allowing users to manually inspect clustering results, and enabling more

  12. The Paraoxonase Gene Cluster Protects Against Abdominal Aortic Aneurysm Formation.

    Science.gov (United States)

    Yan, Yun-Fei; Pei, Jian-Fei; Zhang, Yang; Zhang, Ran; Wang, Fang; Gao, Peng; Zhang, Zhu-Qin; Wang, Ting-Ting; She, Zhi-Gang; Chen, Hou-Zao; Liu, De-Pei

    2017-02-01

    Abdominal aortic aneurysm (AAA) is a life-threatening vascular pathology, the pathogenesis of which is closely related to oxidative stress. However, an effective pharmaceutical treatment is lacking because the exact cause of AAA remains unknown. Here, we aimed at delineating the role of the paraoxonases (PONs) gene cluster (PC), which prevents atherosclerosis through the detoxification of oxidized substrates, in AAA formation. PC transgenic (Tg) mice were crossed to an Apoe -/- background, and an angiotensin II-induced AAA mouse model was used to analyze the effect of the PC on AAA formation. Four weeks after angiotensin II infusion, PC-Tg Apoe -/- mice had a lower AAA incidence, smaller maximal abdominal aortic external diameter, and less medial elastin degradation than Apoe -/- mice. Importantly, PC-Tg Apoe -/- mice exhibited lower aortic reactive oxidative species production and oxidative stress than did the Apoe -/- control mice. As a consequence, the PC transgene alleviated angiotensin II-induced arterial inflammation and suppressed arterial extracellular matrix degradation. Specifically, on angiotensin II stimulation, PC-Tg vascular smooth muscle cells exhibited lower levels of reactive oxidative species production and a decrease in the activities and expression levels of matrix metalloproteinase-2 and matrix metalloproteinase-9. Moreover, PC-Tg serum also enhanced vascular smooth muscle cell oxidative stress resistance and further decreased the expression levels of matrix metalloproteinase-2 and matrix metalloproteinase-9, indicating that circulatory and vascular smooth muscle cell PC members suppress oxidative stress in a synergistic manner. Our findings reveal, for the first time, a protective role of the PC in AAA formation and suggest PONs as promising targets for AAA prevention. © 2016 American Heart Association, Inc.

  13. An Effective Tri-Clustering Algorithm Combining Expression Data with Gene Regulation Information

    Directory of Open Access Journals (Sweden)

    Ao Li

    2009-04-01

    Full Text Available Motivation: Bi-clustering algorithms aim to identify sets of genes sharing similar expression patterns across a subset of conditions. However direct interpretation or prediction of gene regulatory mechanisms may be difficult as only gene expression data is used. Information about gene regulators may also be available, most commonly about which transcription factors may bind to the promoter region and thus control the expression level of a gene. Thus a method to integrate gene expression and gene regulation information is desirable for clustering and analyzing. Methods: By incorporating gene regulatory information with gene expression data, we define regulated expression values (REV as indicators of how a gene is regulated by a specific factor. Existing bi-clustering methods are extended to a three dimensional data space by developing a heuristic TRI-Clustering algorithm. An additional approach named Automatic Boundary Searching algorithm (ABS is introduced to automatically determine the boundary threshold. Results: Results based on incorporating ChIP-chip data representing transcription factor-gene interactions show that the algorithms are efficient and robust for detecting tri-clusters. Detailed analysis of the tri-cluster extracted from yeast sporulation REV data shows genes in this cluster exhibited significant differences during the middle and late stages. The implicated regulatory network was then reconstructed for further study of defined regulatory mechanisms. Topological and statistical analysis of this network demonstrated evidence of significant changes of TF activities during the different stages of yeast sporulation, and suggests this approach might be a general way to study regulatory networks undergoing transformations.

  14. Lichen Biosynthetic Gene Clusters Part II: Homology Mapping Suggests a Functional Diversity.

    Science.gov (United States)

    Bertrand, Robert L; Abdel-Hameed, Mona; Sorensen, John L

    2018-02-27

    Lichens are renowned for their diverse natural products though little is known of the genetic programming dictating lichen natural product biosynthesis. We sequenced the genome of Cladonia uncialis and profiled its secondary metabolite biosynthetic gene clusters. Through a homology searching approach, we can now propose specific functions for gene products as well as the biosynthetic pathways that are encoded in several of these gene clusters. This analysis revealed that the lichen genome encodes the required enzymes for patulin and betaenones A-C biosynthesis, fungal toxins not known to be produced by lichens. Within several gene clusters, some (but not all) genes are genetically similar to genes devoted to secondary metabolite biosynthesis in Fungi. These lichen clusters also contain accessory tailoring genes without such genetic similarity, suggesting that the encoded tailoring enzymes perform distinct chemical transformations. We hypothesize that C. uncialis gene clusters have evolved by shuffling components of ancestral fungal clusters to create new series of chemical steps, leading to the production of hitherto undiscovered derivatives of fungal secondary metabolites.

  15. Leveraging long sequencing reads to investigate R-gene clustering and variation in sugar beet

    Science.gov (United States)

    Host-pathogen interactions are of prime importance to modern agriculture. Plants utilize various types of resistance genes to mitigate pathogen damage. Identification of the specific gene responsible for a specific resistance can be difficult due to duplication and clustering within R-gene families....

  16. Horizontal transfer of a nitrate assimilation gene cluster and ecological transitions in fungi: a phylogenetic study.

    Directory of Open Access Journals (Sweden)

    Jason C Slot

    Full Text Available High affinity nitrate assimilation genes in fungi occur in a cluster (fHANT-AC that can be coordinately regulated. The clustered genes include nrt2, which codes for a high affinity nitrate transporter; euknr, which codes for nitrate reductase; and NAD(PH-nir, which codes for nitrite reductase. Homologs of genes in the fHANT-AC occur in other eukaryotes and prokaryotes, but they have only been found clustered in the oomycete Phytophthora (heterokonts. We performed independent and concatenated phylogenetic analyses of homologs of all three genes in the fHANT-AC. Phylogenetic analyses limited to fungal sequences suggest that the fHANT-AC has been transferred horizontally from a basidiomycete (mushrooms and smuts to an ancestor of the ascomycetous mold Trichoderma reesei. Phylogenetic analyses of sequences from diverse eukaryotes and eubacteria, and cluster structure, are consistent with a hypothesis that the fHANT-AC was assembled in a lineage leading to the oomycetes and was subsequently transferred to the Dikarya (Ascomycota+Basidiomycota, which is a derived fungal clade that includes the vast majority of terrestrial fungi. We propose that the acquisition of high affinity nitrate assimilation contributed to the success of Dikarya on land by allowing exploitation of nitrate in aerobic soils, and the subsequent transfer of a complete assimilation cluster improved the fitness of T. reesei in a new niche. Horizontal transmission of this cluster of functionally integrated genes supports the "selfish operon" hypothesis for maintenance of gene clusters.

  17. High or low correlation between co-occuring gene clusters and 16S rRNA gene phylogeny.

    Science.gov (United States)

    Rudi, Knut; Sekelja, Monika

    2013-02-01

    Ribosomal RNA (rRNA) genes are universal for all living organisms. Yet, the correspondence between genome composition and rRNA phylogeny remains poorly known. The aim of this study was to use the information from genome sequence databases to address the correlation between rRNA gene phylogeny and total gene composition in bacteria. This was done by analysing 327 genomes with TIGRFAM functional gene annotations. Our approach consisted of two steps. First, we searched for discriminatory clusters of co-occurring genes. Using a multivariate statistical approach, we identified 11 such clusters which contain genes that were co-occurring only in a subset of genomes and contributed to explain the gene content differences between genome subsets. Second, we mapped the discovered clusters to 16S rRNA-based phylogeny and calculated the correlation between co-occuring genes and phylogeny. Six of the 11 clusters exhibited significant correlation with 16S rRNA gene phylogeny. The most distinct phylogenetic finding was a high correlation between iron-sulfur oxidoreductases in combination with carbon nitrogen ligases and Chlorobium. The other correlations identified covered relatively large phylogroups: Actinobacteria were positively associated with kinases, while Gammaproteobacteria were positively associated with methylases and acyltransferases. The suggested functional differences between higher phylogroups, however, need experimental verification. © 2012 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.

  18. Activation and clustering of a Plasmodium falciparum var gene are affected by subtelomeric sequences.

    Science.gov (United States)

    Duffy, Michael F; Tang, Jingyi; Sumardy, Fransisca; Nguyen, Hanh H T; Selvarajah, Shamista A; Josling, Gabrielle A; Day, Karen P; Petter, Michaela; Brown, Graham V

    2017-01-01

    The Plasmodium falciparum var multigene family encodes the cytoadhesive, variant antigen PfEMP1. P. falciparum antigenic variation and cytoadhesion specificity are controlled by epigenetic switching between the single, or few, simultaneously expressed var genes. Most var genes are maintained in perinuclear clusters of heterochromatic telomeres. The active var gene(s) occupy a single, perinuclear var expression site. It is unresolved whether the var expression site forms in situ at a telomeric cluster or whether it is an extant compartment to which single chromosomes travel, thus controlling var switching. Here we show that transcription of a var gene did not require decreased colocalisation with clusters of telomeres, supporting var expression site formation in situ. However following recombination within adjacent subtelomeric sequences, the same var gene was persistently activated and did colocalise less with telomeric clusters. Thus, participation in stable, heterochromatic, telomere clusters and var switching are independent but are both affected by subtelomeric sequences. The var expression site colocalised with the euchromatic mark H3K27ac to a greater extent than it did with heterochromatic H3K9me3. H3K27ac was enriched within the active var gene promoter even when the var gene was transiently repressed in mature parasites and thus H3K27ac may contribute to var gene epigenetic memory. © 2016 Federation of European Biochemical Societies.

  19. AntiSMASH 4.0 - improvements in chemistry prediction and gene cluster boundary identification

    NARCIS (Netherlands)

    Blin, Kai; Wolf, Thomas; Chevrette, Marc G.; Lu, Xiaowen; Schwalen, Christopher J.; Kautsar, Satria A.; Suarez Duran, Hernando G.; Los Santos, De Emmanuel L.C.; Kim, Hyun Uk; Nave, Mariana; Dickschat, Jeroen S.; Mitchell, Douglas A.; Shelest, Ekaterina; Breitling, Rainer; Takano, Eriko; Lee, Sang Yup; Weber, Tilmann; Medema, Marnix H.

    2017-01-01

    Many antibiotics, chemotherapeutics, crop protection agents and food preservatives originate from molecules produced by bacteria, fungi or plants. In recent years, genome mining methodologies have been widely adopted to identify and characterize the biosynthetic gene clusters encoding the production

  20. CASSIS and SMIPS: promoter-based prediction of secondary metabolite gene clusters in eukaryotic genomes.

    Science.gov (United States)

    Wolf, Thomas; Shelest, Vladimir; Nath, Neetika; Shelest, Ekaterina

    2016-04-15

    Secondary metabolites (SM) are structurally diverse natural products of high pharmaceutical importance. Genes involved in their biosynthesis are often organized in clusters, i.e., are co-localized and co-expressed. In silico cluster prediction in eukaryotic genomes remains problematic mainly due to the high variability of the clusters' content and lack of other distinguishing sequence features. We present Cluster Assignment by Islands of Sites (CASSIS), a method for SM cluster prediction in eukaryotic genomes, and Secondary Metabolites by InterProScan (SMIPS), a tool for genome-wide detection of SM key enzymes ('anchor' genes): polyketide synthases, non-ribosomal peptide synthetases and dimethylallyl tryptophan synthases. Unlike other tools based on protein similarity, CASSIS exploits the idea of co-regulation of the cluster genes, which assumes the existence of common regulatory patterns in the cluster promoters. The method searches for 'islands' of enriched cluster-specific motifs in the vicinity of anchor genes. It was validated in a series of cross-validation experiments and showed high sensitivity and specificity. CASSIS and SMIPS are freely available at https://sbi.hki-jena.de/cassis thomas.wolf@leibniz-hki.de or ekaterina.shelest@leibniz-hki.de Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  1. Characterization of the fumonisin B2 biosynthetic gene cluster in Aspergillus niger and A. awamori.

    Science.gov (United States)

    Aspergillus niger and A. awamori strains isolated from grapes cultivated in Mediterranean basin were examined for fumonisin B2 (FB2) production and presence/absence of sequences within the fumonisin biosynthetic gene (fum) cluster. Presence of 13 regions in the fum cluster was evaluated by PCR assay...

  2. Molecular population genetics of the β-esterase gene cluster of ...

    Indian Academy of Sciences (India)

    However there are some 'footprints' of directional and balancing selection shaping specific distribution of nucleotide polymorphism within the cluster. Intergenic epistatic selection between Est-6 and Est-6 may play an important role in the evolution of the -esterase gene cluster preserving the putative pseudogene from ...

  3. Intranuclear and higher-order chromatin organization of the major histone gene cluster in breast cancer.

    Science.gov (United States)

    Fritz, Andrew J; Ghule, Prachi N; Boyd, Joseph R; Tye, Coralee E; Page, Natalie A; Hong, Deli; Shirley, David J; Weinheimer, Adam S; Barutcu, Ahmet R; Gerrard, Diana L; Frietze, Seth; van Wijnen, Andre J; Zaidi, Sayyed K; Imbalzano, Anthony N; Lian, Jane B; Stein, Janet L; Stein, Gary S

    2018-02-01

    Alterations in nuclear morphology are common in cancer progression. However, the degree to which gross morphological abnormalities translate into compromised higher-order chromatin organization is poorly understood. To explore the functional links between gene expression and chromatin structure in breast cancer, we performed RNA-seq gene expression analysis on the basal breast cancer progression model based on human MCF10A cells. Positional gene enrichment identified the major histone gene cluster at chromosome 6p22 as one of the most significantly upregulated (and not amplified) clusters of genes from the normal-like MCF10A to premalignant MCF10AT1 and metastatic MCF10CA1a cells. This cluster is subdivided into three sub-clusters of histone genes that are organized into hierarchical topologically associating domains (TADs). Interestingly, the sub-clusters of histone genes are located at TAD boundaries and interact more frequently with each other than the regions in-between them, suggesting that the histone sub-clusters form an active chromatin hub. The anchor sites of loops within this hub are occupied by CTCF, a known chromatin organizer. These histone genes are transcribed and processed at a specific sub-nuclear microenvironment termed the major histone locus body (HLB). While the overall chromatin structure of the major HLB is maintained across breast cancer progression, we detected alterations in its structure that may relate to gene expression. Importantly, breast tumor specimens also exhibit a coordinate pattern of upregulation across the major histone gene cluster. Our results provide a novel insight into the connection between the higher-order chromatin organization of the major HLB and its regulation during breast cancer progression. © 2017 Wiley Periodicals, Inc.

  4. Clusters of Antibiotic Resistance Genes Enriched Together Stay Together in Swine Agriculture.

    Science.gov (United States)

    Johnson, Timothy A; Stedtfeld, Robert D; Wang, Qiong; Cole, James R; Hashsham, Syed A; Looft, Torey; Zhu, Yong-Guan; Tiedje, James M

    2016-04-12

    Antibiotic resistance is a worldwide health risk, but the influence of animal agriculture on the genetic context and enrichment of individual antibiotic resistance alleles remains unclear. Using quantitative PCR followed by amplicon sequencing, we quantified and sequenced 44 genes related to antibiotic resistance, mobile genetic elements, and bacterial phylogeny in microbiomes from U.S. laboratory swine and from swine farms from three Chinese regions. We identified highly abundant resistance clusters: groups of resistance and mobile genetic element alleles that cooccur. For example, the abundance of genes conferring resistance to six classes of antibiotics together with class 1 integrase and the abundance of IS6100-type transposons in three Chinese regions are directly correlated. These resistance cluster genes likely colocalize in microbial genomes in the farms. Resistance cluster alleles were dramatically enriched (up to 1 to 10% as abundant as 16S rRNA) and indicate that multidrug-resistant bacteria are likely the norm rather than an exception in these communities. This enrichment largely occurred independently of phylogenetic composition; thus, resistance clusters are likely present in many bacterial taxa. Furthermore, resistance clusters contain resistance genes that confer resistance to antibiotics independently of their particular use on the farms. Selection for these clusters is likely due to the use of only a subset of the broad range of chemicals to which the clusters confer resistance. The scale of animal agriculture and its wastes, the enrichment and horizontal gene transfer potential of the clusters, and the vicinity of large human populations suggest that managing this resistance reservoir is important for minimizing human risk. Agricultural antibiotic use results in clusters of cooccurring resistance genes that together confer resistance to multiple antibiotics. The use of a single antibiotic could select for an entire suite of resistance genes if

  5. Integrating Data Clustering and Visualization for the Analysis of 3D Gene Expression Data

    Energy Technology Data Exchange (ETDEWEB)

    Data Analysis and Visualization (IDAV) and the Department of Computer Science, University of California, Davis, One Shields Avenue, Davis CA 95616, USA,; nternational Research Training Group ``Visualization of Large and Unstructured Data Sets,' ' University of Kaiserslautern, Germany; Computational Research Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA; Genomics Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley CA 94720, USA; Life Sciences Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley CA 94720, USA,; Computer Science Division,University of California, Berkeley, CA, USA,; Computer Science Department, University of California, Irvine, CA, USA,; All authors are with the Berkeley Drosophila Transcription Network Project, Lawrence Berkeley National Laboratory,; Rubel, Oliver; Weber, Gunther H.; Huang, Min-Yu; Bethel, E. Wes; Biggin, Mark D.; Fowlkes, Charless C.; Hendriks, Cris L. Luengo; Keranen, Soile V. E.; Eisen, Michael B.; Knowles, David W.; Malik, Jitendra; Hagen, Hans; Hamann, Bernd

    2008-05-12

    The recent development of methods for extracting precise measurements of spatial gene expression patterns from three-dimensional (3D) image data opens the way for new analyses of the complex gene regulatory networks controlling animal development. We present an integrated visualization and analysis framework that supports user-guided data clustering to aid exploration of these new complex datasets. The interplay of data visualization and clustering-based data classification leads to improved visualization and enables a more detailed analysis than previously possible. We discuss (i) integration of data clustering and visualization into one framework; (ii) application of data clustering to 3D gene expression data; (iii) evaluation of the number of clusters k in the context of 3D gene expression clustering; and (iv) improvement of overall analysis quality via dedicated post-processing of clustering results based on visualization. We discuss the use of this framework to objectively define spatial pattern boundaries and temporal profiles of genes and to analyze how mRNA patterns are controlled by their regulatory transcription factors.

  6. A Single Gene Cluster for Chalcomycins and Aldgamycins: Genetic Basis for Bifurcation of Their Biosynthesis.

    Science.gov (United States)

    Tang, Xiao-Long; Dai, Ping; Gao, Hao; Wang, Chuan-Xi; Chen, Guo-Dong; Hong, Kui; Hu, Dan; Yao, Xin-Sheng

    2016-07-01

    Aldgamycins are 16-membered macrolide antibiotics with a rare branched-chain sugar d-aldgarose or decarboxylated d-aldgarose at C-5. In our efforts to clone the gene cluster for aldgamycins from a marine-derived Streptomyces sp. HK-2006-1 capable of producing both aldgamycins and chalcomycins, we found that both are biosynthesized from a single gene cluster. Whole-genome sequencing combined with gene disruption established the entire gene cluster of aldgamycins: nine new genes are incorporated with the previously identified chalcomycin gene cluster. Functional analysis of these genes revealed that almDI/almDII, (encoding α/β subunits of pyruvate dehydrogenase) triggers the biosynthesis of aldgamycins, whereas almCI (encoding an oxidoreductase) initiates chalcomycins biosynthesis. This is the first report that aldgamycins and chalcomycins are derived from a single gene cluster and of the genetic basis for bifurcation in their biosynthesis. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  7. A phylogenomic gene cluster resource: the Phylogenetically Inferred Groups (PhIGs database

    Directory of Open Access Journals (Sweden)

    Boore Jeffrey L

    2006-04-01

    Full Text Available Abstract Background We present here the PhIGs database, a phylogenomic resource for sequenced genomes. Although many methods exist for clustering gene families, very few attempt to create truly orthologous clusters sharing descent from a single ancestral gene across a range of evolutionary depths. Although these non-phylogenetic gene family clusters have been used broadly for gene annotation, errors are known to be introduced by the artifactual association of slowly evolving paralogs and lack of annotation for those more rapidly evolving. A full phylogenetic framework is necessary for accurate inference of function and for many studies that address pattern and mechanism of the evolution of the genome. The automated generation of evolutionary gene clusters, creation of gene trees, determination of orthology and paralogy relationships, and the correlation of this information with gene annotations, expression information, and genomic context is an important resource to the scientific community. Discussion The PhIGs database currently contains 23 completely sequenced genomes of fungi and metazoans, containing 409,653 genes that have been grouped into 42,645 gene clusters. Each gene cluster is built such that the gene sequence distances are consistent with the known organismal relationships and in so doing, maximizing the likelihood for the clusters to represent truly orthologous genes. The PhIGs website contains tools that allow the study of genes within their phylogenetic framework through keyword searches on annotations, such as GO and InterPro assignments, and sequence similarity searches by BLAST and HMM. In addition to displaying the evolutionary relationships of the genes in each cluster, the website also allows users to view the relative physical positions of homologous genes in specified sets of genomes. Summary Accurate analyses of genes and genomes can only be done within their full phylogenetic context. The PhIGs database and

  8. Unusual Gene Order and Organization of the Sea Urchin HoxCluster

    Energy Technology Data Exchange (ETDEWEB)

    Richardson, Paul M.; Lucas, Susan; Cameron, R. Andrew; Rowen,Lee; Nesbitt, Ryan; Bloom, Scott; Rast, Jonathan P.; Berney, Kevin; Arenas-Mena, Cesar; Martinez, Pedro; Davidson, Eric H.; Peterson, KevinJ.; Hood, Leroy

    2005-05-10

    The highly consistent gene order and axial colinear expression patterns found in vertebrate hox gene clusters are less well conserved across the rest of bilaterians. We report the first deuterostome instance of an intact hox cluster with a unique gene order where the paralog groups are not expressed in a sequential manner. The finished sequence from BAC clones from the genome of the sea urchin, Strongylocentrotus purpuratus, reveals a gene order wherein the anterior genes (Hox1, Hox2 and Hox3) lie nearest the posterior genes in the cluster such that the most 3' gene is Hox5. (The gene order is : 5'-Hox1,2, 3, 11/13c, 11/13b, '11/13a, 9/10, 8, 7, 6, 5 - 3)'. The finished sequence result is corroborated by restriction mapping evidence and BAC-end scaffold analyses. Comparisons with a putative ancestral deuterostome Hox gene cluster suggest that the rearrangements leading to the sea urchin gene order were many and complex.

  9. Methods for evaluating clustering algorithms for gene expression data using a reference set of functional classes

    Directory of Open Access Journals (Sweden)

    Datta Somnath

    2006-08-01

    Full Text Available Abstract Background A cluster analysis is the most commonly performed procedure (often regarded as a first step on a set of gene expression profiles. In most cases, a post hoc analysis is done to see if the genes in the same clusters can be functionally correlated. While past successes of such analyses have often been reported in a number of microarray studies (most of which used the standard hierarchical clustering, UPGMA, with one minus the Pearson's correlation coefficient as a measure of dissimilarity, often times such groupings could be misleading. More importantly, a systematic evaluation of the entire set of clusters produced by such unsupervised procedures is necessary since they also contain genes that are seemingly unrelated or may have more than one common function. Here we quantify the performance of a given unsupervised clustering algorithm applied to a given microarray study in terms of its ability to produce biologically meaningful clusters using a reference set of functional classes. Such a reference set may come from prior biological knowledge specific to a microarray study or may be formed using the growing databases of gene ontologies (GO for the annotated genes of the relevant species. Results In this paper, we introduce two performance measures for evaluating the results of a clustering algorithm in its ability to produce biologically meaningful clusters. The first measure is a biological homogeneity index (BHI. As the name suggests, it is a measure of how biologically homogeneous the clusters are. This can be used to quantify the performance of a given clustering algorithm such as UPGMA in grouping genes for a particular data set and also for comparing the performance of a number of competing clustering algorithms applied to the same data set. The second performance measure is called a biological stability index (BSI. For a given clustering algorithm and an expression data set, it measures the consistency of the clustering

  10. A Link-Based Cluster Ensemble Approach For Improved Gene Expression Data Analysis

    Directory of Open Access Journals (Sweden)

    P.Balaji

    2015-01-01

    Full Text Available Abstract It is difficult from possibilities to select a most suitable effective way of clustering algorithm and its dataset for a defined set of gene expression data because we have a huge number of ways and huge number of gene expressions. At present many researchers are preferring to use hierarchical clustering in different forms this is no more totally optimal. Cluster ensemble research can solve this type of problem by automatically merging multiple data partitions from a wide range of different clusterings of any dimensions to improve both the quality and robustness of the clustering result. But we have many existing ensemble approaches using an association matrix to condense sample-cluster and co-occurrence statistics and relations within the ensemble are encapsulated only at raw level while the existing among clusters are totally discriminated. Finding these missing associations can greatly expand the capability of those ensemble methodologies for microarray data clustering. We propose general K-means cluster ensemble approach for the clustering of general categorical data into required number of partitions.

  11. Salicin regulates the expression of functional 'youth gene clusters' to reflect a more youthful gene expression profile.

    Science.gov (United States)

    Gopaul, R; Knaggs, H E; Lephart, J

    2011-10-01

    There are a variety of biological mechanisms that contribute to specific characteristics of ageing skin; for example, the loss of skin structure proteins, increased susceptibility to UV-induced pigmentation and/or loss of hydration. Each of these biological processes is influenced by specific groups of genes. In this research, we have identified groups of genes associated with specific clinical signs of skin ageing and refer to these as functional 'youth gene clusters'. In this study, quantitative real-time polymerase chain reaction (qPCR) was used to investigate the effects of topical application of salicin in regulating the expression of functional 'youth gene clusters' to reflect a more youthful skin profile and reduce the appearance of attributes associated with skin ageing. Results showed that salicin significantly influences the gene expression profiles of treated human equivalent full-thickness skin, by regulating the expression of genes associated with various biological processes involving skin structure, skin hydration, pigmentation and cellular differentiation. Based on the findings from this experiment, salicin was identified as a key ingredient that may regulate functional 'youth gene clusters' to reflect a more youthful gene expression profile by increasing the expression of genes responsible for youthful skin and decreasing the expression of genes responsible for the appearance of aged skin. © 2011 The Authors. ICS © 2011 Society of Cosmetic Scientists and the Société Française de Cosmétologie.

  12. Sequencing rare marine actinomycete genomes reveals high density of unique natural product biosynthetic gene clusters

    Science.gov (United States)

    Schorn, Michelle A.; Alanjary, Mohammad M.; Aguinaldo, Kristen; Korobeynikov, Anton; Podell, Sheila; Patin, Nastassia; Lincecum, Tommie; Jensen, Paul R.; Ziemert, Nadine

    2016-01-01

    Traditional natural product discovery methods have nearly exhausted the accessible diversity of microbial chemicals, making new sources and techniques paramount in the search for new molecules. Marine actinomycete bacteria have recently come into the spotlight as fruitful producers of structurally diverse secondary metabolites, and remain relatively untapped. In this study, we sequenced 21 marine-derived actinomycete strains, rarely studied for their secondary metabolite potential and under-represented in current genomic databases. We found that genome size and phylogeny were good predictors of biosynthetic gene cluster diversity, with larger genomes rivalling the well-known marine producers in the Streptomyces and Salinispora genera. Genomes in the Micrococcineae suborder, however, had consistently the lowest number of biosynthetic gene clusters. By networking individual gene clusters into gene cluster families, we were able to computationally estimate the degree of novelty each genus contributed to the current sequence databases. Based on the similarity measures between all actinobacteria in the Joint Genome Institute's Atlas of Biosynthetic gene Clusters database, rare marine genera show a high degree of novelty and diversity, with Corynebacterium, Gordonia, Nocardiopsis, Saccharomonospora and Pseudonocardia genera representing the highest gene cluster diversity. This research validates that rare marine actinomycetes are important candidates for exploration, as they are relatively unstudied, and their relatives are historically rich in secondary metabolites. PMID:27902408

  13. Yeast homologous recombination-based promoter engineering for the activation of silent natural product biosynthetic gene clusters.

    Science.gov (United States)

    Montiel, Daniel; Kang, Hahk-Soo; Chang, Fang-Yuan; Charlop-Powers, Zachary; Brady, Sean F

    2015-07-21

    Large-scale sequencing of prokaryotic (meta)genomic DNA suggests that most bacterial natural product gene clusters are not expressed under common laboratory culture conditions. Silent gene clusters represent a promising resource for natural product discovery and the development of a new generation of therapeutics. Unfortunately, the characterization of molecules encoded by these clusters is hampered owing to our inability to express these gene clusters in the laboratory. To address this bottleneck, we have developed a promoter-engineering platform to transcriptionally activate silent gene clusters in a model heterologous host. Our approach uses yeast homologous recombination, an auxotrophy complementation-based yeast selection system and sequence orthogonal promoter cassettes to exchange all native promoters in silent gene clusters with constitutively active promoters. As part of this platform, we constructed and validated a set of bidirectional promoter cassettes consisting of orthogonal promoter sequences, Streptomyces ribosome binding sites, and yeast selectable marker genes. Using these tools we demonstrate the ability to simultaneously insert multiple promoter cassettes into a gene cluster, thereby expediting the reengineering process. We apply this method to model active and silent gene clusters (rebeccamycin and tetarimycin) and to the silent, cryptic pseudogene-containing, environmental DNA-derived Lzr gene cluster. Complete promoter refactoring and targeted gene exchange in this "dead" cluster led to the discovery of potent indolotryptoline antiproliferative agents, lazarimides A and B. This potentially scalable and cost-effective promoter reengineering platform should streamline the discovery of natural products from silent natural product biosynthetic gene clusters.

  14. Comparative phylogenomic analyses of teleost fish Hox gene clusters: lessons from the cichlid fish Astatotilapia burtoni

    Directory of Open Access Journals (Sweden)

    Kuehl Jennifer V

    2007-09-01

    Full Text Available Abstract Background Teleost fish have seven paralogous clusters of Hox genes stemming from two complete genome duplications early in vertebrate evolution, and an additional genome duplication during the evolution of ray-finned fish, followed by the secondary loss of one cluster. Gene duplications on the one hand, and the evolution of regulatory sequences on the other, are thought to be among the most important mechanisms for the evolution of new gene functions. Cichlid fish, the largest family of vertebrates with about 2500 species, are famous examples of speciation and morphological diversity. Since this diversity could be based on regulatory changes, we chose to study the coding as well as putative regulatory regions of their Hox clusters within a comparative genomic framework. Results We sequenced and characterized all seven Hox clusters of Astatotilapia burtoni, a haplochromine cichlid fish. Comparative analyses with data from other teleost fish such as zebrafish, two species of pufferfish, stickleback and medaka were performed. We traced losses of genes and microRNAs of Hox clusters, the medaka lineage seems to have lost more microRNAs than the other fish lineages. We found that each teleost genome studied so far has a unique set of Hox genes. The hoxb7a gene was lost independently several times during teleost evolution, the most recent event being within the radiation of East African cichlid fish. The conserved non-coding sequences (CNS encompass a surprisingly large part of the clusters, especially in the HoxAa, HoxCa, and HoxDa clusters. Across all clusters, we observe a trend towards an increased content of CNS towards the anterior end. Conclusion The gene content of Hox clusters in teleost fishes is more variable than expected, with each species studied so far having a different set. Although the highest loss rate of Hox genes occurred immediately after whole genome duplications, our analyses showed that gene loss continued and is

  15. Method of Selection of Bacteria Antibiotic Resistance Genes Based on Clustering of Similar Nucleotide Sequences.

    Science.gov (United States)

    Balashov, I S; Naumov, V A; Borovikov, P I; Gordeev, A B; Dubodelov, D V; Lyubasovskaya, L A; Rodchenko, Yu V; Bystritskii, A A; Aleksandrova, N V; Trofimov, D Yu; Priputnevich, T V

    2017-10-01

    A new method for selection of bacterium antibiotic resistance genes is proposed and tested for solving the problems related to selection of primers for PCR assay. The method implies clustering of similar nucleotide sequences and selection of group primers for all genes of each cluster. Clustering of resistance genes for six groups of antibiotics (aminoglycosides, β-lactams, fluoroquinolones, glycopeptides, macrolides and lincosamides, and fusidic acid) was performed. The method was tested for 81 strains of bacteria of different genera isolated from patients (K. pneumoniae, Staphylococcus spp., S. agalactiae, E. faecalis, E. coli, and G. vaginalis). The results obtained by us are comparable to those in the selection of individual genes; this allows reducing the number of primers necessary for maximum coverage of the known antibiotic resistance genes during PCR analysis.

  16. Clustering based gene expression feature selection method: A computational approach to enrich the classifier efficiency of differentially expressed genes

    KAUST Repository

    Abusamra, Heba

    2016-07-20

    The native nature of high dimension low sample size of gene expression data make the classification task more challenging. Therefore, feature (gene) selection become an apparent need. Selecting a meaningful and relevant genes for classifier not only decrease the computational time and cost, but also improve the classification performance. Among different approaches of feature selection methods, however most of them suffer from several problems such as lack of robustness, validation issues etc. Here, we present a new feature selection technique that takes advantage of clustering both samples and genes. Materials and methods We used leukemia gene expression dataset [1]. The effectiveness of the selected features were evaluated by four different classification methods; support vector machines, k-nearest neighbor, random forest, and linear discriminate analysis. The method evaluate the importance and relevance of each gene cluster by summing the expression level for each gene belongs to this cluster. The gene cluster consider important, if it satisfies conditions depend on thresholds and percentage otherwise eliminated. Results Initial analysis identified 7120 differentially expressed genes of leukemia (Fig. 15a), after applying our feature selection methodology we end up with specific 1117 genes discriminating two classes of leukemia (Fig. 15b). Further applying the same method with more stringent higher positive and lower negative threshold condition, number reduced to 58 genes have be tested to evaluate the effectiveness of the method (Fig. 15c). The results of the four classification methods are summarized in Table 11. Conclusions The feature selection method gave good results with minimum classification error. Our heat-map result shows distinct pattern of refines genes discriminating between two classes of leukemia.

  17. The genome of tolypocladium inflatum: evolution, organization, and expression of the cyclosporin biosynthetic gene cluster.

    Directory of Open Access Journals (Sweden)

    Kathryn E Bushley

    2013-06-01

    Full Text Available The ascomycete fungus Tolypocladium inflatum, a pathogen of beetle larvae, is best known as the producer of the immunosuppressant drug cyclosporin. The draft genome of T. inflatum strain NRRL 8044 (ATCC 34921, the isolate from which cyclosporin was first isolated, is presented along with comparative analyses of the biosynthesis of cyclosporin and other secondary metabolites in T. inflatum and related taxa. Phylogenomic analyses reveal previously undetected and complex patterns of homology between the nonribosomal peptide synthetase (NRPS that encodes for cyclosporin synthetase (simA and those of other secondary metabolites with activities against insects (e.g., beauvericin, destruxins, etc., and demonstrate the roles of module duplication and gene fusion in diversification of NRPSs. The secondary metabolite gene cluster responsible for cyclosporin biosynthesis is described. In addition to genes necessary for cyclosporin biosynthesis, it harbors a gene for a cyclophilin, which is a member of a family of immunophilins known to bind cyclosporin. Comparative analyses support a lineage specific origin of the cyclosporin gene cluster rather than horizontal gene transfer from bacteria or other fungi. RNA-Seq transcriptome analyses in a cyclosporin-inducing medium delineate the boundaries of the cyclosporin cluster and reveal high levels of expression of the gene cluster cyclophilin. In medium containing insect hemolymph, weaker but significant upregulation of several genes within the cyclosporin cluster, including the highly expressed cyclophilin gene, was observed. T. inflatum also represents the first reference draft genome of Ophiocordycipitaceae, a third family of insect pathogenic fungi within the fungal order Hypocreales, and supports parallel and qualitatively distinct radiations of insect pathogens. The T. inflatum genome provides additional insight into the evolution and biosynthesis of cyclosporin and lays a foundation for further

  18. The Local Maximum Clustering Method and Its Application in Microarray Gene Expression Data Analysis

    Directory of Open Access Journals (Sweden)

    Chen Yidong

    2004-01-01

    Full Text Available An unsupervised data clustering method, called the local maximum clustering (LMC method, is proposed for identifying clusters in experiment data sets based on research interest. A magnitude property is defined according to research purposes, and data sets are clustered around each local maximum of the magnitude property. By properly defining a magnitude property, this method can overcome many difficulties in microarray data clustering such as reduced projection in similarities, noises, and arbitrary gene distribution. To critically evaluate the performance of this clustering method in comparison with other methods, we designed three model data sets with known cluster distributions and applied the LMC method as well as the hierarchic clustering method, the -mean clustering method, and the self-organized map method to these model data sets. The results show that the LMC method produces the most accurate clustering results. As an example of application, we applied the method to cluster the leukemia samples reported in the microarray study of Golub et al. (1999.

  19. Structural and functional characterization of three polyketide synthase gene clusters in Bacillus amyloliquefaciens FZB 42.

    Science.gov (United States)

    Chen, Xiao-Hua; Vater, Joachim; Piel, Jörn; Franke, Peter; Scholz, Romy; Schneider, Kathrin; Koumoutsi, Alexandra; Hitzeroth, Gabriele; Grammel, Nicolas; Strittmatter, Axel W; Gottschalk, Gerhard; Süssmuth, Roderich D; Borriss, Rainer

    2006-06-01

    Although bacterial polyketides are of considerable biomedical interest, the molecular biology of polyketide biosynthesis in Bacillus spp., one of the richest bacterial sources of bioactive natural products, remains largely unexplored. Here we assign for the first time complete polyketide synthase (PKS) gene clusters to Bacillus antibiotics. Three giant modular PKS systems of the trans-acyltransferase type were identified in Bacillus amyloliquefaciens FZB 42. One of them, pks1, is an ortholog of the pksX operon with a previously unknown function in the sequenced model strain Bacillus subtilis 168, while the pks2 and pks3 clusters are novel gene clusters. Cassette mutagenesis combined with advanced mass spectrometric techniques such as matrix-assisted laser desorption ionization-time of flight mass spectrometry and liquid chromatography-electrospray ionization mass spectrometry revealed that the pks1 (bae) and pks3 (dif) gene clusters encode the biosynthesis of the polyene antibiotics bacillaene and difficidin or oxydifficidin, respectively. In addition, B. subtilis OKB105 (pheA sfp(0)), a transformant of the B. subtilis 168 derivative JH642, was shown to produce bacillaene, demonstrating that the pksX gene cluster directs the synthesis of that polyketide. The GenBank accession numbers for gene clusters pks1(bae), pks2, and pks3(dif) are AJ 634060.2, AJ 6340601.2, and AJ 6340602.2, respectively.

  20. Shared gene structures and clusters of mutually exclusive spliced exons within the metazoan muscle myosin heavy chain genes.

    Directory of Open Access Journals (Sweden)

    Martin Kollmar

    Full Text Available Multicellular animals possess two to three different types of muscle tissues. Striated muscles have considerable ultrastructural similarity and contain a core set of proteins including the muscle myosin heavy chain (Mhc protein. The ATPase activity of this myosin motor protein largely dictates muscle performance at the molecular level. Two different solutions to adjusting myosin properties to different muscle subtypes have been identified so far: Vertebrates and nematodes contain many independent differentially expressed Mhc genes while arthropods have single Mhc genes with clusters of mutually exclusive spliced exons (MXEs. The availability of hundreds of metazoan genomes now allowed us to study whether the ancient bilateria already contained MXEs, how MXE complexity subsequently evolved, and whether additional scenarios to control contractile properties in different muscles could be proposed, By reconstructing the Mhc genes from 116 metazoans we showed that all intron positions within the motor domain coding regions are conserved in all bilateria analysed. The last common ancestor of the bilateria already contained a cluster of MXEs coding for part of the loop-2 actin-binding sequence. Subsequently the protostomes and later the arthropods gained many further clusters while MXEs got completely lost independently in several branches (vertebrates and nematodes and species (for example the annelid Helobdella robusta and the salmon louse Lepeophtheirus salmonis. Several bilateria have been found to encode multiple Mhc genes that might all or in part contain clusters of MXEs. Notable examples are a cluster of six tandemly arrayed Mhc genes, of which two contain MXEs, in the owl limpet Lottia gigantea and four Mhc genes with three encoding MXEs in the predatory mite Metaseiulus occidentalis. Our analysis showed that similar solutions to provide different myosin isoforms (multiple genes or clusters of MXEs or both have independently been developed

  1. Prediction of operon-like gene clusters in the Arabidopsis thaliana genome based on co-expression analysis of neighboring genes.

    Science.gov (United States)

    Wada, Masayoshi; Takahashi, Hiroki; Altaf-Ul-Amin, Md; Nakamura, Kensuke; Hirai, Masami Y; Ohta, Daisaku; Kanaya, Shigehiko

    2012-07-15

    Operon-like arrangements of genes occur in eukaryotes ranging from yeasts and filamentous fungi to nematodes, plants, and mammals. In plants, several examples of operon-like gene clusters involved in metabolic pathways have recently been characterized, e.g. the cyclic hydroxamic acid pathways in maize, the avenacin biosynthesis gene clusters in oat, the thalianol pathway in Arabidopsis thaliana, and the diterpenoid momilactone cluster in rice. Such operon-like gene clusters are defined by their co-regulation or neighboring positions within immediate vicinity of chromosomal regions. A comprehensive analysis of the expression of neighboring genes therefore accounts a crucial step to reveal the complete set of operon-like gene clusters within a genome. Genome-wide prediction of operon-like gene clusters should contribute to functional annotation efforts and provide novel insight into evolutionary aspects acquiring certain biological functions as well. We predicted co-expressed gene clusters by comparing the Pearson correlation coefficient of neighboring genes and randomly selected gene pairs, based on a statistical method that takes false discovery rate (FDR) into consideration for 1469 microarray gene expression datasets of A. thaliana. We estimated that A. thaliana contains 100 operon-like gene clusters in total. We predicted 34 statistically significant gene clusters consisting of 3 to 22 genes each, based on a stringent FDR threshold of 0.1. Functional relationships among genes in individual clusters were estimated by sequence similarity and functional annotation of genes. Duplicated gene pairs (determined based on BLAST with a cutoff of EOperon-like clusters tend to include genes encoding bio-machinery associated with ribosomes, the ubiquitin/proteasome system, secondary metabolic pathways, lipid and fatty-acid metabolism, and the lipid transfer system. Copyright © 2012 Elsevier B.V. All rights reserved.

  2. Mycobiota and identification of aflatoxin gene cluster in marketed spices in West Africa

    DEFF Research Database (Denmark)

    Gnonlonfin, G. J. B.; Adjovi, Y. C.; Tokpo, A. F.

    2013-01-01

    of Aspergillus were dominant on all marketed dried and milled spices irrespective of country. Gene characterization and amplification analysis showed that most of the Aspergillus flavus isolates possess the cluster genes for aflatoxin production. Aflatoxin B1 assessment by Thin Layer Chromatography showed...

  3. iBBiG: iterative binary bi-clustering of gene sets.

    Science.gov (United States)

    Gusenleitner, Daniel; Howe, Eleanor A; Bentink, Stefan; Quackenbush, John; Culhane, Aedín C

    2012-10-01

    Meta-analysis of genomics data seeks to identify genes associated with a biological phenotype across multiple datasets; however, merging data from different platforms by their features (genes) is challenging. Meta-analysis using functionally or biologically characterized gene sets simplifies data integration is biologically intuitive and is seen as having great potential, but is an emerging field with few established statistical methods. We transform gene expression profiles into binary gene set profiles by discretizing results of gene set enrichment analyses and apply a new iterative bi-clustering algorithm (iBBiG) to identify groups of gene sets that are coordinately associated with groups of phenotypes across multiple studies. iBBiG is optimized for meta-analysis of large numbers of diverse genomics data that may have unmatched samples. It does not require prior knowledge of the number or size of clusters. When applied to simulated data, it outperforms commonly used clustering methods, discovers overlapping clusters of diverse sizes and is robust in the presence of noise. We apply it to meta-analysis of breast cancer studies, where iBBiG extracted novel gene set-phenotype association that predicted tumor metastases within tumor subtypes. Implemented in the Bioconductor package iBBiG CONTACT: aedin@jimmy.harvard.edu.

  4. Nonlinear dimensionality reduction of gene expression data for visualization and clustering analysis of cancer tissue samples.

    Science.gov (United States)

    Shi, Jinlong; Luo, Zhigang

    2010-08-01

    Gene expression data are the representation of nonlinear interactions among genes and environmental factors. Computing analysis of these data is expected to gain knowledge of gene functions and disease mechanisms. Clustering is a classical exploratory technique of discovering similar expression patterns and function modules. However, gene expression data are usually of high dimensions and relatively small samples, which results in the main difficulty for the application of clustering algorithms. Principal component analysis (PCA) is usually used to reduce the data dimensions for further clustering analysis. While PCA estimates the similarity between expression profiles based on the Euclidean distance, which cannot reveal the nonlinear connections between genes. This paper uses nonlinear dimensionality reduction (NDR) as a preprocessing strategy for feature selection and visualization, and then applies clustering algorithms to the reduced feature spaces. In order to estimate the effectiveness of NDR for capturing biologically relevant structures, the comparative analysis between NDR and PCA is exploited to five real cancer expression datasets. Results show that NDR can perform better than PCA in visualization and clustering analysis of complex gene expression data. Copyright 2010 Elsevier Ltd. All rights reserved.

  5. Combining affinity propagation clustering and mutual information network to investigate key genes in fibroid.

    Science.gov (United States)

    Chen, Qian-Song; Wang, Dan; Liu, Bao-Lian; Gao, Shu-Feng; Gao, Dan-Li; Li, Gui-Rong

    2017-07-01

    The aim of the present study was to investigate key genes in fibroids based on the multiple affinity propogation-Krzanowski and Lai (mAP-KL) method, which included the maxT multiple hypothesis, Krzanowski and Lai (KL) cluster quality index, affinity propagation (AP) clustering algorithm and mutual information network (MIN) constructed by the context likelihood of relatedness (CLR) algorithm. In order to achieve this goal, mAP-KL was initially implemented to investigate exemplars in fibroid, and the maxT function was employed to rank the genes of training and test sets, and the top 200 genes were obtained for further study. In addition, the KL cluster index was applied to determine the quantity of clusters and the AP clustering algorithm was conducted to identify the clusters and their exemplars. Subsequently, the support vector machine (SVM) model was selected to evaluate the classification performance of mAP-KL. Finally, topological properties (degree, closeness, betweenness and transitivity) of exemplars in MIN constructed according to the CLR algorithm were assessed to investigate key genes in fibroid. The SVM model validated that the classification between normal controls and fibroid patients by mAP-KL had a good performance. A total of 9 clusters and exemplars were identified based on mAP-KL, which were comprised of CALCOCO2 , COL4A2 , COPS8 , SNCG , PA2G4 , C17orf70 , MARK3 , BTNL3 and TBC1D13 . By accessing the topological analysis for exemplars in MIN, SNCG and COL4A2 were identified as the two most significant genes of four types of methods, and they were denoted as key genes in the progress of fibroid. In conclusion, two key genes ( SNCG and COL4A2 ) and 9 exemplars were successfully investigated, and these may be potential biomarkers for the detection and treatment of fibroid.

  6. Structural variation of the ribosomal gene cluster within the class Insecta

    Energy Technology Data Exchange (ETDEWEB)

    Mukha, D.V.; Sidorenko, A.P.; Lazebnaya, I.V. [Vavilov Institute of General Genetics, Moscow (Russian Federation)] [and others

    1995-09-01

    General estimation of ribosomal DNA variation within the class Insecta is presented. It is shown that, using blot-hybridization, one can detect differences in the structure of the ribosomal gene cluster not only between genera within an order, but also between species within a genera, including sibling species. Structure of the ribosomal gene cluster of the Coccinellidae family (ladybirds) is analyzed. It is shown that cloned highly conservative regions of ribosomal DNA of Tetrahymena pyriformis can be used as probes for analyzing ribosomal genes in insects. 24 refs., 4 figs.

  7. Identification and manipulation of the pleuromutilin gene cluster from Clitopilus passeckerianus for increased rapid antibiotic production

    Science.gov (United States)

    Bailey, Andy M.; Alberti, Fabrizio; Kilaru, Sreedhar; Collins, Catherine M.; de Mattos-Shipley, Kate; Hartley, Amanda J.; Hayes, Patrick; Griffin, Alison; Lazarus, Colin M.; Cox, Russell J.; Willis, Christine L.; O'Dwyer, Karen; Spence, David W.; Foster, Gary D.

    2016-05-01

    Semi-synthetic derivatives of the tricyclic diterpene antibiotic pleuromutilin from the basidiomycete Clitopilus passeckerianus are important in combatting bacterial infections in human and veterinary medicine. These compounds belong to the only new class of antibiotics for human applications, with novel mode of action and lack of cross-resistance, representing a class with great potential. Basidiomycete fungi, being dikaryotic, are not generally amenable to strain improvement. We report identification of the seven-gene pleuromutilin gene cluster and verify that using various targeted approaches aimed at increasing antibiotic production in C. passeckerianus, no improvement in yield was achieved. The seven-gene pleuromutilin cluster was reconstructed within Aspergillus oryzae giving production of pleuromutilin in an ascomycete, with a significant increase (2106%) in production. This is the first gene cluster from a basidiomycete to be successfully expressed in an ascomycete, and paves the way for the exploitation of a metabolically rich but traditionally overlooked group of fungi.

  8. Fine Mapping of Two Wheat Powdery Mildew Resistance Genes Located at the Pm1 Cluster

    Directory of Open Access Journals (Sweden)

    Junchao Liang

    2016-07-01

    Full Text Available Powdery mildew caused by (DC. f. sp. ( is a globally devastating foliar disease of wheat ( L.. More than a dozen genes against this disease, identified from wheat germplasms of different ploidy levels, have been mapped to the region surrounding the locus on the long arm of chromosome 7A, which forms a resistance (-gene cluster. and from einkorn wheat ( L. were two of the genes belonging to this cluster. This study was initiated to fine map these two genes toward map-based cloning. Comparative genomics study showed that macrocolinearity exists between L. chromosome 1 (Bd1 and the – region, which allowed us to develop markers based on the wheat sequences orthologous to genes contained in the Bd1 region. With these and other newly developed and published markers, high-resolution maps were constructed for both and using large F populations. Moreover, a physical map of was constructed through chromosome walking with bacterial artificial chromosome (BAC clones and comparative mapping. Eventually, and were restricted to a 0.12- and 0.86-cM interval, respectively. Based on the closely linked common markers, , , and (another powdery mildew resistance gene in the cluster were not allelic to one another. Severe recombination suppression and disruption of synteny were noted in the region encompassing . These results provided useful information for map-based cloning of the genes in the cluster and interpretation of their evolution.

  9. Discovery of Unusual Biaryl Polyketides by Activation of a Silent Streptomyces venezuelae Biosynthetic Gene Cluster.

    Science.gov (United States)

    Thanapipatsiri, Anyarat; Gomez-Escribano, Juan Pablo; Song, Lijiang; Bibb, Maureen J; Al-Bassam, Mahmoud; Chandra, Govind; Thamchaipenet, Arinthip; Challis, Gregory L; Bibb, Mervyn J

    2016-11-17

    Comparative transcriptional profiling of a ΔbldM mutant of Streptomyces venezuelae with its unmodified progenitor revealed that the expression of a cryptic biosynthetic gene cluster containing both type I and type III polyketide synthase genes is activated in the mutant. The 29.5 kb gene cluster, which was predicted to encode an unusual biaryl metabolite, which we named venemycin, and potentially halogenated derivatives, contains 16 genes including one-vemR-that encodes a transcriptional activator of the large ATP-binding LuxR-like (LAL) family. Constitutive expression of vemR in the ΔbldM mutant led to the production of sufficient venemycin for structural characterisation, confirming its unusual biaryl structure. Co-expression of the venemycin biosynthetic gene cluster and vemR in the heterologous host Streptomyces coelicolor also resulted in venemycin production. Although the gene cluster encodes two halogenases and a flavin reductase, constitutive expression of all three genes led to the accumulation only of a monohalogenated venemycin derivative, both in the native producer and the heterologous host. A competition experiment in which equimolar quantities of sodium chloride and sodium bromide were fed to the venemycin-producing strains resulted in the preferential incorporation of bromine, thus suggesting that bromide is the preferred substrate for one or both halogenases. © 2016 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA.

  10. A CLUSTERING OF DJA STOCKS - THE APPLICATION IN FINANCE OF A METHOD FIRST USED IN GENE TRAJECTORY STUDY

    Directory of Open Access Journals (Sweden)

    Silaghi Gheorghe Cosmin

    2009-05-01

    Full Text Available Previously we employed the Gene Trajectory Clustering methodology to search for different associations of the stocks composing the DJA index, with the aim of finding different, logic clusters, supported by economic reasons, preferably different than the

  11. MS/MS networking guided analysis of molecule and gene cluster families

    Science.gov (United States)

    Nguyen, Don Duy; Wu, Cheng-Hsuan; Moree, Wilna J.; Lamsa, Anne; Medema, Marnix H.; Zhao, Xiling; Gavilan, Ronnie G.; Aparicio, Marystella; Atencio, Librada; Jackson, Chanaye; Ballesteros, Javier; Sanchez, Joel; Watrous, Jeramie D.; Phelan, Vanessa V.; van de Wiel, Corine; Kersten, Roland D.; Mehnaz, Samina; De Mot, René; Shank, Elizabeth A.; Charusanti, Pep; Nagarajan, Harish; Duggan, Brendan M.; Moore, Bradley S.; Bandeira, Nuno; Palsson, Bernhard Ø.; Pogliano, Kit; Gutiérrez, Marcelino; Dorrestein, Pieter C.

    2013-01-01

    The ability to correlate the production of specialized metabolites to the genetic capacity of the organism that produces such molecules has become an invaluable tool in aiding the discovery of biotechnologically applicable molecules. Here, we accomplish this task by matching molecular families with gene cluster families, making these correlations to 60 microbes at one time instead of connecting one molecule to one organism at a time, such as how it is traditionally done. We can correlate these families through the use of nanospray desorption electrospray ionization MS/MS, an ambient pressure MS technique, in conjunction with MS/MS networking and peptidogenomics. We matched the molecular families of peptide natural products produced by 42 bacilli and 18 pseudomonads through the generation of amino acid sequence tags from MS/MS data of specific clusters found in the MS/MS network. These sequence tags were then linked to biosynthetic gene clusters in publicly accessible genomes, providing us with the ability to link particular molecules with the genes that produced them. As an example of its use, this approach was applied to two unsequenced Pseudoalteromonas species, leading to the discovery of the gene cluster for a molecular family, the bromoalterochromides, in the previously sequenced strain P. piscicida JCM 20779T. The approach itself is not limited to 60 related strains, because spectral networking can be readily adopted to look at molecular family–gene cluster families of hundreds or more diverse organisms in one single MS/MS network. PMID:23798442

  12. A genomics based discovery of secondary metabolite biosynthetic gene clusters in Aspergillus ustus.

    Directory of Open Access Journals (Sweden)

    Borui Pi

    Full Text Available Secondary metabolites (SMs produced by Aspergillus have been extensively studied for their crucial roles in human health, medicine and industrial production. However, the resulting information is almost exclusively derived from a few model organisms, including A. nidulans and A. fumigatus, but little is known about rare pathogens. In this study, we performed a genomics based discovery of SM biosynthetic gene clusters in Aspergillus ustus, a rare human pathogen. A total of 52 gene clusters were identified in the draft genome of A. ustus 3.3904, such as the sterigmatocystin biosynthesis pathway that was commonly found in Aspergillus species. In addition, several SM biosynthetic gene clusters were firstly identified in Aspergillus that were possibly acquired by horizontal gene transfer, including the vrt cluster that is responsible for viridicatumtoxin production. Comparative genomics revealed that A. ustus shared the largest number of SM biosynthetic gene clusters with A. nidulans, but much fewer with other Aspergilli like A. niger and A. oryzae. These findings would help to understand the diversity and evolution of SM biosynthesis pathways in genus Aspergillus, and we hope they will also promote the development of fungal identification methodology in clinic.

  13. Regulation of Three Nitrogenase Gene Clusters in the Cyanobacterium Anabaena variabilis ATCC 29413

    Directory of Open Access Journals (Sweden)

    Teresa Thiel

    2014-12-01

    Full Text Available The filamentous cyanobacterium Anabaena variabilis ATCC 29413 fixes nitrogen under aerobic conditions in specialized cells called heterocysts that form in response to an environmental deficiency in combined nitrogen. Nitrogen fixation is mediated by the enzyme nitrogenase, which is very sensitive to oxygen. Heterocysts are microxic cells that allow nitrogenase to function in a filament comprised primarily of vegetative cells that produce oxygen by photosynthesis. A. variabilis is unique among well-characterized cyanobacteria in that it has three nitrogenase gene clusters that encode different nitrogenases, which function under different environmental conditions. The nif1 genes encode a Mo-nitrogenase that functions only in heterocysts, even in filaments grown anaerobically. The nif2 genes encode a different Mo-nitrogenase that functions in vegetative cells, but only in filaments grown under anoxic conditions. An alternative V-nitrogenase is encoded by vnf genes that are expressed only in heterocysts in an environment that is deficient in Mo. Thus, these three nitrogenases are expressed differentially in response to environmental conditions. The entire nif1 gene cluster, comprising at least 15 genes, is primarily under the control of the promoter for the first gene, nifB1. Transcriptional control of many of the downstream nif1 genes occurs by a combination of weak promoters within the coding regions of some downstream genes and by RNA processing, which is associated with increased transcript stability. The vnf genes show a similar pattern of transcriptional and post-transcriptional control of expression suggesting that the complex pattern of regulation of the nif1 cluster is conserved in other cyanobacterial nitrogenase gene clusters.

  14. Identification of biosynthetic gene clusters from metagenomic libraries using PPTase complementation in a Streptomyces host.

    Science.gov (United States)

    Bitok, J Kipchirchir; Lemetre, Christophe; Ternei, Melinda A; Brady, Sean F

    2017-09-01

    The majority of environmental bacteria are not readily cultured in the lab, leaving the natural products they make inaccessible using culture-dependent discovery methods. Cloning and heterologous expression of DNA extracted from environmental samples (environmental DNA, eDNA) provides a means of circumventing this discovery bottleneck. To facilitate the identification of clones containing biosynthetic gene clusters, we developed a model heterologous expression reporter strain Streptomyces albus::bpsA ΔPPTase. This strain carries a 4΄-phosphopantetheinyl transferase (PPTase)-dependent blue pigment synthase A gene, bpsA, in a PPTase deletion background. eDNA clones that express a functional PPTase restore production of the blue pigment, indigoidine. As PPTase genes often occur in biosynthetic gene clusters (BGCs), indigoidine production can be used to identify eDNA clones containing BGCs. We screened a soil eDNA library hosted in S. albus::bpsA ΔPPTase and identified clones containing non-ribosomal peptide synthetase (NRPS), polyketide synthase (PKS) and mixed NRPS/PKS biosynthetic gene clusters. One NRPS gene cluster was shown to confer the production of myxochelin A to S. albus::bpsA ΔPPTase. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  15. Methods for simultaneously identifying coherent local clusters with smooth global patterns in gene expression profiles

    Directory of Open Access Journals (Sweden)

    Lee Yun-Shien

    2008-03-01

    Full Text Available Abstract Background The hierarchical clustering tree (HCT with a dendrogram 1 and the singular value decomposition (SVD with a dimension-reduced representative map 2 are popular methods for two-way sorting the gene-by-array matrix map employed in gene expression profiling. While HCT dendrograms tend to optimize local coherent clustering patterns, SVD leading eigenvectors usually identify better global grouping and transitional structures. Results This study proposes a flipping mechanism for a conventional agglomerative HCT using a rank-two ellipse (R2E, an improved SVD algorithm for sorting purpose seriation by Chen 3 as an external reference. While HCTs always produce permutations with good local behaviour, the rank-two ellipse seriation gives the best global grouping patterns and smooth transitional trends. The resulting algorithm automatically integrates the desirable properties of each method so that users have access to a clustering and visualization environment for gene expression profiles that preserves coherent local clusters and identifies global grouping trends. Conclusion We demonstrate, through four examples, that the proposed method not only possesses better numerical and statistical properties, it also provides more meaningful biomedical insights than other sorting algorithms. We suggest that sorted proximity matrices for genes and arrays, in addition to the gene-by-array expression matrix, can greatly aid in the search for comprehensive understanding of gene expression structures. Software for the proposed methods can be obtained at http://gap.stat.sinica.edu.tw/Software/GAP.

  16. Sequencing and transcriptional analysis of the biosynthesis gene cluster of abscisic acid-producing Botrytis cinerea.

    Science.gov (United States)

    Gong, Tao; Shu, Dan; Yang, Jie; Ding, Zhong-Tao; Tan, Hong

    2014-09-29

    Botrytis cinerea is a model species with great importance as a pathogen of plants and has become used for biotechnological production of ABA. The ABA cluster of B. cinerea is composed of an open reading frame without significant similarities (bcaba3), followed by the genes (bcaba1 and bcaba2) encoding P450 monooxygenases and a gene probably coding for a short-chain dehydrogenase/reductase (bcaba4). In B. cinerea ATCC58025, targeted inactivation of the genes in the cluster suggested at least three genes responsible for the hydroxylation at carbon atom C-1' and C-4' or oxidation at C-4' of ABA. Our group has identified an ABA-overproducing strain, B. cinerea TB-3-H8. To differentiate TB-3-H8 from other B. cinerea strains with the functional ABA cluster, the DNA sequence of the 12.11-kb region containing the cluster of B. cinerea TB-3-H8 was determined. Full-length cDNAs were also isolated for bcaba1, bcaba2, bcaba3 and bcaba4 from B. cinerea TB-3-H8. Sequence comparison of the four genes and their flanking regions respectively derived from B. cinerea TB-3-H8, B05.10 and T4 revealed that major variations were located in intergenic sequences. In B. cinerea TB-3-H8, the expression profiles of the four function genes under ABA high-yield conditions were also analyzed by real-time PCR.

  17. Hessian regularization based non-negative matrix factorization for gene expression data clustering.

    Science.gov (United States)

    Liu, Xiao; Shi, Jun; Wang, Congzhi

    2015-01-01

    Since a key step in the analysis of gene expression data is to detect groups of genes that have similar expression patterns, clustering technique is then commonly used to analyze gene expression data. Data representation plays an important role in clustering analysis. The non-negative matrix factorization (NMF) is a widely used data representation method with great success in machine learning. Although the traditional manifold regularization method, Laplacian regularization (LR), can improve the performance of NMF, LR still suffers from the problem of its weak extrapolating power. Hessian regularization (HR) is a newly developed manifold regularization method, whose natural properties make it more extrapolating, especially for small sample data. In this work, we propose the HR-based NMF (HR-NMF) algorithm, and then apply it to represent gene expression data for further clustering task. The clustering experiments are conducted on five commonly used gene datasets, and the results indicate that the proposed HR-NMF outperforms LR-based NMM and original NMF, which suggests the potential application of HR-NMF for gene expression data.

  18. A remarkably stable TipE gene cluster: evolution of insect Para sodium channel auxiliary subunits

    Directory of Open Access Journals (Sweden)

    Li Jia

    2011-11-01

    Full Text Available Abstract Background First identified in fruit flies with temperature-sensitive paralysis phenotypes, the Drosophila melanogaster TipE locus encodes four voltage-gated sodium (NaV channel auxiliary subunits. This cluster of TipE-like genes on chromosome 3L, and a fifth family member on chromosome 3R, are important for the optional expression and functionality of the Para NaV channel but appear quite distinct from auxiliary subunits in vertebrates. Here, we exploited available arthropod genomic resources to trace the origin of TipE-like genes by mapping their evolutionary histories and examining their genomic architectures. Results We identified a remarkably conserved synteny block of TipE-like orthologues with well-maintained local gene arrangements from 21 insect species. Homologues in the water flea, Daphnia pulex, suggest an ancestral pancrustacean repertoire of four TipE-like genes; a subsequent gene duplication may have generated functional redundancy allowing gene losses in the silk moth and mosquitoes. Intronic nesting of the insect TipE gene cluster probably occurred following the divergence from crustaceans, but in the flour beetle and silk moth genomes the clusters apparently escaped from nesting. Across Pancrustacea, TipE gene family members have experienced intronic nesting, escape from nesting, retrotransposition, translocation, and gene loss events while generally maintaining their local gene neighbourhoods. D. melanogaster TipE-like genes exhibit coordinated spatial and temporal regulation of expression distinct from their host gene but well-correlated with their regulatory target, the Para NaV channel, suggesting that functional constraints may preserve the TipE gene cluster. We identified homology between TipE-like NaV channel regulators and vertebrate Slo-beta auxiliary subunits of big-conductance calcium-activated potassium (BKCa channels, which suggests that ion channel regulatory partners have evolved distinct lineage

  19. plantiSMASH: automated identification, annotation and expression analysis of plant biosynthetic gene clusters

    DEFF Research Database (Denmark)

    Kautsar, Satria A.; Suarez Duran, Hernando G.; Blin, Kai

    2017-01-01

    in specific genomic loci: biosynthetic gene clusters (BGCs). Here, we introduce plantiSMASH, a versatile online analysis platform that automates the identification of candidate plant BGCs. Moreover, it allows integration of transcriptomic data to prioritize candidate BGCs based on the coexpression patterns......Plant specialized metabolites are chemically highly diverse, play key roles in host-microbe interactions, have important nutritional value in crops and are frequently applied as medicines. It has recently become clear that plant biosynthetic pathway-encoding genes are sometimes densely clustered...... of predicted biosynthetic enzyme-coding genes, and facilitates comparative genomic analysis to study the evolutionary conservation of each cluster. Applied on 48 high-quality plant genomes, plantiSMASH identifies a rich diversity of candidate plant BGCs. These results will guide further experimental...

  20. The CHRNA5-A3-B4 Gene Cluster and Smoking: From Discovery to Therapeutics.

    Science.gov (United States)

    Lassi, Glenda; Taylor, Amy E; Timpson, Nicholas J; Kenny, Paul J; Mather, Robert J; Eisen, Tim; Munafò, Marcus R

    2016-12-01

    Genome-wide association studies (GWASs) have identified associations between the CHRNA5-CHRNA3-CHRNB4 gene cluster and smoking heaviness and nicotine dependence. Studies in rodents have described the anatomical localisation and function of the nicotinic acetylcholine receptors (nAChRs) formed by the subunits encoded by this gene cluster. Further investigations that complemented these studies highlighted the variability of individuals' smoking behaviours and their ability to adjust nicotine intake. GWASs of smoking-related health outcomes have also identified this signal in the CHRNA5-CHRNA3-CHRNB4 gene cluster. This insight underpins approaches to strengthen causal inference in observational data. Combining genetic and mechanistic studies of nicotine dependence and smoking heaviness may reveal novel targets for medication development. Validated targets can inform genetic therapeutic interventions for smoking cessation and tobacco-related diseases. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.

  1. Clustering Time-Series Gene Expression Data Using Smoothing Spline Derivatives

    Directory of Open Access Journals (Sweden)

    Martin PGP

    2007-01-01

    Full Text Available Microarray data acquired during time-course experiments allow the temporal variations in gene expression to be monitored. An original postprandial fasting experiment was conducted in the mouse and the expression of 200 genes was monitored with a dedicated macroarray at 11 time points between 0 and 72 hours of fasting. The aim of this study was to provide a relevant clustering of gene expression temporal profiles. This was achieved by focusing on the shapes of the curves rather than on the absolute level of expression. Actually, we combined spline smoothing and first derivative computation with hierarchical and partitioning clustering. A heuristic approach was proposed to tune the spline smoothing parameter using both statistical and biological considerations. Clusters are illustrated a posteriori through principal component analysis and heatmap visualization. Most results were found to be in agreement with the literature on the effects of fasting on the mouse liver and provide promising directions for future biological investigations.

  2. Clustering Time-Series Gene Expression Data Using Smoothing Spline Derivatives

    Directory of Open Access Journals (Sweden)

    S. Déjean

    2007-06-01

    Full Text Available Microarray data acquired during time-course experiments allow the temporal variations in gene expression to be monitored. An original postprandial fasting experiment was conducted in the mouse and the expression of 200 genes was monitored with a dedicated macroarray at 11 time points between 0 and 72 hours of fasting. The aim of this study was to provide a relevant clustering of gene expression temporal profiles. This was achieved by focusing on the shapes of the curves rather than on the absolute level of expression. Actually, we combined spline smoothing and first derivative computation with hierarchical and partitioning clustering. A heuristic approach was proposed to tune the spline smoothing parameter using both statistical and biological considerations. Clusters are illustrated a posteriori through principal component analysis and heatmap visualization. Most results were found to be in agreement with the literature on the effects of fasting on the mouse liver and provide promising directions for future biological investigations.

  3. Form gene clustering method about pan-ethnic-group products based on emotional semantic

    Science.gov (United States)

    Chen, Dengkai; Ding, Jingjing; Gao, Minzhuo; Ma, Danping; Liu, Donghui

    2016-09-01

    The use of pan-ethnic-group products form knowledge primarily depends on a designer's subjective experience without user participation. The majority of studies primarily focus on the detection of the perceptual demands of consumers from the target product category. A pan-ethnic-group products form gene clustering method based on emotional semantic is constructed. Consumers' perceptual images of the pan-ethnic-group products are obtained by means of product form gene extraction and coding and computer aided product form clustering technology. A case of form gene clustering about the typical pan-ethnic-group products is investigated which indicates that the method is feasible. This paper opens up a new direction for the future development of product form design which improves the agility of product design process in the era of Industry 4.0.

  4. Evaluation of gene-expression clustering via mutual information distance measure

    Directory of Open Access Journals (Sweden)

    Maimon Oded

    2007-03-01

    Full Text Available Abstract Background The definition of a distance measure plays a key role in the evaluation of different clustering solutions of gene expression profiles. In this empirical study we compare different clustering solutions when using the Mutual Information (MI measure versus the use of the well known Euclidean distance and Pearson correlation coefficient. Results Relying on several public gene expression datasets, we evaluate the homogeneity and separation scores of different clustering solutions. It was found that the use of the MI measure yields a more significant differentiation among erroneous clustering solutions. The proposed measure was also used to analyze the performance of several known clustering algorithms. A comparative study of these algorithms reveals that their "best solutions" are ranked almost oppositely when using different distance measures, despite the found correspondence between these measures when analysing the averaged scores of groups of solutions. Conclusion In view of the results, further attention should be paid to the selection of a proper distance measure for analyzing the clustering of gene expression data.

  5. Gene microarray data analysis using parallel point-symmetry-based clustering.

    Science.gov (United States)

    Sarkar, Anasua; Maulik, Ujjwal

    2015-01-01

    Identification of co-expressed genes is the central goal in microarray gene expression analysis. Point-symmetry-based clustering is an important unsupervised learning technique for recognising symmetrical convex- or non-convex-shaped clusters. To enable fast clustering of large microarray data, we propose a distributed time-efficient scalable approach for point-symmetry-based K-Means algorithm. A natural basis for analysing gene expression data using symmetry-based algorithm is to group together genes with similar symmetrical expression patterns. This new parallel implementation also satisfies linear speedup in timing without sacrificing the quality of clustering solution on large microarray data sets. The parallel point-symmetry-based K-Means algorithm is compared with another new parallel symmetry-based K-Means and existing parallel K-Means over eight artificial and benchmark microarray data sets, to demonstrate its superiority, in both timing and validity. The statistical analysis is also performed to establish the significance of this message-passing-interface based point-symmetry K-Means implementation. We also analysed the biological relevance of clustering solutions.

  6. Cloning of the biosynthetic gene cluster for naphthoxanthene antibiotic FD-594 from Streptomyces sp. TA-0256.

    Science.gov (United States)

    Kudo, Fumitaka; Yonezawa, Takanori; Komatsubara, Akiko; Mizoue, Kazutoshi; Eguchi, Tadashi

    2011-01-01

    FD-594 is an unique pyrano[4',3':6,7]naphtho[1,2-b]xanthene polyketide with a trisaccharide of 2,6-dideoxysugars. In this study, we cloned the FD-594 biosynthetic gene cluster from the producer strain Streptomyces sp. TA-0256 to investigate its biosynthesis. The identified pnx gene cluster was 38143 bp, consisting of 40 open reading frames, including a minimal PKS gene, TDP-olivose biosynthetic genes, two glycosyltransferase genes, two methyltransferase genes and many oxygenase/reductase genes. Most of these enzymes coded in the pnx cluster were reasonably assigned to a plausible biosynthetic pathway for FD-594, in which an unique ring opening process via Baeyer-Villiger-type oxidation catalyzed by a putative flavin adenine dinucleotide (FAD)-dependent monooxygenase, is speculated to lead to the unique xanthene structure. To clarify the involvement of pnx genes in the FD-594 biosynthesis, a glycosyltransferase, PnxGT2, and a methyltransferase, PnxMT2, were characterized enzymatically with the recombinant proteins expressed in Escherichia coli. As a result, PnxGT2 catalyzed the triple olivose transfers to the FD-594 aglycon with TDP-olivose as the glycosyl donor to afford triolivoside. Surprisingly, in the PnxGT2 enzymatic reaction, tetraolivoside and pentaolivoside were significantly detected along with the expected triolivoside. To our knowledge, PnxGT2 is the first contiguous oligosaccharide-forming glycosyltransferase in secondary metabolism. Furthermore, addition of PnxMT2 and S-adenosyl-L-methionine into the PnxGT2 reaction mixture afforded natural FD-594 to confirm that the PnxGT2 reaction product was the expected regiospecifically glycosylated compound. Consequently, the identified pnx gene cluster appears to be involved in FD-594 biosynthesis.

  7. Rearrangements of the beta-globin gene cluster in apparently typical betaS haplotypes.

    Science.gov (United States)

    Zago, M A; Silva, W A; Gualandro, S; Yokomizu, I K; Araujo, A G; Tavela, M H; Gerard, N; Krishnamoorthy, R; Elion, J

    2001-02-01

    The majority of the chromosomes with the betaS gene have one of the five common haplotypes, designated as Benin, Bantu, Senegal, Cameroon, and Arab-Indian haplotypes. However, 5-10% of the chromosomes have less common haplotypes, usually referred to as atypical haplotypes. We have demonstrated that most atypical haplotypes are generated by recombinations. The present study was carried out in order to explore whether recombination also occurs in chromosomes with the common (or typical) haplotypes. We screened the HS-2 region of the beta-globin gene locus control region (LCR) in 244 sickle cell patients who had typical restriction fragment length polymorphism (RFLP)-defined haplotypes of the betaS-gene cluster. For 14 cases in which the expected and the observed LCR repeat-sequence sizes were discrepant, the analysis was extended to other unexplored polymorphic markers of the bS-globin gene cluster, i.e.: pre-Ggamma framework, pre-Ggamma 6-bp deletion, HS-2 LCR (AT)xR(AT)y and pre-beta(AT)xTy repeats, and the intragenic beta-globin gene framework. In all 14 cases (15 chromosomes) in which the LCR repeat-sequence sizes were discrepant, a recombination involving a typical 3' segment of the betaS globin gene cluster was demonstrated. In most of the cases, the recombination site was located between the beta-globin gene and the betaLCR. Nine cases involving recombination were detected among 156 Brazilian HbS homozygotes and five among 88 African patients homozygotes for the Benin haplotype. INTERPRETATION AND CONCLUSIONS. Thus, 3.1% of apparently typical haplotypes linked to the sickle cell gene involve recombinations similar to those that generate the atypical haplotypes, a finding that reinforces the picture of the beta-globin gene cluster as highly dynamic.

  8. A genome-wide analysis of nonribosomal peptide synthetase gene clusters and their peptides in a Planktothrix rubescens strain

    Directory of Open Access Journals (Sweden)

    Nederbragt Alexander J

    2009-08-01

    Full Text Available Abstract Background Cyanobacteria often produce several different oligopeptides, with unknown biological functions, by nonribosomal peptide synthetases (NRPS. Although some cyanobacterial NRPS gene cluster types are well described, the entire NRPS genomic content within a single cyanobacterial strain has never been investigated. Here we have combined a genome-wide analysis using massive parallel pyrosequencing ("454" and mass spectrometry screening of oligopeptides produced in the strain Planktothrix rubescens NIVA CYA 98 in order to identify all putative gene clusters for oligopeptides. Results Thirteen types of oligopeptides were uncovered by mass spectrometry (MS analyses. Microcystin, cyanopeptolin and aeruginosin synthetases, highly similar to already characterized NRPS, were present in the genome. Two novel NRPS gene clusters were associated with production of anabaenopeptins and microginins, respectively. Sequence-depth of the genome and real-time PCR data revealed three copies of the microginin gene cluster. Since NRPS gene cluster candidates for microviridin and oscillatorin synthesis could not be found, putative (gene encoded precursor peptide sequences to microviridin and oscillatorin were found in the genes mdnA and oscA, respectively. The genes flanking the microviridin and oscillatorin precursor genes encode putative modifying enzymes of the precursor oligopeptides. We therefore propose ribosomal pathways involving modifications and cyclisation for microviridin and oscillatorin. The microviridin, anabaenopeptin and cyanopeptolin gene clusters are situated in close proximity to each other, constituting an oligopeptide island. Conclusion Altogether seven nonribosomal peptide synthetase (NRPS gene clusters and two gene clusters putatively encoding ribosomal oligopeptide biosynthetic pathways were revealed. Our results demonstrate that whole genome shotgun sequencing combined with MS-directed determination of oligopeptides successfully

  9. A phase synchronization clustering algorithm for identifying interesting groups of genes from cell cycle expression data

    Directory of Open Access Journals (Sweden)

    Tcha Hong

    2008-01-01

    Full Text Available Abstract Background The previous studies of genome-wide expression patterns show that a certain percentage of genes are cell cycle regulated. The expression data has been analyzed in a number of different ways to identify cell cycle dependent genes. In this study, we pose the hypothesis that cell cycle dependent genes are considered as oscillating systems with a rhythm, i.e. systems producing response signals with period and frequency. Therefore, we are motivated to apply the theory of multivariate phase synchronization for clustering cell cycle specific genome-wide expression data. Results We propose the strategy to find groups of genes according to the specific biological process by analyzing cell cycle specific gene expression data. To evaluate the propose method, we use the modified Kuramoto model, which is a phase governing equation that provides the long-term dynamics of globally coupled oscillators. With this equation, we simulate two groups of expression signals, and the simulated signals from each group shares their own common rhythm. Then, the simulated expression data are mixed with randomly generated expression data to be used as input data set to the algorithm. Using these simulated expression data, it is shown that the algorithm is able to identify expression signals that are involved in the same oscillating process. We also evaluate the method with yeast cell cycle expression data. It is shown that the output clusters by the proposed algorithm include genes, which are closely associated with each other by sharing significant Gene Ontology terms of biological process and/or having relatively many known biological interactions. Therefore, the evaluation analysis indicates that the method is able to identify expression signals according to the specific biological process. Our evaluation analysis also indicates that some portion of output by the proposed algorithm is not obtainable by the traditional clustering algorithm with

  10. Identification of a gene cluster associated with triclosan catabolism.

    Science.gov (United States)

    Kagle, Jeanne M; Paxson, Clayton; Johnstone, Precious; Hay, Anthony G

    2015-06-01

    Aerobic degradation of bis-aryl ethers like the antimicrobial triclosan typically proceeds through oxygenase-dependent catabolic pathways. Although several studies have reported on bacteria capable of degrading triclosan aerobically, there are no reports describing the genes responsible for this process. In this study, a gene encoding the large subunit of a putative triclosan oxygenase, designated tcsA was identified in a triclosan-degrading fosmid clone from a DNA library of Sphingomonas sp. RD1. Consistent with tcsA's similarity to two-part dioxygenases, a putative FMN-dependent ferredoxin reductase, designated tcsB was found immediately downstream of tcsA. Both tcsAB were found in the midst of a putative chlorocatechol degradation operon. We show that RD1 produces hydroxytriclosan and chlorocatechols during triclosan degradation and that tcsA is induced by triclosan. This is the first study to report on the genetics of triclosan degradation.

  11. Identification of certain cancer-mediating genes using Gaussian fuzzy cluster validity index.

    Science.gov (United States)

    Ghosh, Anupam; De, Rajat K

    2015-10-01

    In this article, we have used an index, called Gaussian fuzzy index (GFI), recently developed by the authors, based on the notion of fuzzy set theory, for validating the clusters obtained by a clustering algorithm applied on cancer gene expression data. GFI is then used for the identification of genes that have altered quite significantly from normal state to carcinogenic state with respect to their mRNA expression patterns. The effectiveness of the methodology has been demonstrated on three gene expression cancer datasets dealing with human lung, colon and leukemia. The performance of GFI is compared with 19 exiting cluster validity indices. The results are appropriately validated biologically and statistically. In this context, we have used biochemical pathways, p-value statistics of GO attributes, t-test and zscore for the validation of the results. It has been reported that GFI is capable of identifying high-quality enriched clusters of genes, and thereby is able to select more cancer-mediating genes.

  12. Intact cluster and chordate-like expression of ParaHox genes in a sea star.

    Science.gov (United States)

    Annunziata, Rossella; Martinez, Pedro; Arnone, Maria Ina

    2013-06-27

    The ParaHox genes are thought to be major players in patterning the gut of several bilaterian taxa. Though this is a fundamental role that these transcription factors play, their activities are not limited to the endoderm and extend to both ectodermal and mesodermal tissues. Three genes compose the ParaHox group: Gsx, Xlox and Cdx. In some taxa (mostly chordates but to some degree also in protostomes) the three genes are arranged into a genomic cluster, in a similar fashion to what has been shown for the better-known Hox genes. Sea urchins possess the full complement of ParaHox genes but they are all dispersed throughout the genome, an arrangement that, perhaps, represented the primitive condition for all echinoderms. In order to understand the evolutionary history of this group of genes we cloned and characterized all ParaHox genes, studied their expression patterns and identified their genomic loci in a member of an earlier branching group of echinoderms, the asteroid Patiria miniata. We identified the three ParaHox orthologs in the genome of P. miniata. While one of them, PmGsx is provided as maternal message, with no zygotic activation afterwards, the other two, PmLox and PmCdx are expressed during embryogenesis, within restricted domains of both endoderm and ectoderm. Screening of a Patiria bacterial artificial chromosome (BAC) library led to the identification of a clone containing the three genes. The transcriptional directions of PmGsx and PmLox are opposed to that of the PmCdx gene within the cluster. The identification of P. miniata ParaHox genes has revealed the fact that these genes are clustered in the genome, in contrast to what has been reported for echinoids. Since the presence of an intact cluster, or at least a partial cluster, has been reported in chordates and polychaetes respectively, it becomes clear that within echinoderms, sea urchins have modified the original bilaterian arrangement. Moreover, the sea star ParaHox domains of expression show

  13. Mapping of the {alpha}{sub 4} subunit gene (GABRA4) to human chromosome 4 defines an {alpha}{sub 2}-{alpha}{sub 4}-{beta}{sub 1}-{gamma}{sub 1} gene cluster: Further evidence that modern GABA{sub a} receptor gene clusters are derived from an ancestral cluster

    Energy Technology Data Exchange (ETDEWEB)

    McLean, P.J.; Farb, D.H.; Russek, S.J. [Boston Univ. School of Medicine, MA (United States)] [and others

    1995-04-10

    We demonstrated previously that an {alpha}{sub 1}-{beta}{sub 2}-{gamma}{sub 2} gene cluster of the {gamma}-aminobutyric acid (GABA{sub A}) receptor is located on human chromosome 5q34-q35 and that an ancestral {alpha}-{beta}-{gamma} gene cluster probably spawned clusters on chromosomes 4, 5, and 15. Here, we report that the {alpha}{sub 4} gene (GABRA4) maps to human chromosome 4p14-q12, defining a cluster comprising the {alpha}{sub 2}, {alpha}{sub 4}, {beta}{sub 1}, and {gamma}{sub 1} genes. The existence of an {alpha}{sub 2}-{alpha}{sub 4}-{beta}{sub 1}-{gamma}{sub 2} cluster on chromosome 4 and an {alpha}{sub 1}-{alpha}{sub 6}-{beta}{sub 2}-{gamma}{sub 2} cluster on chromosome 5 provides further evidence that the number of ancestral GABA{sub A} receptor subunit genes has been expanded by duplication within an ancestral gene cluster. Moreover, if duplication of the {alpha} gene occurred before duplication of the ancestral gene cluster, then a heretofore undiscovered subtype of a subunit should be located on human chromosome 15q11-q13 within an {alpha}{sub 5}-{alpha}{sub x}-{beta}{sub 3}-{gamma}{sub 3} gene cluster at the locus for Angelman and Prader-Willi syndromes. 34 refs., 6 figs., 1 tab.

  14. clusters

    Indian Academy of Sciences (India)

    2017-09-27

    Sep 27, 2017 ... while CuCoNO, Co3NO, Cu3CoNO, Cu2Co3NO, Cu3Co3NO and Cu6CoNO clusters display stronger chemical stability. Magnetic and electronic properties are also discussed. The magnetic moment is affected by charge transfer and the spd hybridization. Keywords. CumConNO (m + n = 2–7) clusters; ...

  15. Teaching Gene Technology in an Outreach Lab: Students' Assigned Cognitive Load Clusters and the Clusters' Relationships to Learner Characteristics, Laboratory Variables, and Cognitive Achievement

    Science.gov (United States)

    Scharfenberg, Franz-Josef; Bogner, Franz X.

    2013-01-01

    This study classified students into different cognitive load (CL) groups by means of cluster analysis based on their experienced CL in a gene technology outreach lab which has instructionally been designed with regard to CL theory. The relationships of the identified student CL clusters to learner characteristics, laboratory variables, and…

  16. Identification of the Viridicatumtoxin and Griseofulvin Gene Clusters from Penicillium aethiopicum

    Science.gov (United States)

    Chooi, Yit-Heng; Cacho, Ralph; Tang, Yi

    2010-01-01

    SUMMARY Penicillium aethiopicum produces two structurally interesting and biologically active polyketides: the tetracycline-like viridicatumtoxin 1 and the classic antifungal agent griseofulvin 2. Here, we report the concurrent discovery of the two corresponding biosynthetic gene clusters (vrt and gsf) by 454 shotgun sequencing. Gene deletions confirmed two nonreducing PKSs (NRPKS), vrtA and gsfA, are required for the biosynthesis of 1 and 2, respectively. Both PKSs share similar domain architectures and lack a C-terminal thioesterase domain. We identified gsfI as the chlorinase involved in the biosynthesis of 2, as deletion of gsfI resulted in the accumulation of decholorogriseofulvin 3. Comparative analysis with the P. chrysogenum genome revealed that both clusters are embedded within conserved syntenic regions of P. aethiopicum chromosomes. Discovery of the vrt and gsf clusters provided the basis for genetic and biochemical studies of the pathways. PMID:20534346

  17. Evolutionary history of the phl gene cluster in the plant-associated bacterium Pseudomonas fluorescens

    NARCIS (Netherlands)

    Moynihan, J.A.; Morrissey, J.P.; Coppoolse, E.; Stiekema, W.J.; O'Gara, F.; Boyd, E.F.

    2009-01-01

    Pseudomonas fluorescens is of agricultural and economic importance as a biological control agent largely because of its plant-association and production of secondary metabolites, in particular 2, 4-diacetylphloroglucinol (2, 4-DAPG). This polyketide, which is encoded by the eight gene phl cluster,

  18. Insights into secondary metabolism from a global analysis of prokaryotic biosynthetic gene clusters

    NARCIS (Netherlands)

    Cimermancic, P.; Medema, Marnix; Claesen, J.; Kurika, K.; Wieland Brown, L.C.; Mavrommatis, K.; Pati, A.; Godfrey, P.A.; Koehrsen, M.; Clardy, J.; Birren, B. W.; Takano, Eriko; Sali, A.; Linington, R.G.; Fischbach, M.A.

    2014-01-01

    Although biosynthetic gene clusters (BGCs) have been discovered for hundreds of bacterial metabolites, our knowledge of their diversity remains limited. Here, we used a novel algorithm to systematically identify BGCs in the extensive extant microbial sequencing data. Network analysis of the

  19. Molecular population genetics of the β-esterase gene cluster of ...

    Indian Academy of Sciences (India)

    We suggest that the demographic history (bottleneck and admixture of genetically differentiated populations) is the major factor shaping the pattern of nucleotide polymorphism in the -esterase gene cluster. However there are some 'footprints' of directional and balancing selection shaping specific distribution of nucleotide ...

  20. Molecular population genetics of the β-esterase gene cluster of ...

    Indian Academy of Sciences (India)

    Unknown

    neutrality with recombination are significant for the β−esterase gene cluster in the non-African samples but not signi- ficant in the African one. We suggest ...... I. Viability studies. Genetics 102,. 467–483. Selva E. M., New L., Crouse G. F. and Lahue R. S. 1995 Mis- match correction acts as a barrier to homologous recombina-.

  1. GenClust: A genetic algorithm for clustering gene expression data

    Directory of Open Access Journals (Sweden)

    Raimondi Alessandra

    2005-12-01

    Full Text Available Abstract Background Clustering is a key step in the analysis of gene expression data, and in fact, many classical clustering algorithms are used, or more innovative ones have been designed and validated for the task. Despite the widespread use of artificial intelligence techniques in bioinformatics and, more generally, data analysis, there are very few clustering algorithms based on the genetic paradigm, yet that paradigm has great potential in finding good heuristic solutions to a difficult optimization problem such as clustering. Results GenClust is a new genetic algorithm for clustering gene expression data. It has two key features: (a a novel coding of the search space that is simple, compact and easy to update; (b it can be used naturally in conjunction with data driven internal validation methods. We have experimented with the FOM methodology, specifically conceived for validating clusters of gene expression data. The validity of GenClust has been assessed experimentally on real data sets, both with the use of validation measures and in comparison with other algorithms, i.e., Average Link, Cast, Click and K-means. Conclusion Experiments show that none of the algorithms we have used is markedly superior to the others across data sets and validation measures; i.e., in many cases the observed differences between the worst and best performing algorithm may be statistically insignificant and they could be considered equivalent. However, there are cases in which an algorithm may be better than others and therefore worthwhile. In particular, experiments for GenClust show that, although simple in its data representation, it converges very rapidly to a local optimum and that its ability to identify meaningful clusters is comparable, and sometimes superior, to that of more sophisticated algorithms. In addition, it is well suited for use in conjunction with data driven internal validation measures and, in particular, the FOM methodology.

  2. Genomic and expression analysis of the vanG-like gene cluster of Clostridium difficile.

    Science.gov (United States)

    Peltier, Johann; Courtin, Pascal; El Meouche, Imane; Catel-Ferreira, Manuella; Chapot-Chartier, Marie-Pierre; Lemée, Ludovic; Pons, Jean-Louis

    2013-07-01

    Primary antibiotic treatment of Clostridium difficile intestinal diseases requires metronidazole or vancomycin therapy. A cluster of genes homologous to enterococcal glycopeptides resistance vanG genes was found in the genome of C. difficile 630, although this strain remains sensitive to vancomycin. This vanG-like gene cluster was found to consist of five ORFs: the regulatory region consisting of vanR and vanS and the effector region consisting of vanG, vanXY and vanT. We found that 57 out of 83 C. difficile strains, representative of the main lineages of the species, harbour this vanG-like cluster. The cluster is expressed as an operon and, when present, is found at the same genomic location in all strains. The vanG, vanXY and vanT homologues in C. difficile 630 are co-transcribed and expressed to a low level throughout the growth phases in the absence of vancomycin. Conversely, the expression of these genes is strongly induced in the presence of subinhibitory concentrations of vancomycin, indicating that the vanG-like operon is functional at the transcriptional level in C. difficile. Hydrophilic interaction liquid chromatography (HILIC-HPLC) and MS analysis of cytoplasmic peptidoglycan precursors of C. difficile 630 grown without vancomycin revealed the exclusive presence of a UDP-MurNAc-pentapeptide with an alanine at the C terminus. UDP-MurNAc-pentapeptide [d-Ala] was also the only peptidoglycan precursor detected in C. difficile grown in the presence of vancomycin, corroborating the lack of vancomycin resistance. Peptidoglycan structures of a vanG-like mutant strain and of a strain lacking the vanG-like cluster did not differ from the C. difficile 630 strain, indicating that the vanG-like cluster also has no impact on cell-wall composition.

  3. Identification, characterization and metagenome analysis of oocyte-specific genes organized in clusters in the mouse genome

    Directory of Open Access Journals (Sweden)

    Vaiman Daniel

    2005-05-01

    Full Text Available Abstract Background Genes specifically expressed in the oocyte play key roles in oogenesis, ovarian folliculogenesis, fertilization and/or early embryonic development. In an attempt to identify novel oocyte-specific genes in the mouse, we have used an in silico subtraction methodology, and we have focused our attention on genes that are organized in genomic clusters. Results In the present work, five clusters have been studied: a cluster of thirteen genes characterized by an F-box domain localized on chromosome 9, a cluster of six genes related to T-cell leukaemia/lymphoma protein 1 (Tcl1 on chromosome 12, a cluster composed of a SPErm-associated glutamate (E-Rich (Speer protein expressed in the oocyte in the vicinity of four unknown genes specifically expressed in the testis on chromosome 14, a cluster composed of the oocyte secreted protein-1 (Oosp-1 gene and two Oosp-related genes on chromosome 19, all three being characterized by a partial N-terminal zona pellucida-like domain, and another small cluster of two genes on chromosome 19 as well, composed of a TWIK-Related spinal cord K+ channel encoding-gene, and an unknown gene predicted in silico to be testis-specific. The specificity of expression was confirmed by RT-PCR and in situ hybridization for eight and five of them, respectively. Finally, we showed by comparing all of the isolated and clustered oocyte-specific genes identified so far in the mouse genome, that the oocyte-specific clusters are significantly closer to telomeres than isolated oocyte-specific genes are. Conclusion We have studied five clusters of genes specifically expressed in female, some of them being also expressed in male germ-cells. Moreover, contrarily to non-clustered oocyte-specific genes, those that are organized in clusters tend to map near chromosome ends, suggesting that this specific near-telomere position of oocyte-clusters in rodents could constitute an evolutionary advantage. Understanding the biological

  4. Multiplexed CRISPR/Cas9- and TAR-Mediated Promoter Engineering of Natural Product Biosynthetic Gene Clusters in Yeast.

    Science.gov (United States)

    Kang, Hahk-Soo; Charlop-Powers, Zachary; Brady, Sean F

    2016-09-16

    The use of DNA sequencing to guide the discovery of natural products has emerged as a new paradigm for revealing chemistries encoded in bacterial genomes. A major obstacle to implementing this approach to natural product discovery is the transcriptional silence of biosynthetic gene clusters under laboratory growth conditions. Here we describe an improved yeast-based promoter engineering platform (mCRISTAR) that combines CRISPR/Cas9 and TAR to enable single-marker multiplexed promoter engineering of large gene clusters. mCRISTAR highlights the first application of the CRISPR/Cas9 system to multiplexed promoter engineering of natural product biosynthetic gene clusters. In this method, CRISPR/Cas9 is used to induce DNA double-strand breaks in promoter regions of biosynthetic gene clusters, and the resulting operon fragments are reassembled by TAR using synthetic gene-cluster-specific promoter cassettes. mCRISTAR uses a CRISPR array to simplify the construction of a CRISPR plasmid for multiplex CRISPR and a single auxotrophic selection to improve the inefficiency of using a CRISPR array for multiplex gene cluster refactoring. mCRISTAR is a simple and generic method for multiplexed replacement of promoters in biosynthetic gene clusters that will facilitate the discovery of natural products from the rapidly growing collection of gene clusters found in microbial genome and metagenome sequencing projects.

  5. Clustering of two genes putatively involved in cyanate detoxification evolved recently and independently in multiple fungal lineages

    Science.gov (United States)

    Fungi that have the enzymes cyanase and carbonic anhydrase show a limited capacity to detoxify cyanate, a fungicide employed by both plants and humans. Here, we describe a novel two-gene cluster that comprises duplicated cyanase and carbonic anhydrase copies, which we name the CCA gene cluster, trac...

  6. Gene clustering analysis in human osteoporosis disease and modifications of the jawbone.

    Science.gov (United States)

    Toti, Paolo; Sbordone, Carolina; Martuscelli, Ranieri; Califano, Luigi; Ramaglia, Luca; Sbordone, Ludovico

    2013-08-01

    An analysis of the genes involved in both osteoporosis and modifications of the jawbone, through text mining, using a web search tool, of information regarding gene/protein interaction. The final set of genes involved in the present phenomenon was obtained by expansion-filtering loop. Using a web-available software (STRING), interactions among all genes were searched for, and a clustering procedure was performed in which only high-confidence predicted associations were considered. Two hundred forty-two genes potentially involved in osteoporosis and in modifications of the jawbone were recorded. Seven "leader genes" were identified (CTNNB1, IL1B, IL6, JUN, RUNX2, SPP1, TGFB1), while another 10 genes formed the cluster B group (BMP2, BMP7, COL1A1, ICAM1, IGF1, IL10, MMP9, NFKB1, TNFSF11, VEGFA). Ninety-eight genes had no interactions, and were defined as "orphan genes". The expansion of knowledge regarding the molecular basis causing osteoporotic traits has been brought about with the help of a de novo identification, based on the data mining of genes involved in osteoporosis and in modification of the jawbone. A comparison of the present data, in which no role was verified for 98 genes that had been previously supposed to have a role, with that of the literature, in which another 81 genes, as obtained from GWAS reviews and meta-analyses, appeared to be strongly associated with osteoporosis, probably attests to a lack of information on osteoporotic disease. Copyright © 2013 Elsevier Ltd. All rights reserved.

  7. Transcriptional regulation of gene expression clusters in motor neurons following spinal cord injury

    DEFF Research Database (Denmark)

    Ryge, J.; Winther, Ole; Wienecke, J.

    2010-01-01

    Background: Spinal cord injury leads to neurological dysfunctions affecting the motor, sensory as well as the autonomic systems. Increased excitability of motor neurons has been implicated in injury-induced spasticity, where the reappearance of self-sustained plateau potentials in the absence...... of modulatory inputs from the brain correlates with the development of spasticity. Results: Here we examine the dynamic transcriptional response of motor neurons to spinal cord injury as it evolves over time to unravel common gene expression patterns and their underlying regulatory mechanisms. For this we use...... a rat-tail-model with complete spinal cord transection causing injury-induced spasticity, where gene expression profiles are obtained from labeled motor neurons extracted with laser microdissection 0, 2, 7, 21 and 60 days post injury. Consensus clustering identifies 12 gene clusters with distinct time...

  8. cluster

    Indian Academy of Sciences (India)

    has been investigated electrochemically in positive and negative microenvironments, both in solution and in film. Charge nature around the active centre ... in plants, bacteria and also in mammals. This cluster is also an important constituent of a ..... selection of non-cysteine amino acid in the active centre of Rieske proteins.

  9. Characterisation of the paralytic shellfish toxin biosynthesis gene clusters in Anabaena circinalis AWQC131C and Aphanizomenon sp. NH-5

    Directory of Open Access Journals (Sweden)

    Neilan Brett A

    2009-03-01

    Full Text Available Abstract Background Saxitoxin and its analogues collectively known as the paralytic shellfish toxins (PSTs are neurotoxic alkaloids and are the cause of the syndrome named paralytic shellfish poisoning. PSTs are produced by a unique biosynthetic pathway, which involves reactions that are rare in microbial metabolic pathways. Nevertheless, distantly related organisms such as dinoflagellates and cyanobacteria appear to produce these toxins using the same pathway. Hypothesised explanations for such an unusual phylogenetic distribution of this shared uncommon metabolic pathway, include a polyphyletic origin, an involvement of symbiotic bacteria, and horizontal gene transfer. Results We describe the identification, annotation and bioinformatic characterisation of the putative paralytic shellfish toxin biosynthesis clusters in an Australian isolate of Anabaena circinalis and an American isolate of Aphanizomenon sp., both members of the Nostocales. These putative PST gene clusters span approximately 28 kb and contain genes coding for the biosynthesis and export of the toxin. A putative insertion/excision site in the Australian Anabaena circinalis AWQC131C was identified, and the organization and evolution of the gene clusters are discussed. A biosynthetic pathway leading to the formation of saxitoxin and its analogues in these organisms is proposed. Conclusion The PST biosynthesis gene cluster presents a mosaic structure, whereby genes have apparently transposed in segments of varying size, resulting in different gene arrangements in all three sxt clusters sequenced so far. The gene cluster organizational structure and sequence similarity seems to reflect the phylogeny of the producer organisms, indicating that the gene clusters have an ancient origin, or that their lateral transfer was also an ancient event. The knowledge we gain from the characterisation of the PST biosynthesis gene clusters, including the identity and sequence of the genes involved

  10. Characterisation of the paralytic shellfish toxin biosynthesis gene clusters in Anabaena circinalis AWQC131C and Aphanizomenon sp. NH-5.

    Science.gov (United States)

    Mihali, Troco K; Kellmann, Ralf; Neilan, Brett A

    2009-03-30

    Saxitoxin and its analogues collectively known as the paralytic shellfish toxins (PSTs) are neurotoxic alkaloids and are the cause of the syndrome named paralytic shellfish poisoning. PSTs are produced by a unique biosynthetic pathway, which involves reactions that are rare in microbial metabolic pathways. Nevertheless, distantly related organisms such as dinoflagellates and cyanobacteria appear to produce these toxins using the same pathway. Hypothesised explanations for such an unusual phylogenetic distribution of this shared uncommon metabolic pathway, include a polyphyletic origin, an involvement of symbiotic bacteria, and horizontal gene transfer. We describe the identification, annotation and bioinformatic characterisation of the putative paralytic shellfish toxin biosynthesis clusters in an Australian isolate of Anabaena circinalis and an American isolate of Aphanizomenon sp., both members of the Nostocales. These putative PST gene clusters span approximately 28 kb and contain genes coding for the biosynthesis and export of the toxin. A putative insertion/excision site in the Australian Anabaena circinalis AWQC131C was identified, and the organization and evolution of the gene clusters are discussed. A biosynthetic pathway leading to the formation of saxitoxin and its analogues in these organisms is proposed. The PST biosynthesis gene cluster presents a mosaic structure, whereby genes have apparently transposed in segments of varying size, resulting in different gene arrangements in all three sxt clusters sequenced so far. The gene cluster organizational structure and sequence similarity seems to reflect the phylogeny of the producer organisms, indicating that the gene clusters have an ancient origin, or that their lateral transfer was also an ancient event. The knowledge we gain from the characterisation of the PST biosynthesis gene clusters, including the identity and sequence of the genes involved in the biosynthesis, may also afford the identification of

  11. A scan statistic to extract causal gene clusters from case-control genome-wide rare CNV data

    Directory of Open Access Journals (Sweden)

    Scherer Stephen W

    2011-05-01

    Full Text Available Abstract Background Several statistical tests have been developed for analyzing genome-wide association data by incorporating gene pathway information in terms of gene sets. Using these methods, hundreds of gene sets are typically tested, and the tested gene sets often overlap. This overlapping greatly increases the probability of generating false positives, and the results obtained are difficult to interpret, particularly when many gene sets show statistical significance. Results We propose a flexible statistical framework to circumvent these problems. Inspired by spatial scan statistics for detecting clustering of disease occurrence in the field of epidemiology, we developed a scan statistic to extract disease-associated gene clusters from a whole gene pathway. Extracting one or a few significant gene clusters from a global pathway limits the overall false positive probability, which results in increased statistical power, and facilitates the interpretation of test results. In the present study, we applied our method to genome-wide association data for rare copy-number variations, which have been strongly implicated in common diseases. Application of our method to a simulated dataset demonstrated the high accuracy of this method in detecting disease-associated gene clusters in a whole gene pathway. Conclusions The scan statistic approach proposed here shows a high level of accuracy in detecting gene clusters in a whole gene pathway. This study has provided a sound statistical framework for analyzing genome-wide rare CNV data by incorporating topological information on the gene pathway.

  12. Two Gene Clusters Coordinate Galactose and Lactose Metabolism in Streptococcus gordonii

    Science.gov (United States)

    Zeng, Lin; Martino, Nicole C.

    2012-01-01

    Streptococcus gordonii is an early colonizer of the human oral cavity and an abundant constituent of oral biofilms. Two tandemly arranged gene clusters, designated lac and gal, were identified in the S. gordonii DL1 genome, which encode genes of the tagatose pathway (lacABCD) and sugar phosphotransferase system (PTS) enzyme II permeases. Genes encoding a predicted phospho-β-galactosidase (LacG), a DeoR family transcriptional regulator (LacR), and a transcriptional antiterminator (LacT) were also present in the clusters. Growth and PTS assays supported that the permease designated EIILac transports lactose and galactose, whereas EIIGal transports galactose. The expression of the gene for EIIGal was markedly upregulated in cells growing on galactose. Using promoter-cat fusions, a role for LacR in the regulation of the expressions of both gene clusters was demonstrated, and the gal cluster was also shown to be sensitive to repression by CcpA. The deletion of lacT caused an inability to grow on lactose, apparently because of its role in the regulation of the expression of the genes for EIILac, but had little effect on galactose utilization. S. gordonii maintained a selective advantage over Streptococcus mutans in a mixed-species competition assay, associated with its possession of a high-affinity galactose PTS, although S. mutans could persist better at low pHs. Collectively, these results support the concept that the galactose and lactose systems of S. gordonii are subject to complex regulation and that a high-affinity galactose PTS may be advantageous when S. gordonii is competing against the caries pathogen S. mutans in oral biofilms. PMID:22660715

  13. DMRT gene cluster analysis in the platypus: new insights into genomic organization and regulatory regions.

    Science.gov (United States)

    El-Mogharbel, Nisrine; Wakefield, Matthew; Deakin, Janine E; Tsend-Ayush, Enkhjargal; Grützner, Frank; Alsop, Amber; Ezaz, Tariq; Marshall Graves, Jennifer A

    2007-01-01

    We isolated and characterized a cluster of platypus DMRT genes and compared their arrangement, location, and sequence across vertebrates. The DMRT gene cluster on human 9p24.3 harbors, in order, DMRT1, DMRT3, and DMRT2, which share a DM domain. DMRT1 is highly conserved and involved in sexual development in vertebrates, and deletions in this region cause sex reversal in humans. Sequence comparisons of DMRT genes between species have been valuable in identifying exons, control regions, and conserved nongenic regions (CNGs). The addition of platypus sequences is expected to be particularly valuable, since monotremes fill a gap in the vertebrate genome coverage. We therefore isolated and fully sequenced platypus BAC clones containing DMRT3 and DMRT2 as well as DMRT1 and then generated multispecies alignments and ran prediction programs followed by experimental verification to annotate this gene cluster. We found that the three genes have 58-66% identity to their human orthologues, lie in the same order as in other vertebrates, and colocate on 1 of the 10 platypus sex chromosomes, X5. We also predict that optimal annotation of the newly sequenced platypus genome will be challenging. The analysis of platypus sequence revealed differences in structure and sequence of the DMRT gene cluster. Multispecies comparison was particularly effective for detecting CNGs, revealing several novel potential regulatory regions within DMRT3 and DMRT2 as well as DMRT1. RT-PCR indicated that platypus DMRT1 and DMRT3 are expressed specifically in the adult testis (and not ovary), but DMRT2 has a wider expression profile, as it does for other mammals. The platypus DMRT1 expression pattern, and its location on an X chromosome, suggests an involvement in monotreme sexual development.

  14. A novel method incorporating gene ontology information for unsupervised clustering and feature selection.

    Directory of Open Access Journals (Sweden)

    Shireesh Srivastava

    Full Text Available Among the primary goals of microarray analysis is the identification of genes that could distinguish between different phenotypes (feature selection. Previous studies indicate that incorporating prior information of the genes' function could help identify physiologically relevant features. However, current methods that incorporate prior functional information do not provide a relative estimate of the effect of different genes on the biological processes of interest.Here, we present a method that integrates gene ontology (GO information and expression data using Bayesian regression mixture models to perform unsupervised clustering of the samples and identify physiologically relevant discriminating features. As a model application, the method was applied to identify the genes that play a role in the cytotoxic responses of human hepatoblastoma cell line (HepG2 to saturated fatty acid (SFA and tumor necrosis factor (TNF-alpha, as compared to the non-toxic response to the unsaturated FFAs (UFA and TNF-alpha. Incorporation of prior knowledge led to a better discrimination of the toxic phenotypes from the others. The model identified roles of lysosomal ATPases and adenylate cyclase (AC9 in the toxicity of palmitate. To validate the role of AC in palmitate-treated cells, we measured the intracellular levels of cyclic AMP (cAMP. The cAMP levels were found to be significantly reduced by palmitate treatment and not by the other FFAs, in accordance with the model selection of AC9.A framework is presented that incorporates prior ontology information, which helped to (a perform unsupervised clustering of the phenotypes, and (b identify the genes relevant to each cluster of phenotypes. We demonstrate the proposed framework by applying it to identify physiologically-relevant feature genes that conferred differential toxicity to saturated vs. unsaturated FFAs. The framework can be applied to other problems to efficiently integrate ontology information and

  15. An enhanced deterministic K-Means clustering algorithm for cancer subtype prediction from gene expression data.

    Science.gov (United States)

    Nidheesh, N; Abdul Nazeer, K A; Ameer, P M

    2017-12-01

    Clustering algorithms with steps involving randomness usually give different results on different executions for the same dataset. This non-deterministic nature of algorithms such as the K-Means clustering algorithm limits their applicability in areas such as cancer subtype prediction using gene expression data. It is hard to sensibly compare the results of such algorithms with those of other algorithms. The non-deterministic nature of K-Means is due to its random selection of data points as initial centroids. We propose an improved, density based version of K-Means, which involves a novel and systematic method for selecting initial centroids. The key idea of the algorithm is to select data points which belong to dense regions and which are adequately separated in feature space as the initial centroids. We compared the proposed algorithm to a set of eleven widely used single clustering algorithms and a prominent ensemble clustering algorithm which is being used for cancer data classification, based on the performances on a set of datasets comprising ten cancer gene expression datasets. The proposed algorithm has shown better overall performance than the others. There is a pressing need in the Biomedical domain for simple, easy-to-use and more accurate Machine Learning tools for cancer subtype prediction. The proposed algorithm is simple, easy-to-use and gives stable results. Moreover, it provides comparatively better predictions of cancer subtypes from gene expression data. Copyright © 2017 Elsevier Ltd. All rights reserved.

  16. Genetic variations and haplotype diversity of the UGT1 gene cluster in the Chinese population.

    Directory of Open Access Journals (Sweden)

    Jing Yang

    Full Text Available Vertebrates require tremendous molecular diversity to defend against numerous small hydrophobic chemicals. UDP-glucuronosyltransferases (UGTs are a large family of detoxification enzymes that glucuronidate xenobiotics and endobiotics, facilitating their excretion from the body. The UGT1 gene cluster contains a tandem array of variable first exons, each preceded by a specific promoter, and a common set of downstream constant exons, similar to the genomic organization of the protocadherin (Pcdh, immunoglobulin, and T-cell receptor gene clusters. To assist pharmacogenomics studies in Chinese, we sequenced nine first exons, promoter and intronic regions, and five common exons of the UGT1 gene cluster in a population sample of 253 unrelated Chinese individuals. We identified 101 polymorphisms and found 15 novel SNPs. We then computed allele frequencies for each polymorphism and reconstructed their linkage disequilibrium (LD map. The UGT1 cluster can be divided into five linkage blocks: Block 9 (UGT1A9, Block 9/7/6 (UGT1A9, UGT1A7, and UGT1A6, Block 5 (UGT1A5, Block 4/3 (UGT1A4 and UGT1A3, and Block 3' UTR. Furthermore, we inferred haplotypes and selected their tagSNPs. Finally, comparing our data with those of three other populations of the HapMap project revealed ethnic specificity of the UGT1 genetic diversity in Chinese. These findings have important implications for future molecular genetic studies of the UGT1 gene cluster as well as for personalized medical therapies in Chinese.

  17. Sequencing and transcriptional analysis of the Streptococcus thermophilus histamine biosynthesis gene cluster: factors that affect differential hdcA expression

    DEFF Research Database (Denmark)

    Calles-Enríquez, Marina; Hjort, Benjamin Benn; Andersen, Pia Skov

    2010-01-01

    to produce histamine. The hdc clusters of S. thermophilus CHCC1524 and CHCC6483 were sequenced, and the factors that affect histamine biosynthesis and histidine-decarboxylating gene (hdcA) expression were studied. The hdc cluster began with the hdcA gene, was followed by a transporter (hdcP), and ended...... with the hdcB gene, which is of unknown function. The three genes were orientated in the same direction. The genetic organization of the hdc cluster showed a unique organization among the lactic acid bacterial group and resembled those of Staphylococcus and Clostridium species, thus indicating possible...... acquisition through a horizontal transfer mechanism. Transcriptional analysis of the hdc cluster revealed the existence of a polycistronic mRNA covering the three genes. The histidine-decarboxylating gene (hdcA) of S. thermophilus demonstrated maximum expression during the stationary growth phase, with high...

  18. MeSH key terms for validation and annotation of gene expression clusters

    Energy Technology Data Exchange (ETDEWEB)

    Rechtsteiner, A. (Andreas); Rocha, L. M. (Luis Mateus)

    2004-01-01

    Integration of different sources of information is a great challenge for the analysis of gene expression data, and for the field of Functional Genomics in general. As the availability of numerical data from high-throughput methods increases, so does the need for technologies that assist in the validation and evaluation of the biological significance of results extracted from these data. In mRNA assaying with microarrays, for example, numerical analysis often attempts to identify clusters of co-expressed genes. The important task to find the biological significance of the results and validate them has so far mostly fallen to the biological expert who had to perform this task manually. One of the most promising avenues to develop automated and integrative technology for such tasks lies in the application of modern Information Retrieval (IR) and Knowledge Management (KM) algorithms to databases with biomedical publications and data. Examples of databases available for the field are bibliographic databases c ntaining scientific publications (e.g. MEDLINE/PUBMED), databases containing sequence data (e.g. GenBank) and databases of semantic annotations (e.g. the Gene Ontology Consortium and Medical Subject Headings (MeSH)). We present here an approach that uses the MeSH terms and their concept hierarchies to validate and obtain functional information for gene expression clusters. The controlled and hierarchical MeSH vocabulary is used by the National Library of Medicine (NLM) to index all the articles cited in MEDLINE. Such indexing with a controlled vocabulary eliminates some of the ambiguity due to polysemy (terms that have multiple meanings) and synonymy (multiple terms have similar meaning) that would be encountered if terms would be extracted directly from the articles due to differing article contexts or author preferences and background. Further, the hierarchical organization of the MeSH terms can illustrate the conceptuallfunctional relationships of genes

  19. Rearranged Biosynthetic Gene Cluster and Synthesis of Hassallidin E in Planktothrix serta PCC 8927.

    Science.gov (United States)

    Pancrace, Claire; Jokela, Jouni; Sassoon, Nathalie; Ganneau, Christelle; Desnos-Ollivier, Marie; Wahlsten, Matti; Humisto, Anu; Calteau, Alexandra; Bay, Sylvie; Fewer, David P; Sivonen, Kaarina; Gugger, Muriel

    2017-07-21

    Cyanobacteria produce a wide range of natural products with antifungal bioactivity. The cyclic glycosylated lipopeptides of the hassallidin family have potent antifungal activity and display a great degree of chemical diversity. Here, we report the discovery of a hassallidin biosynthetic gene cluster from the filamentous cyanobacterium Planktothrix serta PCC 8927. The hassallidin gene cluster showed heavy rearrangement and marks of genomic plasticity. Nucleotide bias, differences in GC content, and phylogenetic incongruence suggested the acquisition of the hassallidin biosynthetic gene cluster in Planktothrix serta PCC 8927 by horizontal gene transfer. Chemical analyses by liquid chromatography and mass spectrometry demonstrated that this strain produced hassallidin E, a new glycosylated hassallidin variant. Hassallidin E was the only structural variant produced by Planktothrix serta PCC 8927 in all tested conditions. Further evaluated on human pathogenic fungi, hassallidin E showed an antifungal bioactivity. Hassallidin production levels correlated with nitrogen availability, in the only nitrogen-fixing Planktothrix described so far. Our results provide insights into the distribution and chemical diversity of cyanobacterial antifungal compounds as well as raise questions on their ecological relevance.

  20. Genomic organization, tissue distribution and functional characterization of the rat Pate gene cluster.

    Directory of Open Access Journals (Sweden)

    Angireddy Rajesh

    Full Text Available The cysteine rich prostate and testis expressed (Pate proteins identified till date are thought to resemble the three fingered protein/urokinase-type plasminogen activator receptor proteins. In this study, for the first time, we report the identification, cloning and characterization of rat Pate gene cluster and also determine the expression pattern. The rat Pate genes are clustered on chromosome 8 and their predicted proteins retained the ten cysteine signature characteristic to TFP/Ly-6 protein family. PATE and PATE-F three dimensional protein structure was found to be similar to that of the toxin bucandin. Though Pate gene expression is thought to be prostate and testis specific, we observed that rat Pate genes are also expressed in seminal vesicle and epididymis and in tissues beyond the male reproductive tract. In the developing rats (20-60 day old, expression of Pate genes seem to be androgen dependent in the epididymis and testis. In the adult rat, androgen ablation resulted in down regulation of the majority of Pate genes in the epididymides. PATE and PATE-F proteins were found to be expressed abundantly in the male reproductive tract of rats and on the sperm. Recombinant PATE protein exhibited potent antibacterial activity, whereas PATE-F did not exhibit any antibacterial activity. Pate expression was induced in the epididymides when challenged with LPS. Based on our results, we conclude that rat PATE proteins may contribute to the reproductive and defense functions.

  1. Conserved gene clusters in bacterial genomes provide further support for the primacy of RNA

    Science.gov (United States)

    Siefert, J. L.; Martin, K. A.; Abdi, F.; Widger, W. R.; Fox, G. E.

    1997-01-01

    Five complete bacterial genome sequences have been released to the scientific community. These include four (eu)Bacteria, Haemophilus influenzae, Mycoplasma genitalium, M. pneumoniae, and Synechocystis PCC 6803, as well as one Archaeon, Methanococcus jannaschii. Features of organization shared by these genomes are likely to have arisen very early in the history of the bacteria and thus can be expected to provide further insight into the nature of early ancestors. Results of a genome comparison of these five organisms confirm earlier observations that gene order is remarkably unpreserved. There are, nevertheless, at least 16 clusters of two or more genes whose order remains the same among the four (eu)Bacteria and these are presumed to reflect conserved elements of coordinated gene expression that require gene proximity. Eight of these gene orders are essentially conserved in the Archaea as well. Many of these clusters are known to be regulated by RNA-level mechanisms in Escherichia coli, which supports the earlier suggestion that this type of regulation of gene expression may have arisen very early. We conclude that although the last common ancestor may have had a DNA genome, it likely was preceded by progenotes with an RNA genome.

  2. QServer: a biclustering server for prediction and assessment of co-expressed gene clusters.

    Directory of Open Access Journals (Sweden)

    Fengfeng Zhou

    Full Text Available BACKGROUND: Biclustering is a powerful technique for identification of co-expressed gene groups under any (unspecified substantial subset of given experimental conditions, which can be used for elucidation of transcriptionally co-regulated genes. RESULTS: We have previously developed a biclustering algorithm, QUBIC, which can solve more general biclustering problems than previous biclustering algorithms. To fully utilize the analysis power the algorithm provides, we have developed a web server, QServer, for prediction, computational validation and analyses of co-expressed gene clusters. Specifically, the QServer has the following capabilities in addition to biclustering by QUBIC: (i prediction and assessment of conserved cis regulatory motifs in promoter sequences of the predicted co-expressed genes; (ii functional enrichment analyses of the predicted co-expressed gene clusters using Gene Ontology (GO terms, and (iii visualization capabilities in support of interactive biclustering analyses. QServer supports the biclustering and functional analysis for a wide range of organisms, including human, mouse, Arabidopsis, bacteria and archaea, whose underlying genome database will be continuously updated. CONCLUSION: We believe that QServer provides an easy-to-use and highly effective platform useful for hypothesis formulation and testing related to transcription co-regulation.

  3. QServer: a biclustering server for prediction and assessment of co-expressed gene clusters.

    Science.gov (United States)

    Zhou, Fengfeng; Ma, Qin; Li, Guojun; Xu, Ying

    2012-01-01

    Biclustering is a powerful technique for identification of co-expressed gene groups under any (unspecified) substantial subset of given experimental conditions, which can be used for elucidation of transcriptionally co-regulated genes. We have previously developed a biclustering algorithm, QUBIC, which can solve more general biclustering problems than previous biclustering algorithms. To fully utilize the analysis power the algorithm provides, we have developed a web server, QServer, for prediction, computational validation and analyses of co-expressed gene clusters. Specifically, the QServer has the following capabilities in addition to biclustering by QUBIC: (i) prediction and assessment of conserved cis regulatory motifs in promoter sequences of the predicted co-expressed genes; (ii) functional enrichment analyses of the predicted co-expressed gene clusters using Gene Ontology (GO) terms, and (iii) visualization capabilities in support of interactive biclustering analyses. QServer supports the biclustering and functional analysis for a wide range of organisms, including human, mouse, Arabidopsis, bacteria and archaea, whose underlying genome database will be continuously updated. We believe that QServer provides an easy-to-use and highly effective platform useful for hypothesis formulation and testing related to transcription co-regulation.

  4. Spatial expression of Hox cluster genes in the ontogeny of a sea urchin

    Science.gov (United States)

    Arenas-Mena, C.; Cameron, A. R.; Davidson, E. H.

    2000-01-01

    The Hox cluster of the sea urchin Strongylocentrous purpuratus contains ten genes in a 500 kb span of the genome. Only two of these genes are expressed during embryogenesis, while all of eight genes tested are expressed during development of the adult body plan in the larval stage. We report the spatial expression during larval development of the five 'posterior' genes of the cluster: SpHox7, SpHox8, SpHox9/10, SpHox11/13a and SpHox11/13b. The five genes exhibit a dynamic, largely mesodermal program of expression. Only SpHox7 displays extensive expression within the pentameral rudiment itself. A spatially sequential and colinear arrangement of expression domains is found in the somatocoels, the paired posterior mesodermal structures that will become the adult perivisceral coeloms. No such sequential expression pattern is observed in endodermal, epidermal or neural tissues of either the larva or the presumptive juvenile sea urchin. The spatial expression patterns of the Hox genes illuminate the evolutionary process by which the pentameral echinoderm body plan emerged from a bilateral ancestor.

  5. Multi-class clustering of cancer subtypes through SVM based ensemble of pareto-optimal solutions for gene marker identification.

    Science.gov (United States)

    Mukhopadhyay, Anirban; Bandyopadhyay, Sanghamitra; Maulik, Ujjwal

    2010-11-12

    With the advancement of microarray technology, it is now possible to study the expression profiles of thousands of genes across different experimental conditions or tissue samples simultaneously. Microarray cancer datasets, organized as samples versus genes fashion, are being used for classification of tissue samples into benign and malignant or their subtypes. They are also useful for identifying potential gene markers for each cancer subtype, which helps in successful diagnosis of particular cancer types. In this article, we have presented an unsupervised cancer classification technique based on multiobjective genetic clustering of the tissue samples. In this regard, a real-coded encoding of the cluster centers is used and cluster compactness and separation are simultaneously optimized. The resultant set of near-Pareto-optimal solutions contains a number of non-dominated solutions. A novel approach to combine the clustering information possessed by the non-dominated solutions through Support Vector Machine (SVM) classifier has been proposed. Final clustering is obtained by consensus among the clusterings yielded by different kernel functions. The performance of the proposed multiobjective clustering method has been compared with that of several other microarray clustering algorithms for three publicly available benchmark cancer datasets. Moreover, statistical significance tests have been conducted to establish the statistical superiority of the proposed clustering method. Furthermore, relevant gene markers have been identified using the clustering result produced by the proposed clustering method and demonstrated visually. Biological relationships among the gene markers are also studied based on gene ontology. The results obtained are found to be promising and can possibly have important impact in the area of unsupervised cancer classification as well as gene marker identification for multiple cancer subtypes.

  6. Natural product proteomining, a quantitative proteomics platform, allows rapid discovery of biosynthetic gene clusters for different classes of natural products.

    Science.gov (United States)

    Gubbens, Jacob; Zhu, Hua; Girard, Geneviève; Song, Lijiang; Florea, Bogdan I; Aston, Philip; Ichinose, Koji; Filippov, Dmitri V; Choi, Young H; Overkleeft, Herman S; Challis, Gregory L; van Wezel, Gilles P

    2014-06-19

    Information on gene clusters for natural product biosynthesis is accumulating rapidly because of the current boom of available genome sequencing data. However, linking a natural product to a specific gene cluster remains challenging. Here, we present a widely applicable strategy for the identification of gene clusters for specific natural products, which we name natural product proteomining. The method is based on using fluctuating growth conditions that ensure differential biosynthesis of the bioactivity of interest. Subsequent combination of metabolomics and quantitative proteomics establishes correlations between abundance of natural products and concomitant changes in the protein pool, which allows identification of the relevant biosynthetic gene cluster. We used this approach to elucidate gene clusters for different natural products in Bacillus and Streptomyces, including a novel juglomycin-type antibiotic. Natural product proteomining does not require prior knowledge of the gene cluster or secondary metabolite and therefore represents a general strategy for identification of all types of gene clusters. Copyright © 2014 Elsevier Ltd. All rights reserved.

  7. Lichen Biosynthetic Gene Clusters. Part I. Genome Sequencing Reveals a Rich Biosynthetic Potential.

    Science.gov (United States)

    Bertrand, Robert L; Abdel-Hameed, Mona; Sorensen, John L

    2018-02-27

    Lichens are symbionts of fungi and algae that produce diverse secondary metabolites with useful properties. Little is known of lichen natural product biosynthesis because of the challenges of working with lichenizing fungi. We describe the first attempt to comprehensively profile the genetic secondary metabolome of a lichenizing fungus. An Illumina platform combined with the Antibiotics and Secondary Metabolites Analysis Shell (FungiSMASH, version 4.0) was used to sequence and annotate assembled contigs of the fungal partner of Cladonia uncialis. Up to 48 putative gene clusters are described comprising type I and type III polyketide synthases (PKS), nonribosomal peptide synthetases (NRPS), hybrid PKS-NRPS, and terpene synthases. The number of gene clusters revealed by this work dwarfs the number of known secondary metabolites from C. uncialis, suggesting that lichenizing fungi have an unexplored biosynthetic potential.

  8. Hierarchical clustering of breast cancer methylomes revealed differentially methylated and expressed breast cancer genes.

    Directory of Open Access Journals (Sweden)

    I-Hsuan Lin

    Full Text Available Oncogenic transformation of normal cells often involves epigenetic alterations, including histone modification and DNA methylation. We conducted whole-genome bisulfite sequencing to determine the DNA methylomes of normal breast, fibroadenoma, invasive ductal carcinomas and MCF7. The emergence, disappearance, expansion and contraction of kilobase-sized hypomethylated regions (HMRs and the hypomethylation of the megabase-sized partially methylated domains (PMDs are the major forms of methylation changes observed in breast tumor samples. Hierarchical clustering of HMR revealed tumor-specific hypermethylated clusters and differential methylated enhancers specific to normal or breast cancer cell lines. Joint analysis of gene expression and DNA methylation data of normal breast and breast cancer cells identified differentially methylated and expressed genes associated with breast and/or ovarian cancers in cancer-specific HMR clusters. Furthermore, aberrant patterns of X-chromosome inactivation (XCI was found in breast cancer cell lines as well as breast tumor samples in the TCGA BRCA (breast invasive carcinoma dataset. They were characterized with differentially hypermethylated XIST promoter, reduced expression of XIST, and over-expression of hypomethylated X-linked genes. High expressions of these genes were significantly associated with lower survival rates in breast cancer patients. Comprehensive analysis of the normal and breast tumor methylomes suggests selective targeting of DNA methylation changes during breast cancer progression. The weak causal relationship between DNA methylation and gene expression observed in this study is evident of more complex role of DNA methylation in the regulation of gene expression in human epigenetics that deserves further investigation.

  9. Molecular analysis of an inactive aflatoxin biosynthesis gene cluster in Aspergillus oryzae RIB strains.

    Science.gov (United States)

    Tominaga, Mihoko; Lee, Yun-Hae; Hayashi, Risa; Suzuki, Yuji; Yamada, Osamu; Sakamoto, Kazutoshi; Gotoh, Kuniyasu; Akita, Osamu

    2006-01-01

    To help assess the potential for aflatoxin production by Aspergillus oryzae, the structure of an aflatoxin biosynthesis gene homolog cluster in A. oryzae RIB 40 was analyzed. Although most genes in the corresponding cluster exhibited from 97 to 99% similarity to those of Aspergillus flavus, three genes shared 93% similarity or less. A 257-bp deletion in the aflT region, a frameshift mutation in norA, and a base pair substitution in verA were found in A. oryzae RIB 40. In the aflR promoter, two substitutions were found in one of the three putative AreA binding sites and in the FacB binding site. PCR primers were designed to amplify homologs of aflT, nor-1, aflR, norA, avnA, verB, and vbs and were used to detect these genes in 210 A. oryzae strains. Based on the PCR results, the A. oryzae RIB strains were classified into three groups, although most of them fell into two of the groups. Group 1, in which amplification of all seven genes was confirmed, contained 122 RIB strains (58.1% of examined strains), including RIB 40. Seventy-seven strains (36.7%) belonged to group 2, characterized by having only vbs, verB, and avnA in half of the cluster. Although slight expression of aflR was detected by reverse transcription-PCR in some group 1 strains, including RIB 40, other genes (avnA, vbs, verB, and omtA) related to aflatoxin production were not detected. aflR was not detected in group 2 strains by Southern analysis.

  10. Genetic clusters and sex-biased gene flow in a unicolonial Formica ant

    Directory of Open Access Journals (Sweden)

    Chapuisat Michel

    2009-03-01

    Full Text Available Abstract Background Animal societies are diverse, ranging from small family-based groups to extraordinarily large social networks in which many unrelated individuals interact. At the extreme of this continuum, some ant species form unicolonial populations in which workers and queens can move among multiple interconnected nests without eliciting aggression. Although unicoloniality has been mostly studied in invasive ants, it also occurs in some native non-invasive species. Unicoloniality is commonly associated with very high queen number, which may result in levels of relatedness among nestmates being so low as to raise the question of the maintenance of altruism by kin selection in such systems. However, the actual relatedness among cooperating individuals critically depends on effective dispersal and the ensuing pattern of genetic structuring. In order to better understand the evolution of unicoloniality in native non-invasive ants, we investigated the fine-scale population genetic structure and gene flow in three unicolonial populations of the wood ant F. paralugubris. Results The analysis of geo-referenced microsatellite genotypes and mitochondrial haplotypes revealed the presence of cryptic clusters of genetically-differentiated nests in the three populations of F. paralugubris. Because of this spatial genetic heterogeneity, members of the same clusters were moderately but significantly related. The comparison of nuclear (microsatellite and mitochondrial differentiation indicated that effective gene flow was male-biased in all populations. Conclusion The three unicolonial populations exhibited male-biased and mostly local gene flow. The high number of queens per nest, exchanges among neighbouring nests and restricted long-distance gene flow resulted in large clusters of genetically similar nests. The positive relatedness among clustermates suggests that kin selection may still contribute to the maintenance of altruism in unicolonial

  11. Gene cluster analysis for the biosynthesis of elgicins, novel lantibiotics produced by paenibacillus elgii B69

    Directory of Open Access Journals (Sweden)

    Teng Yi

    2012-03-01

    Full Text Available Abstract Background The recent increase in bacterial resistance to antibiotics has promoted the exploration of novel antibacterial materials. As a result, many researchers are undertaking work to identify new lantibiotics because of their potent antimicrobial activities. The objective of this study was to provide details of a lantibiotic-like gene cluster in Paenibacillus elgii B69 and to produce the antibacterial substances coded by this gene cluster based on culture screening. Results Analysis of the P. elgii B69 genome sequence revealed the presence of a lantibiotic-like gene cluster composed of five open reading frames (elgT1, elgC, elgT2, elgB, and elgA. Screening of culture extracts for active substances possessing the predicted properties of the encoded product led to the isolation of four novel peptides (elgicins AI, AII, B, and C with a broad inhibitory spectrum. The molecular weights of these peptides were 4536, 4593, 4706, and 4820 Da, respectively. The N-terminal sequence of elgicin B was Leu-Gly-Asp-Tyr, which corresponded to the partial sequence of the peptide ElgA encoded by elgA. Edman degradation suggested that the product elgicin B is derived from ElgA. By correlating the results of electrospray ionization-mass spectrometry analyses of elgicins AI, AII, and C, these peptides are deduced to have originated from the same precursor, ElgA. Conclusions A novel lantibiotic-like gene cluster was shown to be present in P. elgii B69. Four new lantibiotics with a broad inhibitory spectrum were isolated, and these appear to be promising antibacterial agents.

  12. Gene Clusters for Insecticidal Loline Alkaloids in the Grass-Endophytic Fungus Neotyphodium uncinatum

    OpenAIRE

    Spiering, Martin J.; Moon, Christina D.; Wilkinson, Heather H.; Schardl, Christopher L.

    2005-01-01

    Loline alkaloids are produced by mutualistic fungi symbiotic with grasses, and they protect the host plants from insects. Here we identify in the fungal symbiont, Neotyphodium uncinatum, two homologous gene clusters (LOL-1 and LOL-2) associated with loline-alkaloid production. Nine genes were identified in a 25-kb region of LOL-1 and designated (in order) lolF-1, lolC-1, lolD-1, lolO-1, lolA-1, lolU-1, lolP-1, lolT-1, and lolE-1. LOL-2 contained the homologs lolC-2 through lolE-2 in the same ...

  13. Comparison of Expression of Secondary Metabolite Biosynthesis Cluster Genes in Aspergillus flavus, A. parasiticus, and A. oryzae

    Directory of Open Access Journals (Sweden)

    Kenneth C. Ehrlich

    2014-06-01

    Full Text Available Fifty six secondary metabolite biosynthesis gene clusters are predicted to be in the Aspergillus flavus genome. In spite of this, the biosyntheses of only seven metabolites, including the aflatoxins, kojic acid, cyclopiazonic acid and aflatrem, have been assigned to a particular gene cluster. We used RNA-seq to compare expression of secondary metabolite genes in gene clusters for the closely related fungi A. parasiticus, A. oryzae, and A. flavus S and L sclerotial morphotypes. The data help to refine the identification of probable functional gene clusters within these species. Our results suggest that A. flavus, a prevalent contaminant of maize, cottonseed, peanuts and tree nuts, is capable of producing metabolites which, besides aflatoxin, could be an underappreciated contributor to its toxicity.

  14. Comparison of Expression of Secondary Metabolite Biosynthesis Cluster Genes in Aspergillus flavus, A. parasiticus, and A. oryzae

    Science.gov (United States)

    Ehrlich, Kenneth C.; Mack, Brian M.

    2014-01-01

    Fifty six secondary metabolite biosynthesis gene clusters are predicted to be in the Aspergillus flavus genome. In spite of this, the biosyntheses of only seven metabolites, including the aflatoxins, kojic acid, cyclopiazonic acid and aflatrem, have been assigned to a particular gene cluster. We used RNA-seq to compare expression of secondary metabolite genes in gene clusters for the closely related fungi A. parasiticus, A. oryzae, and A. flavus S and L sclerotial morphotypes. The data help to refine the identification of probable functional gene clusters within these species. Our results suggest that A. flavus, a prevalent contaminant of maize, cottonseed, peanuts and tree nuts, is capable of producing metabolites which, besides aflatoxin, could be an underappreciated contributor to its toxicity. PMID:24960201

  15. Characterization of the biosynthetic gene cluster for cryptic phthoxazolin A in Streptomyces avermitilis.

    Directory of Open Access Journals (Sweden)

    Dian Anggraini Suroto

    Full Text Available Phthoxazolin A, an oxazole-containing polyketide, has a broad spectrum of anti-oomycete activity and herbicidal activity. We recently identified phthoxazolin A as a cryptic metabolite of Streptomyces avermitilis that produces the important anthelmintic agent avermectin. Even though genome data of S. avermitilis is publicly available, no plausible biosynthetic gene cluster for phthoxazolin A is apparent in the sequence data. Here, we identified and characterized the phthoxazolin A (ptx biosynthetic gene cluster through genome sequencing, comparative genomic analysis, and gene disruption. Sequence analysis uncovered that the putative ptx biosynthetic genes are laid on an extra genomic region that is not found in the public database, and 8 open reading frames in the extra genomic region could be assigned roles in the biosynthesis of the oxazole ring, triene polyketide and carbamoyl moieties. Disruption of the ptxA gene encoding a discrete acyltransferase resulted in a complete loss of phthoxazolin A production, confirming that the trans-AT type I PKS system is responsible for the phthoxazolin A biosynthesis. Based on the predicted functional domains in the ptx assembly line, we propose the biosynthetic pathway of phthoxazolin A.

  16. Clustering gene expression time series data using an infinite Gaussian process mixture model.

    Science.gov (United States)

    McDowell, Ian C; Manandhar, Dinesh; Vockley, Christopher M; Schmid, Amy K; Reddy, Timothy E; Engelhardt, Barbara E

    2018-01-01

    Transcriptome-wide time series expression profiling is used to characterize the cellular response to environmental perturbations. The first step to analyzing transcriptional response data is often to cluster genes with similar responses. Here, we present a nonparametric model-based method, Dirichlet process Gaussian process mixture model (DPGP), which jointly models data clusters with a Dirichlet process and temporal dependencies with Gaussian processes. We demonstrate the accuracy of DPGP in comparison to state-of-the-art approaches using hundreds of simulated data sets. To further test our method, we apply DPGP to published microarray data from a microbial model organism exposed to stress and to novel RNA-seq data from a human cell line exposed to the glucocorticoid dexamethasone. We validate our clusters by examining local transcription factor binding and histone modifications. Our results demonstrate that jointly modeling cluster number and temporal dependencies can reveal shared regulatory mechanisms. DPGP software is freely available online at https://github.com/PrincetonUniversity/DP_GP_cluster.

  17. Clustering gene expression time series data using an infinite Gaussian process mixture model.

    Directory of Open Access Journals (Sweden)

    Ian C McDowell

    2018-01-01

    Full Text Available Transcriptome-wide time series expression profiling is used to characterize the cellular response to environmental perturbations. The first step to analyzing transcriptional response data is often to cluster genes with similar responses. Here, we present a nonparametric model-based method, Dirichlet process Gaussian process mixture model (DPGP, which jointly models data clusters with a Dirichlet process and temporal dependencies with Gaussian processes. We demonstrate the accuracy of DPGP in comparison to state-of-the-art approaches using hundreds of simulated data sets. To further test our method, we apply DPGP to published microarray data from a microbial model organism exposed to stress and to novel RNA-seq data from a human cell line exposed to the glucocorticoid dexamethasone. We validate our clusters by examining local transcription factor binding and histone modifications. Our results demonstrate that jointly modeling cluster number and temporal dependencies can reveal shared regulatory mechanisms. DPGP software is freely available online at https://github.com/PrincetonUniversity/DP_GP_cluster.

  18. Burkholderia thailandensis harbors two identical rhl gene clusters responsible for the biosynthesis of rhamnolipids

    Directory of Open Access Journals (Sweden)

    Woods Donald E

    2009-12-01

    Full Text Available Abstract Background Rhamnolipids are surface active molecules composed of rhamnose and β-hydroxydecanoic acid. These biosurfactants are produced mainly by Pseudomonas aeruginosa and have been thoroughly investigated since their early discovery. Recently, they have attracted renewed attention because of their involvement in various multicellular behaviors. Despite this high interest, only very few studies have focused on the production of rhamnolipids by Burkholderia species. Results Orthologs of rhlA, rhlB and rhlC, which are responsible for the biosynthesis of rhamnolipids in P. aeruginosa, have been found in the non-infectious Burkholderia thailandensis, as well as in the genetically similar important pathogen B. pseudomallei. In contrast to P. aeruginosa, both Burkholderia species contain these three genes necessary for rhamnolipid production within a single gene cluster. Furthermore, two identical, paralogous copies of this gene cluster are found on the second chromosome of these bacteria. Both Burkholderia spp. produce rhamnolipids containing 3-hydroxy fatty acid moieties with longer side chains than those described for P. aeruginosa. Additionally, the rhamnolipids produced by B. thailandensis contain a much larger proportion of dirhamnolipids versus monorhamnolipids when compared to P. aeruginosa. The rhamnolipids produced by B. thailandensis reduce the surface tension of water to 42 mN/m while displaying a critical micelle concentration value of 225 mg/L. Separate mutations in both rhlA alleles, which are responsible for the synthesis of the rhamnolipid precursor 3-(3-hydroxyalkanoyloxyalkanoic acid, prove that both copies of the rhl gene cluster are functional, but one contributes more to the total production than the other. Finally, a double ΔrhlA mutant that is completely devoid of rhamnolipid production is incapable of swarming motility, showing that both gene clusters contribute to this phenotype. Conclusions Collectively, these

  19. Loss of Major DNase I Hypersensitive Sites in Duplicatedglobin Gene Cluster Incompletely Silences HBB Gene Expression

    Czech Academy of Sciences Publication Activity Database

    Reading, N. S.; Shooter, C.; Song, J.; Miller, R.; Agarwal, A.; Láníková, Lucie; Clark, B.; Thein, S.L.; Divoký, V.; Prchal, J.T.

    2016-01-01

    Roč. 37, č. 11 (2016), s. 1153-1156 ISSN 1059-7794 R&D Projects: GA MŠk(CZ) LH15223 Institutional support: RVO:68378050 Keywords : globin gene s * regulation * sickle cell disease * HBB duplication Subject RIV: EB - Gene tics ; Molecular Biology Impact factor: 4.601, year: 2016

  20. Functional dissection of HOXD cluster genes in regulation of neuroblastoma cell proliferation and differentiation.

    Directory of Open Access Journals (Sweden)

    Yunhong Zha

    Full Text Available Retinoic acid (RA can induce growth arrest and neuronal differentiation of neuroblastoma cells and has been used in clinic for treatment of neuroblastoma. It has been reported that RA induces the expression of several HOXD genes in human neuroblastoma cell lines, but their roles in RA action are largely unknown. The HOXD cluster contains nine genes (HOXD1, HOXD3, HOXD4, and HOXD8-13 that are positioned sequentially from 3' to 5', with HOXD1 at the 3' end and HOXD13 the 5' end. Here we show that all HOXD genes are induced by RA in the human neuroblastoma BE(2-C cells, with the genes located at the 3' end being activated generally earlier than those positioned more 5' within the cluster. Individual induction of HOXD8, HOXD9, HOXD10 or HOXD12 is sufficient to induce both growth arrest and neuronal differentiation, which is associated with downregulation of cell cycle-promoting genes and upregulation of neuronal differentiation genes. However, induction of other HOXD genes either has no effect (HOXD1 or has partial effects (HOXD3, HOXD4, HOXD11 and HOXD13 on BE(2-C cell proliferation or differentiation. We further show that knockdown of HOXD8 expression, but not that of HOXD9 expression, significantly inhibits the differentiation-inducing activity of RA. HOXD8 directly activates the transcription of HOXC9, a key effector of RA action in neuroblastoma cells. These findings highlight the distinct functions of HOXD genes in RA induction of neuroblastoma cell differentiation.

  1. Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea

    Directory of Open Access Journals (Sweden)

    Wolf Yuri I

    2007-11-01

    Full Text Available Abstract Background An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs. Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. One of the practical strategies involves construction of refined COGs for phylogenetically compact subsets of genomes. Results New Archaeal Clusters of Orthologous Genes (arCOGs were constructed for 41 archaeal genomes (13 Crenarchaeota, 27 Euryarchaeota and one Nanoarchaeon using an improved procedure that employs a similarity tree between smaller, group-specific clusters, semi-automatically partitions orthology domains in multidomain proteins, and uses profile searches for identification of remote orthologs. The annotation of arCOGs is a consensus between three assignments based on the COGs, the CDD database, and the annotations of homologs in the NR database. The 7538 arCOGs, on average, cover ~88% of the genes in a genome compared to a ~76% coverage in COGs. The finer granularity of ortholog identification in the arCOGs is apparent from the fact that 4538 arCOGs correspond to 2362 COGs; ~40% of the arCOGs are new. The archaeal gene core (protein-coding genes found in all 41 genome consists of 166 arCOGs. The arCOGs were used to reconstruct gene loss and gene gain events during archaeal evolution and gene sets of ancestral forms. The Last Archaeal Common Ancestor (LACA is conservatively estimated to possess 996 genes compared to 1245 and 1335 genes for the last common ancestors of Crenarchaeota and Euryarchaeota, respectively. It is inferred that LACA was a chemoautotrophic hyperthermophile

  2. Using SNP genetic markers to elucidate the linkage of the Co-34/Phg-3 anthracnose and angular leaf spot resistance gene cluster with the Ur-14 resistance gene

    Science.gov (United States)

    The Ouro Negro common bean cultivar contains the Co-34/Phg-3 gene cluster that confers resistance to the anthracnose (ANT) and angular leaf spot (ALS) pathogens. These genes are tightly linked on chromosome 4. Ouro Negro also has the Ur-14 rust resistance gene, reportedly in the vicinity of Co- 34; ...

  3. Differential expression of TIR-like genes embedded in the M1-1 gene cluster in nematode-resistant and -susceptible tomato roots

    NARCIS (Netherlands)

    Seifi Abdolabad, A.R.; Visser, R.G.F.; Bai, Y.

    2011-01-01

    Transport inhibitor 1 (TIR1) is an auxin receptor that plays a pivotal role in auxin signaling. It has been reported that TIR-like genes are present in a gene cluster carrying the Mi-1 gene which confers resistance to nematodes, aphids and whiteflies. Since auxin is involved in the pathogenicity of

  4. Identification of suitable genes contributes to lung adenocarcinoma clustering by multiple meta-analysis methods.

    Science.gov (United States)

    Yang, Ze-Hui; Zheng, Rui; Gao, Yuan; Zhang, Qiang

    2016-09-01

    With the widespread application of high-throughput technology, numerous meta-analysis methods have been proposed for differential expression profiling across multiple studies. We identified the suitable differentially expressed (DE) genes that contributed to lung adenocarcinoma (ADC) clustering based on seven popular multiple meta-analysis methods. Seven microarray expression profiles of ADC and normal controls were extracted from the ArrayExpress database. The Bioconductor was used to perform the data preliminary preprocessing. Then, DE genes across multiple studies were identified. Hierarchical clustering was applied to compare the classification performance for microarray data samples. The classification efficiency was compared based on accuracy, sensitivity and specificity. Across seven datasets, 573 ADC cases and 222 normal controls were collected. After filtering out unexpressed and noninformative genes, 3688 genes were remained for further analysis. The classification efficiency analysis showed that DE genes identified by sum of ranks method separated ADC from normal controls with the best accuracy, sensitivity and specificity of 0.953, 0.969 and 0.932, respectively. The gene set with the highest classification accuracy mainly participated in the regulation of response to external stimulus (P = 7.97E-04), cyclic nucleotide-mediated signaling (P = 0.01), regulation of cell morphogenesis (P = 0.01) and regulation of cell proliferation (P = 0.01). Evaluation of DE genes identified by different meta-analysis methods in classification efficiency provided a new perspective to the choice of the suitable method in a given application. Varying meta-analysis methods always present varying abilities, so synthetic consideration should be taken when providing meta-analysis methods for particular research. © 2015 John Wiley & Sons Ltd.

  5. Identification and functional analysis of gene cluster involvement in biosynthesis of the cyclic lipopeptide antibiotic pelgipeptin produced by Paenibacillus elgii

    Directory of Open Access Journals (Sweden)

    Qian Chao-Dong

    2012-09-01

    Full Text Available Abstract Background Pelgipeptin, a potent antibacterial and antifungal agent, is a non-ribosomally synthesised lipopeptide antibiotic. This compound consists of a β-hydroxy fatty acid and nine amino acids. To date, there is no information about its biosynthetic pathway. Results A potential pelgipeptin synthetase gene cluster (plp was identified from Paenibacillus elgii B69 through genome analysis. The gene cluster spans 40.8 kb with eight open reading frames. Among the genes in this cluster, three large genes, plpD, plpE, and plpF, were shown to encode non-ribosomal peptide synthetases (NRPSs, with one, seven, and one module(s, respectively. Bioinformatic analysis of the substrate specificity of all nine adenylation domains indicated that the sequence of the NRPS modules is well collinear with the order of amino acids in pelgipeptin. Additional biochemical analysis of four recombinant adenylation domains (PlpD A1, PlpE A1, PlpE A3, and PlpF A1 provided further evidence that the plp gene cluster involved in pelgipeptin biosynthesis. Conclusions In this study, a gene cluster (plp responsible for the biosynthesis of pelgipeptin was identified from the genome sequence of Paenibacillus elgii B69. The identification of the plp gene cluster provides an opportunity to develop novel lipopeptide antibiotics by genetic engineering.

  6. Localization and physical mapping of a plasmid-borne 23-kb nif gene cluster from Enterobacter agglomerans showing homology to the entire nif gene cluster of Klebsiella pneumoniae M5a1.

    Science.gov (United States)

    Singh, M; Kreutzer, R; Acker, G; Klingmüller, W

    1988-01-01

    A physical and genetical map of the plasmid pEA3 indigenous to Enterobacter agglomerans is presented. pEA3 is a 111-kb large plasmid containing a 23-kb large cluster of nif genes which shows extensive homology (Southern hybridization and heteroduplex analysis) to the entire nif gene cluster of Klebsiella pneumoniae (Kp) M5a1. All the nif genes on pEA3 are organized in the same manner as in K. pneumoniae, except nifJ, which is located on the left end of pEA3 nif gene cluster (near nifQB). A BamHI restriction map of pEA3 and a detailed restriction map of the 23-kb nif region on pEA3 is also presented. The nif genes of pEA3 showed a low level of acetylene reduction in Escherichia coli, demonstrating that these genes are functional and contain the whole genetic information required to fix nitrogen. The origin of vegetative replication (OriV) of pEA3 was localized about 5.5 kb from the right end of the nif gene cluster. In addition to pEA3, large plasmids from four other strains of E. agglomerans showed homology to all the Kp nif genes tested, indicating that in diazotrophic strains of E. agglomerans nif genes are usually located on plasmids. In contrast, in most of the free-living, nitrogen-fixing bacteria the nif genes are on chromosome.

  7. A conserved cluster of three PRD-class homeobox genes (homeobrain, rx and orthopedia in the Cnidaria and Protostomia

    Directory of Open Access Journals (Sweden)

    Mazza Maureen E

    2010-07-01

    Full Text Available Abstract Background Homeobox genes are a superclass of transcription factors with diverse developmental regulatory functions, which are found in plants, fungi and animals. In animals, several Antennapedia (ANTP-class homeobox genes reside in extremely ancient gene clusters (for example, the Hox, ParaHox, and NKL clusters and the evolution of these clusters has been implicated in the morphological diversification of animal bodyplans. By contrast, similarly ancient gene clusters have not been reported among the other classes of homeobox genes (that is, the LIM, POU, PRD and SIX classes. Results Using a combination of in silico queries and phylogenetic analyses, we found that a cluster of three PRD-class homeobox genes (Homeobrain (hbn, Rax (rx and Orthopedia (otp is present in cnidarians, insects and mollusks (a partial cluster comprising hbn and rx is present in the placozoan Trichoplax adhaerens. We failed to identify this 'HRO' cluster in deuterostomes; in fact, the Homeobrain gene appears to be missing from the chordate genomes we examined, although it is present in hemichordates and echinoderms. To illuminate the ancestral organization and function of this ancient cluster, we mapped the constituent genes against the assembled genome of a model cnidarian, the sea anemone Nematostella vectensis, and characterized their spatiotemporal expression using in situ hybridization. In N. vectensis, these genes reside in a span of 33 kb with the same gene order as previously reported in insects. Comparisons of genomic sequences and expressed sequence tags revealed the presence of alternative transcripts of Nv-otp and two highly unusual protein-coding polymorphisms in the terminal helix of the Nv-rx homeodomain. A population genetic survey revealed the Rx polymorphisms to be widespread in natural populations. During larval development, all three genes are expressed in the ectoderm, in non-overlapping territories along the oral-aboral axis, with distinct

  8. Gene clusters for insecticidal loline alkaloids in the grass-endophytic fungus Neotyphodium uncinatum.

    Science.gov (United States)

    Spiering, Martin J; Moon, Christina D; Wilkinson, Heather H; Schardl, Christopher L

    2005-03-01

    Loline alkaloids are produced by mutualistic fungi symbiotic with grasses, and they protect the host plants from insects. Here we identify in the fungal symbiont, Neotyphodium uncinatum, two homologous gene clusters (LOL-1 and LOL-2) associated with loline-alkaloid production. Nine genes were identified in a 25-kb region of LOL-1 and designated (in order) lolF-1, lolC-1, lolD-1, lolO-1, lolA-1, lolU-1, lolP-1, lolT-1, and lolE-1. LOL-2 contained the homologs lolC-2 through lolE-2 in the same order and orientation. Also identified was lolF-2, but its possible linkage with either cluster was undetermined. Most lol genes were regulated in N. uncinatum and N. coenophialum, and all were expressed concomitantly with loline-alkaloid biosynthesis. A lolC-2 RNA-interference (RNAi) construct was introduced into N. uncinatum, and in two independent transformants, RNAi significantly decreased lolC expression (P lol-gene products indicate that the pathway has evolved from various different primary and secondary biosynthesis pathways.

  9. The Human Paraoxonase Gene Cluster As a Target in the Treatment of Atherosclerosis

    Science.gov (United States)

    She, Zhi-Gang; Chen, Hou-Zao; Yan, Yunfei; Li, Hongliang

    2012-01-01

    Abstract The paraoxonase (PON) gene cluster contains three adjacent gene members, PON1, PON2, and PON3. Originating from the same fungus lactonase precursor, all of the three PON genes share high sequence identity and a similar β propeller protein structure. PON1 and PON3 are primarily expressed in the liver and secreted into the serum upon expression, whereas PON2 is ubiquitously expressed and remains inside the cell. Each PON member has high catalytic activity toward corresponding artificial organophosphate, and all exhibit activities to lactones. Therefore, all three members of the family are regarded as lactonases. Under physiological conditions, they act to degrade metabolites of polyunsaturated fatty acids and homocysteine (Hcy) thiolactone, among other compounds. By detoxifying both oxidized low-density lipoprotein and Hcy thiolactone, PONs protect against atherosclerosis and coronary artery diseases, as has been illustrated by many types of in vitro and in vivo experimental evidence. Clinical observations focusing on gene polymorphisms also indicate that PON1, PON2, and PON3 are protective against coronary artery disease. Many other conditions, such as diabetes, metabolic syndrome, and aging, have been shown to relate to PONs. The abundance and/or activity of PONs can be regulated by lipoproteins and their metabolites, biological macromolecules, pharmacological treatments, dietary factors, and lifestyle. In conclusion, both previous results and ongoing studies provide evidence, making the PON cluster a prospective target for the treatment of atherosclerosis. Antioxid. Redox Signal. 16, 597–632. PMID:21867409

  10. Human paraoxonase gene cluster overexpression alleviates angiotensin II-induced cardiac hypertrophy in mice.

    Science.gov (United States)

    Pei, Jian-Fei; Yan, Yun-Fei; Tang, Xiaoqiang; Zhang, Yang; Cui, Shen-Shen; Zhang, Zhu-Qin; Chen, Hou-Zao; Liu, De-Pei

    2016-11-01

    Cardiac hypertrophy is the strongest predictor of the development of heart failure, and anti-hypertrophic treatment holds the key to improving the clinical syndrome and increasing the survival rates for heart failure. The paraoxonase (PON) gene cluster (PC) protects against atherosclerosis and coronary artery diseases. However, the role of PC in the heart is largely unknown. To evaluate the roles of PC in cardiac hypertrophy, transgenic mice carrying the intact human PON1, PON2, and PON3 genes and their flanking sequences were studied. We demonstrated that the PC transgene (PC-Tg) protected mice from cardiac hypertrophy induced by Ang II; these mice had reduced heart weight/body weight ratios, decreased left ventricular wall thicknesses and increased fractional shortening compared with wild-type (WT) control. The same protective tendency was also observed with an Apoe -/- background. Mechanically, PC-Tg normalized the disequilibrium of matrix metalloproteinases (MMPs)/tissue inhibitors of MMPs (TIMPs) in hypertrophic hearts, which might contribute to the protective role of PC-Tg in cardiac fibrosis and, thus, protect against cardiac remodeling. Taken together, our results identify a novel anti-hypertrophic role for the PON gene cluster, suggesting a possible strategy for the treatment of cardiac hypertrophy through elevating the levels of the PON gene family.

  11. Acquisition and evolution of plant pathogenesis-associated gene clusters and candidate determinants of tissue-specificity in xanthomonas.

    Directory of Open Access Journals (Sweden)

    Hong Lu

    Full Text Available Xanthomonas is a large genus of plant-associated and plant-pathogenic bacteria. Collectively, members cause diseases on over 392 plant species. Individually, they exhibit marked host- and tissue-specificity. The determinants of this specificity are unknown.To assess potential contributions to host- and tissue-specificity, pathogenesis-associated gene clusters were compared across genomes of eight Xanthomonas strains representing vascular or non-vascular pathogens of rice, brassicas, pepper and tomato, and citrus. The gum cluster for extracellular polysaccharide is conserved except for gumN and sequences downstream. The xcs and xps clusters for type II secretion are conserved, except in the rice pathogens, in which xcs is missing. In the otherwise conserved hrp cluster, sequences flanking the core genes for type III secretion vary with respect to insertion sequence element and putative effector gene content. Variation at the rpf (regulation of pathogenicity factors cluster is more pronounced, though genes with established functional relevance are conserved. A cluster for synthesis of lipopolysaccharide varies highly, suggesting multiple horizontal gene transfers and reassortments, but this variation does not correlate with host- or tissue-specificity. Phylogenetic trees based on amino acid alignments of gum, xps, xcs, hrp, and rpf cluster products generally reflect strain phylogeny. However, amino acid residues at four positions correlate with tissue specificity, revealing hpaA and xpsD as candidate determinants. Examination of genome sequences of xanthomonads Xylella fastidiosa and Stenotrophomonas maltophilia revealed that the hrp, gum, and xcs clusters are recent acquisitions in the Xanthomonas lineage.Our results provide insight into the ancestral Xanthomonas genome and indicate that differentiation with respect to host- and tissue-specificity involved not major modifications or wholesale exchange of clusters, but subtle changes in a small

  12. Updated clusters of orthologous genes for Archaea: a complex ancestor of the Archaea and the byways of horizontal gene transfer

    Directory of Open Access Journals (Sweden)

    Wolf Yuri I

    2012-12-01

    Full Text Available Abstract Background Collections of Clusters of Orthologous Genes (COGs provide indispensable tools for comparative genomic analysis, evolutionary reconstruction and functional annotation of new genomes. Initially, COGs were made for all complete genomes of cellular life forms that were available at the time. However, with the accumulation of thousands of complete genomes, construction of a comprehensive COG set has become extremely computationally demanding and prone to error propagation, necessitating the switch to taxon-specific COG collections. Previously, we reported the collection of COGs for 41 genomes of Archaea (arCOGs. Here we present a major update of the arCOGs and describe evolutionary reconstructions to reveal general trends in the evolution of Archaea. Results The updated version of the arCOG database incorporates 91% of the pangenome of 120 archaea (251,032 protein-coding genes altogether into 10,335 arCOGs. Using this new set of arCOGs, we performed maximum likelihood reconstruction of the genome content of archaeal ancestral forms and gene gain and loss events in archaeal evolution. This reconstruction shows that the last Common Ancestor of the extant Archaea was an organism of greater complexity than most of the extant archaea, probably with over 2,500 protein-coding genes. The subsequent evolution of almost all archaeal lineages was apparently dominated by gene loss resulting in genome streamlining. Overall, in the evolution of Archaea as well as a representative set of bacteria that was similarly analyzed for comparison, gene losses are estimated to outnumber gene gains at least 4 to 1. Analysis of specific patterns of gene gain in Archaea shows that, although some groups, in particular Halobacteria, acquire substantially more genes than others, on the whole, gene exchange between major groups of Archaea appears to be largely random, with no major ‘highways’ of horizontal gene transfer. Conclusions The updated collection

  13. ATNT: an enhanced system for expression of polycistronic secondary metabolite gene clusters in Aspergillus niger.

    Science.gov (United States)

    Geib, Elena; Brock, Matthias

    2017-01-01

    Fungi are treasure chests for yet unexplored natural products. However, exploitation of their real potential remains difficult as a significant proportion of biosynthetic gene clusters appears silent under standard laboratory conditions. Therefore, elucidation of novel products requires gene activation or heterologous expression. For heterologous gene expression, we previously developed an expression platform in Aspergillus niger that is based on the transcriptional regulator TerR and its target promoter P terA . In this study, we extended this system by regulating expression of terR  by the doxycycline inducible Tet-on system. Reporter genes cloned under the control of the target promoter P terA remained silent in the absence of doxycycline, but were strongly expressed when doxycycline was added. Reporter quantification revealed that the coupled system results in about five times higher expression rates compared to gene expression under direct control of the Tet-on system. As production of secondary metabolites generally requires the expression of several biosynthetic genes, the suitability of the self-cleaving viral peptide sequence P2A was tested in this optimised expression system. P2A allowed polycistronic expression of genes required for Asp-melanin formation in combination with the gene coding for the red fluorescent protein tdTomato. Gene expression and Asp-melanin formation was prevented in the absence of doxycycline and strongly induced by addition of doxycycline. Fluorescence studies confirmed the correct subcellular localisation of the respective enzymes. This tightly regulated but strongly inducible expression system enables high level production of secondary metabolites most likely even those with toxic potential. Furthermore, this system is compatible with polycistronic gene expression and, thus, suitable for the discovery of novel natural products.

  14. Heterologous Reconstitution of the Intact Geodin Gene Cluster in Aspergillus nidulans through a Simple and Versatile PCR Based Approach

    DEFF Research Database (Denmark)

    Nielsen, Morten Thrane; Nielsen, Jakob Blæsbjerg; Anyaogu, Dianna Chinyere

    2013-01-01

    was transferred in a two step procedure to an expression platform in A. nidulans. The individual cluster fragments were generated by PCR and assembled via efficient USER fusion prior to ransformation and integration via re-iterative gene targeting. A total of 13 open reading frames contained in 25 kb of DNA were...... of solid methodology for genetic manipulation of most species severely hampers pathway haracterization. Here we present a simple PCR based approach for heterologous reconstitution of intact gene clusters. Specifically, the putative gene cluster responsible for geodin production from Aspergillus terreus...... successfully transferred between the two species enabling geodin synthesis in A. nidulans. Subsequently, functions of three genes in the cluster were validated by genetic and chemical analyses. Specifically, ATEG_08451 (gedC) encodes a polyketide synthase, ATEG_08453 (gedR) encodes a transcription factor...

  15. Meta-analysis of cell- specific transcriptomic data using fuzzy c-means clustering discovers versatile viral responsive genes.

    Science.gov (United States)

    Khan, Atif; Katanic, Dejan; Thakar, Juilee

    2017-06-06

    Despite advances in the gene-set enrichment analysis methods; inadequate definitions of gene-sets cause a major limitation in the discovery of novel biological processes from the transcriptomic datasets. Typically, gene-sets are obtained from publicly available pathway databases, which contain generalized definitions frequently derived by manual curation. Recently unsupervised clustering algorithms have been proposed to identify gene-sets from transcriptomics datasets deposited in public domain. These data-driven definitions of the gene-sets can be context-specific revealing novel biological mechanisms. However, the previously proposed algorithms for identification of data-driven gene-sets are based on hard clustering which do not allow overlap across clusters, a characteristic that is predominantly observed across biological pathways. We developed a pipeline using fuzzy-C-means (FCM) soft clustering approach to identify gene-sets which recapitulates topological characteristics of biological pathways. Specifically, we apply our pipeline to derive gene-sets from transcriptomic data measuring response of monocyte derived dendritic cells and A549 epithelial cells to influenza infections. Our approach apply Ward's method for the selection of initial conditions, optimize parameters of FCM algorithm for human cell-specific transcriptomic data and identify robust gene-sets along with versatile viral responsive genes. We validate our gene-sets and demonstrate that by identifying genes associated with multiple gene-sets, FCM clustering algorithm significantly improves interpretation of transcriptomic data facilitating investigation of novel biological processes by leveraging on transcriptomic data available in the public domain. We develop an interactive 'Fuzzy Inference of Gene-sets (FIGS)' package (GitHub: https://github.com/Thakar-Lab/FIGS ) to facilitate use of of pipeline. Future extension of FIGS across different immune cell-types will improve mechanistic

  16. Comparison of loline alkaloid gene clusters across fungal endophytes: predicting the co-regulatory sequence motifs and the evolutionary history.

    Science.gov (United States)

    Kutil, Brandi L; Greenwald, Charles; Liu, Gang; Spiering, Martin J; Schardl, Christopher L; Wilkinson, Heather H

    2007-10-01

    LOL, a fungal secondary metabolite gene cluster found in Epichloë and Neotyphodium species, is responsible for production of insecticidal loline alkaloids. To analyze the genetic architecture and to predict the evolutionary history of LOL, we compared five clusters from four fungal species (single clusters from Epichloë festucae, Neotyphodium sp. PauTG-1, Neotyphodium coenophialum, and two clusters we previously characterized in Neotyphodium uncinatum). Using PhyloCon to compare putative lol gene promoter regions, we have identified four motifs conserved across the lol genes in all five clusters. Each motif has significant similarity to known fungal transcription factor binding sites in the TRANSFAC database. Conservation of these motifs is further support for the hypothesis that the lol genes are co-regulated. Interestingly, the history of asexual Neotyphodium spp. includes multiple interspecific hybridization events. Comparing clusters from three Neotyphodium species and E. festucae allowed us to determine which Epichloë ancestors are the most likely contributors of LOL in these asexual species. For example, while no present day Epichloë typhina isolates are known to produce lolines, our data support the hypothesis that the E. typhina ancestor(s) of three asexual endophyte species contained a LOL gene cluster. Thus, these data support a model of evolution in which the polymorphism in loline alkaloid production phenotypes among endophyte species is likely due to the loss of the trait over time.

  17. Noise Resistant Generalized Parametric Validity Index of Clustering for Gene Expression Data.

    Science.gov (United States)

    Fa, Rui; Nandi, Asoke K

    2014-01-01

    Validity indices have been investigated for decades. However, since there is no study of noise-resistance performance of these indices in the literature, there is no guideline for determining the best clustering in noisy data sets, especially microarray data sets. In this paper, we propose a generalized parametric validity (GPV) index which employs two tunable parameters α and β to control the proportions of objects being considered to calculate the dissimilarities. The greatest advantage of the proposed GPV index is its noise-resistance ability, which results from the flexibility of tuning the parameters. Several rules are set to guide the selection of parameter values. To illustrate the noise-resistance performance of the proposed index, we evaluate the GPV index for assessing five clustering algorithms in two gene expression data simulation models with different noise levels and compare the ability of determining the number of clusters with eight existing indices. We also test the GPV in three groups of real gene expression data sets. The experimental results suggest that the proposed GPV index has superior noise-resistance ability and provides fairly accurate judgements.

  18. Characterization of the biosynthetic gene cluster of rebeccamycin from Lechevalieria aerocolonigenes ATCC 39243.

    Science.gov (United States)

    Onaka, Hiroyasu; Taniguchi, Shin-ichi; Igarashi, Yasuhiro; Furumai, Tamotsu

    2003-01-01

    The biosynthetic gene cluster for rebeccamycin, an indolocarbazole antibiotic, from Lechevalieria aerocolonigenes ATCC 39243 has 11 ORFs. To clarify their functions, mutants with rebG, rebD, rebC, rebP, rebM, rebR, rebH, rebT, or orfD2 disrupted were constructed, and the gene products were examined. rebP disruptants produced 11,11'-dichlorochromopyrrolic acid, found to be a biosynthetic intermediate by a bioconversion experiment. Other genes encoded N-glycosyltransferase (rebG), monooxygenase (rebC), methyltransferase (rebM), a transcriptional activator (rebR), and halogenase (rebH). rebT disruptants produced rebeccamycin as much as the wild strain, so rebT was probably not involved in rebeccamycin production. Biosynthetic genes of staurosporine, an another indolocarbazole antibiotic, were cloned from Streptomyces sp. TP-A0274. staO, staD, and staP were similar to rebO, rebD, and rebP, respectively, all of which are responsible for indolocarbazole biosynthesis, But a rebC homolog, encoding a putative enzyme oxidizing the C-7 site of pyrrole rings, was not found in the staurosporine biosynthetic gene cluster. These results suggest that indolocarbazole is constructed by oxidative decarboxylation of chromopyrrolic acid (11,11'-dichlorochromopyrrolic acid in rebeccamycin) generated from two molecules of tryptophan by coupling and that the oxidation state at the C-7 position depends on the additional enzyme(s) encoded by the biosynthetic genes.

  19. Pathogen corruption and site-directed recombination at a plant disease resistance gene cluster.

    Science.gov (United States)

    Nagy, Ervin D; Bennetzen, Jeffrey L

    2008-12-01

    The Pc locus of sorghum (Sorghum bicolor) determines dominant sensitivity to a host-selective toxin produced by the fungal pathogen Periconia circinata. The Pc region was cloned by a map-based approach and found to contain three tandemly repeated genes with the structures of nucleotide binding site-leucine-rich repeat (NBS-LRR) disease resistance genes. Thirteen independent Pc-to-pc mutations were analyzed, and each was found to remove all or part of the central gene of the threesome. Hence, this central gene is Pc. Most Pc-to-pc mutations were associated with unequal recombination. Eight recombination events were localized to different sites in a 560-bp region within the approximately 3.7-kb NBS-LRR genes. Because any unequal recombination located within the flanking NBS-LRR genes would have removed Pc, the clustering of cross-over events within a 560-bp segment indicates that a site-directed recombination process exists that specifically targets unequal events to generate LRR diversity in NBS-LRR loci.

  20. Isoeugenol monooxygenase and its putative regulatory gene are located in the eugenol metabolic gene cluster in Pseudomonas nitroreducens Jin1.

    Science.gov (United States)

    Ryu, Ji-Young; Seo, Jiyoung; Unno, Tatsuya; Ahn, Joong-Hoon; Yan, Tao; Sadowsky, Michael J; Hur, Hor-Gil

    2010-03-01

    The plant-derived phenylpropanoids eugenol and isoeugenol have been proposed as useful precursors for the production of natural vanillin. Genes involved in the metabolism of eugenol and isoeugenol were clustered in region of about a 30 kb of Pseudomonas nitroreducens Jin1. Two of the 23 ORFs in this region, ORFs 26 (iemR) and 27 (iem), were predicted to be involved in the conversion of isoeugenol to vanillin. The deduced amino acid sequence of isoeugenol monooxygenase (Iem) of strain Jin1 had 81.4% identity to isoeugenol monooxygenase from Pseudomonas putida IE27, which also transforms isoeugenol to vanillin. Iem was expressed in E. coli BL21(DE3) and was found to lead to isoeugenol to vanillin transformation. Deletion and cloning analyses indicated that the gene iemR, located upstream of iem, is required for expression of iem in the presence of isoeugenol, suggesting it to be the iem regulatory gene. Reverse transcription, real-time PCR analyses indicated that the genes involved in the metabolism of eugenol and isoeugenol were differently induced by isoeugenol, eugenol, and vanillin.

  1. Functional classification of genes using semantic distance and fuzzy clustering approach: evaluation with reference sets and overlap analysis.

    Science.gov (United States)

    Devignes, Marie-Dominique; Benabderrahmane, Sidahmed; Smaïl-Tabbone, Malika; Napoli, Amedeo; Poch, Olivier

    2012-01-01

    Functional classification aims at grouping genes according to their molecular function or the biological process they participate in. Evaluating the validity of such unsupervised gene classification remains a challenge given the variety of distance measures and classification algorithms that can be used. We evaluate here functional classification of genes with the help of reference sets: KEGG (Kyoto Encyclopaedia of Genes and Genomes) pathways and Pfam clans. These sets represent ground truth for any distance based on GO (Gene Ontology) biological process and molecular function annotations respectively. Overlaps between clusters and reference sets are estimated by the F-score method. We test our previously described IntelliGO semantic distance with hierarchical and fuzzy C-means clustering and we compare results with the state-of-the-art DAVID (Database for Annotation Visualisation and Integrated Discovery) functional classification method. Finally, study of best matching clusters to reference sets leads us to propose a set-difference method for discovering missing information.

  2. Identifying driving gene clusters in complex diseases through critical transition theory

    Science.gov (United States)

    Wolanyk, Nathaniel; Wang, Xujing; Hessner, Martin; Gao, Shouguo; Chen, Ye; Jia, Shuang

    A novel approach of looking at the human body using critical transition theory has yielded positive results: clusters of genes that act in tandem to drive complex disease progression. This cluster of genes can be thought of as the first part of a large genetic force that pushes the body from a curable, but sick, point to an incurable diseased point through a catastrophic bifurcation. The data analyzed is time course microarray blood assay data of 7 high risk individuals for Type 1 Diabetes who progressed into a clinical onset, with an additional larger study requested to be presented at the conference. The normalized data is 25,000 genes strong, which were narrowed down based on statistical metrics, and finally a machine learning algorithm using critical transition metrics found the driving network. This approach was created to be repeatable across multiple complex diseases with only progression time course data needed so that it would be applicable to identifying when an individual is at risk of developing a complex disease. Thusly, preventative measures can be enacted, and in the longer term, offers a possible solution to prevent all Type 1 Diabetes.

  3. The Serratia gene cluster encoding biosynthesis of the red antibiotic, prodigiosin, shows species- and strain-dependent genome context variation

    DEFF Research Database (Denmark)

    Harris, Abigail K P; Williamson, Neil R; Slater, Holly

    2004-01-01

    The prodigiosin biosynthesis gene cluster (pig cluster) from two strains of Serratia (S. marcescens ATCC 274 and Serratia sp. ATCC 39006) has been cloned, sequenced and expressed in heterologous hosts. Sequence analysis of the respective pig clusters revealed 14 ORFs in S. marcescens ATCC 274...... from Str. coelicolor A3(2) revealed some important differences. A modified scheme for the biosynthesis of prodigiosin, based on the pathway recently suggested for the synthesis of undecylprodigiosin, is proposed. The distribution of the pig cluster within several Serratia sp. isolates is demonstrated...

  4. Generating in vivo cloning vectors for parallel cloning of large gene clusters by homologous recombination.

    Directory of Open Access Journals (Sweden)

    Jeongmin Lee

    Full Text Available A robust method for the in vivo cloning of large gene clusters was developed based on homologous recombination (HR, requiring only the transformation of PCR products into Escherichia coli cells harboring a receiver plasmid. Positive clones were selected by an acquired antibiotic resistance, which was activated by the recruitment of a short ribosome-binding site plus start codon sequence from the PCR products to the upstream position of a silent antibiotic resistance gene in receiver plasmids. This selection was highly stringent and thus the cloning efficiency of the GFPuv gene (size: 0.7 kb was comparable to that of the conventional restriction-ligation method, reaching up to 4.3 × 10(4 positive clones per μg of DNA. When we attempted parallel cloning of GFPuv fusion genes (size: 2.0 kb and carotenoid biosynthesis pathway clusters (sizes: 4 kb, 6 kb, and 10 kb, the cloning efficiency was similarly high regardless of the DNA size, demonstrating that this would be useful for the cloning of large DNA sequences carrying multiple open reading frames. However, restriction analyses of the obtained plasmids showed that the selected cells may contain significant amounts of receiver plasmids without the inserts. To minimize the amount of empty plasmid in the positive selections, the sacB gene encoding a levansucrase was introduced as a counter selection marker in receiver plasmid as it converts sucrose to a toxic levan in the E. coli cells. Consequently, this method yielded completely homogeneous plasmids containing the inserts via the direct transformation of PCR products into E. coli cells.

  5. antiSMASH 3.0-a comprehensive resource for the genome mining of biosynthetic gene clusters.

    Science.gov (United States)

    Weber, Tilmann; Blin, Kai; Duddela, Srikanth; Krug, Daniel; Kim, Hyun Uk; Bruccoleri, Robert; Lee, Sang Yup; Fischbach, Michael A; Müller, Rolf; Wohlleben, Wolfgang; Breitling, Rainer; Takano, Eriko; Medema, Marnix H

    2015-07-01

    Microbial secondary metabolism constitutes a rich source of antibiotics, chemotherapeutics, insecticides and other high-value chemicals. Genome mining of gene clusters that encode the biosynthetic pathways for these metabolites has become a key methodology for novel compound discovery. In 2011, we introduced antiSMASH, a web server and stand-alone tool for the automatic genomic identification and analysis of biosynthetic gene clusters, available at http://antismash.secondarymetabolites.org. Here, we present version 3.0 of antiSMASH, which has undergone major improvements. A full integration of the recently published ClusterFinder algorithm now allows using this probabilistic algorithm to detect putative gene clusters of unknown types. Also, a new dereplication variant of the ClusterBlast module now identifies similarities of identified clusters to any of 1172 clusters with known end products. At the enzyme level, active sites of key biosynthetic enzymes are now pinpointed through a curated pattern-matching procedure and Enzyme Commission numbers are assigned to functionally classify all enzyme-coding genes. Additionally, chemical structure prediction has been improved by incorporating polyketide reduction states. Finally, in order for users to be able to organize and analyze multiple antiSMASH outputs in a private setting, a new XML output module allows offline editing of antiSMASH annotations within the Geneious software. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. New tools for reconstruction and heterologous expression of natural product biosynthetic gene clusters.

    Science.gov (United States)

    Luo, Yunzi; Enghiad, Behnam; Zhao, Huimin

    2016-02-01

    Natural product scaffolds remain a major source and inspiration for human therapeutics. However, generation of a natural product in the post-genomic era often requires reconstruction of the corresponding biosynthetic gene cluster in a heterologous host. In the burgeoning fields of synthetic biology and metabolic engineering, a significant amount of efforts has been devoted to develop DNA assembly techniques with higher efficiency, fidelity, and modularity, and heterologous expression systems with higher productivity and yield. Here we describe recent advances in DNA assembly and host engineering and highlight their applications in natural product discovery and engineering.

  7. Genome-wide identification of physically clustered genes suggests chromatin-level co-regulation in male reproductive development in Arabidopsis thaliana.

    Science.gov (United States)

    Reimegård, Johan; Kundu, Snehangshu; Pendle, Ali; Irish, Vivian F; Shaw, Peter; Nakayama, Naomi; Sundström, Jens F; Emanuelsson, Olof

    2017-04-07

    Co-expression of physically linked genes occurs surprisingly frequently in eukaryotes. Such chromosomal clustering may confer a selective advantage as it enables coordinated gene regulation at the chromatin level. We studied the chromosomal organization of genes involved in male reproductive development in Arabidopsis thaliana. We developed an in-silico tool to identify physical clusters of co-regulated genes from gene expression data. We identified 17 clusters (96 genes) involved in stamen development and acting downstream of the transcriptional activator MS1 (MALE STERILITY 1), which contains a PHD domain associated with chromatin re-organization. The clusters exhibited little gene homology or promoter element similarity, and largely overlapped with reported repressive histone marks. Experiments on a subset of the clusters suggested a link between expression activation and chromatin conformation: qRT-PCR and mRNA in situ hybridization showed that the clustered genes were up-regulated within 48 h after MS1 induction; out of 14 chromatin-remodeling mutants studied, expression of clustered genes was consistently down-regulated only in hta9/hta11, previously associated with metabolic cluster activation; DNA fluorescence in situ hybridization confirmed that transcriptional activation of the clustered genes was correlated with open chromatin conformation. Stamen development thus appears to involve transcriptional activation of physically clustered genes through chromatin de-condensation. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  8. Genetic interrelations in the actinomycin biosynthetic gene clusters of Streptomyces antibioticus IMRU 3720 and Streptomyces chrysomallus ATCC11523, producers of actinomycin X and actinomycin C

    Science.gov (United States)

    Crnovčić, Ivana; Rückert, Christian; Semsary, Siamak; Lang, Manuel; Kalinowski, Jörn; Keller, Ullrich

    2017-01-01

    Sequencing the actinomycin (acm) biosynthetic gene cluster of Streptomyces antibioticus IMRU 3720, which produces actinomycin X (Acm X), revealed 20 genes organized into a highly similar framework as in the bi-armed acm C biosynthetic gene cluster of Streptomyces chrysomallus but without an attached additional extra arm of orthologues as in the latter. Curiously, the extra arm of the S. chrysomallus gene cluster turned out to perfectly match the single arm of the S. antibioticus gene cluster in the same order of orthologues including the the presence of two pseudogenes, scacmM and scacmN, encoding a cytochrome P450 and its ferredoxin, respectively. Orthologues of the latter genes were both missing in the principal arm of the S. chrysomallus acm C gene cluster. All orthologues of the extra arm showed a G +C-contents different from that of their counterparts in the principal arm. Moreover, the similarities of translation products from the extra arm were all higher to the corresponding translation products of orthologue genes from the S. antibioticus acm X gene cluster than to those encoded by the principal arm of their own gene cluster. This suggests that the duplicated structure of the S. chrysomallus acm C biosynthetic gene cluster evolved from previous fusion between two one-armed acm gene clusters each from a different genetic background. However, while scacmM and scacmN in the extra arm of the S. chrysomallus acm C gene cluster are mutated and therefore are non-functional, their orthologues saacmM and saacmN in the S. antibioticus acm C gene cluster show no defects seemingly encoding active enzymes with functions specific for Acm X biosynthesis. Both acm biosynthetic gene clusters lack a kynurenine-3-monooxygenase gene necessary for biosynthesis of 3-hydroxy-4-methylanthranilic acid, the building block of the Acm chromophore, which suggests participation of a genome-encoded relevant monooxygenase during Acm biosynthesis in both S. chrysomallus and S

  9. The entire β-globin gene cluster is deleted in a form of τδβ-thalassemia.

    NARCIS (Netherlands)

    E.R. Fearon; H.H.Jr. Kazazian; P.G. Waber (Pamela); J.I. Lee (Joseph); S.E. Antonarakis; S.H. Orkin (Stuart); E.F. Vanin; P.S. Henthorn; F.G. Grosveld (Frank); A.F. Scott; G.R. Buchanan

    1983-01-01

    textabstractWe have used restriction endonuclease mapping to study a deletion involving the beta-globin gene cluster in a Mexican-American family with gamma delta beta-thalassemia. Analysis of DNA polymorphisms demonstrated deletion of the beta-globin gene from the affected chromosome. Using a DNA

  10. Ladder-like amplification of the type I interferon gene cluster in the human osteosarcoma cell line MG63

    Czech Academy of Sciences Publication Activity Database

    Marella, N.V.; Zeitz, M.J.; Malyavantham, K.S.; Pliss, A.; Matsui, S.; Goetze, S.; Bode, J.; Raška, Ivan; Berezney, R.

    2008-01-01

    Roč. 16, č. 8 (2008), s. 1177-1192 ISSN 0967-3849 Grant - others:GA MŠk(CZ) LC535 Program:LC Institutional research plan: CEZ:AV0Z50110509 Keywords : array comparative genomic hybridization * human osteosarcoma cells * interferon gene cluster Subject RIV: EB - Gene tics ; Molecular Biology Impact factor: 3.405, year: 2008

  11. Dispersed Benzoxazinone Gene Cluster: Molecular Characterization and Chromosomal Localization of Glucosyltransferase and Glucosidase Genes in Wheat and Rye1[W

    Science.gov (United States)

    Sue, Masayuki; Nakamura, Chihiro; Nomura, Taiji

    2011-01-01

    Benzoxazinones (Bxs) are major defensive secondary metabolites in wheat (Triticum aestivum), rye (Secale cereale), and maize (Zea mays). Here, we identified full sets of homeologous and paralogous genes encoding Bx glucosyltransferase (GT) and Bx-glucoside glucosidase (Glu) in hexaploid wheat (2n = 6x = 42; AABBDD). Four GT loci (TaGTa–TaGTd) were mapped on chromosomes 7A, 7B (two loci), and 7D, whereas four glu1 loci (Taglu1a–Taglu1d) were on chromosomes 2A, 2B (two loci), and 2D. Transcript levels differed greatly among the four loci; B-genome loci of both TaGT and Taglu1 genes were preferentially transcribed. Catalytic properties of the enzyme encoded by each homeolog/paralog also differed despite high levels of identity among amino acid sequences. The predominant contribution of the B genome to GT and Glu reactions was revealed, as observed previously for the five Bx biosynthetic genes, TaBx1 to TaBx5, which are separately located on homeologous groups 4 and 5 chromosomes. In rye, where the ScBx1 to ScBx5 genes are dispersed to chromosomes 7R and 5R, ScGT and Scglu were located separately on chromosomes 4R and 2R, respectively. The dispersal of Bx-pathway loci to four distinct chromosomes in hexaploid wheat and rye suggests that the clustering of Bx-pathway genes, as found in maize, is not essential for coordinated transcription. On the other hand, barley (Hordeum vulgare) was found to lack the orthologous GT and glu loci like the Bx1 to Bx5 loci despite its close phylogenetic relationship with wheat and rye. These results contribute to our understanding of the evolutionary processes that the Bx-pathway loci have undergone in grasses. PMID:21875895

  12. Dispersed benzoxazinone gene cluster: molecular characterization and chromosomal localization of glucosyltransferase and glucosidase genes in wheat and rye.

    Science.gov (United States)

    Sue, Masayuki; Nakamura, Chihiro; Nomura, Taiji

    2011-11-01

    Benzoxazinones (Bxs) are major defensive secondary metabolites in wheat (Triticum aestivum), rye (Secale cereale), and maize (Zea mays). Here, we identified full sets of homeologous and paralogous genes encoding Bx glucosyltransferase (GT) and Bx-glucoside glucosidase (Glu) in hexaploid wheat (2n = 6x = 42; AABBDD). Four GT loci (TaGTa-TaGTd) were mapped on chromosomes 7A, 7B (two loci), and 7D, whereas four glu1 loci (Taglu1a-Taglu1d) were on chromosomes 2A, 2B (two loci), and 2D. Transcript levels differed greatly among the four loci; B-genome loci of both TaGT and Taglu1 genes were preferentially transcribed. Catalytic properties of the enzyme encoded by each homeolog/paralog also differed despite high levels of identity among amino acid sequences. The predominant contribution of the B genome to GT and Glu reactions was revealed, as observed previously for the five Bx biosynthetic genes, TaBx1 to TaBx5, which are separately located on homeologous groups 4 and 5 chromosomes. In rye, where the ScBx1 to ScBx5 genes are dispersed to chromosomes 7R and 5R, ScGT and Scglu were located separately on chromosomes 4R and 2R, respectively. The dispersal of Bx-pathway loci to four distinct chromosomes in hexaploid wheat and rye suggests that the clustering of Bx-pathway genes, as found in maize, is not essential for coordinated transcription. On the other hand, barley (Hordeum vulgare) was found to lack the orthologous GT and glu loci like the Bx1 to Bx5 loci despite its close phylogenetic relationship with wheat and rye. These results contribute to our understanding of the evolutionary processes that the Bx-pathway loci have undergone in grasses.

  13. Linking secondary metabolites to gene clusters through genome sequencing of six diverse Aspergillus species

    DEFF Research Database (Denmark)

    Kjærbølling, Inge; Vesth, Tammi C.; Frisvad, Jens C.

    2018-01-01

    to determine phylogeny and genetic diversity, showing that each presented genome contains 15–27% genes not found in other sequenced Aspergilli. In particular, A. novofumigatus was compared with the pathogenic species A. fumigatus. This suggests that A. novofumigatus can produce most of the same allergens......, virulence, and pathogenicity factors as A. fumigatus, suggesting that A. novofumigatus could be as pathogenic as A. fumigatus. Furthermore, SMs were linked to gene clusters based on biological and chemical knowledge and analysis, genome sequences, and predictive algorithms. We thus identify putative SM....... campestris, A. novofumigatus, A. ochraceoroseus, and A. steynii) have been whole-genome PacBio sequenced to provide genetic references in three Aspergillus sections. A. taichungensis and A. candidus also were sequenced for SM elucidation. Thirteen Aspergillus genomes were analyzed with comparative genomics...

  14. Characteristic beta-globin gene cluster haplotypes of Evenkis and Oroqens in north China.

    Science.gov (United States)

    Shimizu, Koji; Marubayashi, Azusa; Tokimasa, Kozue; Harihara, Shinji; Omoto, Keiichi; Imanishi, Tadashi; Hao, Luping; Jin, Feng

    2004-10-01

    Haplotype frequencies of the beta-globin gene cluster were estimated for 114 Evenkis and 81 Oroqens from northeast China, and their characteristics were compared with those in Japanese, Koreans, and three Colombian Amerindian groups of South America (Wayuu, Kamsa, and Inga tribes). A major 5' subhaplotype (5' to the delta-globin gene) was + - - - - in Evenkis, whereas + - - - -, - + + - +, and - + - + + were the major subhaplotypes in Oroqens. One possible candidate for an ancestral 5' subhaplotype, - - - - -, was found in one Evenki (0.5%) and three Oroqen chromosomes (2.0%). They were observed as heterozygous forms for + ---- and -----. Major haplotypes were +-----+, + -----+-, and + - - - - + + in Evenkis, whereas they were +-----+,-++-+-+, +----+-, and -+-++-+ in Oroqens. The lowest Nei's genetic distance values of Evenkis or Oroqens based on the 5' subhaplotype frequency distributions were observed in relation to the Wayuu or Koreans, respectively, but those of Evenkis and Oroqens based on the haplotype frequency distributions were found in relation to Koreans.

  15. Heterologous reconstitution of the intact geodin gene cluster in Aspergillus nidulans through a simple and versatile PCR based approach.

    Directory of Open Access Journals (Sweden)

    Morten Thrane Nielsen

    Full Text Available Fungal natural products are a rich resource for bioactive molecules. To fully exploit this potential it is necessary to link genes to metabolites. Genetic information for numerous putative biosynthetic pathways has become available in recent years through genome sequencing. However, the lack of solid methodology for genetic manipulation of most species severely hampers pathway characterization. Here we present a simple PCR based approach for heterologous reconstitution of intact gene clusters. Specifically, the putative gene cluster responsible for geodin production from Aspergillus terreus was transferred in a two step procedure to an expression platform in A. nidulans. The individual cluster fragments were generated by PCR and assembled via efficient USER fusion prior to transformation and integration via re-iterative gene targeting. A total of 13 open reading frames contained in 25 kb of DNA were successfully transferred between the two species enabling geodin synthesis in A. nidulans. Subsequently, functions of three genes in the cluster were validated by genetic and chemical analyses. Specifically, ATEG_08451 (gedC encodes a polyketide synthase, ATEG_08453 (gedR encodes a transcription factor responsible for activation of the geodin gene cluster and ATEG_08460 (gedL encodes a halogenase that catalyzes conversion of sulochrin to dihydrogeodin. We expect that our approach for transferring intact biosynthetic pathways to a fungus with a well developed genetic toolbox will be instrumental in characterizing the many exciting pathways for secondary metabolite production that are currently being uncovered by the fungal genome sequencing projects.

  16. Expression-based clustering of CAZyme-encoding genes of Aspergillus niger.

    Science.gov (United States)

    Gruben, Birgit S; Mäkelä, Miia R; Kowalczyk, Joanna E; Zhou, Miaomiao; Benoit-Gelber, Isabelle; De Vries, Ronald P

    2017-11-23

    The Aspergillus niger genome contains a large repertoire of genes encoding carbohydrate active enzymes (CAZymes) that are targeted to plant polysaccharide degradation enabling A. niger to grow on a wide range of plant biomass substrates. Which genes need to be activated in certain environmental conditions depends on the composition of the available substrate. Previous studies have demonstrated the involvement of a number of transcriptional regulators in plant biomass degradation and have identified sets of target genes for each regulator. In this study, a broad transcriptional analysis was performed of the A. niger genes encoding (putative) plant polysaccharide degrading enzymes. Microarray data focusing on the initial response of A. niger to the presence of plant biomass related carbon sources were analyzed of a wild-type strain N402 that was grown on a large range of carbon sources and of the regulatory mutant strains ΔxlnR, ΔaraR, ΔamyR, ΔrhaR and ΔgalX that were grown on their specific inducing compounds. The cluster analysis of the expression data revealed several groups of co-regulated genes, which goes beyond the traditionally described co-regulated gene sets. Additional putative target genes of the selected regulators were identified, based on their expression profile. Notably, in several cases the expression profile puts questions on the function assignment of uncharacterized genes that was based on homology searches, highlighting the need for more extensive biochemical studies into the substrate specificity of enzymes encoded by these non-characterized genes. The data also revealed sets of genes that were upregulated in the regulatory mutants, suggesting interaction between the regulatory systems and a therefore even more complex overall regulatory network than has been reported so far. Expression profiling on a large number of substrates provides better insight in the complex regulatory systems that drive the conversion of plant biomass by fungi. In

  17. Inactivation of the indole-diterpene biosynthetic gene cluster of Claviceps paspali by Agrobacterium-mediated gene replacement.

    Science.gov (United States)

    Kozák, László; Szilágyi, Zoltán; Vágó, Barbara; Kakuk, Annamária; Tóth, László; Molnár, István; Pócsi, István

    2018-04-01

    The hypocrealean fungus Claviceps paspali is a parasite of wild grasses. This fungus is widely utilized in the pharmaceutical industry for the manufacture of ergot alkaloids, but also produces tremorgenic and neurotoxic indole-diterpene (IDT) secondary metabolites such as paspalitrems A and B. IDTs cause significant losses in agriculture and represent health hazards that threaten food security. Conversely, IDTs may also be utilized as lead compounds for pharmaceutical drug discovery. Current protoplast-mediated transformation protocols of C. paspali are inadequate as they suffer from inefficiencies in protoplast regeneration, a low frequency of DNA integration, and a low mitotic stability of the nascent transformants. We adapted and optimized Agrobacterium tumefaciens-mediated transformation (ATMT) for C. paspali and validated this method with the straightforward creation of a mutant strain of this fungus featuring a targeted replacement of key genes in the putative IDT biosynthetic gene cluster. Complete abrogation of IDT production in isolates of the mutant strain proved the predicted involvement of the target genes in the biosynthesis of IDTs. The mutant isolates continued to produce ergot alkaloids undisturbed, indicating that equivalent mutants generated in industrial ergot producers may have a better safety profile as they are devoid of IDT-type mycotoxins. Meanwhile, ATMT optimized for Claviceps spp. may open the door for the facile genetic engineering of these industrially and ecologically important organisms.

  18. Hessian regularization based symmetric nonnegative matrix factorization for clustering gene expression and microbiome data.

    Science.gov (United States)

    Ma, Yuanyuan; Hu, Xiaohua; He, Tingting; Jiang, Xingpeng

    2016-12-01

    Nonnegative matrix factorization (NMF) has received considerable attention due to its interpretation of observed samples as combinations of different components, and has been successfully used as a clustering method. As an extension of NMF, Symmetric NMF (SNMF) inherits the advantages of NMF. Unlike NMF, however, SNMF takes a nonnegative similarity matrix as an input, and two lower rank nonnegative matrices (H, H T ) are computed as an output to approximate the original similarity matrix. Laplacian regularization has improved the clustering performance of NMF and SNMF. However, Laplacian regularization (LR), as a classic manifold regularization method, suffers some problems because of its weak extrapolating ability. In this paper, we propose a novel variant of SNMF, called Hessian regularization based symmetric nonnegative matrix factorization (HSNMF), for this purpose. In contrast to Laplacian regularization, Hessian regularization fits the data perfectly and extrapolates nicely to unseen data. We conduct extensive experiments on several datasets including text data, gene expression data and HMP (Human Microbiome Project) data. The results show that the proposed method outperforms other methods, which suggests the potential application of HSNMF in biological data clustering. Copyright © 2016. Published by Elsevier Inc.

  19. Human major histocompatibility complex contains a minimum of 19 genes between the complement cluster and HLA-B

    International Nuclear Information System (INIS)

    Spies, T.; Bresnahan, M.; Strominger, J.L.

    1989-01-01

    A 600-kilobase (kb) DNA segment from the human major histocompatibility complex (MHC) class III region was isolated by extension of a previous 435-kb chromosome walk. The contiguous series of cloned overlapping cosmids contains the entire 555-kb interval between C2 in the complement gene cluster and HLA-B. This region is known to encode the tumor necrosis factors (TNFs) α and β, B144, and the major heat shock protein HSP70. Moreover, a cluster of genes, BAT1-BAT5 (HLA-B-associated transcripts) have been localized in the vicinity of the genes for TNFα and TNFβ. An additional four genes were identified by isolation of corresponding cDNA clones with cosmid DNA probes. These genes for BAT6-BAT9 were mapped near the gene for C2 within a 120-kb region that includes a HSP70 gene pair. These results, together with complementary data from a similar recent study, indicated the presence of a minimum of 19 genes within the C2-HLA-B interval of the MHC class III region. Although the functional properties of most of these genes are yet unknown, they may be involved in some aspects of immunity. This idea is supported by the genetic mapping of the hematopoietic histocompatibility locus-1 (Hh-1) in recombinant mice between TNFα and H-2S, which is homologous to the complement gene cluster in humans

  20. Microbial communication leading to the activation of silent fungal secondary metabolite gene clusters

    Directory of Open Access Journals (Sweden)

    Tina eNetzker

    2015-04-01

    Full Text Available Microorganisms form diverse multispecies communities in various ecosystems. The high abundance of fungal and bacterial species in these consortia results in specific communication between the microorganisms. A key role in this communication is played by secondary metabolites (SMs, which are also called natural products. Recently, it was shown that interspecies ‘talk’ between microorganisms represents a physiological trigger to activate silent gene clusters leading to the formation of novel SMs by the involved species. This review focuses on mixed microbial cultivation, mainly between bacteria and fungi, with a special emphasis on the induced formation of fungal SMs in co-cultures. In addition, the role of chromatin remodeling in the induction is examined, and methodical perspectives for the analysis of natural products are presented. As an example for an intermicrobial interaction elucidated at the molecular level, we discuss the specific interaction between the filamentous fungi Aspergillus nidulans and Aspergillus fumigatus with the soil bacterium Streptomyces rapamycinicus, which provides an excellent model system to enlighten molecular concepts behind regulatory mechanisms and will pave the way to a novel avenue of drug discovery through targeted activation of silent SM gene clusters through co-cultivations of microorganisms.

  1. Directed natural product biosynthesis gene cluster capture and expression in the model bacterium Bacillus subtilis

    Science.gov (United States)

    Li, Yongxin; Li, Zhongrui; Yamanaka, Kazuya; Xu, Ying; Zhang, Weipeng; Vlamakis, Hera; Kolter, Roberto; Moore, Bradley S.; Qian, Pei-Yuan

    2015-03-01

    Bacilli are ubiquitous low G+C environmental Gram-positive bacteria that produce a wide assortment of specialized small molecules. Although their natural product biosynthetic potential is high, robust molecular tools to support the heterologous expression of large biosynthetic gene clusters in Bacillus hosts are rare. Herein we adapt transformation-associated recombination (TAR) in yeast to design a single genomic capture and expression vector for antibiotic production in Bacillus subtilis. After validating this direct cloning ``plug-and-play'' approach with surfactin, we genetically interrogated amicoumacin biosynthetic gene cluster from the marine isolate Bacillus subtilis 1779. Its heterologous expression allowed us to explore an unusual maturation process involving the N-acyl-asparagine pro-drug intermediates preamicoumacins, which are hydrolyzed by the asparagine-specific peptidase into the active component amicoumacin A. This work represents the first direct cloning based heterologous expression of natural products in the model organism B. subtilis and paves the way to the development of future genome mining efforts in this genus.

  2. Heterologous expression of the Halothiobacillus neapolitanus carboxysomal gene cluster in Corynebacterium glutamicum.

    Science.gov (United States)

    Baumgart, Meike; Huber, Isabel; Abdollahzadeh, Iman; Gensch, Thomas; Frunzke, Julia

    2017-09-20

    Compartmentalization represents a ubiquitous principle used by living organisms to optimize metabolic flux and to avoid detrimental interactions within the cytoplasm. Proteinaceous bacterial microcompartments (BMCs) have therefore created strong interest for the encapsulation of heterologous pathways in microbial model organisms. However, attempts were so far mostly restricted to Escherichia coli. Here, we introduced the carboxysomal gene cluster of Halothiobacillus neapolitanus into the biotechnological platform species Corynebacterium gluta-micum. Transmission electron microscopy, fluorescence microscopy and single molecule localization microscopy suggested the formation of BMC-like structures in cells expressing the complete carboxysome operon or only the shell proteins. Purified carboxysomes consisted of the expected protein components as verified by mass spectrometry. Enzymatic assays revealed the functional production of RuBisCO in C. glutamicum both in the presence and absence of carboxysomal shell proteins. Furthermore, we could show that eYFP is targeted to the carboxysomes by fusion to the large RuBisCO subunit. Overall, this study represents the first transfer of an α-carboxysomal gene cluster into a Gram-positive model species supporting the modularity and orthogonality of these microcompartments, but also identified important challenges which need to be addressed on the way towards biotechnological application. Copyright © 2017 Elsevier B.V. All rights reserved.

  3. Directed natural product biosynthesis gene cluster capture and expression in the model bacterium Bacillus subtilis

    KAUST Repository

    Li, Yongxin

    2015-03-24

    Bacilli are ubiquitous low G+C environmental Gram-positive bacteria that produce a wide assortment of specialized small molecules. Although their natural product biosynthetic potential is high, robust molecular tools to support the heterologous expression of large biosynthetic gene clusters in Bacillus hosts are rare. Herein we adapt transformation-associated recombination (TAR) in yeast to design a single genomic capture and expression vector for antibiotic production in Bacillus subtilis. After validating this direct cloning plug-and-playa approach with surfactin, we genetically interrogated amicoumacin biosynthetic gene cluster from the marine isolate Bacillus subtilis 1779. Its heterologous expression allowed us to explore an unusual maturation process involving the N-acyl-asparagine pro-drug intermediates preamicoumacins, which are hydrolyzed by the asparagine-specific peptidase into the active component amicoumacin A. This work represents the first direct cloning based heterologous expression of natural products in the model organism B. subtilis and paves the way to the development of future genome mining efforts in this genus.

  4. Microbial communication leading to the activation of silent fungal secondary metabolite gene clusters.

    Science.gov (United States)

    Netzker, Tina; Fischer, Juliane; Weber, Jakob; Mattern, Derek J; König, Claudia C; Valiante, Vito; Schroeckh, Volker; Brakhage, Axel A

    2015-01-01

    Microorganisms form diverse multispecies communities in various ecosystems. The high abundance of fungal and bacterial species in these consortia results in specific communication between the microorganisms. A key role in this communication is played by secondary metabolites (SMs), which are also called natural products. Recently, it was shown that interspecies "talk" between microorganisms represents a physiological trigger to activate silent gene clusters leading to the formation of novel SMs by the involved species. This review focuses on mixed microbial cultivation, mainly between bacteria and fungi, with a special emphasis on the induced formation of fungal SMs in co-cultures. In addition, the role of chromatin remodeling in the induction is examined, and methodical perspectives for the analysis of natural products are presented. As an example for an intermicrobial interaction elucidated at the molecular level, we discuss the specific interaction between the filamentous fungi Aspergillus nidulans and Aspergillus fumigatus with the soil bacterium Streptomyces rapamycinicus, which provides an excellent model system to enlighten molecular concepts behind regulatory mechanisms and will pave the way to a novel avenue of drug discovery through targeted activation of silent SM gene clusters through co-cultivations of microorganisms.

  5. Global analysis of biosynthetic gene clusters reveals vast potential of secondary metabolite production in Penicillium species.

    Science.gov (United States)

    Nielsen, Jens Christian; Grijseels, Sietske; Prigent, Sylvain; Ji, Boyang; Dainat, Jacques; Nielsen, Kristian Fog; Frisvad, Jens Christian; Workman, Mhairi; Nielsen, Jens

    2017-04-03

    Filamentous fungi produce a wide range of bioactive compounds with important pharmaceutical applications, such as antibiotic penicillins and cholesterol-lowering statins. However, less attention has been paid to fungal secondary metabolites compared to those from bacteria. In this study, we sequenced the genomes of 9 Penicillium species and, together with 15 published genomes, we investigated the secondary metabolism of Penicillium and identified an immense, unexploited potential for producing secondary metabolites by this genus. A total of 1,317 putative biosynthetic gene clusters (BGCs) were identified, and polyketide synthase and non-ribosomal peptide synthetase based BGCs were grouped into gene cluster families and mapped to known pathways. The grouping of BGCs allowed us to study the evolutionary trajectory of pathways based on 6-methylsalicylic acid (6-MSA) synthases. Finally, we cross-referenced the predicted pathways with published data on the production of secondary metabolites and experimentally validated the production of antibiotic yanuthones in Penicillia and identified a previously undescribed compound from the yanuthone pathway. This study is the first genus-wide analysis of the genomic diversity of Penicillia and highlights the potential of these species as a source of new antibiotics and other pharmaceuticals.

  6. Nonblack patients with sickle cell disease have African. beta. sup s gene cluster haplotypes

    Energy Technology Data Exchange (ETDEWEB)

    Rogers, Z.R.; Powars, D.R.; Williams, W.D. (Univ. of Southern California School of Medicine, Los Angeles (USA)); Kinney, T.R. (Duke Univ., Durham, NC (USA)); Schroeder, W.A. (California Institute of Technology, Pasadena (USA))

    1989-05-26

    Of 18 nonblack patients with sickle cell disease, 14 had sickle cell anemia, 2 had hemoglobin SC disease, and 2 had hemoglobin S-{beta}{sup o}-thalassemia. The {beta}{sup s} gene cluster haplotypes that were determined in 7 patients were of African origin and were identified as Central African Republic, Central African Republic minor II, Benin, and Senegal. The haplotype Central African Republic minor II was present on the {beta}{sup o}-thalassemia chromosome in 2 patients. None of 10 patients whose {alpha}-gene status was determined had {alpha}-thalassemia-2. These data strongly support the concept that the {beta}{sup s} gene on chromosome 11 of these individuals is of African origin and that the {alpha}-gene locus on chromosome 16 is of white or native American origin. The clinical severity of the disease in these nonblack patients is appropriate to their haplotype without {alpha}-thalassemia-2 and is comparable with that of black patients. All persons with congenital hemolytic anemia should be examined for the presence of sickle cell disease regardless of physical appearance or ethnic background.

  7. Fungal metabolic gene clusters – caravans traveling across genomes and environments

    Directory of Open Access Journals (Sweden)

    Jennifer Hughes Wisecaver

    2015-03-01

    Full Text Available Metabolic gene clusters (MGCs, physically co-localized genes participating in the same metabolic pathway, are signature features of fungal genomes. MGCs are most often observed in specialized metabolism, having evolved in individual fungal lineages in response to specific ecological needs, such as the utilization of uncommon nutrients (e.g., galactose and allantoin or the production of secondary metabolic antimicrobial compounds and virulence factors (e.g., aflatoxin and melanin. A flurry of recent studies has shown that several MGCs, whose functions are often associated with fungal virulence as well as with the evolutionary arms race between fungi and their competitors, have experienced horizontal gene transfer (HGT. In this minireview, after briefly introducing HGT as a source of gene innovation, we examine the evidence for HGT’s involvement on the evolution of MGCs and, more generally of fungal metabolism, enumerate the molecular mechanisms that mediate such transfers and the ecological circumstances that favor them, as well as discuss the types of evidence required for inferring the presence of HGT in MGCs. The currently available examples indicate that transfers of entire MGCs have taken place between closely related fungal species as well as distant ones and that they sometimes involve large chromosomal segments. These results suggest that the HGT-mediated acquisition of novel metabolism is an ongoing and successful ecological strategy for many fungal species.

  8. Plasmid Complement of Lactococcus lactis NCDO712 Reveals a Novel Pilus Gene Cluster.

    Science.gov (United States)

    Tarazanova, Mariya; Beerthuyzen, Marke; Siezen, Roland; Fernandez-Gutierrez, Marcela M; de Jong, Anne; van der Meulen, Sjoerd; Kok, Jan; Bachmann, Herwig

    2016-01-01

    Lactococcus lactis MG1363 is an important gram-positive model organism. It is a plasmid-free and phage-cured derivative of strain NCDO712. Plasmid-cured strains facilitate studies on molecular biological aspects, but many properties which make L. lactis an important organism in the dairy industry are plasmid encoded. We sequenced the total DNA of strain NCDO712 and, contrary to earlier reports, revealed that the strain carries 6 rather than 5 plasmids. A new 50-kb plasmid, designated pNZ712, encodes functional nisin immunity (nisCIP) and copper resistance (lcoRSABC). The copper resistance could be used as a marker for the conjugation of pNZ712 to L. lactis MG1614. A genome comparison with the plasmid cured daughter strain MG1363 showed that the number of single nucleotide polymorphisms that accumulated in the laboratory since the strains diverted more than 30 years ago is limited to 11 of which only 5 lead to amino acid changes. The 16-kb plasmid pSH74 was found to contain a novel 8-kb pilus gene cluster spaCB-spaA-srtC1-srtC2, which is predicted to encode a pilin tip protein SpaC, a pilus basal subunit SpaB, and a pilus backbone protein SpaA. The sortases SrtC1/SrtC2 are most likely involved in pilus polymerization while the chromosomally encoded SrtA could act to anchor the pilus to peptidoglycan in the cell wall. Overexpression of the pilus gene cluster from a multi-copy plasmid in L. lactis MG1363 resulted in cell chaining, aggregation, rapid sedimentation and increased conjugation efficiency of the cells. Electron microscopy showed that the over-expression of the pilus gene cluster leads to appendices on the cell surfaces. A deletion of the gene encoding the putative basal protein spaB, by truncating spaCB, led to more pilus-like structures on the cell surface, but cell aggregation and cell chaining were no longer observed. This is consistent with the prediction that spaB is involved in the anchoring of the pili to the cell.

  9. The biosynthetic gene cluster for the cyanogenic glucoside dhurrin in Sorghum bicolor contains its co-expressed vacuolar MATE transporter

    DEFF Research Database (Denmark)

    Darbani Shirvanehdeh, Behrooz; Motawie, Mohammed Saddik; Olsen, Carl Erik

    2016-01-01

    for the cyanogenic glucoside dhurrin in Sorghum bicolor additionally contains a gene, SbMATE2, encoding a transporter of the multidrug and toxic compound extrusion (MATE) family, which is co-expressed with the biosynthetic genes. The predicted localisation of SbMATE2 to the vacuolar membrane was demonstrated......-glucoside or the glucosinolate indol-3-yl-methyl glucosinolate. The genomic co-localisation of a transporter gene with the biosynthetic genes producing the transported compound is discussed in relation to the role self-toxicity of chemical defence compounds may play in the formation of gene clusters....

  10. Genomic characterization of a new endophytic Streptomyces kebangsaanensis identifies biosynthetic pathway gene clusters for novel phenazine antibiotic production

    Directory of Open Access Journals (Sweden)

    Juwairiah Remali

    2017-11-01

    Full Text Available Background Streptomyces are well known for their capability to produce many bioactive secondary metabolites with medical and industrial importance. Here we report a novel bioactive phenazine compound, 6-((2-hydroxy-4-methoxyphenoxy carbonyl phenazine-1-carboxylic acid (HCPCA extracted from Streptomyces kebangsaanensis, an endophyte isolated from the ethnomedicinal Portulaca oleracea. Methods The HCPCA chemical structure was determined using nuclear magnetic resonance spectroscopy. We conducted whole genome sequencing for the identification of the gene cluster(s believed to be responsible for phenazine biosynthesis in order to map its corresponding pathway, in addition to bioinformatics analysis to assess the potential of S. kebangsaanensis in producing other useful secondary metabolites. Results The S. kebangsaanensis genome comprises an 8,328,719 bp linear chromosome with high GC content (71.35% consisting of 12 rRNA operons, 81 tRNA, and 7,558 protein coding genes. We identified 24 gene clusters involved in polyketide, nonribosomal peptide, terpene, bacteriocin, and siderophore biosynthesis, as well as a gene cluster predicted to be responsible for phenazine biosynthesis. Discussion The HCPCA phenazine structure was hypothesized to derive from the combination of two biosynthetic pathways, phenazine-1,6-dicarboxylic acid and 4-methoxybenzene-1,2-diol, originated from the shikimic acid pathway. The identification of a biosynthesis pathway gene cluster for phenazine antibiotics might facilitate future genetic engineering design of new synthetic phenazine antibiotics. Additionally, these findings confirm the potential of S. kebangsaanensis for producing various antibiotics and secondary metabolites.

  11. Degradation of Benzene by Pseudomonas veronii 1YdBTEX2 and 1YB2 Is Catalyzed by Enzymes Encoded in Distinct Catabolism Gene Clusters.

    Science.gov (United States)

    de Lima-Morales, Daiana; Chaves-Moreno, Diego; Wos-Oxley, Melissa L; Jáuregui, Ruy; Vilchez-Vargas, Ramiro; Pieper, Dietmar H

    2016-01-01

    Pseudomonas veronii 1YdBTEX2, a benzene and toluene degrader, and Pseudomonas veronii 1YB2, a benzene degrader, have previously been shown to be key players in a benzene-contaminated site. These strains harbor unique catabolic pathways for the degradation of benzene comprising a gene cluster encoding an isopropylbenzene dioxygenase where genes encoding downstream enzymes were interrupted by stop codons. Extradiol dioxygenases were recruited from gene clusters comprising genes encoding a 2-hydroxymuconic semialdehyde dehydrogenase necessary for benzene degradation but typically absent from isopropylbenzene dioxygenase-encoding gene clusters. The benzene dihydrodiol dehydrogenase-encoding gene was not clustered with any other aromatic degradation genes, and the encoded protein was only distantly related to dehydrogenases of aromatic degradation pathways. The involvement of the different gene clusters in the degradation pathways was suggested by real-time quantitative reverse transcription PCR. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  12. Identification of the Regulator Gene Responsible for the Acetone-Responsive Expression of the Binuclear Iron Monooxygenase Gene Cluster in Mycobacteria ▿

    Science.gov (United States)

    Furuya, Toshiki; Hirose, Satomi; Semba, Hisashi; Kino, Kuniki

    2011-01-01

    The mimABCD gene cluster encodes the binuclear iron monooxygenase that oxidizes propane and phenol in Mycobacterium smegmatis strain MC2 155 and Mycobacterium goodii strain 12523. Interestingly, expression of the mimABCD gene cluster is induced by acetone. In this study, we investigated the regulator gene responsible for this acetone-responsive expression. In the genome sequence of M. smegmatis strain MC2 155, the mimABCD gene cluster is preceded by a gene designated mimR, which is divergently transcribed. Sequence analysis revealed that MimR exhibits amino acid similarity with the NtrC family of transcriptional activators, including AcxR and AcoR, which are involved in acetone and acetoin metabolism, respectively. Unexpectedly, many homologs of the mimR gene were also found in the sequenced genomes of actinomycetes. A plasmid carrying a transcriptional fusion of the intergenic region between the mimR and mimA genes with a promoterless green fluorescent protein (GFP) gene was constructed and introduced into M. smegmatis strain MC2 155. Using a GFP reporter system, we confirmed by deletion and complementation analyses that the mimR gene product is the positive regulator of the mimABCD gene cluster expression that is responsive to acetone. M. goodii strain 12523 also utilized the same regulatory system as M. smegmatis strain MC2 155. Although transcriptional activators of the NtrC family generally control transcription using the σ54 factor, a gene encoding the σ54 factor was absent from the genome sequence of M. smegmatis strain MC2 155. These results suggest the presence of a novel regulatory system in actinomycetes, including mycobacteria. PMID:21856847

  13. Regulatory role of tetR gene in a novel gene cluster of Acidovorax avenae subsp. avenae RS-1 under oxidative stress

    Directory of Open Access Journals (Sweden)

    He eLiu

    2014-10-01

    Full Text Available Acidovorax avenae subsp. avenae is the causal agent of bacterial brown stripe disease in rice. In this study, we characterized a novel horizontal transfer of a gene cluster, including tetR, on the chromosome of A. avenae subsp. avenae RS-1 by genome-wide analysis. TetR acted as a repressor in this gene cluster and the oxidative stress resistance was enhanced in tetR-deletion mutant strain. Electrophoretic mobility shift assay (EMSA demonstrated that TetR regulator bound directly to the promoter of this gene cluster. Consistently, the results of quantitative real-time PCR also showed alterations in expression of associated genes. Moreover, the proteins affected by TetR under oxidative stress were revealed by comparing proteomic profiles of wild-type and mutant strains via 1D SDS-PAGE and LC-MS/MS analyses. Taken together, our results demonstrated that tetR gene in this novel gene cluster contributed to cell survival under oxidative stress, and TetR protein played an important regulatory role in growth kinetics, biofilm-forming capability, SOD and catalase activity, and oxide detoxicating ability.

  14. Regulatory role of tetR gene in a novel gene cluster of Acidovorax avenae subsp. avenae RS-1 under oxidative stress.

    Science.gov (United States)

    Liu, He; Yang, Chun-Lan; Ge, Meng-Yu; Ibrahim, Muhammad; Li, Bin; Zhao, Wen-Jun; Chen, Gong-You; Zhu, Bo; Xie, Guan-Lin

    2014-01-01

    Acidovorax avenae subsp. avenae is the causal agent of bacterial brown stripe disease in rice. In this study, we characterized a novel horizontal transfer of a gene cluster, including tetR, on the chromosome of A. avenae subsp. avenae RS-1 by genome-wide analysis. TetR acted as a repressor in this gene cluster and the oxidative stress resistance was enhanced in tetR-deletion mutant strain. Electrophoretic mobility shift assay demonstrated that TetR regulator bound directly to the promoter of this gene cluster. Consistently, the results of quantitative real-time PCR also showed alterations in expression of associated genes. Moreover, the proteins affected by TetR under oxidative stress were revealed by comparing proteomic profiles of wild-type and mutant strains via 1D SDS-PAGE and LC-MS/MS analyses. Taken together, our results demonstrated that tetR gene in this novel gene cluster contributed to cell survival under oxidative stress, and TetR protein played an important regulatory role in growth kinetics, biofilm-forming capability, superoxide dismutase and catalase activity, and oxide detoxicating ability.

  15. A gene cluster for the biosynthesis of moenomycin family antibiotics in the genome of teicoplanin producer Actinoplanes teichomyceticus.

    Science.gov (United States)

    Horbal, Liliya; Ostash, Bohdan; Luzhetskyy, Andriy; Walker, Suzanne; Kalinowski, Jorn; Fedorenko, Victor

    2016-09-01

    Moenomycins are phosphoglycolipid antibiotics notable for their extreme potency, unique mode of action, and proven record of use in animal nutrition without selection for resistant microflora. There is a keen interest in manipulation of structures of moenomycins in order to better understand their structure-activity relationships and to generate improved analogs. Only two almost identical moenomycin biosynthetic gene clusters are known, limiting our knowledge of the evolution of moenomycin pathways and our ability to genetically diversify them. Here, we report a novel gene cluster (tchm) that directs production of the phosphoglycolipid teichomycin in Actinoplanes teichomyceticus. Its overall genetic architecture is significantly different from that of the moenomycin biosynthesis (moe) gene clusters of Streptomyces ghanaensis and Streptomyces clavuligerus, featuring multiple gene rearrangements and two novel structural genes. Involvement of the tchm cluster in teichomycin biosynthesis was confirmed via heterologous co-expression of amidotransferase tchmH5 and moe genes. Our work sets the background for further engineering of moenomycins and for deeper inquiries into the evolution of this fascinating biosynthetic pathway.

  16. Ancient Expansion of the Hox Cluster in Lepidoptera Generated Four Homeobox Genes Implicated in Extra-Embryonic Tissue Formation

    Science.gov (United States)

    Taylor, William R.; Gibbs, Melanie; Breuker, Casper J.; Holland, Peter W. H.

    2014-01-01

    Gene duplications within the conserved Hox cluster are rare in animal evolution, but in Lepidoptera an array of divergent Hox-related genes (Shx genes) has been reported between pb and zen. Here, we use genome sequencing of five lepidopteran species (Polygonia c-album, Pararge aegeria, Callimorpha dominula, Cameraria ohridella, Hepialus sylvina) plus a caddisfly outgroup (Glyphotaelius pellucidus) to trace the evolution of the lepidopteran Shx genes. We demonstrate that Shx genes originated by tandem duplication of zen early in the evolution of large clade Ditrysia; Shx are not found in a caddisfly and a member of the basally diverging Hepialidae (swift moths). Four distinct Shx genes were generated early in ditrysian evolution, and were stably retained in all descendent Lepidoptera except the silkmoth which has additional duplications. Despite extensive sequence divergence, molecular modelling indicates that all four Shx genes have the potential to encode stable homeodomains. The four Shx genes have distinct spatiotemporal expression patterns in early development of the Speckled Wood butterfly (Pararge aegeria), with ShxC demarcating the future sites of extraembryonic tissue formation via strikingly localised maternal RNA in the oocyte. All four genes are also expressed in presumptive serosal cells, prior to the onset of zen expression. Lepidopteran Shx genes represent an unusual example of Hox cluster expansion and integration of novel genes into ancient developmental regulatory networks. PMID:25340822

  17. Evolution of C2H2-zinc finger genes and subfamilies in mammals: Species-specific duplication and loss of clusters, genes and effector domains

    Directory of Open Access Journals (Sweden)

    Aubry Muriel

    2008-06-01

    Full Text Available Abstract Background C2H2 zinc finger genes (C2H2-ZNF constitute the largest class of transcription factors in humans and one of the largest gene families in mammals. Often arranged in clusters in the genome, these genes are thought to have undergone a massive expansion in vertebrates, primarily by tandem duplication. However, this view is based on limited datasets restricted to a single chromosome or a specific subset of genes belonging to the large KRAB domain-containing C2H2-ZNF subfamily. Results Here, we present the first comprehensive study of the evolution of the C2H2-ZNF family in mammals. We assembled the complete repertoire of human C2H2-ZNF genes (718 in total, about 70% of which are organized into 81 clusters across all chromosomes. Based on an analysis of their N-terminal effector domains, we identified two new C2H2-ZNF subfamilies encoding genes with a SET or a HOMEO domain. We searched for the syntenic counterparts of the human clusters in other mammals for which complete gene data are available: chimpanzee, mouse, rat and dog. Cross-species comparisons show a large variation in the numbers of C2H2-ZNF genes within homologous mammalian clusters, suggesting differential patterns of evolution. Phylogenetic analysis of selected clusters reveals that the disparity in C2H2-ZNF gene repertoires across mammals not only originates from differential gene duplication but also from gene loss. Further, we discovered variations among orthologs in the number of zinc finger motifs and association of the effector domains, the latter often undergoing sequence degeneration. Combined with phylogenetic studies, physical maps and an analysis of the exon-intron organization of genes from the SCAN and KRAB domains-containing subfamilies, this result suggests that the SCAN subfamily emerged first, followed by the SCAN-KRAB and finally by the KRAB subfamily. Conclusion Our results are in agreement with the "birth and death hypothesis" for the evolution of

  18. MicroRNAs located in the Hox gene clusters are implicated in huntington's disease pathogenesis.

    Directory of Open Access Journals (Sweden)

    Andrew G Hoss

    2014-02-01

    Full Text Available Transcriptional dysregulation has long been recognized as central to the pathogenesis of Huntington's disease (HD. MicroRNAs (miRNAs represent a major system of post-transcriptional regulation, by either preventing translational initiation or by targeting transcripts for storage or for degradation. Using next-generation miRNA sequencing in prefrontal cortex (Brodmann Area 9 of twelve HD and nine controls, we identified five miRNAs (miR-10b-5p, miR-196a-5p, miR-196b-5p, miR-615-3p and miR-1247-5p up-regulated in HD at genome-wide significance (FDR q-value<0.05. Three of these, miR-196a-5p, miR-196b-5p and miR-615-3p, were expressed at near zero levels in control brains. Expression was verified for all five miRNAs using reverse transcription quantitative PCR and all but miR-1247-5p were replicated in an independent sample (8HD/8C. Ectopic miR-10b-5p expression in PC12 HTT-Q73 cells increased survival by MTT assay and cell viability staining suggesting increased expression may be a protective response. All of the miRNAs but miR-1247-5p are located in intergenic regions of Hox clusters. Total mRNA sequencing in the same samples identified fifteen of 55 genes within the Hox cluster gene regions as differentially expressed in HD, and the Hox genes immediately adjacent to the four Hox cluster miRNAs as up-regulated. Pathway analysis of mRNA targets of these miRNAs implicated functions for neuronal differentiation, neurite outgrowth, cell death and survival. In regression models among the HD brains, huntingtin CAG repeat size, onset age and age at death were independently found to be inversely related to miR-10b-5p levels. CAG repeat size and onset age were independently inversely related to miR-196a-5p, onset age was inversely related to miR-196b-5p and age at death was inversely related to miR-615-3p expression. These results suggest these Hox-related miRNAs may be involved in neuroprotective response in HD. Recently, miRNAs have shown promise as

  19. Unsupervised clustering of gene expression data points at hypoxia as possible trigger for metabolic syndrome

    Directory of Open Access Journals (Sweden)

    York David

    2006-12-01

    Full Text Available Abstract Background Classification of large volumes of data produced in a microarray experiment allows for the extraction of important clues as to the nature of a disease. Results Using multi-dimensional unsupervised FOREL (FORmal ELement algorithm we have re-analyzed three public datasets of skeletal muscle gene expression in connection with insulin resistance and type 2 diabetes (DM2. Our analysis revealed the major line of variation between expression profiles of normal, insulin resistant, and diabetic skeletal muscle. A cluster of most "metabolically sound" samples occupied one end of this line. The distance along this line coincided with the classic markers of diabetes risk, namely obesity and insulin resistance, but did not follow the accepted clinical diagnosis of DM2 as defined by the presence or absence of hyperglycemia. Genes implicated in this expression pattern are those controlling skeletal muscle fiber type and glycolytic metabolism. Additionally myoglobin and hemoglobin were upregulated and ribosomal genes deregulated in insulin resistant patients. Conclusion Our findings are concordant with the changes seen in skeletal muscle with altitude hypoxia. This suggests that hypoxia and shift to glycolytic metabolism may also drive insulin resistance.

  20. Sexuality Generates Diversity in the Aflatoxin Gene Cluster: Evidence on a Global Scale

    Science.gov (United States)

    Moore, Geromy G.; Elliott, Jacalyn L.; Singh, Rakhi; Horn, Bruce W.; Dorner, Joe W.; Stone, Eric A.; Chulze, Sofia N.; Barros, German G.; Naik, Manjunath K.; Wright, Graeme C.; Hell, Kerstin; Carbone, Ignazio

    2013-01-01

    Aflatoxins are produced by Aspergillus flavus and A. parasiticus in oil-rich seed and grain crops and are a serious problem in agriculture, with aflatoxin B1 being the most carcinogenic natural compound known. Sexual reproduction in these species occurs between individuals belonging to different vegetative compatibility groups (VCGs). We examined natural genetic variation in 758 isolates of A. flavus, A. parasiticus and A. minisclerotigenes sampled from single peanut fields in the United States (Georgia), Africa (Benin), Argentina (Córdoba), Australia (Queensland) and India (Karnataka). Analysis of DNA sequence variation across multiple intergenic regions in the aflatoxin gene clusters of A. flavus, A. parasiticus and A. minisclerotigenes revealed significant linkage disequilibrium (LD) organized into distinct blocks that are conserved across different localities, suggesting that genetic recombination is nonrandom and a global occurrence. To assess the contributions of asexual and sexual reproduction to fixation and maintenance of toxin chemotype diversity in populations from each locality/species, we tested the null hypothesis of an equal number of MAT1-1 and MAT1-2 mating-type individuals, which is indicative of a sexually recombining population. All samples were clone-corrected using multi-locus sequence typing which associates closely with VCG. For both A. flavus and A. parasiticus, when the proportions of MAT1-1 and MAT1-2 were significantly different, there was more extensive LD in the aflatoxin cluster and populations were fixed for specific toxin chemotype classes, either the non-aflatoxigenic class in A. flavus or the B1-dominant and G1-dominant classes in A. parasiticus. A mating type ratio close to 1∶1 in A. flavus, A. parasiticus and A. minisclerotigenes was associated with higher recombination rates in the aflatoxin cluster and less pronounced chemotype differences in populations. This work shows that the reproductive nature of the population (more

  1. Sexuality generates diversity in the aflatoxin gene cluster: evidence on a global scale.

    Directory of Open Access Journals (Sweden)

    Geromy G Moore

    Full Text Available Aflatoxins are produced by Aspergillus flavus and A. parasiticus in oil-rich seed and grain crops and are a serious problem in agriculture, with aflatoxin B₁ being the most carcinogenic natural compound known. Sexual reproduction in these species occurs between individuals belonging to different vegetative compatibility groups (VCGs. We examined natural genetic variation in 758 isolates of A. flavus, A. parasiticus and A. minisclerotigenes sampled from single peanut fields in the United States (Georgia, Africa (Benin, Argentina (Córdoba, Australia (Queensland and India (Karnataka. Analysis of DNA sequence variation across multiple intergenic regions in the aflatoxin gene clusters of A. flavus, A. parasiticus and A. minisclerotigenes revealed significant linkage disequilibrium (LD organized into distinct blocks that are conserved across different localities, suggesting that genetic recombination is nonrandom and a global occurrence. To assess the contributions of asexual and sexual reproduction to fixation and maintenance of toxin chemotype diversity in populations from each locality/species, we tested the null hypothesis of an equal number of MAT1-1 and MAT1-2 mating-type individuals, which is indicative of a sexually recombining population. All samples were clone-corrected using multi-locus sequence typing which associates closely with VCG. For both A. flavus and A. parasiticus, when the proportions of MAT1-1 and MAT1-2 were significantly different, there was more extensive LD in the aflatoxin cluster and populations were fixed for specific toxin chemotype classes, either the non-aflatoxigenic class in A. flavus or the B₁-dominant and G₁-dominant classes in A. parasiticus. A mating type ratio close to 1∶1 in A. flavus, A. parasiticus and A. minisclerotigenes was associated with higher recombination rates in the aflatoxin cluster and less pronounced chemotype differences in populations. This work shows that the reproductive nature of

  2. Regulatory role of tetR gene in a novel gene cluster of Acidovorax avenae subsp. avenae RS-1 under oxidative stress

    OpenAIRE

    Liu, He; Yang, Chun-Lan; Ge, Meng-Yu; Ibrahim, Muhammad; Li, Bin; Zhao, Wen-Jun; Chen, Gong-You; Zhu, Bo; Xie, Guan-Lin

    2014-01-01

    Acidovorax avenae subsp. avenae is the causal agent of bacterial brown stripe disease in rice. In this study, we characterized a novel horizontal transfer of a gene cluster, including tetR, on the chromosome of A. avenae subsp. avenae RS-1 by genome-wide analysis. TetR acted as a repressor in this gene cluster and the oxidative stress resistance was enhanced in tetR-deletion mutant strain. Electrophoretic mobility shift assay demonstrated that TetR regulator bound directly to the promoter of ...

  3. antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters

    DEFF Research Database (Denmark)

    Weber, Tilmann; Blin, Kai; Duddela, Srikanth

    2015-01-01

    Microbial secondary metabolism constitutes a rich source of antibiotics, chemotherapeutics, insecticides and other high-value chemicals. Genome mining of gene clusters that encode the biosynthetic pathways for these metabolites has become a key methodology for novel compound discovery. In 2011, we...... introduced antiSMASH, a web server and stand-alone tool for the automatic genomic identification and analysis of biosynthetic gene clusters, available at http://antismash.secondarymetabolites.org. Here, we present version 3.0 of antiSMASH, which has undergone major improvements. A full integration...

  4. Global analysis of biosynthetic gene clusters reveals vast potential of secondary metabolite production in Penicillium species

    DEFF Research Database (Denmark)

    Nielsen, Jens Christian; Grijseels, Sietske; Prigent, Sylvain

    2017-01-01

    -referenced the predicted pathways with published data on the production of secondary metabolites and experimentally validated the production of antibiotic yanuthones in Penicillia and identified a previously undescribed compound from the yanuthone pathway. This study is the first genus-wide analysis of the genomic......Filamentous fungi produce a wide range of bioactive compounds with important pharmaceutical applications, such as antibiotic penicillins and cholesterol-lowering statins. However, less attention has been paid to fungal secondary metabolites compared to those from bacteria. In this study, we...... sequenced the genomes of 9 Penicillium species and, together with 15 published genomes, we investigated the secondary metabolism of Penicillium and identified an immense, unexploited potential for producing secondary metabolites by this genus. A total of 1,317 putative biosynthetic gene clusters (BGCs) were...

  5. antiSMASH 4.0-improvements in chemistry prediction and gene cluster boundary identification

    DEFF Research Database (Denmark)

    Blin, Kai; Wolf, Thomas; Chevrette, Marc G.

    2017-01-01

    architectures. Additionally, several usability features have been updated and improved. Together, these improvements make antiSMASH up-to-date with the latest developments in natural product research and will further facilitate computational genome mining for the discovery of novel bioactive molecules.......Many antibiotics, chemotherapeutics, crop protection agents and food preservatives originate from molecules produced by bacteria, fungi or plants. In recent years, genome mining methodologies have been widely adopted to identify and characterize the biosynthetic gene clusters encoding...... the production of such compounds. Since 2011, the 'antibiotics and secondary metabolite analysis shell-antiSMASH' has assisted researchers in efficiently performing this, both as a web server and a standalone tool. Here, we present the thoroughly updated antiSMASH version 4, which adds several novel features...

  6. Biosynthesis of antinutritional alkaloids in solanaceous crops is mediated by clustered genes.

    Science.gov (United States)

    Itkin, M; Heinig, U; Tzfadia, O; Bhide, A J; Shinde, B; Cardenas, P D; Bocobza, S E; Unger, T; Malitsky, S; Finkers, R; Tikunov, Y; Bovy, A; Chikate, Y; Singh, P; Rogachev, I; Beekwilder, J; Giri, A P; Aharoni, A

    2013-07-12

    Steroidal glycoalkaloids (SGAs) such as α-solanine found in solanaceous food plants--as, for example, potato--are antinutritional factors for humans. Comparative coexpression analysis between tomato and potato coupled with chemical profiling revealed an array of 10 genes that partake in SGA biosynthesis. We discovered that six of them exist as a cluster on chromosome 7, whereas an additional two are adjacent in a duplicated genomic region on chromosome 12. Following systematic functional analysis, we suggest a revised SGA biosynthetic pathway starting from cholesterol up to the tetrasaccharide moiety linked to the tomato SGA aglycone. Silencing GLYCOALKALOID METABOLISM 4 prevented accumulation of SGAs in potato tubers and tomato fruit. This may provide a means for removal of unsafe, antinutritional substances present in these widely used food crops.

  7. A systematic computational analysis of biosynthetic gene cluster evolution: lessons for engineering biosynthesis.

    Directory of Open Access Journals (Sweden)

    Marnix H Medema

    2014-12-01

    Full Text Available Bacterial secondary metabolites are widely used as antibiotics, anticancer drugs, insecticides and food additives. Attempts to engineer their biosynthetic gene clusters (BGCs to produce unnatural metabolites with improved properties are often frustrated by the unpredictability and complexity of the enzymes that synthesize these molecules, suggesting that genetic changes within BGCs are limited by specific constraints. Here, by performing a systematic computational analysis of BGC evolution, we derive evidence for three findings that shed light on the ways in which, despite these constraints, nature successfully invents new molecules: 1 BGCs for complex molecules often evolve through the successive merger of smaller sub-clusters, which function as independent evolutionary entities. 2 An important subset of polyketide synthases and nonribosomal peptide synthetases evolve by concerted evolution, which generates sets of sequence-homogenized domains that may hold promise for engineering efforts since they exhibit a high degree of functional interoperability, 3 Individual BGC families evolve in distinct ways, suggesting that design strategies should take into account family-specific functional constraints. These findings suggest novel strategies for using synthetic biology to rationally engineer biosynthetic pathways.

  8. Cancer Transcriptome Dataset Analysis: Comparing Methods of Pathway and Gene Regulatory Network-Based Cluster Identification.

    Science.gov (United States)

    Nam, Seungyoon

    2017-04-01

    Cancer transcriptome analysis is one of the leading areas of Big Data science, biomarker, and pharmaceutical discovery, not to forget personalized medicine. Yet, cancer transcriptomics and postgenomic medicine require innovation in bioinformatics as well as comparison of the performance of available algorithms. In this data analytics context, the value of network generation and algorithms has been widely underscored for addressing the salient questions in cancer pathogenesis. Analysis of cancer trancriptome often results in complicated networks where identification of network modularity remains critical, for example, in delineating the "druggable" molecular targets. Network clustering is useful, but depends on the network topology in and of itself. Notably, the performance of different network-generating tools for network cluster (NC) identification has been little investigated to date. Hence, using gastric cancer (GC) transcriptomic datasets, we compared two algorithms for generating pathway versus gene regulatory network-based NCs, showing that the pathway-based approach better agrees with a reference set of cancer-functional contexts. Finally, by applying pathway-based NC identification to GC transcriptome datasets, we describe cancer NCs that associate with candidate therapeutic targets and biomarkers in GC. These observations collectively inform future research on cancer transcriptomics, drug discovery, and rational development of new analysis tools for optimal harnessing of omics data.

  9. Clustered Mutation Signatures Reveal that Error-Prone DNA Repair Targets Mutations to Active Genes.

    Science.gov (United States)

    Supek, Fran; Lehner, Ben

    2017-07-27

    Many processes can cause the same nucleotide change in a genome, making the identification of the mechanisms causing mutations a difficult challenge. Here, we show that clustered mutations provide a more precise fingerprint of mutagenic processes. Of nine clustered mutation signatures identified from >1,000 tumor genomes, three relate to variable APOBEC activity and three are associated with tobacco smoking. An additional signature matches the spectrum of translesion DNA polymerase eta (POLH). In lymphoid cells, these mutations target promoters, consistent with AID-initiated somatic hypermutation. In solid tumors, however, they are associated with UV exposure and alcohol consumption and target the H3K36me3 chromatin of active genes in a mismatch repair (MMR)-dependent manner. These regions normally have a low mutation rate because error-free MMR also targets H3K36me3 chromatin. Carcinogens and error-prone repair therefore redistribute mutations to the more important regions of the genome, contributing a substantial mutation load in many tumors, including driver mutations. Copyright © 2017 Elsevier Inc. All rights reserved.

  10. Biosynthetic gene clusters for relevant secondary metabolites produced by Penicillium roqueforti in blue cheeses.

    Science.gov (United States)

    García-Estrada, Carlos; Martín, Juan-Francisco

    2016-10-01

    Ripening of blue-veined cheeses, such as the French Bleu and Roquefort, the Italian Gorgonzola, the English Stilton, the Danish Danablu or the Spanish Cabrales, Picón Bejes-Tresviso, and Valdeón, requires the growth and enzymatic activity of the mold Penicillium roqueforti, which is responsible for the characteristic texture, blue-green spots, and aroma of these types of cheeses. This filamentous fungus is able to synthesize different secondary metabolites, including andrastins, mycophenolic acid, and several mycotoxins, such as roquefortines C and D, PR-toxin and eremofortins, isofumigaclavines A and B, and festuclavine. This review provides a detailed description of the main secondary metabolites produced by P. roqueforti in blue cheese, giving a special emphasis to roquefortine, PR-toxin and mycophenolic acid, and their biosynthetic gene clusters and pathways. The knowledge of these clusters and secondary metabolism pathways, together with the ability of P. roqueforti to produce beneficial secondary metabolites, is of interest for commercial purposes.

  11. Insights into variability of actinorhodopsin genes of the LG1 cluster in two different freshwater habitats.

    Directory of Open Access Journals (Sweden)

    Jitka Jezberová

    Full Text Available Actinorhodopsins (ActRs are recently discovered proteorhodopsins present in Actinobacteria, enabling them to adapt to a wider spectrum of environmental conditions. Frequently, a large fraction of freshwater bacterioplankton belongs to the acI lineage of Actinobacteria and codes the LG1 type of ActRs. In this paper we studied the genotype variability of the LG1 ActRs. We have constructed two clone libraries originating from two environmentally different habitats located in Central Europe; the large alkaline lake Mondsee (Austria and the small humic reservoir Jiřická (the Czech Republic. The 75 yielded clones were phylogenetically analyzed together with all ActR sequences currently available in public databases. Altogether 156 sequences were analyzed and 13 clusters of ActRs were distinguished. Newly obtained clones are distributed over all three LG1 subgroups--LG1-A, B and C. Eighty percent of the sequences belonged to the acI lineage (LG1-A ActR gene bearers further divided into LG1-A1 and LG1-A2 subgroups. Interestingly, the two habitats markedly differed in genotype composition with no identical sequence found in both samples of clones. Moreover, Jiřická reservoir contained three so far not reported clusters, one of them LG1-C related, presenting thus completely new, so far undescribed, genotypes of Actinobacteria in freshwaters.

  12. Ancestral Variations of the PCDHG Gene Cluster Predispose to Dyslexia in a Multiplex Family

    Directory of Open Access Journals (Sweden)

    Teesta Naskar

    2018-02-01

    Full Text Available Dyslexia is a heritable neurodevelopmental disorder characterized by difficulties in reading and writing. In this study, we describe the identification of a set of 17 polymorphisms located across 1.9 Mb region on chromosome 5q31.3, encompassing genes of the PCDHG cluster, TAF7, PCDH1 and ARHGAP26, dominantly inherited with dyslexia in a multi-incident family. Strikingly, the non-risk form of seven variations of the PCDHG cluster, are preponderant in the human lineage, while risk alleles are ancestral and conserved across Neanderthals to non-human primates. Four of these seven ancestral variations (c.460A > C [p.Ile154Leu], c.541G > A [p.Ala181Thr], c.2036G > C [p.Arg679Pro] and c.2059A > G [p.Lys687Glu] result in amino acid alterations. p.Ile154Leu and p.Ala181Thr are present at EC2: EC3 interacting interface of γA3-PCDH and γA4-PCDH respectively might affect trans-homophilic interaction and hence neuronal connectivity. p.Arg679Pro and p.Lys687Glu are present within the linker region connecting trans-membrane to extracellular domain. Sequence analysis indicated the importance of p.Ile154, p.Arg679 and p.Lys687 in maintaining class specificity. Thus the observed association of PCDHG genes encoding neural adhesion proteins reinforces the hypothesis of aberrant neuronal connectivity in the pathophysiology of dyslexia. Additionally, the striking conservation of the identified variants indicates a role of PCDHG in the evolution of highly specialized cognitive skills critical to reading.

  13. The Sound of Silence: Activating Silent Biosynthetic Gene Clusters in Marine Microorganisms

    Directory of Open Access Journals (Sweden)

    F. Jerry Reen

    2015-07-01

    Full Text Available Unlocking the rich harvest of marine microbial ecosystems has the potential to both safeguard the existence of our species for the future, while also presenting significant lifestyle benefits for commercial gain. However, while significant advances have been made in the field of marine biodiscovery, leading to the introduction of new classes of therapeutics for clinical medicine, cosmetics and industrial products, much of what this natural ecosystem has to offer is locked in, and essentially hidden from our screening methods. Releasing this silent potential represents a significant technological challenge, the key to which is a comprehensive understanding of what controls these systems. Heterologous expression systems have been successful in awakening a number of these cryptic marine biosynthetic gene clusters (BGCs. However, this approach is limited by the typically large size of the encoding sequences. More recently, focus has shifted to the regulatory proteins associated with each BGC, many of which are signal responsive raising the possibility of exogenous activation. Abundant among these are the LysR-type family of transcriptional regulators, which are known to control production of microbial aromatic systems. Although the environmental signals that activate these regulatory systems remain unknown, it offers the exciting possibility of evoking mimic molecules and synthetic expression systems to drive production of potentially novel natural products in microorganisms. Success in this field has the potential to provide a quantum leap forward in medical and industrial bio-product development. To achieve these new endpoints, it is clear that the integrated efforts of bioinformaticians and natural product chemists will be required as we strive to uncover new and potentially unique structures from silent or cryptic marine gene clusters.

  14. Ancestral Variations of the PCDHG Gene Cluster Predispose to Dyslexia in a Multiplex Family.

    Science.gov (United States)

    Naskar, Teesta; Faruq, Mohammed; Banerjee, Priyajit; Khan, Massarat; Midha, Rashi; Kumari, Renu; Devasenapathy, Subhashree; Prajapati, Bharat; Sengupta, Sanghamitra; Jain, Deepti; Mukerji, Mitali; Singh, Nandini Chatterjee; Sinha, Subrata

    2018-02-01

    Dyslexia is a heritable neurodevelopmental disorder characterized by difficulties in reading and writing. In this study, we describe the identification of a set of 17 polymorphisms located across 1.9Mb region on chromosome 5q31.3, encompassing genes of the PCDHG cluster, TAF7, PCDH1 and ARHGAP26, dominantly inherited with dyslexia in a multi-incident family. Strikingly, the non-risk form of seven variations of the PCDHG cluster, are preponderant in the human lineage, while risk alleles are ancestral and conserved across Neanderthals to non-human primates. Four of these seven ancestral variations (c.460A>C [p.Ile154Leu], c.541G>A [p.Ala181Thr], c.2036G>C [p.Arg679Pro] and c.2059A>G [p.Lys687Glu]) result in amino acid alterations. p.Ile154Leu and p.Ala181Thr are present at EC2: EC3 interacting interface of γA3-PCDH and γA4-PCDH respectively might affect trans-homophilic interaction and hence neuronal connectivity. p.Arg679Pro and p.Lys687Glu are present within the linker region connecting trans-membrane to extracellular domain. Sequence analysis indicated the importance of p.Ile154, p.Arg679 and p.Lys687 in maintaining class specificity. Thus the observed association of PCDHG genes encoding neural adhesion proteins reinforces the hypothesis of aberrant neuronal connectivity in the pathophysiology of dyslexia. Additionally, the striking conservation of the identified variants indicates a role of PCDHG in the evolution of highly specialized cognitive skills critical to reading. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.

  15. Analysis of healthy cohorts for single nucleotide polymorphisms in C1q gene cluster

    Directory of Open Access Journals (Sweden)

    MARIA A. RADANOVA

    2015-12-01

    Full Text Available C1q is the first component of the classical pathway of complement activation. The coding region for C1q is localized on chromosome 1p34.1–36.3. Mutations or single nucleotide polymorphisms (SNPs in C1q gene cluster can cause developing of Systemic lupus erythematosus (SLE because of C1q deficiency or other unknown reason. We selected five SNPs located in 7.121 kbp region on chromosome 1, which were previously associated with SLE and/or low C1q level, but not causing C1q deficiency and analyzed them in terms of allele frequencies and genotype distribution in comparison with Hispanic, Asian, African and other Caucasian cohorts. These SNPs were: rs587585, rs292001, rs172378, rs294179 and rs631090. One hundred eighty five healthy Bulgarian volunteers were genotyped for the selected five C1q SNPs by quantative real-time PCR methods. International HapMap Project has been used for information about genotype distribution and allele frequencies of the five SNPs in, Hispanics, Asians, Africans and others Caucasian cohorts. Bulgarian healthy volunteers and another pooled Caucasian cohort had similar frequencies of genotypes and alleles of rs587585, rs292001, rs294179 and rs631090 SNPs. Nevertheless, genotype AA of rs172378 was significantly overrepresented in Bulgarians when compared to other healthy Caucasians from USA and UK (60% vs 31%. Genotype distribution of rs172378 in Bulgarians was similar to Greek-Cyriot Caucasians. For all Caucasians the major allele of rs172378 was A. This is the first study analyzing the allele frequencies and genotype distribution of C1q gene cluster SNPs in Bulgarian healthy population.

  16. Myf-6, a new member of the human gene family of myogenic determination factors: evidence for a gene cluster on chromosome 12.

    OpenAIRE

    Braun, T; Bober, E; Winter, B; Rosenthal, N; Arnold, H H

    1990-01-01

    The Myf-6 gene, a novel member of the human gene family of muscle determination factors has been detected by its highly conserved sequence coding for a putative helix-loop-helix domain. This sequence motif is a common feature of all Myf factors and other regulatory proteins. The new Myf gene is located on human chromosome 12, approximately 6.5 Kb upstream of the Myf-5 locus in a closely linked cluster of myogenic determination genes. Myf-6 cDNAs were isolated from human and mouse skeletal mus...

  17. An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks.

    Science.gov (United States)

    Botía, Juan A; Vandrovcova, Jana; Forabosco, Paola; Guelfi, Sebastian; D'Sa, Karishma; Hardy, John; Lewis, Cathryn M; Ryten, Mina; Weale, Michael E

    2017-04-12

    Weighted Gene Co-expression Network Analysis (WGCNA) is a widely used R software package for the generation of gene co-expression networks (GCN). WGCNA generates both a GCN and a derived partitioning of clusters of genes (modules). We propose k-means clustering as an additional processing step to conventional WGCNA, which we have implemented in the R package km2gcn (k-means to gene co-expression network, https://github.com/juanbot/km2gcn ). We assessed our method on networks created from UKBEC data (10 different human brain tissues), on networks created from GTEx data (42 human tissues, including 13 brain tissues), and on simulated networks derived from GTEx data. We observed substantially improved module properties, including: (1) few or zero misplaced genes; (2) increased counts of replicable clusters in alternate tissues (x3.1 on average); (3) improved enrichment of Gene Ontology terms (seen in 48/52 GCNs) (4) improved cell type enrichment signals (seen in 21/23 brain GCNs); and (5) more accurate partitions in simulated data according to a range of similarity indices. The results obtained from our investigations indicate that our k-means method, applied as an adjunct to standard WGCNA, results in better network partitions. These improved partitions enable more fruitful downstream analyses, as gene modules are more biologically meaningful.

  18. Identification of a Gene Cluster Enabling Lactobacillus casei BL23 To Utilize myo-Inositol▿ †

    Science.gov (United States)

    Yebra, María Jesús; Zúñiga, Manuel; Beaufils, Sophie; Pérez-Martínez, Gaspar; Deutscher, Josef; Monedero, Vicente

    2007-01-01

    Genome analysis of Lactobacillus casei BL23 revealed that, compared to L. casei ATCC 334, it carries a 12.8-kb DNA insertion containing genes involved in the catabolism of the cyclic polyol myo-inositol (MI). Indeed, L. casei ATCC 334 does not ferment MI, whereas strain BL23 is able to utilize this carbon source. The inserted DNA consists of an iolR gene encoding a DeoR family transcriptional repressor and a divergently transcribed iolTABCDG1G2EJK operon, encoding a complete MI catabolic pathway, in which the iolK gene probably codes for a malonate semialdehyde decarboxylase. The presence of iolK suggests that L. casei has two alternative pathways for the metabolism of malonic semialdehyde: (i) the classical MI catabolic pathway in which IolA (malonate semialdehyde dehydrogenase) catalyzes the formation of acetyl-coenzyme A from malonic semialdehyde and (ii) the conversion of malonic semialdehyde to acetaldehyde catalyzed by the product of iolK. The function of the iol genes was verified by the disruption of iolA, iolT, and iolD, which provided MI-negative strains. By contrast, the disruption of iolK resulted in a strain with no obvious defect in MI utilization. Transcriptional analyses conducted with different mutant strains showed that the iolTABCDG1G2EJK cluster is regulated by substrate-specific induction mediated by the inactivation of the transcriptional repressor IolR and by carbon catabolite repression mediated by the catabolite control protein A (CcpA). This is the first example of an operon for MI utilization in lactic acid bacteria and illustrates the versatility of carbohydrate utilization in L. casei BL23. PMID:17449687

  19. Genetic interrelations in the actinomycin biosynthetic gene clusters of Streptomyces antibioticus IMRU 3720 and Streptomyces chrysomallus ATCC11523, producers of actinomycin X and actinomycin C

    Directory of Open Access Journals (Sweden)

    Crnovčić I

    2017-04-01

    Full Text Available Ivana Crnovčić,1 Christian Rückert,2 Siamak Semsary,1 Manuel Lang,1 Jörn Kalinowski,2 Ullrich Keller1 1Institut für Chemie, Technische Universität Berlin, Berlin-Charlottenburg, 2Technology Platform Genomics, Center for Biotechnology, Bielefeld University, Bielefeld, Germany Abstract: Sequencing the actinomycin (acm biosynthetic gene cluster of Streptomyces antibioticus IMRU 3720, which produces actinomycin X (Acm X, revealed 20 genes organized into a highly similar framework as in the bi-armed acm C biosynthetic gene cluster of Streptomyces chrysomallus but without an attached additional extra arm of orthologues as in the latter. Curiously, the extra arm of the S. chrysomallus gene cluster turned out to perfectly match the single arm of the S. antibioticus gene cluster in the same order of orthologues including the the presence of two pseudogenes, scacmM and scacmN, encoding a cytochrome P450 and its ferredoxin, respectively. Orthologues of the latter genes were both missing in the principal arm of the S. chrysomallus acm C gene cluster. All orthologues of the extra arm showed a G +C-contents different from that of their counterparts in the principal arm. Moreover, the similarities of translation products from the extra arm were all higher to the corresponding translation products of orthologue genes from the S. antibioticus acm X gene cluster than to those encoded by the principal arm of their own gene cluster. This suggests that the duplicated structure of the S. chrysomallus acm C biosynthetic gene cluster evolved from previous fusion between two one-armed acm gene clusters each from a different genetic background. However, while scacmM and scacmN in the extra arm of the S. chrysomallus acm C gene cluster are mutated and therefore are non-functional, their orthologues saacmM and saacmN in the S. antibioticus acm C gene cluster show no defects seemingly encoding active enzymes with functions specific for Acm X biosynthesis. Both acm

  20. Automatic extraction of gene ontology annotation and its correlation with clusters in protein networks

    Directory of Open Access Journals (Sweden)

    Mazo Ilya

    2007-07-01

    Full Text Available Abstract Background Uncovering cellular roles of a protein is a task of tremendous importance and complexity that requires dedicated experimental work as well as often sophisticated data mining and processing tools. Protein functions, often referred to as its annotations, are believed to manifest themselves through topology of the networks of inter-proteins interactions. In particular, there is a growing body of evidence that proteins performing the same function are more likely to interact with each other than with proteins with other functions. However, since functional annotation and protein network topology are often studied separately, the direct relationship between them has not been comprehensively demonstrated. In addition to having the general biological significance, such demonstration would further validate the data extraction and processing methods used to compose protein annotation and protein-protein interactions datasets. Results We developed a method for automatic extraction of protein functional annotation from scientific text based on the Natural Language Processing (NLP technology. For the protein annotation extracted from the entire PubMed, we evaluated the precision and recall rates, and compared the performance of the automatic extraction technology to that of manual curation used in public Gene Ontology (GO annotation. In the second part of our presentation, we reported a large-scale investigation into the correspondence between communities in the literature-based protein networks and GO annotation groups of functionally related proteins. We found a comprehensive two-way match: proteins within biological annotation groups form significantly denser linked network clusters than expected by chance and, conversely, densely linked network communities exhibit a pronounced non-random overlap with GO groups. We also expanded the publicly available GO biological process annotation using the relations extracted by our NLP technology

  1. Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials

    Directory of Open Access Journals (Sweden)

    Andrew Williams

    2017-12-01

    Full Text Available This article contains data related to the research article ‘Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials’ (Williams and Halappanavar, 2015 [1]. The presence of diverse types of nanomaterials (NMs in commerce has grown significantly in the past decade and as a result, human exposure to these materials in the environment is inevitable. The traditional toxicity testing approaches that are reliant on animals are both time- and cost- intensive; employing which, it is not possible to complete the challenging task of safety assessment of NMs currently on the market in a timely manner. Thus, there is an urgent need for comprehensive understanding of the biological behavior of NMs, and efficient toxicity screening tools that will enable the development of predictive toxicology paradigms suited to rapidly assessing the human health impacts of exposure to NMs. In an effort to predict the long term health impacts of acute exposure to NMs, in Williams and Halappanavar (2015 [1], we applied bi-clustering and gene set enrichment analysis methods to derive essential features of altered lung transcriptome following exposure to NMs that are associated with lung-specific diseases. Several datasets from public microarray repositories describing pulmonary diseases in mouse models following exposure to a variety of substances were examined and functionally related bi-clusters showing similar gene expression profiles were identified. The identified bi-clusters were then used to conduct a gene set enrichment analysis on lung gene expression profiles derived from mice exposed to nano-titanium dioxide, carbon black or carbon nanotubes (nano-TiO2, CB and CNTs to determine the disease significance of these data-driven gene sets. The results of the analysis correctly identified all NMs to be inflammogenic, and only CB and CNTs as potentially fibrogenic. Here, we

  2. Characterization and detection of a widely distributed gene cluster that predicts anaerobic choline utilization by human gut bacteria.

    Science.gov (United States)

    Martínez-del Campo, Ana; Bodea, Smaranda; Hamer, Hilary A; Marks, Jonathan A; Haiser, Henry J; Turnbaugh, Peter J; Balskus, Emily P

    2015-04-14

    Elucidation of the molecular mechanisms underlying the human gut microbiota's effects on health and disease has been complicated by difficulties in linking metabolic functions associated with the gut community as a whole to individual microorganisms and activities. Anaerobic microbial choline metabolism, a disease-associated metabolic pathway, exemplifies this challenge, as the specific human gut microorganisms responsible for this transformation have not yet been clearly identified. In this study, we established the link between a bacterial gene cluster, the choline utilization (cut) cluster, and anaerobic choline metabolism in human gut isolates by combining transcriptional, biochemical, bioinformatic, and cultivation-based approaches. Quantitative reverse transcription-PCR analysis and in vitro biochemical characterization of two cut gene products linked the entire cluster to growth on choline and supported a model for this pathway. Analyses of sequenced bacterial genomes revealed that the cut cluster is present in many human gut bacteria, is predictive of choline utilization in sequenced isolates, and is widely but discontinuously distributed across multiple bacterial phyla. Given that bacterial phylogeny is a poor marker for choline utilization, we were prompted to develop a degenerate PCR-based method for detecting the key functional gene choline TMA-lyase (cutC) in genomic and metagenomic DNA. Using this tool, we found that new choline-metabolizing gut isolates universally possessed cutC. We also demonstrated that this gene is widespread in stool metagenomic data sets. Overall, this work represents a crucial step toward understanding anaerobic choline metabolism in the human gut microbiota and underscores the importance of examining this microbial community from a function-oriented perspective. Anaerobic choline utilization is a bacterial metabolic activity that occurs in the human gut and is linked to multiple diseases. While bacterial genes responsible for

  3. Characterization of a multicopper oxidase gene cluster in Phanerochaete chrysosporium and evidence of altered splicing of the mco transcripts

    Science.gov (United States)

    Luis F. Larrondo; Bernardo Gonzalez; Dan Cullen; Rafael Vicuna

    2004-01-01

    A cluster of multicopper oxidase genes (mco1, mco2, mco3, mco4) from the lignin-degrading basidiomycete Phanerochaete chrysosporium is described. The four genes share the same transcriptional orientation within a 25 kb region. mco1, mco2 and mco3 are tightly grouped, with intergenic regions of 2.3 and 0.8 kb, respectively, whereas mco4 is located 11 kb upstream of mco1...

  4. Genotyping of Campylobacter jejuni strains from Danish broiler chickens by restriction fragment length polymorphism of the LPS gene cluster

    DEFF Research Database (Denmark)

    Knudsen, K.N.; Bang, Dang Duong; Nielsen, E.M.

    2005-01-01

    Aims: To apply and evaluate LG (LPS genes) genotyping, which is a genotyping method based on a cluster of genes involved in the synthesis of surface lipopolysaccharides (LPS) in Campylobacter species, for typing of Campylobacter jejuni isolates obtained from Danish broiler chickens. Furthermore...... and LG genotyping was low when applied to poultry isolates. This is in contrast to previous studies on isolates of human origin that reported a high correlation between results obtained by the two typing methods....

  5. Glutamic acid promotes monacolin K production and monacolin K biosynthetic gene cluster expression in Monascus.

    Science.gov (United States)

    Zhang, Chan; Liang, Jian; Yang, Le; Chai, Shiyuan; Zhang, Chenxi; Sun, Baoguo; Wang, Chengtao

    2017-12-01

    This study investigated the effects of glutamic acid on production of monacolin K and expression of the monacolin K biosynthetic gene cluster. When Monascus M1 was grown in glutamic medium instead of in the original medium, monacolin K production increased from 48.4 to 215.4 mg l -1 , monacolin K production increased by 3.5 times. Glutamic acid enhanced monacolin K production by upregulating the expression of mokB-mokI; on day 8, the expression level of mokA tended to decrease by Reverse Transcription-polymerase Chain Reaction. Our findings demonstrated that mokA was not a key gene responsible for the quantity of monacolin K production in the presence of glutamic acid. Observation of Monascus mycelium morphology using Scanning Electron Microscope showed glutamic acid significantly increased the content of Monascus mycelium, altered the permeability of Monascus mycelium, enhanced secretion of monacolin K from the cell, and reduced the monacolin K content in Monascus mycelium, thereby enhancing monacolin K production.

  6. Identification of a conserved cluster of skin-specific genes encoding secreted proteins.

    Science.gov (United States)

    Moffatt, Pierre; Salois, Patrick; St-Amant, Natalie; Gaumond, Marie-Hélène; Lanctôt, Christian

    2004-06-09

    Terminal differentiation of keratinocytes results in the formation of a cornified layer composed of cross-linked intracellular and extracellular material. Using a signal trap expression screening strategy, we have identified four cDNAs encoding secreted proteins potentially involved in this process. One of the cDNAs is identical to the short isoform of suprabasin, a recently described epidermis-specific protein, which is shown here to contain a functional secretory signal. The second cDNA, sk89, encodes a protein of 493 amino acids, rich in glycine and serine residues. The third cDNA encodes a C-terminal fragment of SK89 (amino acids 410-493). It comprises exons 13 to 18 of the sk89 locus but transcription starts at an isoform-specific exon encoding a distinct secretory signal. The fourth cDNA encodes keratinocyte differentiation-associated protein (KDAP), a precursor protein of 102 amino acids. Subcellular localization by immunofluorescence and detection of the tagged proteins by Western blotting confirmed that the four proteins are secreted. Northern analysis and in situ hybridization revealed that expression of the corresponding genes was restricted to the suprabasal keratinocytes of the epidermis. These genes encoding epidermis-specific secreted products are found in a conserved cluster on human chromosome 19q13.12 and on mouse chromosome 7A3.

  7. Identification of a gene cluster for biosynthesis of mannosylerythritol lipids in the basidiomycetous fungus Ustilago maydis.

    Science.gov (United States)

    Hewald, Sandra; Linne, Uwe; Scherer, Mario; Marahiel, Mohamed A; Kämper, Jörg; Bölker, Michael

    2006-08-01

    Many microorganisms produce surface-active substances that enhance the availability of water-insoluble substrates. Although many of these biosurfactants have interesting potential applications, very little is known about their biosynthesis. The basidiomycetous fungus Ustilago maydis secretes large amounts of mannosylerythritol lipids (MELs) under conditions of nitrogen starvation. We recently described a putative glycosyltransferase, Emt1, which is essential for MEL biosynthesis and whose expression is strongly induced by nitrogen limitation. We used DNA microarray analysis to identify additional genes involved in MEL biosynthesis. Here we show that emt1 is part of a gene cluster which comprises five open reading frames. Three of the newly identified proteins, Mac1, Mac2, and Mat1, contain short sequence motifs characteristic for acyl- and acetyltransferases. Mutational analysis revealed that Mac1 and Mac2 are essential for MEL production, which suggests that they are involved in the acylation of mannosylerythritol. Deletion of mat1 resulted in the secretion of completely deacetylated MELs, as determined by mass spectrometry. We overexpressed Mat1 in Escherichia coli and demonstrated that this enzyme acts as an acetyl coenzyme A-dependent acetyltransferase. Remarkably, Mat1 displays relaxed regioselectivity and is able to acetylate mannosylerythritol at both the C-4 and C-6 hydroxyl groups. Based on these results, we propose a biosynthesis pathway for the generation of mannosylerythritol lipids in U. maydis.

  8. Genetic analysis of capsular polysaccharide synthesis gene clusters in 79 capsular types of Klebsiella spp

    Science.gov (United States)

    Pan, Yi-Jiun; Lin, Tzu-Lung; Chen, Chun-Tang; Chen, Yi-Yin; Hsieh, Pei-Fang; Hsu, Chun-Ru; Wu, Meng-Chuan; Wang, Jin-Town

    2015-01-01

    A total of 79 capsular types have been reported in Klebsiella spp., whereas capsular polysaccharide synthesis (cps) regions were available in only 22 types. Due to the limitations of serotyping, complete repertoire of cps will be helpful for capsular genotyping. We therefore resolved the rest 57 cps and conducted comparative analysis. Clustering results of 1,515 predicted proteins from cps loci categorized proteins which share similarity into homology groups (HGs) revealing that 77 Wzy polymerases were classified into 56 HGs, which indicate the high specificity of wzy between different types. Accordingly, wzy-based capsular genotyping could differentiate capsule types except for those lacking wzy (K29 and K50), those sharing identical wzy (K22 vs. K37); and should be carefully applied in those exhibited high similarity (K12 vs. K41, K2 vs. K13, K74 vs. K80, K79 vs. KN1 and K30 vs. K69). Comparison of CPS structures in several capsular types that shared similarity in their gene contents implies possible functions of glycosyltransferases. Therefore, our results provide complete set of cps in various types of Klebsiella spp., which enable the understandings of relationship between genes and CPS structures and are useful for identification of documented or new capsular types. PMID:26493302

  9. pMH2, a small plasmid bearing the nif gene cluster of Enterobacter agglomerans 333 as an excisable cassette.

    Science.gov (United States)

    Stumpf, F; Halda, L; Klingmüller, W

    1993-10-01

    A small plasmid containing the entire nif gene cluster of Enterobacter agglomerans 333 as an excisable cassette has been constructed, using pACYC177 as a vector. Two cosmid clones taken from a gene library of E. agglomerans plasmid pEA3 were used as a source of nif genes. A SmaI fragment of peaMS2-2, containing the H,D,K,Y,E,N,X,U,S,V,W,Z,M,L,A and B genes and an ApaI fragment of peaMS2-16 containing nif A,B,Q,F and J were selected to construct pMH2. The resulting plasmid of 33 kb carries the complete nif gene cluster as a nif cassette on a single XbaI fragment. The nif construct pMH2 in Escherichia coli strains has significant nitrogenase activity compared to wild-type E. agglomerans 333. The nif gene cluster construct was found to be very stable.

  10. GenCLiP: a software program for clustering gene lists by literature profiling and constructing gene co-occurrence networks related to custom keywords

    Directory of Open Access Journals (Sweden)

    Zhou Yi-Bo

    2008-07-01

    Full Text Available Abstract Background Biomedical researchers often want to explore pathogenesis and pathways regulated by abnormally expressed genes, such as those identified by microarray analyses. Literature mining is an important way to assist in this task. Many literature mining tools are now available. However, few of them allows the user to make manual adjustments to zero in on what he/she wants to know in particular. Results We present our software program, GenCLiP (Gene Cluster with Literature Profiles, which is based on the methods presented by Chaussabel and Sher (Genome Biol 2002, 3(10:RESEARCH0055 that search gene lists to identify functional clusters of genes based on up-to-date literature profiling. Four features were added to this previously described method: the ability to 1 manually curate keywords extracted from the literature, 2 search genes and gene co-occurrence networks related to custom keywords, 3 compare analyzed gene results with negative and positive controls generated by GenCLiP, and 4 calculate probabilities that the resulting genes and gene networks are randomly related. In this paper, we show with a set of differentially expressed genes between keloids and normal control, how implementation of functions in GenCLiP successfully identified keywords related to the pathogenesis of keloids and unknown gene pathways involved in the pathogenesis of keloids. Conclusion With regard to the identification of disease-susceptibility genes, GenCLiP allows one to quickly acquire a primary pathogenesis profile and identify pathways involving abnormally expressed genes not previously associated with the disease.

  11. Assessment of clusters of transcription factor binding sites in relationship to human promoter, CpG islands and gene expression

    Directory of Open Access Journals (Sweden)

    Sakaki Yoshiyuki

    2004-02-01

    Full Text Available Abstract Background Gene expression is regulated mainly by transcription factors (TFs that interact with regulatory cis-elements on DNA sequences. To identify functional regulatory elements, computer searching can predict TF binding sites (TFBS using position weight matrices (PWMs that represent positional base frequencies of collected experimentally determined TFBS. A disadvantage of this approach is the large output of results for genomic DNA. One strategy to identify genuine TFBS is to utilize local concentrations of predicted TFBS. It is unclear whether there is a general tendency for TFBS to cluster at promoter regions, although this is the case for certain TFBS. Also unclear is the identification of TFs that have TFBS concentrated in promoters and to what level this occurs. This study hopes to answer some of these questions. Results We developed the cluster score measure to evaluate the correlation between predicted TFBS clusters and promoter sequences for each PWM. Non-promoter sequences were used as a control. Using the cluster score, we identified a PWM group called PWM-PCP, in which TFBS clusters positively correlate with promoters, and another PWM group called PWM-NCP, in which TFBS clusters negatively correlate with promoters. The PWM-PCP group comprises 47% of the 199 vertebrate PWMs, while the PWM-NCP group occupied 11 percent. After reducing the effect of CpG islands (CGI against the clusters using partial correlation coefficients among three properties (promoter, CGI and predicted TFBS cluster, we identified two PWM groups including those strongly correlated with CGI and those not correlated with CGI. Conclusion Not all PWMs predict TFBS correlated with human promoter sequences. Two main PWM groups were identified: (1 those that show TFBS clustered in promoters associated with CGI, and (2 those that show TFBS clustered in promoters independent of CGI. Assessment of PWM matches will allow more positive interpretation of TFBS in

  12. Evolution of the C-Type Lectin-Like Receptor Genes of the DECTIN-1 Cluster in the NK Gene Complex

    Directory of Open Access Journals (Sweden)

    Susanne Sattler

    2012-01-01

    Full Text Available Pattern recognition receptors are crucial in initiating and shaping innate and adaptive immune responses and often belong to families of structurally and evolutionarily related proteins. The human C-type lectin-like receptors encoded in the DECTIN-1 cluster within the NK gene complex contain prominent receptors with pattern recognition function, such as DECTIN-1 and LOX-1. All members of this cluster share significant homology and are considered to have arisen from subsequent gene duplications. Recent developments in sequencing and the availability of comprehensive sequence data comprising many species showed that the receptors of the DECTIN-1 cluster are not only homologous to each other but also highly conserved between species. Even in Caenorhabditis elegans, genes displaying homology to the mammalian C-type lectin-like receptors have been detected. In this paper, we conduct a comprehensive phylogenetic survey and give an up-to-date overview of the currently available data on the evolutionary emergence of the DECTIN-1 cluster genes.

  13. vanI: a novel d-Ala-d-Lac vancomycin resistance gene cluster found in Desulfitobacterium hafniense

    NARCIS (Netherlands)

    Kruse, T.; Levisson, M.; Vos, de W.M.; Smidt, H.

    2014-01-01

    The glycopeptide vancomycin was until recently considered a drug of last resort against Gram-positive bacteria. Increasing numbers of bacteria, however, are found to carry genes that confer resistance to this antibiotic. So far, 10 different vancomycin resistance clusters have been described. A

  14. Evolution and genetic population structure of prickly lettuce (Lactuca serriola) and its RGC2 resistance gene cluster

    NARCIS (Netherlands)

    Kuang, H.; Eck, van H.J.; Sicard, D.; Michelmore, R.; Nevo, E.

    2008-01-01

    Genetic structure and diversity of natural populations of prickly lettuce (Lactuca serriola) were studied using AFLP markers and then compared with the diversity of the RGC2 disease resistance gene cluster. Screening of 696 accessions from 41 populations using 319 AFLP markers showed that eastern

  15. Histone and Ribosomal RNA Repetitive Gene Clusters of the Boll Weevil are Linked in a Tandem Array

    Science.gov (United States)

    Histones are the major protein component of chromatin structure. The histone family is made up of a quintet of proteins, four core histones (H2A, H2B, H3 & H4) and the linker histones (H1). Spacers are found between the coding regions. Among insects this quintet of genes is usually clustered and ...

  16. Genetic diversity of K-antigen gene clusters of Escherichia coli and their molecular typing using a suspension array.

    Science.gov (United States)

    Yang, Shuang; Xi, Daoyi; Jing, Fuyi; Kong, Deju; Wu, Junli; Feng, Lu; Cao, Boyang; Wang, Lei

    2018-04-01

    Capsular polysaccharides (CPSs), or K-antigens, are the major surface antigens of Escherichia coli. More than 80 serologically unique K-antigens are classified into 4 groups (Groups 1-4) of capsules. Groups 1 and 4 contain the Wzy-dependent polymerization pathway and the gene clusters are in the order galF to gnd; Groups 2 and 3 contain the ABC-transporter-dependent pathway and the gene clusters consist of 3 regions, regions 1, 2 and 3. Little is known about the variations among the gene clusters. In this study, 9 serotypes of K-antigen gene clusters (K2ab, K11, K20, K24, K38, K84, K92, K96, and K102) were sequenced and correlated with their CPS chemical structures. On the basis of sequence data, a K-antigen-specific suspension array that detects 10 distinct CPSs, including the above 9 CPSs plus K30, was developed. This is the first report to catalog the genetic features of E. coli K-antigen variations and to develop a suspension array for their molecular typing. The method has a number of advantages over traditional bacteriophage and serum agglutination methods and lays the foundation for straightforward identification and detection of additional K-antigens in the future.

  17. Organization of nif gene cluster in Frankia sp. EuIK1 strain, a symbiont of Elaeagnus umbellata.

    Science.gov (United States)

    Oh, Chang Jae; Kim, Ho Bang; Kim, Jitae; Kim, Won Jin; Lee, Hyoungseok; An, Chung Sun

    2012-01-01

    The nucleotide sequence of a 20.5-kb genomic region harboring nif genes was determined and analyzed. The fragment was obtained from Frankia sp. EuIK1 strain, an indigenous symbiont of Elaeagnus umbellata. A total of 20 ORFs including 12 nif genes were identified and subjected to comparative analysis with the genome sequences of 3 Frankia strains representing diverse host plant specificities. The nucleotide and deduced amino acid sequences showed highest levels of identity with orthologous genes from an Elaeagnus-infecting strain. The gene organization patterns around the nif gene clusters were well conserved among all 4 Frankia strains. However, characteristic features appeared in the location of the nifV gene for each Frankia strain, depending on the type of host plant. Sequence analysis was performed to determine the transcription units and suggested that there could be an independent operon starting from the nifW gene in the EuIK strain. Considering the organization patterns and their total extensions on the genome, we propose that the nif gene clusters remained stable despite genetic variations occurring in the Frankia genomes.

  18. Evolution of Chromosomal Clostridium botulinum Type E Neurotoxin Gene Clusters: Evidence Provided by Their Rare Plasmid-Borne Counterparts.

    Science.gov (United States)

    Carter, Andrew T; Austin, John W; Weedmark, Kelly A; Peck, Michael W

    2016-03-02

    Analysis of more than 150 Clostridium botulinum Group II type E genomes identified a small fraction (6%) where neurotoxin-encoding genes were located on plasmids. Seven closely related (134-144 kb) neurotoxigenic plasmids of subtypes E1, E3, and E10 were characterized; all carried genes associated with plasmid mobility via conjugation. Each plasmid contained the same 24-kb neurotoxin cluster cassette (six neurotoxin cluster and six flanking genes) that had split a helicase gene, rather than the more common chromosomal rarA. The neurotoxin cluster cassettes had evolved as separate genetic units which had either exited their chromosomal rarA locus in a series of parallel events, inserting into the plasmid-borne helicase gene, or vice versa. A single intact version of the helicase gene was discovered on a nonneurotoxigenic form of this plasmid. The observed low frequency for the plasmid location may reflect one or more of the following: 1) Less efficient recombination mechanism for the helicase gene target, 2) lack of suitable target plasmids, and 3) loss of neurotoxigenic plasmids. Type E1 and E10 plasmids possessed a Clustered Regularly Interspaced Short Palindromic Repeats locus with spacers that recognized C. botulinum Group II plasmids, but not C. botulinum Group I plasmids, demonstrating their long-term separation. Clostridium botulinum Group II type E strains also carry nonneurotoxigenic plasmids closely related to C. botulinum Group II types B and F plasmids. Here, the absence of neurotoxin cassettes may be because recombination requires both a specific mechanism and specific target sequence, which are rarely found together. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  19. The medaka novel immune-type receptor (NITR gene clusters reveal an extraordinary degree of divergence in variable domains

    Directory of Open Access Journals (Sweden)

    Litman Gary W

    2008-06-01

    Full Text Available Abstract Background Novel immune-type receptor (NITR genes are members of diversified multigene families that are found in bony fish and encode type I transmembrane proteins containing one or two extracellular immunoglobulin (Ig domains. The majority of NITRs can be classified as inhibitory receptors that possess cytoplasmic immunoreceptor tyrosine-based inhibition motifs (ITIMs. A much smaller number of NITRs can be classified as activating receptors by the lack of cytoplasmic ITIMs and presence of a positively charged residue within their transmembrane domain, which permits partnering with an activating adaptor protein. Results Forty-four NITR genes in medaka (Oryzias latipes are located in three gene clusters on chromosomes 10, 18 and 21 and can be organized into 24 families including inhibitory and activating forms. The particularly large dataset acquired in medaka makes direct comparison possible to another complete dataset acquired in zebrafish in which NITRs are localized in two clusters on different chromosomes. The two largest medaka NITR gene clusters share conserved synteny with the two zebrafish NITR gene clusters. Shared synteny between NITRs and CD8A/CD8B is limited but consistent with a potential common ancestry. Conclusion Comprehensive phylogenetic analyses between the complete datasets of NITRs from medaka and zebrafish indicate multiple species-specific expansions of different families of NITRs. The patterns of sequence variation among gene family members are consistent with recent birth-and-death events. Similar effects have been observed with mammalian immunoglobulin (Ig, T cell antigen receptor (TCR and killer cell immunoglobulin-like receptor (KIR genes. NITRs likely diverged along an independent pathway from that of the somatically rearranging antigen binding receptors but have undergone parallel evolution of V family diversity.

  20. Insights into the evolutionary origins of clostridial neurotoxins from analysis of the Clostridium botulinum strain A neurotoxin gene cluster.

    Science.gov (United States)

    Doxey, Andrew C; Lynch, Michael D J; Müller, Kirsten M; Meiering, Elizabeth M; McConkey, Brendan J

    2008-11-14

    Clostridial neurotoxins (CNTs) are the most deadly toxins known and causal agents of botulism and tetanus neuroparalytic diseases. Despite considerable progress in understanding CNT structure and function, the evolutionary origins of CNTs remain a mystery as they are unique to Clostridium and possess a sequence and structural architecture distinct from other protein families. Uncovering the origins of CNTs would be a significant contribution to our understanding of how pathogens evolve and generate novel toxin families. The C. botulinum strain A genome was examined for potential homologues of CNTs. A key link was identified between the neurotoxin and the flagellin gene (CBO0798) located immediately upstream of the BoNT/A neurotoxin gene cluster. This flagellin sequence displayed the strongest sequence similarity to the neurotoxin and NTNH homologue out of all proteins encoded within C. botulinum strain A. The CBO0798 gene contains a unique hypervariable region, which in closely related flagellins encodes a collagenase-like domain. Remarkably, these collagenase-containing flagellins were found to possess the characteristic HEXXH zinc-protease motif responsible for the neurotoxin's endopeptidase activity. Additional links to collagenase-related sequences and functions were detected by further analysis of CNTs and surrounding genes, including sequence similarities to collagen-adhesion domains and collagenases. Furthermore, the neurotoxin's HCRn domain was found to exhibit both structural and sequence similarity to eukaryotic collagen jelly-roll domains. Multiple lines of evidence suggest that the neurotoxin and adjacent genes evolved from an ancestral collagenase-like gene cluster, linking CNTs to another major family of clostridial proteolytic toxins. Duplication, reshuffling and assembly of neighboring genes within the BoNT/A neurotoxin gene cluster may have lead to the neurotoxin's unique architecture. This work provides new insights into the evolution of C

  1. A Functional Bikaverin Biosynthesis Gene Cluster in Rare Strains of Botrytis cinerea Is Positively Controlled by VELVET

    Science.gov (United States)

    Schumacher, Julia; Gautier, Angélique; Morgant, Guillaume; Studt, Lena; Ducrot, Paul-Henri; Le Pêcheur, Pascal; Azeddine, Saad; Fillinger, Sabine; Leroux, Pierre; Tudzynski, Bettina; Viaud, Muriel

    2013-01-01

    The gene cluster responsible for the biosynthesis of the red polyketidic pigment bikaverin has only been characterized in Fusarium ssp. so far. Recently, a highly homologous but incomplete and nonfunctional bikaverin cluster has been found in the genome of the unrelated phytopathogenic fungus Botrytis cinerea. In this study, we provided evidence that rare B. cinerea strains such as 1750 have a complete and functional cluster comprising the six genes orthologous to Fusarium fujikuroi ffbik1-ffbik6 and do produce bikaverin. Phylogenetic analysis confirmed that the whole cluster was acquired from Fusarium through a horizontal gene transfer (HGT). In the bikaverin-nonproducing strain B05.10, the genes encoding bikaverin biosynthesis enzymes are nonfunctional due to deleterious mutations (bcbik2-3) or missing (bcbik1) but interestingly, the genes encoding the regulatory proteins BcBIK4 and BcBIK5 do not harbor deleterious mutations which suggests that they may still be functional. Heterologous complementation of the F. fujikuroi Δffbik4 mutant confirmed that bcbik4 of strain B05.10 is indeed fully functional. Deletion of bcvel1 in the pink strain 1750 resulted in loss of bikaverin and overproduction of melanin indicating that the VELVET protein BcVEL1 regulates the biosynthesis of the two pigments in an opposite manner. Although strain 1750 itself expresses a truncated BcVEL1 protein (100 instead of 575 aa) that is nonfunctional with regard to sclerotia formation, virulence and oxalic acid formation, it is sufficient to regulate pigment biosynthesis (bikaverin and melanin) and fenhexamid HydR2 type of resistance. Finally, a genetic cross between strain 1750 and a bikaverin-nonproducing strain sensitive to fenhexamid revealed that the functional bikaverin cluster is genetically linked to the HydR2 locus. PMID:23308280

  2. Presence of CTX gene cluster in environmental non-O1/O139 Vibrio cholerae and its potential clinical significance

    Directory of Open Access Journals (Sweden)

    B Bakhshi

    2012-01-01

    Full Text Available Purpose: The aim of this study was to understand the epidemiological linkage of clinical and environmental isolates of Vibrio cholerae and to determine their genotypes and virulence genes content. Materials and Methods: A total of 60 V. cholerae strains obtained from clinical specimens (n = 40 and surface waters (n = 20 were subjected to genotyping using PFGE and determination of their virulence-associated gene clusters. Result: PCR analysis showed the presence of chromosomally located hly and RTX genetic elements in 100% and 90% of the environmental isolates, respectively. The phage-mediated genetic elements such as CTX, TLC and VPI were detected in 5% of the environmental isolates suggesting that the environmental isolates cannot acquire certain mobile gene clusters. A total of 4 and 18 pulsotypes were obtained among the clinical and environmental V. cholerae isolates, respectively. Non-pathogenic environmentally isolated V. cholerae constituted a distinct cluster with one single non-O1, non-O139 strain (EP6 carrying the virulence genes similar to the epidemic strains. This may suggest the possible potential of conversion of non-pathogenic to a pathogenic environmental strain. Conclusions: The emergence of a single environmental isolate in our study containing the pathogenicity genes amongst the diverse non-pathogenic environmental isolates needs to be further studied in the context of V. cholerae pathogenicity sero-coversion.

  3. De novo deletion of HOXB gene cluster in a patient with failure to thrive, developmental delay, gastroesophageal reflux and bronchiectasis.

    Science.gov (United States)

    Pajusalu, Sander; Reimand, Tiia; Uibo, Oivi; Vasar, Maire; Talvik, Inga; Zilina, Olga; Tammur, Pille; Õunap, Katrin

    2015-01-01

    We report a female patient with a complex phenotype consisting of failure to thrive, developmental delay, congenital bronchiectasis, gastroesophageal reflux and bilateral inguinal hernias. Chromosomal microarray analysis revealed a 230 kilobase deletion in chromosomal region 17q21.32 (arr[hg19] 17q21.32(46 550 362-46 784 039)×1) encompassing only 9 genes - HOXB1 to HOXB9. The deletion was not found in her mother or father. This is the first report of a patient with a HOXB gene cluster deletion involving only HOXB1 to HOXB9 genes. By comparing our case to previously reported five patients with larger chromosomal aberrations involving the HOXB gene cluster, we can suppose that HOXB gene cluster deletions are responsible for growth retardation, developmental delay, and specific facial dysmorphic features. Also, we suppose that bilateral inguinal hernias, tracheo-esophageal abnormalities, and lung malformations represent features with incomplete penetrance. Interestingly, previously published knock-out mice with targeted heterozygous deletion comparable to our patient did not show phenotypic alterations. Copyright © 2015 Elsevier Masson SAS. All rights reserved.

  4. Molecular evolution of the nif gene cluster carrying nifI1 and nifI2 genes in the Gram-positive phototrophic bacterium Heliobacterium chlorum.

    Science.gov (United States)

    Enkh-Amgalan, Jigjiddorj; Kawasaki, Hiroko; Seki, Tatsuji

    2006-01-01

    A major nif cluster was detected in the strictly anaerobic, Gram-positive phototrophic bacterium Heliobacterium chlorum. The cluster consisted of 11 genes arranged within a 10 kb region in the order nifI1, nifI2, nifH, nifD, nifK, nifE, nifN, nifX, fdx, nifB and nifV. The phylogenetic position of Hbt. chlorum was the same in the NifH, NifD, NifK, NifE and NifN trees; Hbt. chlorum formed a cluster with Desulfitobacterium hafniense, the closest neighbour of heliobacteria based on the 16S rRNA phylogeny, and two species of the genus Geobacter belonging to the Deltaproteobacteria. Two nifI genes, known to occur in the nif clusters of methanogenic archaea between nifH and nifD, were found upstream of the nifH gene of Hbt. chlorum. The organization of the nif operon and the phylogeny of individual and concatenated gene products showed that the Hbt. chlorum nif operon carrying nifI genes upstream of the nifH gene was an intermediate between the nif operon with nifI downstream of nifH (group II and III of the nitrogenase classification) and the nif operon lacking nifI (group I). Thus, the phylogenetic position of Hbt. chlorum nitrogenase may reflect an evolutionary stage of a divergence of the two nitrogenase groups, with group I consisting of the aerobic diazotrophs and group II consisting of strictly anaerobic prokaryotes.

  5. Relational analysis of CpG islands methylation and gene expression in human lymphomas using possibilistic C-means clustering and modified cluster fuzzy density.

    Science.gov (United States)

    Sjahputera, Ozy; Keller, James M; Davis, J Wade; Taylor, Kristen H; Rahmatpanah, Farahnaz; Shi, Huidong; Anderson, Derek T; Blisard, Samuel N; Luke, Robert H; Popescu, Mihail; Arthur, Gerald C; Caldwell, Charles W

    2007-01-01

    Heterogeneous genetic and epigenetic alterations are commonly found in human non-Hodgkin's lymphomas (NHL). One such epigenetic alteration is aberrant methylation of gene promoter-related CpG islands, where hypermethylation frequently results in transcriptional inactivation of target genes, while a decrease or loss of promoter methylation (hypomethylation) is frequently associated with transcriptional activation. Discovering genes with these relationships in NHL or other types of cancers could lead to a better understanding of the pathobiology of these diseases. The simultaneous analysis of promoter methylation using Differential Methylation Hybridization (DMH) and its associated gene expression using Expressed CpG Island Sequence Tag (ECIST) microarrays generates a large volume of methylation-expression relational data. To analyze this data, we propose a set of algorithms based on fuzzy sets theory, in particular Possibilistic c-Means (PCM) and cluster fuzzy density. For each gene, these algorithms calculate measures of confidence of various methylation-expression relationships in each NHL subclass. Thus, these tools can be used as a means of high volume data exploration to better guide biological confirmation using independent molecular biology methods.

  6. Identification and characterization of a biosynthetic gene cluster for tryptophan dimers in deep sea-derived Streptomyces sp. SCSIO 03032.

    Science.gov (United States)

    Ma, Liang; Zhang, Wenjun; Zhu, Yiguang; Zhang, Guangtao; Zhang, Haibo; Zhang, Qingbo; Zhang, Liping; Yuan, Chengshan; Zhang, Changsheng

    2017-08-01

    Tryptophan dimers (TDs) are an important class of natural products with diverse bioactivities and share conserved biosynthetic pathways. We report the identification of a partial gene cluster (spm) responsible for the biosynthesis of a class of unusual TDs with non-planar skeletons including spiroindimicins (SPMs), indimicins (IDMs), and lynamicins (LNMs) from the deep-sea derived Streptomyces sp. SCSIO 03032. Bioinformatics analysis, targeted gene disruptions, and heterologous expression studies confirmed the involvement of the spm gene cluster in the biosynthesis of SPM/IDM/LNMs, and revealed the indispensable roles for the halogenase/reductase pair SpmHF, the amino acid oxidase SpmO, and the chromopyrrolic acid (CPA) synthase SpmD, as well as the positive regulator SpmR and the putative transporter SpmA. However, the spm gene cluster was unable to confer a heterologous host the ability to produce SPM/IDM/LNMs. In addition, the P450 enzyme SpmP and the monooxygenase SpmX2 were found to be non-relevant to the biosynthesis of SPM/IDM/LNMs. Sequence alignment and structure modeling suggested the lack of key conserved amino acid residues in the substrate-binding pocket of SpmP. Furthermore, feeding experiments in the non-producing ΔspmO mutant revealed several biosynthetic precursors en route to SPMs, indicating that key enzymes responsible for the biosynthesis of SPMs should be encoded by genes outside of the identified spm gene cluster. Finally, the biosynthetic pathways of SPM/IDM/LNMs are proposed to lay a basis for further insights into their intriguing biosynthetic machinery.

  7. Genes encoding Cher-TPR fusion proteins are predominantly found in gene clusters encoding chemosensory pathways with alternative cellular functions.

    Directory of Open Access Journals (Sweden)

    Francisco Muñoz-Martínez

    Full Text Available Chemosensory pathways correspond to major signal transduction mechanisms and can be classified into the functional families flagellum-mediated taxis, type four pili-mediated taxis or pathways with alternative cellular functions (ACF. CheR methyltransferases are core enzymes in all of these families. CheR proteins fused to tetratricopeptide repeat (TPR domains have been reported and we present an analysis of this uncharacterized family. We show that CheR-TPRs are widely distributed in GRAM-negative but almost absent from GRAM-positive bacteria. Most strains contain a single CheR-TPR and its abundance does not correlate with the number of chemoreceptors. The TPR domain fused to CheR is comparatively short and frequently composed of 2 repeats. The majority of CheR-TPR genes were found in gene clusters that harbor multidomain response regulators in which the REC domain is fused to different output domains like HK, GGDEF, EAL, HPT, AAA, PAS, GAF, additional REC, HTH, phosphatase or combinations thereof. The response regulator architectures coincide with those reported for the ACF family of pathways. Since the presence of multidomain response regulators is a distinctive feature of this pathway family, we conclude that CheR-TPR proteins form part of ACF type pathways. The diversity of response regulator output domains suggests that the ACF pathways form a superfamily which regroups many different regulatory mechanisms, in which all CheR-TPR proteins appear to participate. In the second part we characterize WspC of Pseudomonas putida, a representative example of CheR-TPR. The affinities of WspC-Pp for S-adenosylmethionine and S-adenosylhomocysteine were comparable to those of prototypal CheR, indicating that WspC-Pp activity is in analogy to prototypal CheRs controlled by product feed-back inhibition. The removal of the TPR domain did not impact significantly on the binding constants and consequently not on the product feed-back inhibition. WspC-Pp was

  8. Genes encoding Cher-TPR fusion proteins are predominantly found in gene clusters encoding chemosensory pathways with alternative cellular functions.

    Science.gov (United States)

    Muñoz-Martínez, Francisco; García-Fontana, Cristina; Rico-Jiménez, Miriam; Alfonso, Carlos; Krell, Tino

    2012-01-01

    Chemosensory pathways correspond to major signal transduction mechanisms and can be classified into the functional families flagellum-mediated taxis, type four pili-mediated taxis or pathways with alternative cellular functions (ACF). CheR methyltransferases are core enzymes in all of these families. CheR proteins fused to tetratricopeptide repeat (TPR) domains have been reported and we present an analysis of this uncharacterized family. We show that CheR-TPRs are widely distributed in GRAM-negative but almost absent from GRAM-positive bacteria. Most strains contain a single CheR-TPR and its abundance does not correlate with the number of chemoreceptors. The TPR domain fused to CheR is comparatively short and frequently composed of 2 repeats. The majority of CheR-TPR genes were found in gene clusters that harbor multidomain response regulators in which the REC domain is fused to different output domains like HK, GGDEF, EAL, HPT, AAA, PAS, GAF, additional REC, HTH, phosphatase or combinations thereof. The response regulator architectures coincide with those reported for the ACF family of pathways. Since the presence of multidomain response regulators is a distinctive feature of this pathway family, we conclude that CheR-TPR proteins form part of ACF type pathways. The diversity of response regulator output domains suggests that the ACF pathways form a superfamily which regroups many different regulatory mechanisms, in which all CheR-TPR proteins appear to participate. In the second part we characterize WspC of Pseudomonas putida, a representative example of CheR-TPR. The affinities of WspC-Pp for S-adenosylmethionine and S-adenosylhomocysteine were comparable to those of prototypal CheR, indicating that WspC-Pp activity is in analogy to prototypal CheRs controlled by product feed-back inhibition. The removal of the TPR domain did not impact significantly on the binding constants and consequently not on the product feed-back inhibition. WspC-Pp was found to be

  9. Sequencing and Transcriptional Analysis of the Biosynthesis Gene Cluster of Putrescine-Producing Lactococcus lactis ▿ †

    Science.gov (United States)

    Ladero, Victor; Rattray, Fergal P.; Mayo, Baltasar; Martín, María Cruz; Fernández, María; Alvarez, Miguel A.

    2011-01-01

    Lactococcus lactis is a prokaryotic microorganism with great importance as a culture starter and has become the model species among the lactic acid bacteria. The long and safe history of use of L. lactis in dairy fermentations has resulted in the classification of this species as GRAS (General Regarded As Safe) or QPS (Qualified Presumption of Safety). However, our group has identified several strains of L. lactis subsp. lactis and L. lactis subsp. cremoris that are able to produce putrescine from agmatine via the agmatine deiminase (AGDI) pathway. Putrescine is a biogenic amine that confers undesirable flavor characteristics and may even have toxic effects. The AGDI cluster of L. lactis is composed of a putative regulatory gene, aguR, followed by the genes (aguB, aguD, aguA, and aguC) encoding the catabolic enzymes. These genes are transcribed as an operon that is induced in the presence of agmatine. In some strains, an insertion (IS) element interrupts the transcription of the cluster, which results in a non-putrescine-producing phenotype. Based on this knowledge, a PCR-based test was developed in order to differentiate nonproducing L. lactis strains from those with a functional AGDI cluster. The analysis of the AGDI cluster and their flanking regions revealed that the capacity to produce putrescine via the AGDI pathway could be a specific characteristic that was lost during the adaptation to the milk environment by a process of reductive genome evolution. PMID:21803900

  10. Endophytic actinobacteria: Diversity, secondary metabolism and mechanisms to unsilence biosynthetic gene clusters.

    Science.gov (United States)

    Dinesh, Raghavan; Srinivasan, Veeraraghavan; T E, Sheeja; Anandaraj, Muthuswamy; Srambikkal, Hamza

    2017-09-01

    Endophytic actinobacteria, which reside in the inner tissues of host plants, are gaining serious attention due to their capacity to produce a plethora of secondary metabolites (e.g. antibiotics) possessing a wide variety of biological activity with diverse functions. This review encompasses the recent reports on endophytic actinobacterial species diversity, in planta habitats and mechanisms underlying their mode of entry into plants. Besides, their metabolic potential, novel bioactive compounds they produce and mechanisms to unravel their hidden metabolic repertoire by activation of cryptic or silent biosynthetic gene clusters (BGCs) for eliciting novel secondary metabolite production are discussed. The study also reviews the classical conservative techniques (chemical/biological/physical elicitation, co-culturing) as well as modern microbiology tools (e.g. next generation sequencing) that are being gainfully employed to uncover the vast hidden scaffolds for novel secondary metabolites produced by these endophytes, which would subsequently herald a revolution in drug engineering. The potential role of these endophytes in the agro-environment as promising biological candidates for inhibition of phytopathogens and the way forward to thoroughly exploit this unique microbial community by inducing expression of cryptic BGCs for encoding unseen products with novel therapeutic properties are also discussed.

  11. The N‐acetylglucosamine catabolic gene cluster in Trichoderma reesei is controlled by the Ndt80‐like transcription factor RON1

    Science.gov (United States)

    Kappel, Lisa; Gaderer, Romana; Flipphi, Michel

    2015-01-01

    Summary Chitin is an important structural constituent of fungal cell walls composed of N‐acetylglucosamine (GlcNAc) monosaccharides, but catabolism of GlcNAc has not been studied in filamentous fungi so far. In the yeast C andida albicans, the genes encoding the three enzymes responsible for stepwise conversion of GlcNAc to fructose‐6‐phosphate are clustered. In this work, we analysed GlcNAc catabolism in ascomycete filamentous fungi and found that the respective genes are also clustered in these fungi. In contrast to C . albicans, the cluster often contains a gene for an Ndt80‐like transcription factor, which we named RON1 (regulator of N‐acetylglucosamine catabolism 1). Further, a gene for a glycoside hydrolase 3 protein related to bacterial N‐acetylglucosaminidases can be found in the GlcNAc gene cluster in filamentous fungi. Functional analysis in T richoderma reesei showed that the transcription factor RON1 is a key activator of the GlcNAc gene cluster and essential for GlcNAc catabolism. Furthermore, we present an evolutionary analysis of Ndt80‐like proteins in Ascomycota. All GlcNAc cluster genes, as well as the GlcNAc transporter gene ngt1, and an additional transcriptional regulator gene, csp2, encoding the homolog of N eurospora crassa  CSP2/GRHL, were functionally characterised by gene expression analysis and phenotypic characterisation of knockout strains in T . reesei. PMID:26481444

  12. Identification and Characterization of Mycemycin Biosynthetic Gene Clusters in Streptomyces olivaceus FXJ8.012 and Streptomyces sp. FXJ1.235

    Directory of Open Access Journals (Sweden)

    Fangying Song

    2018-03-01

    Full Text Available Mycemycins A–E are new members of the dibenzoxazepinone (DBP family, derived from the gntR gene-disrupted deep sea strain Streptomyces olivaceus FXJ8.012Δ1741 and the soil strain Streptomyces sp. FXJ1.235. In this paper, we report the identification of the gene clusters and pathways’ inference for mycemycin biosynthesis in the two strains. Bioinformatics analyses of the genome sequences of S. olivaceus FXJ8.012Δ1741 and S. sp. FXJ1.235 predicted two divergent mycemycin gene clusters, mym and mye, respectively. Heterologous expression of the key enzyme genes of mym and genetic manipulation of mye as well as a feeding study in S. sp. FXJ1.235 confirmed the gene clusters and led to the proposed biosynthetic pathways for mycemycins. To the best of our knowledge, this is the first report on DBP biosynthetic gene clusters and pathways.

  13. Molecular Typing and Virulence Gene Profiles of Enterotoxin Gene Cluster (egc)-Positive Staphylococcus aureus Isolates Obtained from Various Food and Clinical Specimens.

    Science.gov (United States)

    Song, Minghui; Shi, Chunlei; Xu, Xuebing; Shi, Xianming

    2016-11-01

    The enterotoxin gene cluster (egc) has been proposed to contribute to the Staphylococcus aureus colonization, which highlights the need to evaluate genetic diversity and virulence gene profiles of the egc-positive population. Here, a total of 43 egc-positive isolates (16.2%) were identified from 266 S. aureus isolates that were obtained from various food and clinical specimens in Shanghai. Seven different egc profiles were found based on the polymerase chain reaction (PCR) result for egc genes. Then, these 43 egc-positive isolates were further typed by multilocus sequence typing, pulsed-field gel electrophoresis (PFGE), multiple-locus variable-number tandem-repeat analysis (MLVA), and accessory gene regulatory (agr) typing. It showed that the 43 egc-positive isolates displayed 17 sequence types, 28 PFGE patterns, 29 MLVA types, and 4 agr types, respectively. Among them, the dominant clonal lineage was CC5-agr II (48.84%). Thirty toxin and 20 adhesion-associated genes were detected by PCR in egc-positive isolates. Notably, invasive toxin genes showed a high prevalence, such as 76.7% for Panton-Valentine leukocidin encoding genes, 27.9% for sec, and 23.3% for tsst-1. Most of the examined adhesion-associated genes were found to be conserved (76.7-100%), whereas the fnbB gene was only found in 8 (18.6%) isolates. In addition, 33 toxin gene profiles and 13 adhesion gene profiles were identified, respectively. Our results imply that isolates belonging to the same clonal lineage harbored similar adhesion gene profiles but diverse toxin gene profiles. Overall, the high prevalence of invasive virulence genes increases the potential risk of egc-positive isolates in S. aureus infection.

  14. Identification of the chelocardin biosynthetic gene cluster from Amycolatopsis sulphurea: a platform for producing novel tetracycline antibiotics.

    Science.gov (United States)

    Lukežič, Tadeja; Lešnik, Urška; Podgoršek, Ajda; Horvat, Jaka; Polak, Tomaž; Šala, Martin; Jenko, Branko; Raspor, Peter; Herron, Paul R; Hunter, Iain S; Petković, Hrvoje

    2013-12-01

    Tetracyclines (TCs) are medically important antibiotics from the polyketide family of natural products. Chelocardin (CHD), produced by Amycolatopsis sulphurea, is a broad-spectrum tetracyclic antibiotic with potent bacteriolytic activity against a number of Gram-positive and Gram-negative multi-resistant pathogens. CHD has an unknown mode of action that is different from TCs. It has some structural features that define it as 'atypical' and, notably, is active against tetracycline-resistant pathogens. Identification and characterization of the chelocardin biosynthetic gene cluster from A. sulphurea revealed 18 putative open reading frames including a type II polyketide synthase. Compared to typical TCs, the chd cluster contains a number of features that relate to its classification as 'atypical': an additional gene for a putative two-component cyclase/aromatase that may be responsible for the different aromatization pattern, a gene for a putative aminotransferase for C-4 with the opposite stereochemistry to TCs and a gene for a putative C-9 methylase that is a unique feature of this biosynthetic cluster within the TCs. Collectively, these enzymes deliver a molecule with different aromatization of ring C that results in an unusual planar structure of the TC backbone. This is a likely contributor to its different mode of action. In addition CHD biosynthesis is primed with acetate, unlike the TCs, which are primed with malonamate, and offers a biosynthetic engineering platform that represents a unique opportunity for efficient generation of novel tetracyclic backbones using combinatorial biosynthesis.

  15. Manual annotation and analysis of the defensin gene cluster in the C57BL/6J mouse reference genome

    Directory of Open Access Journals (Sweden)

    Dougan Gordon

    2009-12-01

    Full Text Available Abstract Background Host defense peptides are a critical component of the innate immune system. Human alpha- and beta-defensin genes are subject to copy number variation (CNV and historically the organization of mouse alpha-defensin genes has been poorly defined. Here we present the first full manual genomic annotation of the mouse defensin region on Chromosome 8 of the reference strain C57BL/6J, and the analysis of the orthologous regions of the human and rat genomes. Problems were identified with the reference assemblies of all three genomes. Defensins have been studied for over two decades and their naming has become a critical issue due to incorrect identification of defensin genes derived from different mouse strains and the duplicated nature of this region. Results The defensin gene cluster region on mouse Chromosome 8 A2 contains 98 gene loci: 53 are likely active defensin genes and 22 defensin pseudogenes. Several TATA box motifs were found for human and mouse defensin genes that likely impact gene expression. Three novel defensin genes belonging to the Cryptdin Related Sequences (CRS family were identified. All additional mouse defensin loci on Chromosomes 1, 2 and 14 were annotated and unusual splice variants identified. Comparison of the mouse alpha-defensins in the three main mouse reference gene sets Ensembl, Mouse Genome Informatics (MGI, and NCBI RefSeq reveals significant inconsistencies in annotation and nomenclature. We are collaborating with the Mouse Genome Nomenclature Committee (MGNC to establish a standardized naming scheme for alpha-defensins. Conclusions Prior to this analysis, there was no reliable reference gene set available for the mouse strain C57BL/6J defensin genes, demonstrating that manual intervention is still critical for the annotation of complex gene families and heavily duplicated regions. Accurate gene annotation is facilitated by the annotation of pseudogenes and regulatory elements. Manually curated gene

  16. Defining reference sequences for Nocardia species by similarity and clustering analyses of 16S rRNA gene sequence data.

    Directory of Open Access Journals (Sweden)

    Manal Helal

    Full Text Available BACKGROUND: The intra- and inter-species genetic diversity of bacteria and the absence of 'reference', or the most representative, sequences of individual species present a significant challenge for sequence-based identification. The aims of this study were to determine the utility, and compare the performance of several clustering and classification algorithms to identify the species of 364 sequences of 16S rRNA gene with a defined species in GenBank, and 110 sequences of 16S rRNA gene with no defined species, all within the genus Nocardia. METHODS: A total of 364 16S rRNA gene sequences of Nocardia species were studied. In addition, 110 16S rRNA gene sequences assigned only to the Nocardia genus level at the time of submission to GenBank were used for machine learning classification experiments. Different clustering algorithms were compared with a novel algorithm or the linear mapping (LM of the distance matrix. Principal Components Analysis was used for the dimensionality reduction and visualization. RESULTS: The LM algorithm achieved the highest performance and classified the set of 364 16S rRNA sequences into 80 clusters, the majority of which (83.52% corresponded with the original species. The most representative 16S rRNA sequences for individual Nocardia species have been identified as 'centroids' in respective clusters from which the distances to all other sequences were minimized; 110 16S rRNA gene sequences with identifications recorded only at the genus level were classified using machine learning methods. Simple kNN machine learning demonstrated the highest performance and classified Nocardia species sequences with an accuracy of 92.7% and a mean frequency of 0.578. CONCLUSION: The identification of centroids of 16S rRNA gene sequence clusters using novel distance matrix clustering enables the identification of the most representative sequences for each individual species of Nocardia and allows the quantitation of inter- and intra

  17. Clustered organization, polycistronic transcription, and evolution of modification-guide snoRNA genes in Euglena gracilis.

    Science.gov (United States)

    Moore, Ashley N; Russell, Anthony G

    2012-01-01

    Previous studies have shown that the eukaryotic microbe Euglena gracilis contains an unusually large assortment of small nucleolar RNAs (snoRNAs) and ribosomal RNA (rRNA) modification sites. However, little is known about the evolutionary mechanisms contributing to this situation. In this study, we have examined the organization and evolution of snoRNA genes in Euglena with the additional objective of determining how these properties relate to the rRNA modification pattern in this protist. We have identified and extensively characterized a clustered pattern of genes encoding previously biochemically isolated snoRNA sequences in E. gracilis. We show that polycistronic transcription is a prevalent snoRNA gene expression strategy in this organism. Further, we have identified 121 new snoRNA coding regions through sequence analysis of these clusters. We have identified an E. gracilis U14 snoRNA homolog clustered with modification-guide snoRNA genes. The U14 snoRNAs in other eukaryotic organisms examined to date typically contain both a modification and a processing domain. E. gracilis U14 lacks the modification domain but retains the processing domain. Our analysis of U14 structure and evolution in Euglena and other eukaryotes allows us to propose a model for its evolution and suggest its processing role may be its more important function, explaining its conservation in many eukaryotes. The preponderance of apparent small and larger-scale duplication events in the genomic regions we have characterized in Euglena provides a mechanism for the generation of the unusually diverse collection and abundance of snoRNAs and modified rRNA sites. Our findings provide the framework for more extensive whole genome analysis to elucidate whether these snoRNA gene clusters are spread across multiple chromosomes and/or form dense "arrays" at a limited number of chromosomal loci.

  18. A pseudogene cluster in the leader region of the Euglena chloroplast 16S-23S rRNA genes.

    Science.gov (United States)

    Miyata, T; Kikuno, R; Ohshima, Y

    1982-01-01

    The nucleotide sequence of a region (leader region) preceding the 5'-end of 16S-23S rRNA gene region of Euglena gracilis chloroplast DNA was compared with the homologous sequences that code for the 16S-23S rRNA operons of Euglena and E. coli. The leader region shows close homology in sequence to the 16S-23S rRNA gene region of Euglena (Orozco et al. (1980) J. Biol.Chem. 255, 10997-11003) as well as to the rrnD operon of E. coli, suggesting that it was derived from the 16S-23S rRNA gene region by gene duplication. It was shown that the leader region had accumulated nucleotide substitutions at an extremely rapid rate in its entirety, similar to the rate of tRNAIle pseudogene identified in the leader region. In addition, the leader region shows an unique base content which is quite distinct from those of 16S-23S rRNA gene regions of Euglena and E. coli, but again is similar to that of the tRNAIle pseudogene. The above two results strongly suggest that the leader region contains a pseudogene cluster which was derived from a gene cluster coding for the functional 16S-23S rRNA operon possibly by imperfect duplication during evolution of Euglena chloroplast DNA. PMID:7041094

  19. Phylogeography of var gene repertoires reveals fine-scale geospatial clustering of Plasmodium falciparum populations in a highly endemic area.

    Science.gov (United States)

    Tessema, Sofonias K; Monk, Stephanie L; Schultz, Mark B; Tavul, Livingstone; Reeder, John C; Siba, Peter M; Mueller, Ivo; Barry, Alyssa E

    2015-01-01

    Plasmodium falciparum malaria is a major global health problem that is being targeted for progressive elimination. Knowledge of local disease transmission patterns in endemic countries is critical to these elimination efforts. To investigate fine-scale patterns of malaria transmission, we have compared repertoires of rapidly evolving var genes in a highly endemic area. A total of 3680 high-quality DBLα-sequences were obtained from 68 P. falciparum isolates from ten villages spread over two distinct catchment areas on the north coast of Papua New Guinea (PNG). Modelling of the extent of var gene diversity in the two parasite populations predicts more than twice as many var gene alleles circulating within each catchment (Mugil = 906; Wosera = 1094) than previously recognized in PNG (Amele = 369). In addition, there were limited levels of var gene sharing between populations, consistent with local parasite population structure. Phylogeographic analyses demonstrate that while neutrally evolving microsatellite markers identified population structure only at the catchment level, var gene repertoires reveal further fine-scale geospatial clustering of parasite isolates. The clustering of parasite isolates by village in Mugil, but not in Wosera was consistent with the physical and cultural isolation of the human populations in the two catchments. The study highlights the microheterogeneity of P. falciparum transmission in highly endemic areas and demonstrates the potential of var genes as markers of local patterns of parasite population structure. © 2014 John Wiley & Sons Ltd.

  20. Molecular identification and characterization of clustered regularly interspaced short palindromic repeat (CRISPR) gene cluster in Taylorella equigenitalis.

    Science.gov (United States)

    Hara, Yasushi; Hayashi, Kyohei; Nakajima, Takuya; Kagawa, Shizuko; Tazumi, Akihiro; Moore, John E; Matsuda, Motoo

    2013-09-01

    Clustered regularly interspaced short palindromic repeats (CRISPRs), of approximately 10,000 base pairs (bp) in length, were shown to occur in the Japanese Taylorella equigenitalis strain, EQ59. The locus was composed of the putative CRISPRs-associated with 5 (cas5), RAMP csd1, csd2, recB, cas1, a leader region, 13 CRISPR consensus sequence repeats (each 32 bp; 5'-TCAGCCACGTTCGCGTGGCTGTGTGTTTAAAG-3'). These were in turn separated by 12 non repetitive unique spacer regions of similar length. In addition, a leader region, a transposase/IS protein, a leader region, and cas3 were also seen. All seven putative open reading frames carry their ribosome binding sites. Promoter consensus sequences at the -35 and -10 regions and putative intrinsic ρ-independent transcription terminator regions also occurred. A possible long overlap of 170 bp in length occurred between the recB and cas1 loci. Positive reverse transcription PCR signals of cas5, RAMP csd1, csd2-recB/cas1, and cas3 were generated. A putative secondary structure of the CRISPR consensus repeats was constructed. Following this, CRISPR results of the T. equigenitalis EQ59 isolate were subsequently compared with those from the Taylorella asinigenitalis MCE3 isolate.

  1. Overexpression of Hoxc13 in differentiating keratinocytes results in downregulation of a novel hair keratin gene cluster and alopecia.

    Science.gov (United States)

    Tkatchenko, A V; Visconti, R P; Shang, L; Papenbrock, T; Pruett, N D; Ito, T; Ogawa, M; Awgulewitsch, A

    2001-05-01

    Studying the roles of Hox genes in normal and pathological development of skin and hair requires identification of downstream target genes in genetically defined animal models. We show that transgenic mice overexpressing Hoxc13 in differentiating keratinocytes of hair follicles develop alopecia, accompanied by a progressive pathological skin condition that resembles ichthyosis. Large-scale analysis of differential gene expression in postnatal skin of these mice identified 16 previously unknown and 13 known genes as presumptive Hoxc13 targets. The majority of these targets are downregulated and belong to a subgroup of genes that encode hair-specific keratin-associated proteins (KAPs). Genomic mapping using a mouse hamster radiation hybrid panel showed these genes to reside in a novel KAP gene cluster on mouse chromosome 16 in a region of conserved linkage with human chromosome 21q22.11. Furthermore, data obtained by Hoxc13/lacZ reporter gene analysis in mice that overexpress Hoxc13 suggest negative autoregulatory feedback control of Hoxc13 expression levels, thus providing an entry point for elucidating currently unknown mechanisms that are required for regulating quantitative levels of Hox gene expression. Combined, these results provide a framework for understanding molecular mechanisms of Hoxc13 function in hair growth and development.

  2. Prevalence of the lmo0036-0043 gene cluster encoding arginine deiminase and agmatine deiminase systems in Listeria monocytogenes.

    Science.gov (United States)

    Chen, Jianshun; Chen, Fan; Cheng, Changyong; Fang, Weihuan

    2013-04-01

    Arginine deiminase and agmatine deiminase systems are involved in acid tolerance, and their encoding genes form the cluster lmo0036-0043 in Listeria monocytogenes. While lmo0042 and lmo0043 were conserved in all L. monocytogenes strains, the lmo0036-0041 region of this cluster was identified in all lineages I and II, and the majority of lineage IV (83.3%) strains, but absent in all lineage III and a small fraction of lineage IV (16.7%) strains, suggesting that the presence of the complete lmo0036-0043 cluster is dependent on lineages. lmo0036-0043-complete and -deficient lineage IV strains exhibit specific ascB-dapE profiles, which might represent two subpopulations with distinct genetic characteristics.

  3. Mapping in an apple (Malus x domestica) F1 segregating population based on physical clustering of differentially expressed genes.

    Science.gov (United States)

    Jensen, Philip J; Fazio, Gennaro; Altman, Naomi; Praul, Craig; McNellis, Timothy W

    2014-04-04

    Apple tree breeding is slow and difficult due to long generation times, self-incompatibility, and complex genetics. The identification of molecular markers linked to traits of interest is a way to expedite the breeding process. In the present study, we aimed to identify genes whose steady-state transcript abundance was associated with inheritance of specific traits segregating in an apple (Malus × domestica) rootstock F1 breeding population, including resistance to powdery mildew (Podosphaera leucotricha) disease and woolly apple aphid (Eriosoma lanigerum). Transcription profiling was performed for 48 individual F1 apple trees from a cross of two highly heterozygous parents, using RNA isolated from healthy, actively-growing shoot tips and a custom apple DNA oligonucleotide microarray representing 26,000 unique transcripts. Genome-wide expression profiles were not clear indicators of powdery mildew or woolly apple aphid resistance phenotype. However, standard differential gene expression analysis between phenotypic groups of trees revealed relatively small sets of genes with trait-associated expression levels. For example, thirty genes were identified that were differentially expressed between trees resistant and susceptible to powdery mildew. Interestingly, the genes encoding twenty-four of these transcripts were physically clustered on chromosome 12. Similarly, seven genes were identified that were differentially expressed between trees resistant and susceptible to woolly apple aphid, and the genes encoding five of these transcripts were also clustered, this time on chromosome 17. In each case, the gene clusters were in the vicinity of previously identified major quantitative trait loci for the corresponding trait. Similar results were obtained for a series of molecular traits. Several of the differentially expressed genes were used to develop DNA polymorphism markers linked to powdery mildew disease and woolly apple aphid resistance. Gene expression profiling

  4. In silico analysis highlights the frequency and diversity of type 1 lantibiotic gene clusters in genome sequenced bacteria

    LENUS (Irish Health Repository)

    Marsh, Alan J

    2010-11-30

    Abstract Background Lantibiotics are lanthionine-containing, post-translationally modified antimicrobial peptides. These peptides have significant, but largely untapped, potential as preservatives and chemotherapeutic agents. Type 1 lantibiotics are those in which lanthionine residues are introduced into the structural peptide (LanA) through the activity of separate lanthionine dehydratase (LanB) and lanthionine synthetase (LanC) enzymes. Here we take advantage of the conserved nature of LanC enzymes to devise an in silico approach to identify potential lantibiotic-encoding gene clusters in genome sequenced bacteria. Results In total 49 novel type 1 lantibiotic clusters were identified which unexpectedly were associated with species, genera and even phyla of bacteria which have not previously been associated with lantibiotic production. Conclusions Multiple type 1 lantibiotic gene clusters were identified at a frequency that suggests that these antimicrobials are much more widespread than previously thought. These clusters represent a rich repository which can yield a large number of valuable novel antimicrobials and biosynthetic enzymes.

  5. Identification and activation of novel biosynthetic gene clusters by genome mining in the kirromycin producer Streptomyces collinus Tü 365

    DEFF Research Database (Denmark)

    Iftime, Dumitrita; Kulik, Andreas; Härtner, Thomas

    2016-01-01

    Streptomycetes are prolific sources of novel biologically active secondary metabolites with pharmaceutical potential. S. collinus Tü 365 is a Streptomyces strain, isolated 1972 from Kouroussa (Guinea). It is best known as producer of the antibiotic kirromycin, an inhibitor of the protein biosynth...... of a lanthipeptide, a carotenoid, five terpenoid compounds, an ectoine, a siderophore and a spore pigment-associated gene cluster to their respective biosynthesis products....

  6. Variants in linkage disequilibrium with the late cornified envelope gene cluster deletion are associated with susceptibility to psoriatic arthritis.

    LENUS (Irish Health Repository)

    Bowes, John

    2010-12-01

    A common deletion mapping to the psoriasis susceptibility locus 4 on chromosome 1q21, encompassing two genes of the late cornified envelope (LCE) gene cluster, has been associated with an increased risk of psoriasis vulgaris (PsV). One previous report found no association of the deletion with psoriatic arthritis (PsA), suggesting it may be a specific risk factor for PsV. Given the genetic overlap between PsA and PsV, a study was undertaken to investigate whether single nucleotide polymorphisms (SNPs) mapping to this locus are risk factors for PsA in a UK and Irish population.

  7. Beta-globin gene cluster haplotypes in the Mapuche Indians of Argentina

    Directory of Open Access Journals (Sweden)

    Letícia Kaufman

    1998-12-01

    Full Text Available Haplotypes derived from five polymorphic restriction sites in the beta-globin gene cluster were investigated in 86 chromosomes from the Argentinian Mapuche. These results were integrated with those previously obtained for ten Brazilian Indian tribes. Eight haplotypes were identified, the most frequent being 2 (57% and 6 (27%. The presence of haplotype 3 in 2% of the Mapuche chromosomes is probably an evidence of admixture with individuals of African ancestry. Due to the high number of haplotypes observed, heterozygosity as measured by the Gini-Simpson index was higher in the Mapuche than in Brazilian Indians. The haplotypic distribution in the Mapuche was also significantly different from those of all Brazilian tribes investigated. This heterogeneity could be at least partially explained by admixture with non-Indian populations.Haplótipos derivados de cinco sítios de restrição polimórficos presentes no agrupamento da globina beta foram investigados em 86 cromossomos da população mapuche da Argentina. Esses resultados foram analisados em conjunto com os previamente obtidos para dez tribos indígenas brasileiras. Oito haplótipos foram identificados, dos quais os mais freqüentes foram o 2 (57% e o 6 (27%. A presença do haplótipo 3 em 2% dos cromossomos dos Mapuches é uma evidência de mistura com indivíduos de ancestralidade africana. Devido ao alto número de haplótipos, a heterozigosidade medida pelo índice Gini-Simpson é mais alta nos Mapuches do que nos índios brasileiros. A distribuição haplotípica nos Mapuches é também significativamente diferente da observada nas tribos brasileiras. Essa heterogeneidade poderia ser parcialmente explicada pela mistura com populações não-indígenas.

  8. Total alpha-globin gene cluster deletion has high frequency in Filipinos

    Energy Technology Data Exchange (ETDEWEB)

    Hunt, J.A.; Haruyama, A.Z.; Chu, B.M. [Kapiolani Medical Center, Honolulu, HI (United States)] [and others

    1994-09-01

    Most {alpha}-thalassemias [Thal] are due to large deletions. In Southeast Asians, the (--{sup SEA}) double {alpha}-globin gene deletion is common, 3 (--{sup Tot}) total {alpha}-globin cluster deletions are known: Filipino (--{sup Fil}), Thai (--{sup Thai}), and Chinese (--{sup Chin}). In a Hawaii Thal project, provisional diagnosis of {alpha}-Thal-1 heterozygotes was based on microcytosis, normal isoelectric focusing, and no iron deficiency. One in 10 unselected Filipinos was an {alpha}-Thal-1 heterozygote, 2/3 of these had a (--{sup Tot}) deletion: a {var_sigma}-cDNA probe consistently showed fainter intensity of the constant 5.5 kb {var_sigma}{sub 2} BamHI band, with no heterzygosity for {var_sigma}-globin region polymorphisms; {alpha}-cDNA or {var_sigma}-cDNA probes showed no BamHI or BglII bands diagnostic of the (--{sup SEA}) deletion; bands for the (-{alpha}) {alpha}-Thal-2 single {alpha}-globin deletions were only seen in Hb H cases. A reliable monoclonal anti-{var_sigma}-peptide antibody test for the (--{sup SEA}) deletion was always negative in (--{sup Tot}) samples. Southern digests with the Lo probe, a gift from D. Higgs of Oxford Univ., confirmed that 49 of 50 (--{sup Tot}) chromosomes in Filipinos were (--{sup Fil}). Of 20 {alpha}-Thal-1 hydrops born to Filipinos, 11 were (--{sup Fil}/--{sup SEA}) compound heterozygotes; 9 were (--{sup SEA}/--{sup SEA}) homozygotes, but none was a (--{sup Fil}/--{sup Fil}).

  9. Cracking the regulatory code of biosynthetic gene clusters as a strategy for natural product discovery.

    Science.gov (United States)

    Rigali, Sébastien; Anderssen, Sinaeda; Naômé, Aymeric; van Wezel, Gilles P

    2018-01-05

    The World Health Organization (WHO) describes antibiotic resistance as "one of the biggest threats to global health, food security, and development today", as the number of multi- and pan-resistant bacteria is rising dangerously. Acquired resistance phenomena also impair antifungals, antivirals, anti-cancer drug therapy, while herbicide resistance in weeds threatens the crop industry. On the positive side, it is likely that the chemical space of natural products goes far beyond what has currently been discovered. This idea is fueled by genome sequencing of microorganisms which unveiled numerous so-called cryptic biosynthetic gene clusters (BGCs), many of which are transcriptionally silent under laboratory culture conditions, and by the fact that most bacteria cannot yet be cultivated in the laboratory. However, brute force antibiotic discovery does not yield the same results as it did in the past, and researchers have had to develop creative strategies in order to unravel the hidden potential of microorganisms such as Streptomyces and other antibiotic-producing microorganisms. Identifying the cis elements and their corresponding transcription factors(s) involved in the control of BGCs through bioinformatic approaches is a promising strategy. Theoretically, we are a few 'clicks' away from unveiling the culturing conditions or genetic changes needed to activate the production of cryptic metabolites or increase the production yield of known compounds to make them economically viable. In this opinion article, we describe and illustrate the idea beyond 'cracking' the regulatory code for natural product discovery, by presenting a series of proofs of concept, and discuss what still should be achieved to increase the rate of success of this strategy. Copyright © 2018 Elsevier Inc. All rights reserved.

  10. Serum paraoxonase activity is associated with variants in the PON gene cluster and risk of Alzheimer disease.

    Science.gov (United States)

    Erlich, Porat M; Lunetta, Kathryn L; Cupples, L Adrienne; Abraham, Carmela R; Green, Robert C; Baldwin, Clinton T; Farrer, Lindsay A

    2012-05-01

    Previous studies have shown association of single nucleotide polymorphisms (SNPs) in 3 contiguous genes (PON1, PON2, and PON3) encoding paraoxonase with risk of Alzheimer disease (AD). We evaluated the association of serum paraoxonase activity measured by phenyl acetate (PA) and thiobutyl butyrolactone (TBBL) with risk of AD and with 26 SNPs spanning the PON gene cluster in 266 AD cases and 306 sibling controls from the MIRAGE study. The odds of AD (adjusted for age, gender, and ethnicity) increased 20% for each standard deviation decrease in PA or TBBL activity. There were association signals with activity in all 3 genes. Haplotypes including SNPs spanning the PON genes were generally more significant than haplotypes comprising SNPs from 1 gene. Significant interactions were observed between SNP pairs located across the PON cluster with either serum activity measure as the outcome, and between several PON SNPs and PA activity with AD status as the outcome. Our results suggest that low serum paraoxonase activity is a risk factor for AD. Furthermore, multiple variants in PON influence serum paraoxonase activity and their effects may be synergistic. Copyright © 2012 Elsevier Inc. All rights reserved.

  11. The nitrate-reduction gene cluster components exert lineage-dependent contributions to optimization of Sinorhizobium symbiosis with soybeans.

    Science.gov (United States)

    Liu, Li Xue; Li, Qin Qin; Zhang, Yun Zeng; Hu, Yue; Jiao, Jian; Guo, Hui Juan; Zhang, Xing Xing; Zhang, Biliang; Chen, Wen Xin; Tian, Chang Fu

    2017-12-01

    Receiving nodulation and nitrogen fixation genes does not guarantee rhizobia an effective symbiosis with legumes. Here, variations in gene content were determined for three Sinorhizobium species showing contrasting symbiotic efficiency on soybeans. A nitrate-reduction gene cluster absent in S. sojae was found to be essential for symbiotic adaptations of S. fredii and S. sp. III. In S. fredii, the deletion mutation of the nap (nitrate reductase), instead of nir (nitrite reductase) and nor (nitric oxide reductase), led to defects in nitrogen-fixation (Fix - ). By contrast, none of these core nitrate-reduction genes were required for the symbiosis of S. sp. III. However, within the same gene cluster, the deletion of hemN1 (encoding oxygen-independent coproporphyrinogen III oxidase) in both S. fredii and S. sp. III led to the formation of nitrogen-fixing (Fix + ) but ineffective (Eff - ) nodules. These Fix + /Eff - nodules were characterized by significantly lower enzyme activity of glutamine synthetase indicating rhizobial modulation of nitrogen-assimilation by plants. A distant homologue of HemN1 from S. sojae can complement this defect in S. fredii and S. sp. III, but exhibited a more pleotropic role in symbiosis establishment. These findings highlighted the lineage-dependent optimization of symbiotic functions in different rhizobial species associated with the same host. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.

  12. Structure of the neutral capsular polysaccharide of Acinetobacter baumannii NIPH146 that carries the KL37 capsule gene cluster.

    Science.gov (United States)

    Arbatsky, Nikolay P; Shneider, Mikhail M; Kenyon, Johanna J; Shashkov, Alexander S; Popova, Anastasiya V; Miroshnikov, Konstantin A; Volozhantsev, Nikolay V; Knirel, Yuriy A

    2015-09-02

    Capsular polysaccharide (CPS) was isolated from Acinetobacter baumannii NIPH146, and the following structure of branched pentasaccharide repeating unit was established by sugar analyses along with 1D and 2D NMR spectroscopy: In comparison to most other known capsular polysaccharides of A. baumannii, the CPS studied is neutral and lacks any specific monosaccharide component. The synthesis, assembly and export of this structure could be attributed to genes in a novel capsule biosynthesis gene cluster, designated KL37, which was found in the NIPH146 genome. The CPS of A. baumannii NIPH146 shares the α-d-Galp-(1→6)-β-d-Glcp-(1→3)-d-GalpNAc-(1→ trisaccharide fragment with the CPS units of several A. baumannii strains, including ATCC 17978 and LUH 5537 that carry the KL3 and KL22 gene clusters, respectively. KL37 contains two genes for glycosyltransferases that are related to two glycosyltransferase genes present in both KL3 and KL22, and the encoded proteins could be tentatively assigned to linkages between sugars in the CPS repeat. Copyright © 2015 Elsevier Ltd. All rights reserved.

  13. Transcriptional analysis of the jamaicamide gene cluster from the marine cyanobacterium Lyngbya majuscula and identification of possible regulatory proteins

    Directory of Open Access Journals (Sweden)

    Dorrestein Pieter C

    2009-12-01

    Full Text Available Abstract Background The marine cyanobacterium Lyngbya majuscula is a prolific producer of bioactive secondary metabolites. Although biosynthetic gene clusters encoding several of these compounds have been identified, little is known about how these clusters of genes are transcribed or regulated, and techniques targeting genetic manipulation in Lyngbya strains have not yet been developed. We conducted transcriptional analyses of the jamaicamide gene cluster from a Jamaican strain of Lyngbya majuscula, and isolated proteins that could be involved in jamaicamide regulation. Results An unusually long untranslated leader region of approximately 840 bp is located between the jamaicamide transcription start site (TSS and gene cluster start codon. All of the intergenic regions between the pathway ORFs were transcribed into RNA in RT-PCR experiments; however, a promoter prediction program indicated the possible presence of promoters in multiple intergenic regions. Because the functionality of these promoters could not be verified in vivo, we used a reporter gene assay in E. coli to show that several of these intergenic regions, as well as the primary promoter preceding the TSS, are capable of driving β-galactosidase production. A protein pulldown assay was also used to isolate proteins that may regulate the jamaicamide pathway. Pulldown experiments using the intergenic region upstream of jamA as a DNA probe isolated two proteins that were identified by LC-MS/MS. By BLAST analysis, one of these had close sequence identity to a regulatory protein in another cyanobacterial species. Protein comparisons suggest a possible correlation between secondary metabolism regulation and light dependent complementary chromatic adaptation. Electromobility shift assays were used to evaluate binding of the recombinant proteins to the jamaicamide promoter region. Conclusion Insights into natural product regulation in cyanobacteria are of significant value to drug discovery

  14. The effect of alcohol on the differential expression of cluster of differentiation 14 gene, associated pathways, and genetic network.

    Science.gov (United States)

    Zhou, Diana X; Zhao, Yinghong; Baker, Jessica A; Gu, Qingqing; Hamre, Kristin M; Yue, Junming; Jones, Byron C; Cook, Melloni N; Lu, Lu

    2017-01-01

    Alcohol consumption affects human health in part by compromising the immune system. In this study, we examined the expression of the Cd14 (cluster of differentiation 14) gene, which is involved in the immune system through a proinflammatory cascade. Expression was evaluated in BXD mice treated with saline or acute 1.8 g/kg i.p. ethanol (12.5% v/v). Hippocampal gene expression data were generated to examine differential expression and to perform systems genetics analyses. The Cd14 gene expression showed significant changes among the BXD strains after ethanol treatment, and eQTL mapping revealed that Cd14 is a cis-regulated gene. We also identified eighteen ethanol-related phenotypes correlated with Cd14 expression related to either ethanol responses or ethanol consumption. Pathway analysis was performed to identify possible biological pathways involved in the response to ethanol and Cd14. We also constructed a genetic network for Cd14 using the top 20 correlated genes and present several genes possibly involved in Cd14 and ethanol responses based on differential gene expression. In conclusion, we found Cd14, along with several other genes and pathways, to be involved in ethanol responses in the hippocampus, such as increased susceptibility to lipopolysaccharides and neuroinflammation.

  15. The effect of alcohol on the differential expression of cluster of differentiation 14 gene, associated pathways, and genetic network.

    Directory of Open Access Journals (Sweden)

    Diana X Zhou

    Full Text Available Alcohol consumption affects human health in part by compromising the immune system. In this study, we examined the expression of the Cd14 (cluster of differentiation 14 gene, which is involved in the immune system through a proinflammatory cascade. Expression was evaluated in BXD mice treated with saline or acute 1.8 g/kg i.p. ethanol (12.5% v/v. Hippocampal gene expression data were generated to examine differential expression and to perform systems genetics analyses. The Cd14 gene expression showed significant changes among the BXD strains after ethanol treatment, and eQTL mapping revealed that Cd14 is a cis-regulated gene. We also identified eighteen ethanol-related phenotypes correlated with Cd14 expression related to either ethanol responses or ethanol consumption. Pathway analysis was performed to identify possible biological pathways involved in the response to ethanol and Cd14. We also constructed a genetic network for Cd14 using the top 20 correlated genes and present several genes possibly involved in Cd14 and ethanol responses based on differential gene expression. In conclusion, we found Cd14, along with several other genes and pathways, to be involved in ethanol responses in the hippocampus, such as increased susceptibility to lipopolysaccharides and neuroinflammation.

  16. Cloning, reassembling and integration of the entire nikkomycin biosynthetic gene cluster into Streptomyces ansochromogenes lead to an improved nikkomycin production

    Directory of Open Access Journals (Sweden)

    Yang Haihua

    2010-01-01

    Full Text Available Abstract Background Nikkomycins are a group of peptidyl nucleoside antibiotics produced by Streptomyces ansochromogenes. They are competitive inhibitors of chitin synthase and show potent fungicidal, insecticidal, and acaricidal activities. Nikkomycin X and Z are the main components produced by S. ansochromogenes. Generation of a high-producing strain is crucial to scale up nikkomycins production for further clinical trials. Results To increase the yields of nikkomycins, an additional copy of nikkomycin biosynthetic gene cluster (35 kb was introduced into nikkomycin producing strain, S. ansochromogenes 7100. The gene cluster was first reassembled into an integrative plasmid by Red/ET technology combining with classic cloning methods and then the resulting plasmid(pNIKwas introduced into S. ansochromogenes by conjugal transfer. Introduction of pNIK led to enhanced production of nikkomycins (880 mg L-1, 4 -fold nikkomycin X and 210 mg L-1, 1.8-fold nikkomycin Z in the resulting exconjugants comparing with the parent strain (220 mg L-1 nikkomycin X and 120 mg L-1 nikkomycin Z. The exconjugants are genetically stable in the absence of antibiotic resistance selection pressure. Conclusion A high nikkomycins producing strain (1100 mg L-1 nikkomycins was obtained by introduction of an extra nikkomycin biosynthetic gene cluster into the genome of S. ansochromogenes. The strategies presented here could be applicable to other bacteria to improve the yields of secondary metabolites.

  17. Evolution of a pentameral body plan was not linked to translocation of anterior Hox genes: the echinoderm HOX cluster revisited.

    Science.gov (United States)

    Byrne, Maria; Martinez, Pedro; Morris, Valerie

    2016-01-01

    Echinodermata is a large phylum of marine invertebrates characterized by an adult, pentameral body plan. This morphology is clearly derived as all members of Deuterostomia (the superphylum to which they belong) have a bilateral body plan. The origin of the pentameral plan has been the subject of intense debate. It is clear that the ancestor of Echinodermata had a bilateral plan but how this ancestor transformed its body "architecture" in such a drastic manner is not clear. Data from the fossil record and ontogeny are sparse and, so far, not very informative. The sequencing of the sea urchin genome a decade ago opened the possibility that the pentameral body plan was a consequence of a broken Hox cluster and a series of papers dwelt on the putative relationship between Hox gene arrangements in the chromosomes and the origin of pentamery. This relationship, sound as it was, is challenged by the revelation that the sea star HOX cluster is, in fact, intact, thus falsifying the hypothesis of a direct relationship between HOX cluster arrangement and the origin of the pentameral body plan. Here, we explore the relationship between Hox gene arrangements and echinoderm body "architecture," the expression of Hox genes in development and alternative scenarios for the origin of pentamery, with putative roles for signaling centers in generating multiple axes. © 2016 Wiley Periodicals, Inc.

  18. An original SERPINA3 gene cluster: Elucidation of genomic organization and gene expression in the Bos taurus 21q24 region

    Directory of Open Access Journals (Sweden)

    Ouali Ahmed

    2008-04-01

    Full Text Available Abstract Background The superfamily of serine proteinase inhibitors (serpins is involved in numerous fundamental biological processes as inflammation, blood coagulation and apoptosis. Our interest is focused on the SERPINA3 sub-family. The major human plasma protease inhibitor, α1-antichymotrypsin, encoded by the SERPINA3 gene, is homologous to genes organized in clusters in several mammalian species. However, although there is a similar genic organization with a high degree of sequence conservation, the reactive-centre-loop domains, which are responsible for the protease specificity, show significant divergences. Results We provide additional information by analyzing the situation of SERPINA3 in the bovine genome. A cluster of eight genes and one pseudogene sharing a high degree of identity and the same structural organization was characterized. Bovine SERPINA3 genes were localized by radiation hybrid mapping on 21q24 and only spanned over 235 Kilobases. For all these genes, we propose a new nomenclature from SERPINA3-1 to SERPINA3-8. They share approximately 70% of identity with the human SERPINA3 homologue. In the cluster, we described an original sub-group of six members with an unexpected high degree of conservation for the reactive-centre-loop domain, suggesting a similar peptidase inhibitory pattern. Preliminary expression analyses of these bovSERPINA3s showed different tissue-specific patterns and diverse states of glycosylation and phosphorylation. Finally, in the context of phylogenetic analyses, we improved our knowledge on mammalian SERPINAs evolution. Conclusion Our experimental results update data of the bovine genome sequencing, substantially increase the bovSERPINA3 sub-family and enrich the phylogenetic tree of serpins. We provide new opportunities for future investigations to approach the biological functions of this unusual subset of serine proteinase inhibitors.

  19. [Sequence analysis of 16S rDNA and pmoCAB gene cluster of trichloroethylene-degrading methanotroph].

    Science.gov (United States)

    Zhang, Yunru; Chen, Huaqing; Gao, Yanhui; Xing, Zhilin; Zhao, Tiantao

    2014-12-01

    Methanotrophs could degrade methane and various chlorinated hydrocarbons. The analysis on methane monooxygenase gene cluster sequence would help to understand its catalytic mechanism and enhance the application in pollutants biodegradation. The methanotrophs was enriched and isolated with methane as the sole carbon source in the nitrate mineral salt medium. Then, five chlorinated hydrocarbons were selected as cometabolic substrates to study the biodegradation. The phylogenetic tree of 16S rDNA using MEGE5.05 software was constructed to identify the methanotroph strain. The pmoCAB gene cluster encoding particulate methane monooxygenase (pMMO) was amplified by semi-nested PCR in segments. ExPASy was performed to analyze theoretical molecular weight of the three pMMO subunits. As a result, a strain of methanotroph was isolated. The phylogenetic analysis indicated that the strain belongs to a species of Methylocystis, and it was named as Methylocystis sp. JTC3. The degradation rate of trichloroethylene (TCE) reached 93.79% when its initial concentration was 15.64 μmol/L after 5 days. We obtained the pmoCAB gene cluster of 3 227 bp including pmoC gene of 771 bp, pmoA gene of 759 bp, pmoB gene of 1 260 bp and two noncoding sequences in the middle by semi-nested PCR, T-A cloning and sequencing. The theoretical molecular weight of their corresponding gamma, beta and alpha subunit were 29.1 kDa, 28.6 kDa and 45.6 kDa respectively analyzed using ExPASy tool. The pmoCAB gene cluster of JTC3 was highly identical with that of Methylocystis sp. strain M analyzed by Blast, and pmoA sequences is more conservative than pmoC and pmoB. Finally, Methylocystis sp. JTC3 could degrade TCE efficiently. And the detailed analysis of pmoCAB from Methylocystis sp. JTC3 laid a solid foundation to further study its active sites features and its selectivity to chlorinated hydrocarbon.

  20. Genome-Wide Analysis of Secondary Metabolite Gene Clusters in Ophiostoma ulmi and Ophiostoma novo-ulmi Reveals a Fujikurin-Like Gene Cluster with a Putative Role in Infection

    Directory of Open Access Journals (Sweden)

    Nicolau Sbaraini

    2017-06-01

    Full Text Available The emergence of new microbial pathogens can result in destructive outbreaks, since their hosts have limited resistance and pathogens may be excessively aggressive. Described as the major ecological incident of the twentieth century, Dutch elm disease, caused by ascomycete fungi from the Ophiostoma genus, has caused a significant decline in elm tree populations (Ulmus sp. in North America and Europe. Genome sequencing of the two main causative agents of Dutch elm disease (Ophiostoma ulmi and Ophiostoma novo-ulmi, along with closely related species with different lifestyles, allows for unique comparisons to be made to identify how pathogens and virulence determinants have emerged. Among several established virulence determinants, secondary metabolites (SMs have been suggested to play significant roles during phytopathogen infection. Interestingly, the secondary metabolism of Dutch elm pathogens remains almost unexplored, and little is known about how SM biosynthetic genes are organized in these species. To better understand the metabolic potential of O. ulmi and O. novo-ulmi, we performed a deep survey and description of SM biosynthetic gene clusters (BGCs in these species and assessed their conservation among eight species from the Ophiostomataceae family. Among 19 identified BGCs, a fujikurin-like gene cluster (OpPKS8 was unique to Dutch elm pathogens. Phylogenetic analysis revealed that orthologs for this gene cluster are widespread among phytopathogens and plant-associated fungi, suggesting that OpPKS8 may have been horizontally acquired by the Ophiostoma genus. Moreover, the detailed identification of several BGCs paves the way for future in-depth research and supports the potential impact of secondary metabolism on Ophiostoma genus’ lifestyle.

  1. Identification and functional clustering of genes regulating muscle protein degradation from amongst the known C. elegans muscle mutants.

    Directory of Open Access Journals (Sweden)

    Freya Shephard

    Full Text Available Loss of muscle mass via protein degradation is an important clinical problem but we know little of how muscle protein degradation is regulated genetically. To gain insight our labs developed C. elegans into a model for understanding the regulation of muscle protein degradation. Past studies uncovered novel functional roles for genes affecting muscle and/or involved in signalling in other cells or tissues. Here we examine most of the genes previously identified as the sites of mutations affecting muscle for novel roles in regulating degradation. We evaluate genomic (RNAi knockdown approaches and combine them with our established genetic (mutant and pharmacologic (drugs approaches to examine these 159 genes. We find that RNAi usually recapitulates both organismal and sub-cellular mutant phenotypes but RNAi, unlike mutants, can frequently be used acutely to study gene function solely in differentiated muscle. In the majority of cases where RNAi does not produce organismal level phenotypes, sub-cellular defects can be detected; disrupted proteostasis is most commonly observed. We identify 48 genes in which mutation or RNAi knockdown causes excessive protein degradation; myofibrillar and/or mitochondrial morphologies are also disrupted in 19 of these 48 cases. These 48 genes appear to act via at least three sub-networks to control bulk degradation of protein in muscle cytosol. Attachment to the extracellular matrix regulates degradation via unidentified proteases and affects myofibrillar and mitochondrial morphology. Growth factor imbalance and calcium overload promote lysosome based degradation whereas calcium deficit promotes proteasome based degradation, in both cases myofibrillar and mitochondrial morphologies are largely unaffected. Our results provide a framework for effectively using RNAi to identify and functionally cluster novel regulators of degradation. This clustering allows prioritization of candidate genes/pathways for future

  2. Organization and subcloning of the dacA-rodA-pbpA cluster of cell shape genes in Escherichia coli.

    Science.gov (United States)

    Stoker, N G; Broome-Smith, J K; Edelman, A; Spratt, B G

    1983-01-01

    The transducing bacteriophage lambda pBS10 carries a small cluster of Escherichia coli penicillin-binding protein/cell shape genes, including pbpA, rodA, and dacA. Deletion mapping and subcloning showed that these genes, and the gene for a cytoplasmic membrane protein of molecular weight 54,000, are located within a 5.6-kilobase region and are probably contiguous. The dacA gene, which codes for penicillin-binding protein 5, was cloned on a 1.5-kilobase fragment into a low-copy-number plasmid vector, but insertion into high-copy-number plasmids produced deleterious effects on bacterial growth, and the plasmids could not be stably maintained. The direction of transcription of dacA was determined. The rodA gene was cloned on a 1.6-kilobase fragment into both low- and high-copy-number plasmids, and the identification of its gene product is described in the accompanying paper (Stoker et al., J. Bacteriol. 155:854-859). The pbpA gene, which codes for penicillin-binding protein 2, was cloned on a 3.7-kilobase fragment in low-copy-number plasmids, but insertion of the fragment into high-copy-number plasmids resulted in deleterious effects on bacterial growth, and the plasmids could not be stably maintained. Images PMID:6348028

  3. Rapid transcriptional plasticity of duplicated gene clusters enables a clonally reproducing aphid to colonise diverse plant species.

    Science.gov (United States)

    Mathers, Thomas C; Chen, Yazhou; Kaithakottil, Gemy; Legeai, Fabrice; Mugford, Sam T; Baa-Puyoulet, Patrice; Bretaudeau, Anthony; Clavijo, Bernardo; Colella, Stefano; Collin, Olivier; Dalmay, Tamas; Derrien, Thomas; Feng, Honglin; Gabaldón, Toni; Jordan, Anna; Julca, Irene; Kettles, Graeme J; Kowitwanich, Krissana; Lavenier, Dominique; Lenzi, Paolo; Lopez-Gomollon, Sara; Loska, Damian; Mapleson, Daniel; Maumus, Florian; Moxon, Simon; Price, Daniel R G; Sugio, Akiko; van Munster, Manuella; Uzest, Marilyne; Waite, Darren; Jander, Georg; Tagu, Denis; Wilson, Alex C C; van Oosterhout, Cock; Swarbreck, David; Hogenhout, Saskia A

    2017-02-13

    The prevailing paradigm of host-parasite evolution is that arms races lead to increasing specialisation via genetic adaptation. Insect herbivores are no exception and the majority have evolved to colonise a small number of closely related host species. Remarkably, the green peach aphid, Myzus persicae, colonises plant species across 40 families and single M. persicae clonal lineages can colonise distantly related plants. This remarkable ability makes M. persicae a highly destructive pest of many important crop species. To investigate the exceptional phenotypic plasticity of M. persicae, we sequenced the M. persicae genome and assessed how one clonal lineage responds to host plant species of different families. We show that genetically identical individuals are able to colonise distantly related host species through the differential regulation of genes belonging to aphid-expanded gene families. Multigene clusters collectively upregulate in single aphids within two days upon host switch. Furthermore, we demonstrate the functional significance of this rapid transcriptional change using RNA interference (RNAi)-mediated knock-down of genes belonging to the cathepsin B gene family. Knock-down of cathepsin B genes reduced aphid fitness, but only on the host that induced upregulation of these genes. Previous research has focused on the role of genetic adaptation of parasites to their hosts. Here we show that the generalist aphid pest M. persicae is able to colonise diverse host plant species in the absence of genetic specialisation. This is achieved through rapid transcriptional plasticity of genes that have duplicated during aphid evolution.

  4. Draft genome sequence of Streptomyces coelicoflavus ZG0656 reveals the putative biosynthetic gene cluster of acarviostatin family α-amylase inhibitors.

    Science.gov (United States)

    Guo, X; Geng, P; Bai, F; Bai, G; Sun, T; Li, X; Shi, L; Zhong, Q

    2012-08-01

    The aims of this study are to obtain the draft genome sequence of Streptomyces coelicoflavus ZG0656, which produces novel acarviostatin family α-amylase inhibitors, and then to reveal the putative acarviostatin-related gene cluster and the biosynthetic pathway. The draft genome sequence of S. coelicoflavus ZG0656 was generated using a shotgun approach employing a combination of 454 and Solexa sequencing technologies. Genome analysis revealed a putative gene cluster for acarviostatin biosynthesis, termed sct-cluster. The cluster contains 13 acarviostatin synthetic genes, six transporter genes, four starch degrading or transglycosylation enzyme genes and two regulator genes. On the basis of bioinformatic analysis, we proposed a putative biosynthetic pathway of acarviostatins. The intracellular steps produce a structural core, acarviostatin I00-7-P, and the extracellular assemblies lead to diverse acarviostatin end products. The draft genome sequence of S. coelicoflavus ZG0656 revealed the putative biosynthetic gene cluster of acarviostatins and a putative pathway of acarviostatin production. To our knowledge, S. coelicoflavus ZG0656 is the first strain in this species for which a genome sequence has been reported. The analysis of sct-cluster provided important insights into the biosynthesis of acarviostatins. This work will be a platform for producing novel variants and yield improvement. © 2012 The Authors. Letters in Applied Microbiology © 2012 The Society for Applied Microbiology.

  5. Two gene clusters co-ordinate for a functional N-acetylglucosamine catabolic pathway in Vibrio cholerae.

    Science.gov (United States)

    Ghosh, Swagata; Rao, K Hanumantha; Sengupta, Manjistha; Bhattacharya, Sujit K; Datta, Asis

    2011-06-01

    Pathogenic microorganisms like Vibrio cholerae are capable of adapting to diverse living conditions, especially when they transit from their environmental reservoirs to human host. V. cholerae attaches to N-acetylglucosamine (GlcNAc) residues in glycoproteins and lipids present in the intestinal epithelium and chitinous surface of zoo-phytoplanktons in the aquatic environment for its survival and colonization. GlcNAc utilization thus appears to be important for the pathogen to reach sufficient titres in the intestine for producing clinical symptoms of cholera. We report here the involvement of a second cluster of genes working in combination with the classical genes of GlcNAc catabolism, suggesting the occurrence of a novel variant of the process of biochemical conversion of GlcNAc to Fructose-6-phosphate as has been described in other organisms. Colonization was severely attenuated in mutants that were incapable of utilizing GlcNAc. It was also shown that N-acetylglucosamine specific repressor (NagC) performs a dual role - while the classical GlcNAc catabolic genes are under its negative control, the genes belonging to the second cluster are positively regulated by it. Further application of tandem affinity purification to NagC revealed its interaction with a novel partner. Our results provide a genetic program that probably enables V. cholerae to successfully utilize amino - sugars and also highlights a new mode of transcriptional regulation, not described in this organism. © 2011 Blackwell Publishing Ltd.

  6. Ancestral and derived attributes of the dlx gene repertoire, cluster structure and expression patterns in an African cichlid fish

    Directory of Open Access Journals (Sweden)

    Renz Adina J

    2011-01-01

    Full Text Available Abstract Background Cichlid fishes have undergone rapid, expansive evolutionary radiations that are manifested in the diversification of their trophic morphologies, tooth patterning and coloration. Understanding the molecular mechanisms that underlie the cichlids' unique patterns of evolution requires a thorough examination of genes that pattern the neural crest, from which these diverse phenotypes are derived. Among those genes, the homeobox-containing Dlx gene family is of particular interest since it is involved in the patterning of the brain, jaws and teeth. Results In this study, we characterized the dlx genes of an African cichlid fish, Astatotilapia burtoni, to provide a baseline to later allow cross-species comparison within Cichlidae. We identified seven dlx paralogs (dlx1a, -2a, -4a, -3b, -4b, -5a and -6a, whose orthologies were validated with molecular phylogenetic trees. The intergenic regions of three dlx gene clusters (dlx1a-2a, dlx3b-4b, and dlx5a-6a were amplified with long PCR. Intensive cross-species comparison revealed a number of conserved non-coding elements (CNEs that are shared with other percomorph fishes. This analysis highlighted additional lineage-specific gains/losses of CNEs in different teleost fish lineages and a novel CNE that had previously not been identified. Our gene expression analyses revealed overlapping but distinct expression of dlx orthologs in the developing brain and pharyngeal arches. Notably, four of the seven A. burtoni dlx genes, dlx2a, dlx3b, dlx4a and dlx5a, were expressed in the developing pharyngeal teeth. Conclusion This comparative study of the dlx genes of A. burtoni has deepened our knowledge of the diversity of the Dlx gene family, in terms of gene repertoire, expression patterns and non-coding elements. We have identified possible cichlid lineage-specific changes, including losses of a subset of dlx expression domains in the pharyngeal teeth, which will be the targets of future functional

  7. Molecular characterization of a conserved archaeal copper resistance (cop) gene cluster and its copper-responsive regulator in Sulfolobus solfataricus P2

    NARCIS (Netherlands)

    Ettema, T.J.G.; Brinkman, A.B.; Lamers, P.P.; Kornet, N.; Vos, de W.M.; Oost, van der J.

    2006-01-01

    Using a comparative genomics approach, a copper resistance gene cluster has been identified in multiple archaeal genomes. The cop cluster is predicted to encode a metallochaperone (CopM), a P-type copper-exporting ATPase (CopA) and a novel, archaea-specific transcriptional regulator (CopT) which

  8. Haplotype diversity of VvTFL1A gene and association with cluster traits in grapevine (V. vinifera).

    Science.gov (United States)

    Fernandez, Lucie; Le Cunff, Loïc; Tello, Javier; Lacombe, Thierry; Boursiquot, Jean Michel; Fournier-Level, Alexandre; Bravo, Gema; Lalet, Sandrine; Torregrosa, Laurent; This, Patrice; Martinez-Zapater, José Miguel

    2014-08-05

    Interaction between TERMINAL FLOWER 1 (TFL1) and LEAFY (LFY) seem to determine the inflorescence architecture in Arabidopsis. In a parallel way, overexpression of VvTFL1A, a grapevine TFL1 homolog, causes delayed flowering and production of a ramose cluster in the reiterated reproductive meristem (RRM) somatic variant of cultivar Carignan. To analyze the possible contribution of this gene to cluster phenotypic variation in a diversity panel of cultivated grapevine (Vitis vinifera L. subsp. vinifera) its nucleotide diversity was characterized and association analyses among detected sequence polymorphisms and phenology and cluster traits was carried out. A total of 3.6 kb of the VvTFL1A gene, including its promoter, was sequenced in a core collection of 140 individuals designed to maximize phenotypic variation at agronomical relevant traits. Nucleotide variation for VvTFL1A within this collection was higher in the promoter and intron sequences than in the exon regions; where few polymorphisms were located in agreement with a high conservation of coding sequence. Characterization of the VvTFL1A haplotype network identified three major haplogroups, consistent with the geographic origins and the use of the cultivars that could correspond to three major ancestral alleles or evolutionary branches, based on the existence of mutations in linkage disequilibrium. Genetic association studies with cluster traits revealed the presence of major INDEL polymorphisms, explaining 16%, 13% and 25% of flowering time, cluster width and berry weight, respectively, and also structuring the three haplogroups. At least three major VvTFL1A haplogroups are present in cultivated grapevines, which are defined by the presence of three main polymorphism LD blocks and associated to characteristic phenotypic values for flowering time, cluster width and berry size. Phenotypic differences between haplogroups are consistent with differences observed between Eastern and Western grapevine cultivars and

  9. Diverse and Abundant Secondary Metabolism Biosynthetic Gene Clusters in the Genomes of Marine Sponge Derived Streptomyces spp. Isolates

    Directory of Open Access Journals (Sweden)

    Stephen A. Jackson

    2018-02-01

    Full Text Available The genus Streptomyces produces secondary metabolic compounds that are rich in biological activity. Many of these compounds are genetically encoded by large secondary metabolism biosynthetic gene clusters (smBGCs such as polyketide synthases (PKS and non-ribosomal peptide synthetases (NRPS which are modular and can be highly repetitive. Due to the repeats, these gene clusters can be difficult to resolve using short read next generation datasets and are often quite poorly predicted using standard approaches. We have sequenced the genomes of 13 Streptomyces spp. strains isolated from shallow water and deep-sea sponges that display antimicrobial activities against a number of clinically relevant bacterial and yeast species. Draft genomes have been assembled and smBGCs have been identified using the antiSMASH (antibiotics and Secondary Metabolite Analysis Shell web platform. We have compared the smBGCs amongst strains in the search for novel sequences conferring the potential to produce novel bioactive secondary metabolites. The strains in this study recruit to four distinct clades within the genus Streptomyces. The marine strains host abundant smBGCs which encode polyketides, NRPS, siderophores, bacteriocins and lantipeptides. The deep-sea strains appear to be enriched with gene clusters encoding NRPS. Marine adaptations are evident in the sponge-derived strains which are enriched for genes involved in the biosynthesis and transport of compatible solutes and for heat-shock proteins. Streptomyces spp. from marine environments are a promising source of novel bioactive secondary metabolites as the abundance and diversity of smBGCs show high degrees of novelty. Sponge derived Streptomyces spp. isolates appear to display genomic adaptations to marine living when compared to terrestrial strains.

  10. IMG-ABC: A Knowledge Base To Fuel Discovery of Biosynthetic Gene Clusters and Novel Secondary Metabolites.

    Science.gov (United States)

    Hadjithomas, Michalis; Chen, I-Min Amy; Chu, Ken; Ratner, Anna; Palaniappan, Krishna; Szeto, Ernest; Huang, Jinghua; Reddy, T B K; Cimermančič, Peter; Fischbach, Michael A; Ivanova, Natalia N; Markowitz, Victor M; Kyrpides, Nikos C; Pati, Amrita

    2015-07-14

    In the discovery of secondary metabolites, analysis of sequence data is a promising exploration path that remains largely underutilized due to the lack of computational platforms that enable such a systematic approach on a large scale. In this work, we present IMG-ABC (https://img.jgi.doe.gov/abc), an atlas of biosynthetic gene clusters within the Integrated Microbial Genomes (IMG) system, which is aimed at harnessing the power of "big" genomic data for discovering small molecules. IMG-ABC relies on IMG's comprehensive integrated structural and functional genomic data for the analysis of biosynthetic gene clusters (BCs) and associated secondary metabolites (SMs). SMs and BCs serve as the two main classes of objects in IMG-ABC, each with a rich collection of attributes. A unique feature of IMG-ABC is the incorporation of both experimentally validated and computationally predicted BCs in genomes as well as metagenomes, thus identifying BCs in uncultured populations and rare taxa. We demonstrate the strength of IMG-ABC's focused integrated analysis tools in enabling the exploration of microbial secondary metabolism on a global scale, through the discovery of phenazine-producing clusters for the first time in Alphaproteobacteria. IMG-ABC strives to fill the long-existent void of resources for computational exploration of the secondary metabolism universe; its underlying scalable framework enables traversal of uncovered phylogenetic and chemical structure space, serving as a doorway to a new era in the discovery of novel molecules. IMG-ABC is the largest publicly available database of predicted and experimental biosynthetic gene clusters and the secondary metabolites they produce. The system also includes powerful search and analysis tools that are integrated with IMG's extensive genomic/metagenomic data and analysis tool kits. As new research on biosynthetic gene clusters and secondary metabolites is published and more genomes are sequenced, IMG-ABC will continue to

  11. An epigenetic switch involving overlapping fur and DNA methylation optimizes expression of a type VI secretion gene cluster.

    Directory of Open Access Journals (Sweden)

    Yannick R Brunet

    2011-07-01

    Full Text Available Type VI secretion systems (T6SS are macromolecular machines of the cell envelope of Gram-negative bacteria responsible for bacterial killing and/or virulence towards different host cells. Here, we characterized the regulatory mechanism underlying expression of the enteroagregative Escherichia coli sci1 T6SS gene cluster. We identified Fur as the main regulator of the sci1 cluster. A detailed analysis of the promoter region showed the presence of three GATC motifs, which are target of the DNA adenine methylase Dam. Using a combination of reporter fusion, gel shift, and in vivo and in vitro Dam methylation assays, we dissected the regulatory role of Fur and Dam-dependent methylation. We showed that the sci1 gene cluster expression is under the control of an epigenetic switch depending on methylation: fur binding prevents methylation of a GATC motif, whereas methylation at this specific site decreases the affinity of Fur for its binding box. A model is proposed in which the sci1 promoter is regulated by iron availability, adenine methylation, and DNA replication.

  12. The N?acetylglucosamine catabolic gene cluster in Trichoderma reesei is controlled by the Ndt80?like transcription factor RON1

    OpenAIRE

    Kappel, Lisa; Gaderer, Romana; Flipphi, Michel; Seidl?Seiboth, Verena

    2015-01-01

    Summary Chitin is an important structural constituent of fungal cell walls composed of N ?acetylglucosamine (GlcNAc) monosaccharides, but catabolism of GlcNAc has not been studied in filamentous fungi so far. In the yeast C andida albicans, the genes encoding the three enzymes responsible for stepwise conversion of GlcNAc to fructose?6?phosphate are clustered. In this work, we analysed GlcNAc catabolism in ascomycete filamentous fungi and found that the respective genes are also clustered in ...

  13. High GC Content Cas9-Mediated Genome-Editing and Biosynthetic Gene Cluster Activation in Saccharopolyspora erythraea.

    Science.gov (United States)

    Liu, Yong; Wei, Wen-Ping; Ye, Bang-Ce

    2018-04-17

    The overexpression of bacterial secondary metabolite biosynthetic enzymes is the basis for industrial overproducing strains. Genome editing tools can be used to further improve gene expression and yield. Saccharopolyspora erythraea produces erythromycin, which has extensive clinical applications. In this study, the CRISPR-Cas9 system was used to edit genes in the S. erythraea genome. A temperature-sensitive plasmid containing the PermE promoter, to drive Cas9 expression, and the Pj23119 and PkasO promoters, to drive sgRNAs, was designed. Erythromycin esterase, encoded by S. erythraea SACE_1765, inactivates erythromycin by hydrolyzing the macrolactone ring. Sequencing and qRT-PCR confirmed that reporter genes were successfully inserted into the SACE_1765 gene. Deletion of SACE_1765 in a high-producing strain resulted in a 12.7% increase in erythromycin levels. Subsequent PermE- egfp knock-in at the SACE_0712 locus resulted in an 80.3% increase in erythromycin production compared with that of wild type. Further investigation showed that PermE promoter knock-in activated the erythromycin biosynthetic gene clusters at the SACE_0712 locus. Additionally, deletion of indA (SACE_1229) using dual sgRNA targeting without markers increased the editing efficiency to 65%. In summary, we have successfully applied Cas9-based genome editing to a bacterial strain, S. erythraea, with a high GC content. This system has potential application for both genome-editing and biosynthetic gene cluster activation in Actinobacteria.

  14. Cluster editing

    DEFF Research Database (Denmark)

    Böcker, S.; Baumbach, Jan

    2013-01-01

    . The problem has been the inspiration for numerous algorithms in bioinformatics, aiming at clustering entities such as genes, proteins, phenotypes, or patients. In this paper, we review exact and heuristic methods that have been proposed for the Cluster Editing problem, and also applications...

  15. NJ cluster analysis of the SnRK2, PYR/PYL/RCAR, and ABF genes in Tibetan hulless barley.

    Science.gov (United States)

    Yuan, H J; Wang, Y L; Wei, Z X; Xu, Q J; Zeng, X Q; Tang, Y W; Nyima, T S

    2016-11-03

    The abscisic acid (ABA) signaling pathway is known as one of the most important signaling pathways in plants and is mediated by multiple regulators. The genes SnRK2, PYR/PYL/RCAR, and ABF are relevant to both ABA-dependent and -independent signaling pathways. To elucidate the profile of these genes from Tibetan hulless barley (Hordeum vulgare L. var. nudum Hook. f.), we collected available sequences from RNA-Seq data, together with NCBI data from five other model plant species (Arabidopsis thaliana, Brachypodium distachyon, Oryza sativa, Populus trichocarpa, and Sorghum bicolor). Gene trees of SnRK2, PYR/PYL/RCAR, and ABF were constructed using a neighbor joining (NJ) method. For all genes, we identified a dominant group in which all six species were represented. Three, four, and five groups were found in the NJ trees of SnRK2, PYR/PYL/RCAR, and ABF, respectively. For each gene, Tibetan hulless barley was divided into three groups. Our analyses indicated that Tibetan hulless barley was associated with B. distachyon. The NJ cluster analysis also suggested that Tibetan hulless barley was affiliated with S. bicolor (SnRK2), A. thaliana (PYR/PYL/RCAR), and O. sativa (ABF). These results illustrate a diverse expression of genes SnRK2, PYR/PYL/RCAR, and ABF, and suggest a relationship among the six species studied. Collectively, our characterization of the three components of the ABA signaling pathway may contribute to improve stress tolerance in Tibetan hulless barley.

  16. Functional characterization of diverse ring-hydroxylating oxygenases and induction of complex aromatic catabolic gene clusters in Sphingobium sp. PNB

    Directory of Open Access Journals (Sweden)

    Pratick Khara

    2014-01-01

    Full Text Available Sphingobium sp. PNB, like other sphingomonads, has multiple ring-hydroxylating oxygenase (RHO genes. Three different fosmid clones have been sequenced to identify the putative genes responsible for the degradation of various aromatics in this bacterial strain. Comparison of the map of the catabolic genes with that of different sphingomonads revealed a similar arrangement of gene clusters that harbors seven sets of RHO terminal components and a sole set of electron transport (ET proteins. The presence of distinctly conserved amino acid residues in ferredoxin and in silico molecular docking analyses of ferredoxin with the well characterized terminal oxygenase components indicated the structural uniqueness of the ET component in sphingomonads. The predicted substrate specificities, derived from the phylogenetic relationship of each of the RHOs, were examined based on transformation of putative substrates and their structural homologs by the recombinant strains expressing each of the oxygenases and the sole set of available ET proteins. The RHO AhdA1bA2b was functionally characterized for the first time and was found to be capable of transforming ethylbenzene, propylbenzene, cumene, p-cymene and biphenyl, in addition to a number of polycyclic aromatic hydrocarbons. Overexpression of aromatic catabolic genes in strain PNB, revealed by real-time PCR analyses, is a way forward to understand the complex regulation of degradative genes in sphingomonads.

  17. The ArcD1 and ArcD2 arginine/ornithine exchangers encoded in the arginine deiminase (ADI) pathway gene cluster of Lactococcus lactis

    NARCIS (Netherlands)

    Noens, Elke E E; Kaczmarek, Michał B; Żygo, Monika; Lolkema, Juke S

    2015-01-01

    The arginine deiminase pathway (ADI) gene cluster in Lactococcus lactis contains two copies of a gene encoding an L-arginine/L-ornithine exchanger, the arcD1 and arcD2 genes. The physiological function of ArcD1 and ArcD2 was studied by deleting the two genes. Deletion of arcD1 resulted in loss of

  18. ConGEMs: Condensed Gene Co-Expression Module Discovery Through Rule-Based Clustering and Its Application to Carcinogenesis

    Directory of Open Access Journals (Sweden)

    Saurav Mallik

    2017-12-01

    Full Text Available For transcriptomic analysis, there are numerous microarray-based genomic data, especially those generated for cancer research. The typical analysis measures the difference between a cancer sample-group and a matched control group for each transcript or gene. Association rule mining is used to discover interesting item sets through rule-based methodology. Thus, it has advantages to find causal effect relationships between the transcripts. In this work, we introduce two new rule-based similarity measures—weighted rank-based Jaccard and Cosine measures—and then propose a novel computational framework to detect condensed gene co-expression modules ( C o n G E M s through the association rule-based learning system and the weighted similarity scores. In practice, the list of evolved condensed markers that consists of both singular and complex markers in nature depends on the corresponding condensed gene sets in either antecedent or consequent of the rules of the resultant modules. In our evaluation, these markers could be supported by literature evidence, KEGG (Kyoto Encyclopedia of Genes and Genomes pathway and Gene Ontology annotations. Specifically, we preliminarily identified differentially expressed genes using an empirical Bayes test. A recently developed algorithm—RANWAR—was then utilized to determine the association rules from these genes. Based on that, we computed the integrated similarity scores of these rule-based similarity measures between each rule-pair, and the resultant scores were used for clustering to identify the co-expressed rule-modules. We applied our method to a gene expression dataset for lung squamous cell carcinoma and a genome methylation dataset for uterine cervical carcinogenesis. Our proposed module discovery method produced better results than the traditional gene-module discovery measures. In summary, our proposed rule-based method is useful for exploring biomarker modules from transcriptomic data.

  19. Neural networks and Fuzzy clustering methods for assessing the efficacy of microarray based intrinsic gene signatures in breast cancer classification and the character and relations of identified subtypes.

    Science.gov (United States)

    Samarasinghe, Sandhya; Chaiboonchoe, Amphun

    2015-01-01

    In the classification of breast cancer subtypes using microarray data, hierarchical clustering is commonly used. Although this form of clustering shows basic cluster patterns, more needs to be done to investigate the accuracy of clusters as well as to extract meaningful cluster characteristics and their relations to increase our confidence in their use in a clinical setting. In this study, an in-depth investigation of the efficacy of three reported gene subsets in distinguishing breast cancer subtypes was performed using four advanced computational intelligence methods-Self-Organizing Maps (SOM), Emergent Self-Organizing Maps (ESOM), Fuzzy Clustering by Local Approximation of Memberships (FLAME), and Fuzzy C-means (FCM)-each differing in the way they view data in terms of distance measures and fuzzy or crisp clustering. The gene subsets consisted of 71, 93, and 71 genes reported in the literature from three comprehensive experimental studies for distinguishing Luminal (A and B), Basal, Normal breast-like, and HER2 subtypes. Given the costly procedures involved in clinical studies, the proposed 93-gene set can be used for preliminary classification of breast cancer. Then, as a decision aid, SOM can be used to map the gene signature of a new patient to locate them with respect to all subtypes to get a comprehensive view of the classification. These can be followed by a deeper investigation in the light of the observations made in this study regarding overlapping subtypes. Results from the study could be used as the base for further refining the gene signatures from later experiments and from new experiments designed to separate overlapping clusters as well as to maximally separate all clusters.

  20. IMG-ABC: An Atlas of Biosynthetic Gene Clusters to Fuel the Discovery of Novel Secondary Metabolites

    Energy Technology Data Exchange (ETDEWEB)

    Chen, I-Min; Chu, Ken; Ratner, Anna; Palaniappan, Krishna; Huang, Jinghua; Reddy, T. B.K.; Cimermancic, Peter; Fischbach, Michael; Ivanova, Natalia; Markowitz, Victor; Kyrpides, Nikos; Pati, Amrita

    2014-10-28

    In the discovery of secondary metabolites (SMs), large-scale analysis of sequence data is a promising exploration path that remains largely underutilized due to the lack of relevant computational resources. We present IMG-ABC (https://img.jgi.doe.gov/abc/) -- An Atlas of Biosynthetic gene Clusters within the Integrated Microbial Genomes (IMG) system1. IMG-ABC is a rich repository of both validated and predicted biosynthetic clusters (BCs) in cultured isolates, single-cells and metagenomes linked with the SM chemicals they produce and enhanced with focused analysis tools within IMG. The underlying scalable framework enables traversal of phylogenetic dark matter and chemical structure space -- serving as a doorway to a new era in the discovery of novel molecules.

  1. Identification and Interrogation of the Herbicidin Biosynthetic Gene Cluster: First Insight into the Biosynthesis of a Rare Undecose Nucleoside Antibiotic.

    Science.gov (United States)

    Lin, Geng-Min; Romo, Anthony J; Liem, Priscilla H; Chen, Zhang; Liu, Hung-Wen

    2017-11-22

    Herbicidins are adenosine-based nucleoside antibiotics with an unusual tricyclic undecose core decorated with a (5-hydroxy)tiglyl moiety. Feeding studies are herein reported demonstrating that the tricyclic core is derived from d-glucose and d-ribose, whereas the tiglyl moiety is derived from an intermediate of l-isoleucine catabolism. Identification of the gene cluster for herbicidin A biosynthesis in Streptomyces sp. L-9-10 as well as its verification by heterologous expression in a nonproducing host are described, and the results of in vitro characterization of a carboxyl methyltransferase encoded in the cluster, Her8, are presented. Based on these observations, a biosynthetic pathway is proposed for herbicidins.

  2. Mandibulofacial dysostosis in a patient with a de novo 2;17 translocation that disrupts the HOXD gene cluster.

    Science.gov (United States)

    Stevenson, David A; Bleyl, Steven B; Maxwell, Teresa; Brothman, Arthur R; South, Sarah T

    2007-05-15

    Treacher Collins syndrome (TCS) is the prototypical mandibulofacial dysostosis syndrome, but other mandibulofacial dysostosis syndromes have been described. We report an infant with mandibulofacial dysostosis and an apparently balanced de novo 2;17 translocation. She presented with severe lower eyelid colobomas requiring skin grafting, malar and mandibular hypoplasia, bilateral microtia with external auditory canal atreasia, dysplastic ossicles, hearing loss, bilateral choanal stenosis, cleft palate without cleft lip, several oral frenula of the upper lip/gum, and micrognathia requiring tracheostomy. Her limbs were normal. Chromosome analysis at the 600-band level showed a 46,XX,t(2;17)(q24.3;q23) karyotype. Sequencing of the entire TCOF1 coding region did not show evidence of a sequence variation. High-resolution genomic microarray analysis did not identify a cryptic imbalance. FISH mapping refined the breakpoints to 2q31.1 and 17q24.3-25.1 and showed the 2q31.1 breakpoint likely affects the HOXD gene cluster. Several atypical findings and lack of an identifiable TCOF1 mutation suggest that this child has a provisionally unique mandibulofacial dysostosis syndrome. The apparently balanced de novo translocation provides candidate loci for atypical and TCOF1 mutation negative cases of TCS. Based on the agreement of our findings with one previous case of mandibulofacial dysostosis with a 2q31.1 transocation, we hypothesize that misexpression of genes in the HOXD gene cluster produced the described phenotype in this patient.

  3. A species-specific cluster of defensin-like genes encodes diffusible pollen tube attractants in Arabidopsis.

    Directory of Open Access Journals (Sweden)

    Hidenori Takeuchi

    Full Text Available Genes directly involved in male/female and host/parasite interactions are believed to be under positive selection. The flowering plant Arabidopsis thaliana has more than 300 defensin-like (DEFL genes, which are likely to be involved in both natural immunity and cell-to-cell communication including pollen-pistil interactions. However, little is known of the relationship between the molecular evolution of DEFL genes and their functions. Here, we identified a recently evolved cluster of DEFL genes in A. thaliana and demonstrated that these DEFL (cysteine-rich peptide [CRP810_1] peptides, named AtLURE1 peptides, are pollen tube attractants guiding pollen tubes to the ovular micropyle. The AtLURE1 genes formed the sole species-specific cluster among DEFL genes compared to its close relative, A. lyrata. No evidence for positive selection was detected in AtLURE1 genes and their orthologs, implying neutral evolution of AtLURE1 genes. AtLURE1 peptides were specifically expressed in egg-accompanying synergid cells and secreted toward the funicular surface through the micropyle. Genetic analyses showed that gametophytic mutants defective in micropylar guidance (myb98, magatama3, and central cell guidance do not express AtLURE1 peptides. Downregulation of the expression of these peptides impaired precise pollen tube attraction to the micropylar opening of some populations of ovules. Recombinant AtLURE1 peptides attracted A. thaliana pollen tubes at a higher frequency compared to A. lyrata pollen tubes, suggesting that these peptides are species-preferential attractants in micropylar guidance. In support of this idea, the heterologous expression of a single AtLURE1 peptide in the synergid cell of Torenia fournieri was sufficient to guide A. thaliana pollen tubes to the T. fournieri embryo sac and to permit entry into it. Our results suggest the unique evolution of AtLURE1 genes, which are directly involved in male-female interaction among the DEFL multigene

  4. An Ipomoea batatas iron-sulfur cluster scaffold protein gene, IbNFU1, is involved in salt tolerance.

    Science.gov (United States)

    Liu, Degao; Wang, Lianjun; Liu, Chenglong; Song, Xuejin; He, Shaozhen; Zhai, Hong; Liu, Qingchang

    2014-01-01

    Iron-sulfur cluster biosynthesis involving the nitrogen fixation (Nif) proteins has been proposed as a general mechanism acting in various organisms. NifU-like protein may play an important role in protecting plants against abiotic and biotic stresses. An iron-sulfur cluster scaffold protein gene, IbNFU1, was isolated from a salt-tolerant sweetpotato (Ipomoea batatas (L.) Lam.) line LM79 in our previous study, but its role in sweetpotato stress tolerance was not investigated. In the present study, the IbNFU1 gene was introduced into a salt-sensitive sweetpotato cv. Lizixiang to characterize its function in salt tolerance. The IbNFU1-overexpressing sweetpotato plants exhibited significantly higher salt tolerance compared with the wild-type. Proline and reduced ascorbate content were significantly increased, whereas malonaldehyde (MDA) content was significantly decreased in the transgenic plants. The activities of superoxide dismutase (SOD) and photosynthesis were significantly enhanced in the transgenic plants. H2O2 was also found to be significantly less accumulated in the transgenic plants than in the wild-type. Overexpression of IbNFU1 up-regulated pyrroline-5-carboxylate synthase (P5CS) and pyrroline-5-carboxylate reductase (P5CR) genes under salt stress. The systemic up-regulation of reactive oxygen species (ROS) scavenging genes was found in the transgenic plants under salt stress. These findings suggest that IbNFU1gene is involved in sweetpotato salt tolerance and enhances salt tolerance of the transgenic sweetpotato plants by regulating osmotic balance, protecting membrane integrity and photosynthesis and activating ROS scavenging system.

  5. Interleukin‑1 gene cluster variants in hemodialysis patients with end stage renal disease: An association and meta‑analysis

    Directory of Open Access Journals (Sweden)

    G Tripathi

    2015-01-01

    Full Text Available We evaluated whether polymorphisms in interleukin (IL-1 gene cluster (IL-1 alpha [IL-1A], IL-1 beta [IL-1B], and IL-1 receptor antagonist [IL-1RN] are associated with end stage renal disease (ESRD. A total of 258 ESRD patients and 569 ethnicity matched controls were examined for IL-1 gene cluster. These were genotyped for five single-nucleotide gene polymorphisms in the IL-1A, IL-1B and IL-1RN genes and a variable number of tandem repeats (VNTR in the IL-1RN. The IL-1B − 3953 and IL-1RN + 8006 polymorphism frequencies were significantly different between the two groups. At IL-1B, the T allele of − 3953C/T was increased among ESRD (P = 0.0001. A logistic regression model demonstrated that two repeat (240 base pair [bp] of the IL-1Ra VNTR polymorphism was associated with ESRD (P = 0.0001. The C/C/C/C/C/1 haplotype was more prevalent in ESRD = 0.007. No linkage disequilibrium (LD was observed between six loci of IL-1 gene. We further conducted a meta-analysis of existing studies and found that there is a strong association of IL-1 RN VNTR 86 bp repeat polymorphism with susceptibility to ESRD (odds ratio = 2.04, 95% confidence interval = 1.48-2.82; P = 0.000. IL-1B − 5887, +8006 and the IL-1RN VNTR polymorphisms have been implicated as potential risk factors for ESRD. The meta-analysis showed a strong association of IL-1RN 86 bp VNTR polymorphism with susceptibility to ESRD.

  6. An Ipomoea batatas Iron-Sulfur Cluster Scaffold Protein Gene, IbNFU1, Is Involved in Salt Tolerance

    Science.gov (United States)

    Song, Xuejin; He, Shaozhen; Zhai, Hong; Liu, Qingchang

    2014-01-01

    Iron-sulfur cluster biosynthesis involving the nitrogen fixation (Nif) proteins has been proposed as a general mechanism acting in various organisms. NifU-like protein may play an important role in protecting plants against abiotic and biotic stresses. An iron-sulfur cluster scaffold protein gene, IbNFU1, was isolated from a salt-tolerant sweetpotato (Ipomoea batatas (L.) Lam.) line LM79 in our previous study, but its role in sweetpotato stress tolerance was not investigated. In the present study, the IbNFU1 gene was introduced into a salt-sensitive sweetpotato cv. Lizixiang to characterize its function in salt tolerance. The IbNFU1-overexpressing sweetpotato plants exhibited significantly higher salt tolerance compared with the wild-type. Proline and reduced ascorbate content were significantly increased, whereas malonaldehyde (MDA) content was significantly decreased in the transgenic plants. The activities of superoxide dismutase (SOD) and photosynthesis were significantly enhanced in the transgenic plants. H2O2 was also found to be significantly less accumulated in the transgenic plants than in the wild-type. Overexpression of IbNFU1 up-regulated pyrroline-5-carboxylate synthase (P5CS) and pyrroline-5-carboxylate reductase (P5CR) genes under salt stress. The systemic up-regulation of reactive oxygen species (ROS) scavenging genes was found in the transgenic plants under salt stress. These findings suggest that IbNFU1gene is involved in sweetpotato salt tolerance and enhances salt tolerance of the transgenic sweetpotato plants by regulating osmotic balance, protecting membrane integrity and photosynthesis and activating ROS scavenging system. PMID:24695556

  7. Frequent long-range epigenetic silencing of protocadherin gene clusters on chromosome 5q31 in Wilms' tumor.

    Directory of Open Access Journals (Sweden)

    Anthony R Dallosso

    2009-11-01

    Full Text Available Wilms' tumour (WT is a pediatric tumor of the kidney that arises via failure of the fetal developmental program. The absence of identifiable mutations in the majority of WTs suggests the frequent involvement of epigenetic aberrations in WT. We therefore conducted a genome-wide analysis of promoter hypermethylation in WTs and identified hypermethylation at chromosome 5q31 spanning 800 kilobases (kb and more than 50 genes. The methylated genes all belong to alpha-, beta-, and gamma-protocadherin (PCDH gene clusters (Human Genome Organization nomenclature PCDHA@, PCDHB@, and PCDHG@, respectively. This demonstrates that long-range epigenetic silencing (LRES occurs in developmental tumors as well as in adult tumors. Bisulfite polymerase chain reaction analysis showed that PCDH hypermethylation is a frequent event found in all Wilms' tumor subtypes. Hypermethylation is concordant with reduced PCDH expression in tumors. WT precursor lesions showed no PCDH hypermethylation, suggesting that de novo PCDH hypermethylation occurs during malignant progression. Discrete boundaries of the PCDH domain are delimited by abrupt changes in histone modifications; unmethylated genes flanking the LRES are associated with permissive marks which are absent from methylated genes within the domain. Silenced genes are marked with non-permissive histone 3 lysine 9 dimethylation. Expression analysis of embryonic murine kidney and differentiating rat metanephric mesenchymal cells demonstrates that Pcdh expression is developmentally regulated and that Pcdhg@ genes are expressed in blastemal cells. Importantly, we show that PCDHs negatively regulate canonical Wnt signalling, as short-interfering RNA-induced reduction of PCDHG@ encoded proteins leads to elevated beta-catenin protein, increased beta-catenin/T-cell factor (TCF reporter activity, and induction of Wnt target genes. Conversely, over-expression of PCDHs suppresses beta-catenin/TCF-reporter activity and also inhibits

  8. Synergistic effect of two β globin gene cluster mutations leading to the hereditary persistence of fetal hemoglobin (HPFH) phenotype.

    Science.gov (United States)

    Hariharan, Priya; Sawant, Madhavi; Gorivale, Manju; Manchanda, Ruma; Colah, Roshan; Ghosh, K; Nadkarni, Anita

    2017-10-01

    Co-inheritance of gamma and beta globin gene mutations in a compound heterozygous state is rare but of clinical interest as it provides an important data on understanding the HbF expression. Hematological analysis was carried out (Sysmex KX-21). F-cells were enumerated using flow cytometry. Beta globin gene was analysed by CRDB technique and by DNA sequencing. Gamma globin promoter region was sequenced and expression studies were carried out using real time Taqman assay. We report a family, where two inherited defects of the β globin gene cluster segregate. The proband and her sibling were compound heterozygotes for a novel G γ promoter mutation and the 619 bp deletion a common Indian β thalassemia mutation. Molecular characterization revealed that the father (HbA 2 5.1%, HbF 5.4%), proband (HbA 2 3.6%, HbF 31.7%) and her brother (HbA 2 3.9%, HbF 23.6%) were heterozygous for the 619 bp deletion. The mother (HbA 2 2.1%, HbF 3.4%) had a normal β globin gene. As both the children showed high HbF levels, the γ globin gene work up was carried out. The G γ-globin gene promoter analysis revealed that the mother and the two children were heterozygous for a 5 bp deletion -ATAAG (-533 to -529) that resides in the GATA binding site. These findings suggest that the 5 bp deletion in the G γ globin promoter has a functional role in silencing the γ-globin gene expression in adults by disrupting GATA-1 binding and the associated repressor complex and results in the up-regulation of gamma globin gene expression. When co-inherited with β -thalassemia trait it leads to a phenotype of HPFH.

  9. Nuclear topography of beta-like globin gene cluster in IL-3-stimulated human leukemic K-562 cells

    Czech Academy of Sciences Publication Activity Database

    Galiová-Šustáčková, Gabriela; Bártová, Eva; Kozubek, Stanislav

    2004-01-01

    Roč. 33, č. 1 (2004), s. 4-14 ISSN 1079-9796 R&D Projects: GA ČR GA301/01/0186; GA AV ČR KSK5052113; GA AV ČR IAA5004306; GA ČR GA202/04/0907; GA MŠk ME 565 Institutional research plan: CEZ:AV0Z5004920 Keywords : beta-like globin gene cluster * K-562 cells * nuclear topography Subject RIV: BO - Biophysics Impact factor: 2.549, year: 2004

  10. Polymorphisms of ST2-IL18R1-IL18RAP gene cluster: a new risk for autoimmune thyroid diseases.

    Science.gov (United States)

    Wang, X; Zhu, Y F; Li, D M; Qin, Q; Wang, Q; Muhali, F S; Jiang, W J; Zhang, J A

    2016-02-01

    Interleukin 33 (IL33) / ST2 pathway and ST2-interlukin18 receptor1-interlukin18 receptor accessory protein (ST2-IL18R1-IL18RAP) gene cluster have been involved in many autoimmune diseases but few report in autoimmune thyroid diseases (AITD). In this study, we investigated whether polymorphisms of IL33, ST2, IL18R1, and IL18RAP are associated with Graves' disease (GD) and Hashimoto's thyroiditis (HT), two major forms of AITD, among a Chinese population. A total of 11 SNPs were explored in a case-control study including 417 patients with GD, 250 HT patients and 301 controls, including rs1929992, rs10975519, rs10208293, rs6543116, rs1041973, rs3732127, rs11465597, rs1035130, rs2293225, rs1035127, rs917997 of IL 33, ST2-IL18R1-IL18RAP gene cluster. Genotyping of these SNPs was performed using matrix-assisted laser desorption / ionization-time-of-flight mass spectrometer (MALDI-TOF-MS) platform from Sequenom. The frequencies of allele A and AA+AG genotype of rs6543116 (ST2) in HT patients were significantly increased compared with those of the controls (P = 0.029/0.021, OR = 1.31/1.62). And in another SNP rs917997, AA+AG genotype presented an increased frequency in HT subjects compared with controls (P = 0.046, OR = 1.53). Furthermore, the haplotype GAGCCCG from ST2-IL18R1-IL18RAP gene cluster (rs6543116, rs1041973, rs1035130, rs3732127, rs1035127, rs2293225, rs917997) was associated with increased susceptibility to GD with an OR of 2.03 (P = 0.022, 95% CI = 1.07-3.86). Some SNPs of ST2-IL18R1-IL18RAP gene cluster might increase the risk of susceptibility of HT and GD in Chinese Han population. © 2015 John Wiley & Sons Ltd.

  11. Identification of natural killer cell receptor clusters in the platypus genome reveals an expansion of C-type lectin genes.

    Science.gov (United States)

    Wong, Emily S W; Sanderson, Claire E; Deakin, Janine E; Whittington, Camilla M; Papenfuss, Anthony T; Belov, Katherine

    2009-08-01

    Natural killer (NK) cell receptors belong to two unrelated, but functionally analogous gene families: the immunoglobulin superfamily, situated in the leukocyte receptor complex (LRC) and the C-type lectin superfamily, located in the natural killer complex (NKC). Here, we describe the largest NK receptor gene expansion seen to date. We identified 213 putative C-type lectin NK receptor homologs in the genome of the platypus. Many have arisen as the result of a lineage-specific expansion. Orthologs of OLR1, CD69, KLRE, CLEC12B, and CLEC16p genes were also identified. The NKC is split into at least two regions of the genome: 34 genes map to chromosome 7, two map to a small autosome, and the remainder are unanchored in the current genome assembly. No NK receptor genes from the LRC were identified. The massive C-type lectin expansion and lack of Ig-domain-containing NK receptors represents the most extreme polarization of NK receptors found to date. We have used this new data from platypus to trace the possible evolutionary history of the NK receptor clusters.

  12. Association of the Porcine Cluster of Differentiation 4 Gene with T Lymphocyte Subpopulations and Its Expression in Immune Tissues

    Directory of Open Access Journals (Sweden)

    Jingen Xu

    2013-04-01

    Full Text Available Cluster of differentiation 4 (CD4 is mainly expressed on CD4+ T cells, which plays an important role in immune response. The aim of this study was to detect the association between polymorphisms of the CD4 gene and T lymphocyte subpopulations in pigs, and to investigate the effects of genetic variation on the CD4 gene expression level in immune tissues. Five missense mutations in the CD4 gene were identified using DNA pooling sequencing assays, and two main haplotypes (CCTCC and AGCTG in strong linkage disequilibrium (with frequencies of 50.26% and 46.34%, respectively were detected in the population of Large White pigs. Our results indicated that the five SNPs and the two haplotypes were significantly associated with the proportions of CD4−CD8−, CD4+CD8+, CD4+CD8−, CD4+ and CD4+/CD8+ in peripheral blood (p0.05. These results indicate that the CD4 gene may influence T lymphocyte subpopulations and can be considered as a candidate gene affecting immunity in pigs.

  13. Transcript profiles of Blumeria graminis development during infection reveal a cluster of genes that are potential virulence determinants.

    Science.gov (United States)

    Both, Maike; Eckert, Sabine E; Csukai, Michael; Müller, Elisabeth; Dimopoulos, George; Spanu, Pietro D

    2005-02-01

    High-density cDNA microarrays (2,027 unigenes) were used to analyze transcript profiles of the plant-pathogenic fungus Blumeria graminis f. sp. hordei throughout its asexual life cycle and development of infection. RNA was obtained from four stages preceding penetration and four stages after penetration of the host cells. The microarray data was validated by comparing the expression of a plasma membrane H+-ATPase and fructose-1,6-bis phosphatase with the data obtained from a quantitative polymerase chain reaction (PCR) assay. The results showed that there was a global switch in expression between the pre- and postpenetrative stages. This was largely due to accumulation of RNA encoding protein biosynthesis genes in the late stages. Other functional clusters, such as virulence-related genes and sterol metabolism genes, are up-regulated in pre- and postpenetration stages, respectively. A group of RNAs whose abundance correlated with the expression of cap20, a gene known to be required for virulence in Colletotrichum gloeosporioides, identified genes that are strong candidates for pathogenicity factors in B. graminis.

  14. Gene Repression in Haloarchaea Using the CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)-Cas I-B System.

    Science.gov (United States)

    Stachler, Aris-Edda; Marchfelder, Anita

    2016-07-15

    The clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system is used by bacteria and archaea to fend off foreign genetic elements. Since its discovery it has been developed into numerous applications like genome editing and regulation of transcription in eukaryotes and bacteria. For archaea currently no tools for transcriptional repression exist. Because molecular biology analyses in archaea become more and more widespread such a tool is vital for investigating the biological function of essential genes in archaea. Here we use the model archaeon Haloferax volcanii to demonstrate that its endogenous CRISPR-Cas system I-B can be harnessed to repress gene expression in archaea. Deletion of cas3 and cas6b genes results in efficient repression of transcription. crRNAs targeting the promoter region reduced transcript levels down to 8%. crRNAs targeting the reading frame have only slight impact on transcription. crRNAs that target the coding strand repress expression only down to 88%, whereas crRNAs targeting the template strand repress expression down to 8%. Repression of an essential gene results in reduction of transcription levels down to 22%. Targeting efficiencies can be enhanced by expressing a catalytically inactive Cas3 mutant. Genes can be targeted on plasmids or on the chromosome, they can be monocistronic or part of a polycistronic operon. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  15. Gene Repression in Haloarchaea Using the CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)-Cas I-B System*

    Science.gov (United States)

    Stachler, Aris-Edda; Marchfelder, Anita

    2016-01-01

    The clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system is used by bacteria and archaea to fend off foreign genetic elements. Since its discovery it has been developed into numerous applications like genome editing and regulation of transcription in eukaryotes and bacteria. For archaea currently no tools for transcriptional repression exist. Because molecular biology analyses in archaea become more and more widespread such a tool is vital for investigating the biological function of essential genes in archaea. Here we use the model archaeon Haloferax volcanii to demonstrate that its endogenous CRISPR-Cas system I-B can be harnessed to repress gene expression in archaea. Deletion of cas3 and cas6b genes results in efficient repression of transcription. crRNAs targeting the promoter region reduced transcript levels down to 8%. crRNAs targeting the reading frame have only slight impact on transcription. crRNAs that target the coding strand repress expression only down to 88%, whereas crRNAs targeting the template strand repress expression down to 8%. Repression of an essential gene results in reduction of transcription levels down to 22%. Targeting efficiencies can be enhanced by expressing a catalytically inactive Cas3 mutant. Genes can be targeted on plasmids or on the chromosome, they can be monocistronic or part of a polycistronic operon. PMID:27226589

  16. Disruption of Transporters Affiliated with Enantio-Pyochelin Biosynthesis Gene Cluster of Pseudomonas protegens Pf-5 Has Pleiotropic Effects.

    Science.gov (United States)

    Lim, Chee Kent; Penesyan, Anahit; Hassan, Karl A; Loper, Joyce E; Paulsen, Ian T

    2016-01-01

    Pseudomonas protegens Pf-5 (formerly Pseudomonas fluorescens) is a biocontrol bacterium that produces the siderophore enantio-pyochelin under conditions of iron starvation in a process that is often accompanied by the secretion of its biosynthesis intermediates, salicylic acid and dihydroaeruginoic acid. In this study, we investigated whether several transporters that are encoded by genes within or adjacent to the enantio-pyochelin biosynthetic cluster, serve as efflux systems for enantio-pyochelin and/or its intermediates. In addition, we determined whether these transporters have broad substrates range specificity using a Phenotype Microarray system. Intriguingly, knockouts of the pchH and fetF transporter genes resulted in mutant strains that secrete higher levels of enantio-pyochelin as well as its intermediates salicylic acid and dihydroaeruginoic acid. Analyses of these mutants did not indicate significant change in transcription of biosynthetic genes involved in enantio-pyochelin production. In contrast, the deletion mutant of PFL_3504 resulted in reduced transcription of the biosynthetic genes as well as decreased dihydroaeruginoic acid concentrations in the culture supernatant, which could either point to regulation of gene expression by the transporter or its role in dihydroaeruginoic acid transport. Disruption of each of the transporters resulted in altered stress and/or chemical resistance profile of Pf-5, which may reflect that these transporters could have specificity for rather a broad range of substrates.

  17. Disruption of Transporters Affiliated with Enantio-Pyochelin Biosynthesis Gene Cluster of Pseudomonas protegens Pf-5 Has Pleiotropic Effects.

    Directory of Open Access Journals (Sweden)

    Chee Kent Lim

    Full Text Available Pseudomonas protegens Pf-5 (formerly Pseudomonas fluorescens is a biocontrol bacterium that produces the siderophore enantio-pyochelin under conditions of iron starvation in a process that is often accompanied by the secretion of its biosynthesis intermediates, salicylic acid and dihydroaeruginoic acid. In this study, we investigated whether several transporters that are encoded by genes within or adjacent to the enantio-pyochelin biosynthetic cluster, serve as efflux systems for enantio-pyochelin and/or its intermediates. In addition, we determined whether these transporters have broad substrates range specificity using a Phenotype Microarray system. Intriguingly, knockouts of the pchH and fetF transporter genes resulted in mutant strains that secrete higher levels of enantio-pyochelin as well as its intermediates salicylic acid and dihydroaeruginoic acid. Analyses of these mutants did not indicate significant change in transcription of biosynthetic genes involved in enantio-pyochelin production. In contrast, the deletion mutant of PFL_3504 resulted in reduced transcription of the biosynthetic genes as well as decreased dihydroaeruginoic acid concentrations in the culture supernatant, which could either point to regulation of gene expression by the transporter or its role in dihydroaeruginoic acid transport. Disruption of each of the transporters resulted in altered stress and/or chemical resistance profile of Pf-5, which may reflect that these transporters could have specificity for rather a broad range of substrates.

  18. Functional clustering and lineage markers: insights into cellular differentiation and gene function from large-scale microarray studies of purified primary cell populations.

    Science.gov (United States)

    Hume, David A; Summers, Kim M; Raza, Sobia; Baillie, J Kenneth; Freeman, Thomas C

    2010-06-01

    Very large microarray datasets showing gene expression across multiple tissues and cell populations provide a window on the transcriptional networks that underpin the differences in functional activity between biological systems. Clusters of co-expressed genes provide lineage markers, candidate regulators of cell function and, by applying the principle of guilt by association, candidate functions for genes of currently unknown function. We have analysed a dataset comprising pure cell populations from hemopoietic and non-hemopoietic cell types (http://biogps.gnf.org). Using a novel network visualisation and clustering approach, we demonstrate that it is possible to identify very tight expression signatures associated specifically with embryonic stem cells, mesenchymal cells and hematopoietic lineages. Selected examples validate the prediction that gene function can be inferred by co-expression. One expression cluster was enriched in phagocytes, which, alongside endosome-lysosome constituents, contains genes that may make up a 'pathway' for phagocyte differentiation. Promoters of these genes are enriched for binding sites for the ETS/PU.1 and MITF families. Another cluster was associated with the production of a specific extracellular matrix, with high levels of gene expression shared by cells of mesenchymal origin (fibroblasts, adipocytes, osteoblasts and myoblasts). We discuss the limitations placed upon such data by the presence of alternative promoters with distinct tissue specificity within many protein-coding genes. Copyright 2010 Elsevier Inc. All rights reserved.

  19. Effects of variations in the APOA1/C3/A4/A5 gene cluster on different parameters of postprandial lipid metabolism in healthy young men

    Science.gov (United States)

    Background: The APOA1/C3/A4/A5 gene cluster encodes important regulators of fasting lipids, but the majority of lipid metabolism takes place in the postprandial state, and knowledge about gene regulation in this state is scarce. With the aim of characterizing possible regulators of lipid metabolism...

  20. Identification of the Biosynthetic Gene Clusters for the Lipopeptides Fusaristatin A and W493 B in Fusarium graminearum and F. pseudograminearum

    DEFF Research Database (Denmark)

    Sørensen, Jens Laurids; Sondergaard, Teis Esben; Covarelli, Lorenzo

    2014-01-01

    The closely related species Fusarium graminearum and Fusarium pseudograminearum differ in that each contains a gene cluster with a polyketide synthase (PKS) and a nonribosomal peptide synthetase (NRPS) that is not present in the other species. To identify their products, we deleted PKS6 and NRPS7...... Fusarium species. On the basis of genes in the putative gene clusters we propose a model for biosynthesis where the polyketide product is shuttled to the NPRS via a CoA ligase and a thioesterase in F. pseudograminearum. In F. graminearum the polyketide is proposed to be directly assimilated by the NRPS....

  1. Unusual organization for lactose and galactose gene clusters in Lactobacillus helveticus.

    Science.gov (United States)

    Fortina, Maria Grazia; Ricci, Giovanni; Mora, Diego; Guglielmetti, Simone; Manachini, Pier Luigi

    2003-06-01

    The nucleotide sequences of the Lactobacillus helveticus lactose utilization genes were determined, and these genes were located and oriented relative to one another. The lacLM genes (encoding the beta-galactosidase protein) were in a divergent orientation compared to lacR (regulatory gene) and lacS (lactose transporter). Downstream from lacM was an open reading frame (galE) encoding a UDP-galactose 4 epimerase, and the open reading frame had the same orientation as lacM. The lacR gene was separated from the downstream lacS gene by 2.0 kb of DNA containing several open reading frames that were derived from fragmentation of another permease gene (lacS'). Northern blot analysis revealed that lacL, lacM, and galE made up an operon that was transcribed in the presence of lactose from an upstream lacL promoter. The inducible genes lacL and lacM were regulated at the transcriptional level by the LacR repressor. In the presence of glucose and galactose galE was transcribed from its promoter, suggesting that the corresponding enzyme can be expressed constitutively. Lactose transport was inducible by addition of lactose to the growth medium.

  2. Gene clusters involved in isethionate degradation by terrestrial and marine bacteria.

    KAUST Repository

    Weinitschke, Sonja

    2010-01-01

    Ubiquitous isethionate (2-hydroxyethanesulfonate) is dissimilated by diverse bacteria. Growth of Cupriavidus necator H16 with isethionate was observed, as was inducible membrane-bound isethionate dehydrogenase (IseJ) and inducible transcription of the genes predicted to encode IseJ and a transporter (IseU). Biodiversity in isethionate transport genes was observed and investigated by transcription experiments.

  3. Metabologenomics: Correlation of Microbial Gene Clusters with Metabolites Drives Discovery of a Nonribosomal Peptide with an Unusual Amino Acid Monomer.

    Science.gov (United States)

    Goering, Anthony W; McClure, Ryan A; Doroghazi, James R; Albright, Jessica C; Haverland, Nicole A; Zhang, Yongbo; Ju, Kou-San; Thomson, Regan J; Metcalf, William W; Kelleher, Neil L

    2016-02-24

    For more than half a century the pharmaceutical industry has sifted through natural products produced by microbes, uncovering new scaffolds and fashioning them into a broad range of vital drugs. We sought a strategy to reinvigorate the discovery of natural products with distinctive structures using bacterial genome sequencing combined with metabolomics. By correlating genetic content from 178 actinomycete genomes with mass spectrometry-enabled analyses of their exported metabolomes, we paired new secondary metabolites with their biosynthetic gene clusters. We report the use of this new approach to isolate and characterize tambromycin, a new chlorinated natural product, composed of several nonstandard amino acid monomeric units, including a unique pyrrolidine-containing amino acid we name tambroline. Tambromycin shows antiproliferative activity against cancerous human B- and T-cell lines. The discovery of tambromycin via large-scale correlation of gene clusters with metabolites (a.k.a. metabologenomics) illuminates a path for structure-based discovery of natural products at a sharply increased rate.

  4. An indigoidine biosynthetic gene cluster from Streptomyces chromofuscus ATCC 49982 contains an unusual IndB homologue.

    Science.gov (United States)

    Yu, Dayu; Xu, Fuchao; Valiente, Jonathan; Wang, Siyuan; Zhan, Jixun

    2013-01-01

    A putative indigoidine biosynthetic gene cluster was located in the genome of Streptomyces chromofuscus ATCC 49982. The silent 9.4-kb gene cluster consists of five open reading frames, named orf1, Sc-indC, Sc-indA, Sc-indB, and orf2, respectively. Sc-IndC was functionally characterized as an indigoidine synthase through heterologous expression of the enzyme in both Streptomyces coelicolor CH999 and Escherichia coli BAP1. The yield of indigoidine in E. coli BAP1 reached 2.78 g/l under the optimized conditions. The predicted protein product of Sc-indB is unusual and much larger than any other reported IndB-like protein. The N-terminal portion of this enzyme resembles IdgB and the C-terminal portion is a hypothetical protein. Sc-IndA and/or Sc-IndB were co-expressed with Sc-IndC in E. coli BAP1, which demonstrated the involvement of Sc-IndB, but not Sc-IndA, in the biosynthetic pathway of indigoidine. The yield of indigoidine was dramatically increased by 41.4 % (3.93 g/l) when Sc-IndB was co-expressed with Sc-IndC in E. coli BAP1. Indigoidine is more stable at low temperatures.

  5. Fuzzy clustering demonstrates that codon 72 SNP rs1042522 of TP53 gene associated with HNSCC but not with prognoses.

    Science.gov (United States)

    Pinheiro, Ugo Borges; Fraga, Carlos Alberto de Carvalho; Mendes, Danilo Cangussu; Farias, Lucyana Conceição; Cardoso, Cláudio Marcelo; Silveira, Christine Mendes; D'Angelo, Marcos Flávio Silveira Vasconcelos; Jones, Kimberly Marie; Santos, Sérgio Henrique Souza; de Paula, Alfredo Maurício Batista; Guimarães, André Luiz Sena

    2015-12-01

    It is estimated that 7.6 million people will die as a consequence of head and neck squamous cell carcinoma (HNSCC). Genetic predisposition has emerged as an important risk factor in the development and prognosis of HNSCC. Considering this, the aim of the current study is to assess whether codon 72 SNP of the TP53 gene (rs1042522) is associated with an increased odds ratio of developing HNSCC or with a worse prognosis in patients with HNSCC. Analysis of the rs1042522 in HNSCC patients and in control individuals. Differences between the case and control groups were determined using chi-squared tests. Multivariate analysis was performed to evaluate the odds ratio of HNSCC. Fussy C Means Clustering was to cluster HNSCC patients for survival analyses. Time of survival was calculated using the Kaplan-Meier estimator and comparing this to the log rank test. Statistical significance was set at p control group. Logistic regression demonstrated that the Arg/Arg genotype, smoking, and alcohol consumption increase the odds ratio of HNSCC. No association between TP53 codon 72 polymorphism and P53 expression. No association between rs1042522 and survival or prognoses was observed. This study identified that individuals carrying the arginine allele at rs1042522 have an increased odds ratio of HNSCC. However, no association between codon 72 SNP of the TP53 gene and HNSCC prognosis or P53 expression was observed.

  6. Dysregulated gliotoxin biosynthesis attenuates the production of unrelated biosynthetic gene cluster-encoded metabolites in Aspergillus fumigatus.

    Science.gov (United States)

    Doyle, Sean; Jones, Gary W; Dolan, Stephen K

    2018-04-01

    Gliotoxin is an epipolythiodioxopiperazine (ETP) class toxin, contains a disulfide bridge that mediates its toxic effects via redox cycling and is produced by the opportunistic fungal pathogen Aspergillus fumigatus. The gliotoxin bis-thiomethyltransferase, GtmA, attenuates gliotoxin biosynthesis in A. fumigatus by conversion of dithiol gliotoxin to bis-thiomethylgliotoxin (BmGT). Here we show that disruption of dithiol gliotoxin bis-thiomethylation functionality in A. fumigatus results in significant remodelling of the A. fumigatus secondary metabolome upon extended culture. RP-HPLC and LC-MS/MS analysis revealed the reduced production of a plethora of unrelated biosynthetic gene cluster-encoded metabolites, including pseurotin A, fumagillin, fumitremorgin C and tryprostatin B, occurs in A. fumigatus ΔgtmA upon extended incubation. Parallel quantitative proteomic analysis of A. fumigatus wild-type and ΔgtmA during extended culture revealed cognate abundance alteration of proteins encoded by relevant biosynthetic gene clusters, allied to multiple alterations in hypoxia-related proteins. The data presented herein reveal a previously concealed functionality of GtmA in facilitating the biosynthesis of other BGC-encoded metabolites produced by A. fumigatus. Copyright © 2017 British Mycological Society. Published by Elsevier Ltd. All rights reserved.

  7. A highly conserved NB-LRR encoding gene cluster effective against Setosphaeria turcica in sorghum

    Directory of Open Access Journals (Sweden)

    Martin Tom

    2011-11-01

    Full Text Available Abstract Background The fungal pathogen Setosphaeria turcica causes turcicum or northern leaf blight disease on maize, sorghum and related grasses. A prevalent foliar disease found worldwide where the two host crops, maize and sorghum are grown. The aim of the present study was to find genes controlling the host defense response to this devastating plant pathogen. A cDNA-AFLP approach was taken to identify candidate sequences, which functions were further validated via virus induced gene silencing (VIGS, and real-time PCR analysis. Phylogenetic analysis was performed to address evolutionary events. Results cDNA-AFLP analysis was run on susceptible and resistant sorghum and maize genotypes to identify resistance-related sequences. One CC-NB-LRR encoding gene GRMZM2G005347 was found among the up-regulated maize transcripts after fungal challenge. The new plant resistance gene was designated as St referring to S. turcica. Genome sequence comparison revealed that the CC-NB-LRR encoding St genes are located on chromosome 2 in maize, and on chromosome 5 in sorghum. The six St sorghum genes reside in three pairs in one locus. When the sorghum St genes were silenced via VIGS, the resistance was clearly compromised, an observation that was supported by real-time PCR. Database searches and phylogenetic analysis suggest that the St genes have a common ancestor present before the grass subfamily split 50-70 million years ago. Today, 6 genes are present in sorghum, 9 in rice and foxtail millet, respectively, 3 in maize and 4 in Brachypodium distachyon. The St gene homologs have all highly conserved sequences, and commonly reside as gene pairs in the grass genomes. Conclusions Resistance genes to S. turcica, with a CC-NB-LRR protein domain architecture, have been found in maize and sorghum. VIGS analysis revealed their importance in the surveillance to S. turcica in sorghum. The St genes are highly conserved in sorghum, rice, foxtail millet, maize and

  8. Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data

    Directory of Open Access Journals (Sweden)

    Jonassen Inge

    2003-12-01

    Full Text Available Abstract Background Using DNA microarrays, we have developed two novel models for tumor classification and target gene prediction. First, gene expression profiles are summarized by optimally selected Self-Organizing Maps (SOMs, followed by tumor sample classification by Fuzzy C-means clustering. Then, the prediction of marker genes is accomplished by either manual feature selection (visualizing the weighted/mean SOM component plane or automatic feature selection (by pair-wise Fisher's linear discriminant. Results The proposed models were tested on four published datasets: (1 Leukemia (2 Colon cancer (3 Brain tumors and (4 NCI cancer cell lines. The models gave class prediction with markedly reduced error rates compared to other class prediction approaches, and the importance of feature selection on microarray data analysis was also emphasized. Conclusions Our models identify marker genes with predictive potential, often better than other available methods in the literature. The models are potentially useful for medical diagnostics and may reveal some insights into cancer classification. Additionally, we illustrated two limitations in tumor classification from microarray data related to the biology underlying the data, in terms of (1 the class size of data, and (2 the internal structure of classes. These limitations are not specific for the classification models used.

  9. Deletion of the MBII-85 snoRNA gene cluster in mice results in postnatal growth retardation.

    Directory of Open Access Journals (Sweden)

    Boris V Skryabin

    2007-12-01

    Full Text Available Prader-Willi syndrome (PWS [MIM 176270] is a neurogenetic disorder characterized by decreased fetal activity, muscular hypotonia, failure to thrive, short stature, obesity, mental retardation, and hypogonadotropic hypogonadism. It is caused by the loss of function of one or more imprinted, paternally expressed genes on the proximal long arm of chromosome 15. Several potential PWS mouse models involving the orthologous region on chromosome 7C exist. Based on the analysis of deletions in the mouse and gene expression in PWS patients with chromosomal translocations, a critical region (PWScr for neonatal lethality, failure to thrive, and growth retardation was narrowed to the locus containing a cluster of neuronally expressed MBII-85 small nucleolar RNA (snoRNA genes. Here, we report the deletion of PWScr. Mice carrying the maternally inherited allele (PWScr(m-/p+ are indistinguishable from wild-type littermates. All those with the paternally inherited allele (PWScr(m+/p- consistently display postnatal growth retardation, with about 15% postnatal lethality in C57BL/6, but not FVB/N crosses. This is the first example in a multicellular organism of genetic deletion of a C/D box snoRNA gene resulting in a pronounced phenotype.

  10. Discovering biomarkers from gene expression data for predicting cancer subgroups using neural networks and relational fuzzy clustering

    Directory of Open Access Journals (Sweden)

    Sharma Animesh

    2007-01-01

    Full Text Available Abstract Background The four heterogeneous childhood cancers, neuroblastoma, non-Hodgkin lymphoma, rhabdomyosarcoma, and Ewing sarcoma present a similar histology of small round blue cell tumor (SRBCT and thus often leads to misdiagnosis. Identification of biomarkers for distinguishing these cancers is a well studied problem. Existing methods typically evaluate each gene separately and do not take into account the nonlinear interaction between genes and the tools that are used to design the diagnostic prediction system. Consequently, more genes are usually identified as necessary for prediction. We propose a general scheme for finding a small set of biomarkers to design a diagnostic system for accurate classification of the cancer subgroups. We use multilayer networks with online gene selection ability and relational fuzzy clustering to identify a small set of biomarkers for accurate classification of the training and blind test cases of a well studied data set. Results Our method discerned just seven biomarkers that precisely categorized the four subgroups of cancer both in training and blind samples. For the same problem, others suggested 19–94 genes. These seven biomarkers include three novel genes (NAB2, LSP1 and EHD1 – not identified by others with distinct class-specific signatures and important role in cancer biology, including cellular proliferation, transendothelial migration and trafficking of MHC class antigens. Interestingly, NAB2 is downregulated in other tumors including Non-Hodgkin lymphoma and Neuroblastoma but we observed moderate to high upregulation in a few cases of Ewing sarcoma and Rabhdomyosarcoma, suggesting that NAB2 might be mutated in these tumors. These genes can discover the subgroups correctly with unsupervised learning, can differentiate non-SRBCT samples and they perform equally well with other machine learning tools including support vector machines. These biomarkers lead to four simple human interpretable

  11. Expression of the gene cluster associated with the Escherichia coli pilus adhesin K99.

    OpenAIRE

    Lee, J H; Isaacson, R E

    1995-01-01

    The biogenesis of the pilus adhesin K99 is dependent on the expression of eight contiguous genes, fanA to fanH. Transposon mutants were prepared by using TnlacZ and TnphoA, and selected transposon mutants were used to measure expression of each K99 gene. Expression of the K99 genes is likely controlled at the transcription level, since in general, there were no differences between the results obtained with the two transposons. fanC was the most highly expressed, and fanD was expressed at very...

  12. Distinct Loci in the CHRNA5/CHRNA3/CHRNB4 Gene Cluster Are Associated With Onset of Regular Smoking

    Science.gov (United States)

    Stephens, Sarah H.; Hartz, Sarah M.; Hoft, Nicole R.; Saccone, Nancy L.; Corley, Robin C.; Hewitt, John K.; Hopfer, Christian J.; Breslau, Naomi; Coon, Hilary; Chen, Xiangning; Ducci, Francesca; Dueker, Nicole; Franceschini, Nora; Frank, Josef; Han, Younghun; Hansel, Nadia N.; Jiang, Chenhui; Korhonen, Tellervo; Lind, Penelope A.; Liu, Jason; Lyytikäinen, Leo-Pekka; Michel, Martha; Shaffer, John R.; Short, Susan E.; Sun, Juzhong; Teumer, Alexander; Thompson, John R.; Vogelzangs, Nicole; Vink, Jacqueline M.; Wenzlaff, Angela; Wheeler, William; Yang, Bao-Zhu; Aggen, Steven H.; Balmforth, Anthony J.; Baumeister, Sebastian E.; Beaty, Terri H.; Benjamin, Daniel J.; Bergen, Andrew W.; Broms, Ulla; Cesarini, David; Chatterjee, Nilanjan; Chen, Jingchun; Cheng, Yu-Ching; Cichon, Sven; Couper, David; Cucca, Francesco; Dick, Danielle; Foroud, Tatiana; Furberg, Helena; Giegling, Ina; Gillespie, Nathan A.; Gu, Fangyi; Hall, Alistair S.; Hällfors, Jenni; Han, Shizhong; Hartmann, Annette M.; Heikkilä, Kauko; Hickie, Ian B.; Hottenga, Jouke Jan; Jousilahti, Pekka; Kaakinen, Marika; Kähönen, Mika; Koellinger, Philipp D.; Kittner, Stephen; Konte, Bettina; Landi, Maria-Teresa; Laatikainen, Tiina; Leppert, Mark; Levy, Steven M.; Mathias, Rasika A.; McNeil, Daniel W.; Medland, Sarah E.; Montgomery, Grant W.; Murray, Tanda; Nauck, Matthias; North, Kari E.; Paré, Peter D.; Pergadia, Michele; Ruczinski, Ingo; Salomaa, Veikko; Viikari, Jorma; Willemsen, Gonneke; Barnes, Kathleen C.; Boerwinkle, Eric; Boomsma, Dorret I.; Caporaso, Neil; Edenberg, Howard J.; Francks, Clyde; Gelernter, Joel; Grabe, Hans Jörgen; Hops, Hyman; Jarvelin, Marjo-Riitta; Johannesson, Magnus; Kendler, Kenneth S.; Lehtimäki, Terho; Magnusson, Patrik K.E.; Marazita, Mary L.; Marchini, Jonathan; Mitchell, Braxton D.; Nöthen, Markus M.; Penninx, Brenda W.; Raitakari, Olli; Rietschel, Marcella; Rujescu, Dan; Samani, Nilesh J.; Schwartz, Ann G.; Shete, Sanjay; Spitz, Margaret; Swan, Gary E.; Völzke, Henry; Veijola, Juha; Wei, Qingyi; Amos, Chris; Cannon, Dale S.; Grucza, Richard; Hatsukami, Dorothy; Heath, Andrew; Johnson, Eric O.; Kaprio, Jaakko; Madden, Pamela; Martin, Nicholas G.; Stevens, Victoria L.; Weiss, Robert B.; Kraft, Peter; Bierut, Laura J.; Ehringer, Marissa A.

    2014-01-01

    Neuronal nicotinic acetylcholine receptor (nAChR) genes (CHRNA5/CHRNA3/CHRNB4) have been reproducibly associated with nicotine dependence, smoking behaviors, and lung cancer risk. Of the few reports that have focused on early smoking behaviors, association results have been mixed. This meta-analysis examines early smoking phenotypes and SNPs in the gene cluster to determine: (1) whether the most robust association signal in this region (rs16969968) for other smoking behaviors is also associated with early behaviors, and/or (2) if additional statistically independent signals are important in early smoking. We focused on two phenotypes: age of tobacco initiation (AOI) and age of first regular tobacco use (AOS). This study included 56,034 subjects (41 groups) spanning nine countries and evaluated five SNPs including rs1948, rs16969968, rs578776, rs588765, and rs684513. Each dataset was analyzed using a centrally generated script. Meta-analyses were conducted from summary statistics. AOS yielded significant associations with SNPs rs578776 (beta = 0.02, P = 0.004), rs1948 (beta = 0.023, P = 0.018), and rs684513 (beta = 0.032, P = 0.017), indicating protective effects. There were no significant associations for the AOI phenotype. Importantly, rs16969968, the most replicated signal in this region for nicotine dependence, cigarettes per day, and cotinine levels, was not associated with AOI (P = 0.59) or AOS (P = 0.92). These results provide important insight into the complexity of smoking behavior phenotypes, and suggest that association signals in the CHRNA5/A3/B4 gene cluster affecting early smoking behaviors may be different from those affecting the mature nicotine dependence phenotype. PMID:24186853

  13. Accuracy and differential bias in copy number measurement of CCL3L1 in association studies with three auto-immune disorders

    NARCIS (Netherlands)

    Carpenter, D.; Walker, S.; Prescott, N.; Schalkwijk, J.; Armour, J.A.

    2011-01-01

    BACKGROUND: Copy number variation (CNV) contributes to the variation observed between individuals and can influence human disease progression, but the accurate measurement of individual copy numbers is technically challenging. In the work presented here we describe a modification to a previously

  14. Isolation of Resistance Gene Candidates (RGCs) and characterization of an RGC cluster in cassava.

    Science.gov (United States)

    López, C E; Zuluaga, A P; Cooke, R; Delseny, M; Tohme, J; Verdier, V

    2003-08-01

    Plant disease resistance genes (R genes) show significant similarity amongst themselves in terms of both their DNA sequences and structural motifs present in their protein products. Oligonucleotide primers designed from NBS (Nucleotide Binding Site) domains encoded by several R-genes have been used to amplify NBS sequences from the genomic DNA of various plant species, which have been called Resistance Gene Analogues (RGAs) or Resistance Gene Candidates (RGCs). Using specific primers from the NBS and TIR (Toll/Interleukin-1 Receptor) regions, we identified twelve classes of RGCs in cassava (Manihot esculenta Crantz). Two classes were obtained from the PCR-amplification of the TIR domain. The other 10 classes correspond to the NBS sequences and were grouped into two subfamilies. Classes RCa1 to RCa5 are part of the first subfamily and were linked to a TIR domain in the N terminus. Classes RCa6 to RCa10 corresponded to non-TIR NBS-LRR encoding sequences. BAC library screening with the 12 RGC classes as probes allowed the identification of 42 BAC clones that were assembled into 10 contigs and 19 singletons. Members of the two TIR and non-TIR NBS-LRR subfamilies occurred together within individual BAC clones. The BAC screening and Southern hybridization analyses showed that all RGCs were single copy sequences except RCa6 that represented a large and diverse gene family. One BAC contained five NBS sequences and sequence analysis allowed the identification of two complete RGCs encoding two highly similar proteins. This BAC was located on linkage group J with three other RGC-containing BACs. At least one of these genes, RGC2, is expressed constitutively in cassava tissues.

  15. A proteomic approach to investigating gene cluster expression and secondary metabolite functionality in Aspergillus fumigatus.

    Directory of Open Access Journals (Sweden)

    Rebecca A Owens

    Full Text Available A combined proteomics and metabolomics approach was utilised to advance the identification and characterisation of secondary metabolites in Aspergillus fumigatus. Here, implementation of a shotgun proteomic strategy led to the identification of non-redundant mycelial proteins (n = 414 from A. fumigatus including proteins typically under-represented in 2-D proteome maps: proteins with multiple transmembrane regions, hydrophobic proteins and proteins with extremes of molecular mass and pI. Indirect identification of secondary metabolite cluster expression was also achieved, with proteins (n = 18 from LaeA-regulated clusters detected, including GliT encoded within the gliotoxin biosynthetic cluster. Biochemical analysis then revealed that gliotoxin significantly attenuates H2O2-induced oxidative stress in A. fumigatus (p>0.0001, confirming observations from proteomics data. A complementary 2-D/LC-MS/MS approach further elucidated significantly increased abundance (p<0.05 of proliferating cell nuclear antigen (PCNA, NADH-quinone oxidoreductase and the gliotoxin oxidoreductase GliT, along with significantly attenuated abundance (p<0.05 of a heat shock protein, an oxidative stress protein and an autolysis-associated chitinase, when gliotoxin and H2O2 were present, compared to H2O2 alone. Moreover, gliotoxin exposure significantly reduced the abundance of selected proteins (p<0.05 involved in de novo purine biosynthesis. Significantly elevated abundance (p<0.05 of a key enzyme, xanthine-guanine phosphoribosyl transferase Xpt1, utilised in purine salvage, was observed in the presence of H2O2 and gliotoxin. This work provides new insights into the A. fumigatus proteome and experimental strategies, plus mechanistic data pertaining to gliotoxin functionality in the organism.

  16. Temporal clustering of gene expression links the metabolic transcription factor HNF4α to the ER stress-dependent gene regulatory network

    Directory of Open Access Journals (Sweden)

    Angela M Arensdorf

    2013-09-01

    Full Text Available The unfolded protein response (UPR responds to disruption of endoplasmic reticulum (ER function by initiating signaling cascades that ultimately culminate in extensive transcriptional regulation. Classically, this regulation includes genes encoding ER chaperones, ER-associated degradation factors, and others involved in secretory protein folding and processing, and is carried out by the transcriptional activators that are produced as a consequence of UPR activation. However, up to half of the mRNAs regulated by ER stress are downregulated rather than upregulated, and the mechanisms linking ER stress and UPR activation to mRNA suppression are poorly understood. To begin to address this issue, we used a bottom-up approach to study the metabolic gene regulatory network controlled by the UPR in the liver, because ER in the liver stress leads to lipid accumulation, and fatty liver disease is the most common liver disease in the western world. qRT-PCR profiling of mouse liver mRNAs during ER stress revealed that suppression of the transcriptional regulators C/EBPα, PPARα, and PGC-1α preceded lipid accumulation, and was then followed by suppression of mRNAs encoding key enzymes involved in fatty acid oxidation and lipoprotein biogenesis and transport. Mice lacking the ER stress sensor ATF6α, which experience persistent ER stress and profound lipid accumulation during challenge, were then used as the basis for a functional genomics approach that allowed genes to be grouped into distinct expression profiles. This clustering predicted that ER stress would suppress the activity of the metabolic transcriptional regulator HNF4α--a finding subsequently confirmed by chromatin immunopreciptation at the Cebpa and Pgc1a promoters. Our results establish a framework for hepatic gene regulation during ER stress and suggest that HNF4α occupies the apex of that framework. They also provide a unique resource for the community to further explore the temporal

  17. Antibiotic discovery throughout the Small World Initiative: A molecular strategy to identify biosynthetic gene clusters involved in antagonistic activity.

    Science.gov (United States)

    Davis, Elizabeth; Sloan, Tyler; Aurelius, Krista; Barbour, Angela; Bodey, Elijah; Clark, Brigette; Dennis, Celeste; Drown, Rachel; Fleming, Megan; Humbert, Allison; Glasgo, Elizabeth; Kerns, Trent; Lingro, Kelly; McMillin, MacKenzie; Meyer, Aaron; Pope, Breanna; Stalevicz, April; Steffen, Brittney; Steindl, Austin; Williams, Carolyn; Wimberley, Carmen; Zenas, Robert; Butela, Kristen; Wildschutte, Hans

    2017-06-01

    The emergence of bacterial pathogens resistant to all known antibiotics is a global health crisis. Adding to this problem is that major pharmaceutical companies have shifted away from antibiotic discovery due to low profitability. As a result, the pipeline of new antibiotics is essentially dry and many bacteria now resist the effects of most commonly used drugs. To address this global health concern, citizen science through the Small World Initiative (SWI) was formed in 2012. As part of SWI, students isolate bacteria from their local environments, characterize the strains, and assay for antibiotic production. During the 2015 fall semester at Bowling Green State University, students isolated 77 soil-derived bacteria and genetically characterized strains using the 16S rRNA gene, identified strains exhibiting antagonistic activity, and performed an expanded SWI workflow using transposon mutagenesis to identify a biosynthetic gene cluster involved in toxigenic compound production. We identified one mutant with loss of antagonistic activity and through subsequent whole-genome sequencing and linker-mediated PCR identified a 24.9 kb biosynthetic gene locus likely involved in inhibitory activity in that mutant. Further assessment against human pathogens demonstrated the inhibition of Bacillus cereus, Listeria monocytogenes, and methicillin-resistant Staphylococcus aureus in the presence of this compound, thus supporting our molecular strategy as an effective research pipeline for SWI antibiotic discovery and genetic characterization. © 2017 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.

  18. Clusters of conserved beta cell marker genes for assessment of beta cell phenotype

    DEFF Research Database (Denmark)

    Martens, Geert A; Jiang, Lei; Hellemans, Karine H

    2011-01-01

    The aim of this study was to establish a gene expression blueprint of pancreatic beta cells conserved from rodents to humans and to evaluate its applicability to assess shifts in the beta cell differentiated state. Genome-wide mRNA expression profiles of isolated beta cells were compared to those...

  19. Functional Reconstitution of a Fungal Natural Product Gene Cluster by Advanced Genome Editing

    DEFF Research Database (Denmark)

    Weber, Jakob; Valiante, Vito; Nødvig, Christina Spuur

    2017-01-01

    is not produced among different isolates. Combining computational analysis with targeted gene editing, we could link a single nucleotide insertion in the polyketide synthase of the trypacidin biosynthetic pathway and reconstitute its production in a nonproducing strain. Thus, we present a CRISPR/Cas9-based tool...

  20. Biosynthesis of Antinutritional Alkaloids in Solanaceous Crops Is Mediated by Clustered Genes

    NARCIS (Netherlands)

    Itkin, M.; Heinig, U.; Tzfadia, O.; Bhide, A.J.; Shinde, B.; Cardenas, P.D.; Bocobza, S.E.; Unger, T.; Malitsky, S.; Finkers, H.J.; Tikunov, Y.M.; Bovy, A.G.; Chikate, Y.; Singh, P.; Rogachev, I.; Beekwilder, J.; Giri, A.P.; Aharoni, A.

    2013-01-01

    Steroidal glycoalkaloids (SGAs) such as a-solanine found in solanaceous food plants—as, for example, potato—are antinutritional factors for humans. Comparative coexpression analysis between tomato and potato coupled with chemical profiling revealed an array of 10 genes that partake in SGA

  1. ATAD3 gene cluster deletions cause cerebellar dysfunction associated with altered mitochondrial DNA and cholesterol metabolism.

    Science.gov (United States)

    Desai, Radha; Frazier, Ann E; Durigon, Romina; Patel, Harshil; Jones, Aleck W; Dalla Rosa, Ilaria; Lake, Nicole J; Compton, Alison G; Mountford, Hayley S; Tucker, Elena J; Mitchell, Alice L R; Jackson, Deborah; Sesay, Abdul; Di Re, Miriam; van den Heuvel, Lambert P; Burke, Derek; Francis, David; Lunke, Sebastian; McGillivray, George; Mandelstam, Simone; Mochel, Fanny; Keren, Boris; Jardel, Claude; Turner, Anne M; Ian Andrews, P; Smeitink, Jan; Spelbrink, Johannes N; Heales, Simon J; Kohda, Masakazu; Ohtake, Akira; Murayama, Kei; Okazaki, Yasushi; Lombès, Anne; Holt, Ian J; Thorburn, David R; Spinazzola, Antonella

    2017-06-01

    Although mitochondrial disorders are clinically heterogeneous, they frequently involve the central nervous system and are among the most common neurogenetic disorders. Identifying the causal genes has benefited enormously from advances in high-throughput sequencing technologies; however, once the defect is known, researchers face the challenge of deciphering the underlying disease mechanism. Here we characterize large biallelic deletions in the region encoding the ATAD3C, ATAD3B and ATAD3A genes. Although high homology complicates genomic analysis of the ATAD3 defects, they can be identified by targeted analysis of standard single nucleotide polymorphism array and whole exome sequencing data. We report deletions that generate chimeric ATAD3B/ATAD3A fusion genes in individuals from four unrelated families with fatal congenital pontocerebellar hypoplasia, whereas a case with genomic rearrangements affecting the ATAD3C/ATAD3B genes on one allele and ATAD3B/ATAD3A genes on the other displays later-onset encephalopathy with cerebellar atrophy, ataxia and dystonia. Fibroblasts from affected individuals display mitochondrial DNA abnormalities, associated with multiple indicators of altered cholesterol metabolism. Moreover, drug-induced perturbations of cholesterol homeostasis cause mitochondrial DNA disorganization in control cells, while mitochondrial DNA aggregation in the genetic cholesterol trafficking disorder Niemann-Pick type C disease further corroborates the interdependence of mitochondrial DNA organization and cholesterol. These data demonstrate the integration of mitochondria in cellular cholesterol homeostasis, in which ATAD3 plays a critical role. The dual problem of perturbed cholesterol metabolism and mitochondrial dysfunction could be widespread in neurological and neurodegenerative diseases. © The Author (2017). Published by Oxford University Press on behalf of the Guarantors of Brain.

  2. Characterization of the aldo-keto reductase 1C gene cluster on pig chromosome 10: possible associations with reproductive traits

    Directory of Open Access Journals (Sweden)

    Nonneman Dan J

    2006-09-01

    Full Text Available Abstract Background The rate of pubertal development and weaning to estrus interval are correlated and affect reproductive efficiency of swine. Quantitative trait loci (QTL for age of puberty, nipple number and ovulation rate have been identified in Meishan crosses on pig chromosome 10q (SSC10 near the telomere, which is homologous to human chromosome 10p15 and contains an aldo-keto reductase (AKR gene cluster with at least six family members. AKRs are tissue-specific hydroxysteroid dehydrogenases that interconvert weak steroid hormones to their more potent counterparts and regulate processes involved in development, homeostasis and reproduction. Because of their location in the swine genome and their implication in reproductive physiology, this gene cluster was characterized and evaluated for effects on reproductive traits in swine. Results Screening the porcine CHORI-242 BAC library with a full-length AKR1C4 cDNA identified 7 positive clones and sample sequencing of 5 BAC clones revealed 5 distinct AKR1C genes (AKR1CL2 and AKR1C1 through 4, which mapped to 126–128 cM on SSC10. Using the IMpRH7000rad and IMNpRH212000rad radiation hybrid panels, these 5 genes mapped between microsatellite markers SWR67 and SW2067. Comparison of sequence data with the porcine BAC fingerprint map show that the cluster of genes resides in a 300 kb region. Twelve SNPs were genotyped in gilts observed for age at first estrus and ovulation rate from the F8 and F10 generations of one-quarter Meishan descendants of the USMARC resource population. Age at puberty, nipple number and ovulation rate data were analyzed for association with genotypes by MTDFREML using an animal model. One SNP, a phenylalanine to isoleucine substitution in AKR1C2, was associated with age of puberty (p = 0.07 and possibly ovulation rate (p = 0.102. Two SNP in AKR1C4 were significantly associated with nipple number (p ≤ 0.03 and another possibly associated with age at puberty (p = 0

  3. BGDMdocker: a Docker workflow for data mining and visualization of bacterial pan-genomes and biosynthetic gene clusters

    Directory of Open Access Journals (Sweden)

    Gong Cheng

    2017-11-01

    Full Text Available Recently, Docker technology has received increasing attention throughout the bioinformatics community. However, its implementation has not yet been mastered by most biologists; accordingly, its application in biological research has been limited. In order to popularize this technology in the field of bioinformatics and to promote the use of publicly available bioinformatics tools, such as Dockerfiles and Images from communities, government sources, and private owners in the Docker Hub Registry and other Docker-based resources, we introduce here a complete and accurate bioinformatics workflow based on Docker. The present workflow enables analysis and visualization of pan-genomes and biosynthetic gene clusters of bacteria. This provides a new solution for bioinformatics mining of big data from various publicly available biological databases. The present step-by-step guide creates an integrative workflow through a Dockerfile to allow researchers to build their own Image and run Container easily.

  4. Characterization of a large deletion in the {beta}-globin gene cluster in a newborn with hemoglobin FE

    Energy Technology Data Exchange (ETDEWEB)

    Louie, E.; Dietz, L.; Shafer, F. [Children`s Hosptial, Oakland, CA (United States)] [and others

    1994-09-01

    A sample on a newborn with hemoglobin FE screen results was obtained to investigate whether E/E or B/{beta}{degrees} thalassemia was present using polymerase chain reaction (PCR) methodology. The newborn appeared homozygous for the hemoglobin E mutation in our initial study, but the parents` genotypes did not support this diagnosis. The father is homozygous for the absence of the hemoglobin E mutation (non E/non E) and the mother is heterozygous (E/non E) for this mutation. The limitation of PCR analysis is an assumption that the amplification of the two {beta}-globin alleles is equivalent. A large deletion on one {beta}-globin gene, which would produce E/{beta}{degrees} thalassemia, would be missed if it included part or the entire region subjected to amplification. The family results were consistent with either non-paternity, sample mix-up or such a deletion of the {beta}-globin gene in the father and child. To rule out the possibility of non-paternity, two polymorphic loci (HLA on chromosome 6 and a VNTR system of chromosome 17) that are outside of the {beta}-globin gene were analyzed and show that inheritance is consistent and the likelihood of a sample mix-up is then reduced. We therefore believe there is a gene deletion in this family. At the present time, analyses of the RFLPs that are 5{prime} of the {beta}-globin gene cluster show that the polymorphisms most distal from the 5{prime} {beta}-globin gene are not being inherited as expected. These results support our interpretation that a deletion exists in the father and was inherited by the child. The father`s clinical picture of possible HPFH (the father has 12% hemoglobin F) also supports the interpretation of a deletion in this family. Deletions of the {beta}-globin gene within this ethnic group are rare. Currently, Southern blots on the family are being probed to determine the extent of the putative deletion.

  5. Combining multiple hypothesis testing and affinity propagation clustering leads to accurate, robust and sample size independent classification on gene expression data

    Directory of Open Access Journals (Sweden)

    Sakellariou Argiris

    2012-10-01

    Full Text Available Abstract Background A feature selection method in microarray gene expression data should be independent of platform, disease and dataset size. Our hypothesis is that among the statistically significant ranked genes in a gene list, there should be clusters of genes that share similar biological functions related to the investigated disease. Thus, instead of keeping N top ranked genes, it would be more appropriate to define and keep a number of gene cluster exemplars. Results We propose a hybrid FS method (mAP-KL, which combines multiple hypothesis testing and affinity propagation (AP-clustering algorithm along with the Krzanowski & Lai cluster quality index, to select a small yet informative subset of genes. We applied mAP-KL on real microarray data, as well as on simulated data, and compared its performance against 13 other feature selection approaches. Across a variety of diseases and number of samples, mAP-KL presents competitive classification results, particularly in neuromuscular diseases, where its overall AUC score was 0.91. Furthermore, mAP-KL generates concise yet biologically relevant and informative N-gene expression signatures, which can serve as a valuable tool for diagnostic and prognostic purposes, as well as a source of potential disease biomarkers in a broad range of diseases. Conclusions mAP-KL is a data-driven and classifier-independent hybrid feature selection method, which applies to any disease classification problem based on microarray data, regardless of the available samples. Combining multiple hypothesis testing and AP leads to subsets of genes, which classify unknown samples from both, small and large patient cohorts with high accuracy.

  6. Identification of an extensive gene cluster among a family of PPOs in Trifolium pratense L. (red clover) using a large insert BAC library.

    Science.gov (United States)

    Winters, Ana; Heywood, Sue; Farrar, Kerrie; Donnison, Iain; Thomas, Ann; Webb, K Judith

    2009-07-20

    Polyphenol oxidase (PPO) activity in plants is a trait with potential economic, agricultural and environmental impact. In relation to the food industry, PPO-induced browning causes unacceptable discolouration in fruit and vegetables: from an agriculture perspective, PPO can protect plants against pathogens and environmental stress, improve ruminant growth by increasing nitrogen absorption and decreasing nitrogen loss to the environment through the animal's urine. The high PPO legume, red clover, has a significant economic and environmental role in sustaining low-input organic and conventional farms. Molecular markers for a range of important agricultural traits are being developed for red clover and improved knowledge of PPO genes and their structure will facilitate molecular breeding. A bacterial artificial chromosome (BAC) library comprising 26,016 BAC clones with an average 135 Kb insert size, was constructed from Trifolium pratense L. (red clover), a diploid legume with a haploid genome size of 440-637 Mb. Library coverage of 6-8 genome equivalents ensured good representation of genes: the library was screened for polyphenol oxidase (PPO) genes.Two single copy PPO genes, PPO4 and PPO5, were identified to add to a family of three, previously reported, paralogous genes (PPO1-PPO3). Multiple PPO1 copies were identified and characterised revealing a subfamily comprising three variants PPO1/2, PPO1/4 and PPO1/5. Six PPO genes clustered within the genome: four separate BAC clones could be assembled onto a predicted 190-510 Kb single BAC contig. A PPO gene family in red clover resides as a cluster of at least 6 genes. Three of these genes have high homology, suggesting a more recent evolutionary event. This PPO cluster covers a longer region of the genome than clusters detected in rice or previously reported in tomato. Full-length coding sequences from PPO4, PPO5, PPO1/5 and PPO1/4 will facilitate functional studies and provide genetic markers for plant breeding.

  7. Clusters of ancestrally related genes that show paralogy in whole or in part are a major feature of the genomes of humans and other species.

    Directory of Open Access Journals (Sweden)

    Michael B Walker

    Full Text Available Arrangements of genes along chromosomes are a product of evolutionary processes, and we can expect that preferable arrangements will prevail over the span of evolutionary time, often being reflected in the non-random clustering of structurally and/or functionally related genes. Such non-random arrangements can arise by two distinct evolutionary processes: duplications of DNA sequences that give rise to clusters of genes sharing both sequence similarity and common sequence features and the migration together of genes related by function, but not by common descent. To provide a background for distinguishing between the two, which is important for future efforts to unravel the evolutionary processes involved, we here provide a description of the extent to which ancestrally related genes are found in proximity.Towards this purpose, we combined information from five genomic datasets, InterPro, SCOP, PANTHER, Ensembl protein families, and Ensembl gene paralogs. The results are provided in publicly available datasets (http://cgd.jax.org/datasets/clustering/paraclustering.shtml describing the extent to which ancestrally related genes are in proximity beyond what is expected by chance (i.e. form paraclusters in the human and nine other vertebrate genomes, as well as the D. melanogaster, C. elegans, A. thaliana, and S. cerevisiae genomes. With the exception of Saccharomyces, paraclusters are a common feature of the genomes we examined. In the human genome they are estimated to include at least 22% of all protein coding genes. Paraclusters are far more prevalent among some gene families than others, are highly species or clade specific and can evolve rapidly, sometimes in response to environmental cues. Altogether, they account for a large portion of the functional clustering previously reported in several genomes.

  8. A Metabolic Gene Cluster in the Wheat W1 and the Barley Cer-cqu Loci Determines β-Diketone Biosynthesis and Glaucousness.

    Science.gov (United States)

    Hen-Avivi, Shelly; Savin, Orna; Racovita, Radu C; Lee, Wing-Sham; Adamski, Nikolai M; Malitsky, Sergey; Almekias-Siegl, Efrat; Levy, Matan; Vautrin, Sonia; Bergès, Hélène; Friedlander, Gilgi; Kartvelishvily, Elena; Ben-Zvi, Gil; Alkan, Noam; Uauy, Cristobal; Kanyuka, Kostya; Jetter, Reinhard; Distelfeld, Assaf; Aharoni, Asaph

    2016-06-01

    The glaucous appearance of wheat (Triticum aestivum) and barley (Hordeum vulgare) plants, that is the light bluish-gray look of flag leaf, stem, and spike surfaces, results from deposition of cuticular β-diketone wax on their surfaces; this phenotype is associated with high yield, especially under drought conditions. Despite extensive genetic and biochemical characterization, the molecular genetic basis underlying the biosynthesis of β-diketones remains unclear. Here, we discovered that the wheat W1 locus contains a metabolic gene cluster mediating β-diketone biosynthesis. The cluster comprises genes encoding proteins of several families including type-III polyketide synthases, hydrolases, and cytochrome P450s related to known fatty acid hydroxylases. The cluster region was identified in both genetic and physical maps of glaucous and glossy tetraploid wheat, demonstrating entirely different haplotypes in these accessions. Complementary evidence obtained through gene silencing in planta and heterologous expression in bacteria supports a model for a β-diketone biosynthesis pathway involving members of these three protein families. Mutations in homologous genes were identified in the barley eceriferum mutants defective in β-diketone biosynthesis, demonstrating a gene cluster also in the β-diketone biosynthesis Cer-cqu locus in barley. Hence, our findings open new opportunities to breed major cereal crops for surface features that impact yield and stress response. © 2016 American Society of Plant Biologists. All rights reserved.

  9. Clusters of conserved beta cell marker genes for assessment of beta cell phenotype

    DEFF Research Database (Denmark)

    Martens, Geert A; Jiang, Lei; Hellemans, Karine H

    2011-01-01

    The aim of this study was to establish a gene expression blueprint of pancreatic beta cells conserved from rodents to humans and to evaluate its applicability to assess shifts in the beta cell differentiated state. Genome-wide mRNA expression profiles of isolated beta cells were compared to those...... microdissected beta cells, monitor adaptations of the beta cell phenotype to fasting, and retrieve possible conserved transcriptional regulators....

  10. Restriction site polymorphisms in the pig beta-globin gene cluster.

    Science.gov (United States)

    Rando, A; Masina, P

    1985-01-01

    A restriction fragment length polymorphism was detected in pig DNA digested with Hind III restriction endonuclease and probed with rabbit beta 1-globin gene. Eight different phenotypes were observed and for six of them family data demonstrated that they are determined by three alleles. As this polymorphism is not found with four other restriction endonucleases (Bam HI, Eco RI, Kpn I, and Pst I), single point mutations are proposed to explain the observed differences.

  11. Identification of the First Riboflavin Catabolic Gene Cluster Isolated from Microbacterium maritypicum G10*

    OpenAIRE

    Xu, Hui; Chakrabarty, Yindrila; Philmus, Benjamin; Mehta, Angad P.; Bhandari, Dhananjay; Hohmann, Hans-Peter; Begley, Tadhg P.

    2016-01-01

    Riboflavin is a common cofactor, and its biosynthetic pathway is well characterized. However, its catabolic pathway, despite intriguing hints in a few distinct organisms, has never been established. This article describes the isolation of a Microbacterium maritypicum riboflavin catabolic strain, and the cloning of the riboflavin catabolic genes. RcaA, RcaB, RcaD, and RcaE were overexpressed and biochemically characterized as riboflavin kinase, riboflavin reductase, ribokinase, and riboflavin ...

  12. Genomic organisation of the Mal d 1 gene cluster on linkage group 16 in apple

    OpenAIRE

    Pagliarani, Giulia; Paris, Roberta; Iorio, Anna Rosa; Tartarini, Stefano; Del Duca, Stefano; Arens, Paul; Peters, Sander; van de Weg, Eric

    2011-01-01

    European populations exhibit progressive sensitisation to food allergens, and apples are one of the foods for which sensitisation is observed most frequently. Apple cultivars vary greatly in their allergenic characteristics, and a better understanding of the genetic basis of low allergenicity may therefore allow allergic individuals to increase their fruit intake. Mal d 1 is considered to be a major apple allergen, and this protein is encoded by the most complex allergen gene family. Not all ...

  13. Identification and functional analysis of the gene cluster for fructan utilization in Prevotella intermedia.

    Science.gov (United States)

    Fuse, Haruka; Fukamachi, Haruka; Inoue, Mitsuko; Igarashi, Takeshi

    2013-02-25

    Fructanase enzymes hydrolyze the β-2,6 and β-2,1 linkages of levan and inulin fructans, respectively. We analyzed the influence of fructan on the growth of Prevotella intermedia. The growth of P. intermedia was enhanced by addition of inulin, implying that P. intermedia could also use inulin. Based on this finding, we identified and analyzed the genes encoding a putative fructanase (FruA), sugar transporter (FruB), and fructokinase (FruK) in the genome of strain ATCC25611. Transcript analysis by RT-PCR showed that the fruABK genes were co-transcribed as a single mRNA and semi-quantitative analysis confirmed that the fruA gene was induced in response to fructose and inulin. Recombinant FruA and FruK were purified and characterized biochemically. FruA strongly hydrolyzed inulin, with slight degradation of levan via an exo-type mechanism, revealing that FruA is an exo-β-d-fructanase. FruK converted fructose to fructose-6-phosphate in the presence of ATP, confirming that FruK is an ATP-dependent fructokinase. These results suggest that P. intermedia can utilize fructan as a carbon source for growth, and that the fructanase, sugar transporter, and fructokinase proteins we identified are involved in this fructan utilization. Copyright © 2012 Elsevier B.V. All rights reserved.

  14. Acinetobacter baumannii K27 and K44 capsular polysaccharides have the same K unit but different structures due to the presence of distinct wzy genes in otherwise closely related K gene clusters.

    Science.gov (United States)

    Shashkov, Alexander S; Kenyon, Johanna J; Senchenkova, Sof'ya N; Shneider, Mikhail M; Popova, Anastasiya V; Arbatsky, Nikolay P; Miroshnikov, Konstantin A; Volozhantsev, Nikolay V; Hall, Ruth M; Knirel, Yuriy A

    2016-05-01

    Capsular polysaccharides (CPSs), from Acinetobacter baumannii isolates 1432, 4190 and NIPH 70, which have related gene content at the K locus, were examined, and the chemical structures established using 2D(1)H and(13)C NMR spectroscopy. The three isolates produce the same pentasaccharide repeat unit, which consists of 5-N-acetyl-7-N-[(S)-3-hydroxybutanoyl] (major) or 5,7-di-N-acetyl (minor) derivatives of 5,7-diamino-3,5,7,9-tetradeoxy-D-glycero-D-galacto-non-2-ulosonic (legionaminic) acid (Leg5Ac7R), D-galactose, N-acetyl-D-galactosamine and N-acetyl-D-glucosamine. However, the linkage between repeat units in NIPH 70 was different to that in 1432 and 4190, and this significantly alters the CPS structure. The KL27 gene cluster in 4190 and KL44 gene cluster in NIPH 70 are organized identically and contain lga genes for Leg5Ac7R synthesis, genes for the synthesis of the common sugars, as well as anitrA2 initiating transferase and four glycosyltransferases genes. They share high-level nucleotide sequence identity for corresponding genes, but differ in the wzy gene encoding the Wzy polymerase. The Wzy proteins, which have different lengths and share no similarity, would form the unrelated linkages in the K27 and K44 structures. The linkages formed by the four shared glycosyltransferases were predicted by comparison with gene clusters that synthesize related structures. These findings unambiguously identify the linkages formed by WzyK27 and WzyK44, and show that the presence of different wzy genes in otherwise closely related K gene clusters changes the structure of the CPS. This may affect its capacity as a protective barrier for A. baumannii. © The Author 2015. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  15. Related structures of neutral capsular polysaccharides of Acinetobacter baumannii isolates that carry related capsule gene clusters KL43, KL47, and KL88.

    Science.gov (United States)

    Shashkov, Alexander S; Kenyon, Johanna J; Arbatsky, Nikolay P; Shneider, Mikhail M; Popova, Anastasiya V; Miroshnikov, Konstantin A; Hall, Ruth M; Knirel, Yuriy A

    2016-11-29

    Capsular polysaccharides were recovered from four Acinetobacter baumannii isolates, and the following related structures of oligosaccharide repeating units were established by sugar analyses along with 1D and 2D 1 H and 13 C NMR spectroscopy: NIPH 60 and LUH5544 (K43) NIPH 601 (K47) The K locus for capsule biosynthesis in the genome sequences available for NIPH 60 and LUH5544, designated KL43, was found to be related to gene clusters KL47 in NIPH 601 and KL88 in LUH5548. The three clusters share most gene content differing in only a small portion that includes an additional glycosyltransferase genes in KL47 and KL88, as well as genes encoding distinct Wzy polymerases that were found to form the same α-d-GlcpNAc-(1 → 6)-α-d-GlcpNAc linkage in K43 and K47. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. A genome-wide association study on androstenone levels in pigs reveals a cluster of candidate genes on chromosome 6

    Directory of Open Access Journals (Sweden)

    Groenen Martien AM

    2010-05-01

    concentration were identified in this commercial breeding line of pigs. Known and new candidate genes cluster especially on SSC6. For one of the most significant SNP variants, the difference in the proportion of animals surpassing the threshold of consumer acceptance between the two homozygous genotypes was as much as 15.6%.

  17. Cloning of the staurosporine biosynthetic gene cluster from Streptomyces sp. TP-A0274 and its heterologous expression in Streptomyces lividans.

    Science.gov (United States)

    Onaka, Hiroyasu; Taniguchi, Shin-ichi; Igarashi, Yasuhiro; Furumai, Tamotsu

    2002-12-01

    Staurosporine is a representative member of indolocarbazole antibiotics. The entire staurosporine biosynthetic and regulatory gene cluster spanning 20-kb was cloned from Streptomyces sp. TP-A0274 and sequenced. The gene cluster consists of 14 ORFs and the amino acid sequence homology search revealed that it contains three genes, staO, staD, and staP, coding for the enzymes involved in the indolocarbazole aglycone biosynthesis, two genes, staG and staN, for the bond formation between the aglycone and deoxysugar, eight genes, staA, staB, staE, staJ, staI, staK, staMA, and staMB, for the deoxysugar biosynthesis and one gene, staR is a transcriptional regulator. Heterologous gene expression of a 38-kb fragment containing a complete set of the biosynthetic genes for staurosporine cloned into pTOYAMAcos confirmed its role in staurosporine biosynthesis. Moreover, the distribution of the gene for chromopyrrolic acid synthase, the key enzyme for the biosynthesis of indolocarbazole aglycone, in actinomycetes was investigated, and rebD homologs were shown to exist only in the strains producing indolocarbazole antibiotics.

  18. Identification of the First Riboflavin Catabolic Gene Cluster Isolated from Microbacterium maritypicum G10.

    Science.gov (United States)

    Xu, Hui; Chakrabarty, Yindrila; Philmus, Benjamin; Mehta, Angad P; Bhandari, Dhananjay; Hohmann, Hans-Peter; Begley, Tadhg P

    2016-11-04

    Riboflavin is a common cofactor, and its biosynthetic pathway is well characterized. However, its catabolic pathway, despite intriguing hints in a few distinct organisms, has never been established. This article describes the isolation of a Microbacterium maritypicum riboflavin catabolic strain, and the cloning of the riboflavin catabolic genes. RcaA, RcaB, RcaD, and RcaE were overexpressed and biochemically characterized as riboflavin kinase, riboflavin reductase, ribokinase, and riboflavin hydrolase, respectively. Based on these activities, a pathway for riboflavin catabolism is proposed. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  19. Identification of the First Riboflavin Catabolic Gene Cluster Isolated from Microbacterium maritypicum G10*

    Science.gov (United States)

    Xu, Hui; Chakrabarty, Yindrila; Philmus, Benjamin; Mehta, Angad P.; Bhandari, Dhananjay; Hohmann, Hans-Peter; Begley, Tadhg P.

    2016-01-01

    Riboflavin is a common cofactor, and its biosynthetic pathway is well characterized. However, its catabolic pathway, despite intriguing hints in a few distinct organisms, has never been established. This article describes the isolation of a Microbacterium maritypicum riboflavin catabolic strain, and the cloning of the riboflavin catabolic genes. RcaA, RcaB, RcaD, and RcaE were overexpressed and biochemically characterized as riboflavin kinase, riboflavin reductase, ribokinase, and riboflavin hydrolase, respectively. Based on these activities, a pathway for riboflavin catabolism is proposed. PMID:27590337

  20. Functional Operons in Secondary Metabolic Gene Clusters in Glarea lozoyensis (Fungi, Ascomycota, Leotiomycetes).

    Science.gov (United States)

    Yue, Qun; Chen, Li; Li, Yan; Bills, Gerald F; Zhang, Xinyu; Xiang, Meichun; Li, Shaojie; Che, Yongsheng; Wang, Chengshu; Niu, Xuemei; An, Zhiqiang; Liu, Xingzhong

    2015-06-16

    Operons are multigene transcriptional units which occur mostly in prokaryotes but rarely in eukaryotes. Protein-coding operons have not been reported in the Fungi even though they represent a very diverse kingdom of organisms. Here, we report a functional operon involved in the secondary metabolism of the fungus Glarea lozoyensis belonging to Leotiomycetes (Ascomycota). Two contiguous genes, glpks3 and glnrps7, encoding polyketide synthase and nonribosomal peptide synthetase, respectively, are cotranscribed into one dicistronic mRNA under the control of the same promoter, and the mRNA is then translated into two individual proteins, GLPKS3 and GLNRPS7. Heterologous expression in Aspergillus nidulans shows that the GLPKS3-GLNRPS7 enzyme complex catalyzes the biosynthesis of a novel pyrrolidinedione-containing compound, xenolozoyenone (compound 1), which indicates the operon is functional. Although it is structurally similar to prokaryotic operons, the glpks3-glnrps7 operon locus has a monophylogenic origin from fungi rather than having been horizontally transferred from prokaryotes. Moreover, two additional operons, glpks28-glnrps8 and glpks29-glnrps9, were verified at the transcriptional level in the same fungus. This is the first report of protein-coding operons in a member of the Fungi. Operons are multigene transcriptional units which occur mostly in prokaryotes but rarely in eukaryotes. Three operon-like gene structures for secondary metabolism that were discovered in the filamentous fungus Glarea lozoyensis are the first examples of protein-coding operons identified in a member of the Fungi. Among them, the glpks3-glnrps7 operon is responsible for the biosynthesis of xenolozoyenone, which is a novel tetramic acid-containing compound. Although structurally similar to prokaryotic operons, the glpks3-glnrps7 operon locus did not result from horizontal gene transfer from prokaryotes. In addition, operonlike structures have been predicted in silico to be common in

  1. Sustainable production of bioactive compounds by sponges--cell culture and gene cluster approach: a review.

    Science.gov (United States)

    Müller, Werner E G; Grebenjuk, Vladislav A; Le Pennec, Gaël; Schröder, Heinz- C; Brümmer, Franz; Hentschel, Ute; Müller, Isabel M; Breter, Hans- J

    2004-01-01

    Sponges (phylum Porifera) are sessile marine filter feeders that have developed efficient defense mechanisms against foreign attackers such as viruses, bacteria, or eukaryotic organisms. Protected by a highly complex immune system, as well as by the capacity to produce efficient antiviral compounds (e.g., nucleoside analogues), antimicrobial compounds (e.g., polyketides), and cytostatic compounds (e.g., avarol), they have not become extinct during the last 600 million years. It can be assumed that during this long period of time, bacteria and microorganisms coevolved with sponges, and thus acquired a complex common metabolism. It is suggested that (at least) some of the bioactive secondary metabolites isolated from sponges are produced by functional enzyme clusters, which originated from the sponges and their associated microorganisms. As a consequence, both the host cells and the microorganisms lost the ability to grow independently from each other. Therefore, it was--until recently--impossible to culture sponge cells in vitro. Also the predominant number of "symbiotic bacteria" proved to be nonculturable. In order to exploit the bioactive potential of both the sponge and the "symbionts," a 3D-aggregate primmorph culture system was established; also it was proved that one bioactive compound, avarol/avarone, is produced by the sponge Dysidea avara. Another promising way to utilize the bioactive potential of the microorganisms is the cloning and heterologous expression of enzymes involved in secondary metabolism, such as the polyketide synthases.

  2. A brain-specific gene cluster isolated from the region of the mouse obesity locus is expressed in the adult hypothalamus and during mouse development

    Energy Technology Data Exchange (ETDEWEB)

    Laig-Webster, M.; Lim, M.E.; Chehab, F.F. [Univ. of California, San Francisco, CA (United States)

    1994-09-01

    The molecular defect underlying an autosomal recessive form of genetic obesity in a classical mouse model C57 BL/6J-ob/ob has not yet been elucidated. Whereas metabolic and physiological disturbances such as diabetes and hypertension are associated with obesity, the site of expression and the nature of the primary lesion responsible for this cascade of events remains elusive. Our efforts aimed at the positional cloning of the ob gene by YAC contig mapping and gene identification have resulted in the cloning of a brain-specific gene cluster from the ob critical region. The expression of this gene cluster is remarkably complex owing to the multitude of brain-specific mRNA transcripts detected on Northern blots. cDNA cloning of these transcripts suggests that they are expressed from different genes as well as by alternate splicing mechanisms. Furthermore, the genomic organization of the cluster appears to consist of at least two identical promoters displaying CpG islands characteristic of housekeeping genes, yet clearly involving tissue-specific expression. Sense and anti-sense synthetic RNA probes were derived from a common DNA sequence on 3 cDNA clones and hybridized to 8-16 days mouse embryonic stages and mouse adult brain sections. Expression in development was noticeable as of the 11th day of gestation and confined to the central nervous system mainly in the telencephalon and spinal cord. Coronal and sagittal sections of the adult mouse brain showed expression only in 3 different regions of the brain stem. In situ hybridization to mouse hypothalamus sections revealed the presence of a localized and specialized group of cells expressing high levels of mRNA, suggesting that this gene cluster may also be involved in the regulation of hypothalamic activities. The hypothalamus has long been hypothesized as a primary candidate tissue for the expression of the obesity gene mainly because of its well-established role in the regulation of energy metabolism and food intake.

  3. Unique actinomycetes from marine caves and coral reef sediments provide novel PKS and NRPS biosynthetic gene clusters.

    Science.gov (United States)

    Hodges, Tyler W; Slattery, Marc; Olson, Julie B

    2012-06-01

    In the ever-expanding search for novel bioactive molecules and enzymes, marine actinomycetes have proven to be a productive source. While open reef sediment and sponge-associated actinomycetes have been extensively examined, their marine cave counterparts remain unevaluated. Anchialine cave systems in the Bahamas offered an ideal setting to evaluate the occurrence and variation within sediment-associated actinomycete communities. While in close geographical proximity to open reef environments, these systems provide a specialized environmental niche devoid of light and direct exposure to nutrient input. In the present study, selective isolation techniques and molecular methods were used to test the hypothesis that variable distribution of actinomycetes and secondary metabolite gene clusters occur between open reef and marine cave systems. The results indicated that differences exist within the culturable sediment-associated actinomycete communities between marine caves and open reef systems, with members of the genus Streptomyces dominating cultures from open reef sediments and a more diverse suite of actinomycetes isolated from marine cave sediment samples. Within the cave isolates, members of the proposed genus Solwaraspora were the most represented. Based on PKS- and NRPS-gene-targeted PCR amplification and sequencing, geographic variation in the occurrence of these biosynthetic pathways was also observed. These findings indicate that marine cave systems are a lucrative source in the search for novel secondary metabolite producers with biotechnological applications and that environmental and geographic factors likely affect the occurrence of these biosynthetic pathways.

  4. The Escherichia coli Serogroup O1 and O2 Lipopolysaccharides Are Encoded by Multiple O-antigen Gene Clusters

    Science.gov (United States)

    Delannoy, Sabine; Beutin, Lothar; Mariani-Kurkdjian, Patricia; Fleiss, Aubin; Bonacorsi, Stéphane; Fach, Patrick

    2017-01-01

    Escherichia coli strains belonging to serogroups O1 and O2 are frequently associated with human infections, especially extra-intestinal infections such as bloodstream infections or urinary tract infections. These strains can be associated with a large array of flagellar antigens. Because of their frequency and clinical importance, a reliable detection of E. coli O1 and O2 strains and also the frequently associated K1 capsule is important for diagnosis and source attribution of E. coli infections in humans and animals. By sequencing the O-antigen clusters of various O1 and O2 strains we showed that the serogroups O1 and O2 are encoded by different sets of O-antigen encoding genes and identified potentially new O-groups. We developed qPCR-assays to detect the various O1 and O2 variants and the K1-encoding gene. These qPCR assays proved to be 100% sensitive and 100% specific and could be valuable tools for the investigations of zoonotic and food-borne infection of humans with O1 and O2 extra-intestinal (ExPEC) or Shiga toxin-producing E. coli (STEC) strains. PMID:28224115

  5. K19 capsular polysaccharide of Acinetobacter baumannii is produced via a Wzy polymerase encoded in a small genomic island rather than the KL19 capsule gene cluster.

    Science.gov (United States)

    Kenyon, Johanna J; Shneider, Mikhail M; Senchenkova, Sofya N; Shashkov, Alexander S; Siniagina, Maria N; Malanin, Sergey Y; Popova, Anastasiya V; Miroshnikov, Konstantin A; Hall, Ruth M; Knirel, Yuriy A

    2016-08-01

    Polymerization of the oligosaccharides (K units) of complex capsular polysaccharides (CPSs) requires a Wzy polymerase, which is usually encoded in the gene cluster that directs K unit synthesis. Here, a gene cluster at the Acinetobacter K locus (KL) that lacks a wzy gene, KL19, was found in Acinetobacter baumannii ST111 isolates 28 and RBH2 recovered from hospitals in the Russian Federation and Australia, respectively. However, these isolates produced long-chain capsule, and a wzy gene was found in a 6.1 kb genomic island (GI) located adjacent to the cpn60 gene. The GI also includes an acetyltransferase gene, atr25, which is interrupted by an insertion sequence (IS) in RBH2. The capsule structure from both strains was →3)-α-d-GalpNAc-(1→4)-α-d-GalpNAcA-(1→3)-β-d-QuipNAc4NAc-(1→, determined using NMR spectroscopy. Biosynthesis of the K unit was inferred to be initiated with QuiNAc4NAc, and hence the Wzy forms the β-(1→3) linkage between QuipNAc4NAc and GalpNAc. The GalpNAc residue is 6-O-acetylated in isolate 28 only, showing that atr25 is responsible for this acetylation. The same GI with or without an IS in atr25 was found in draft genomes of other KL19 isolates, as well as ones carrying a closely related CPS gene cluster, KL39, which differs from KL19 only in a gene for an acyltransferase in the QuiNAc4NR synthesis pathway. Isolates carrying a KL1 variant with the wzy and atr genes each interrupted by an ISAba125 also have this GI. To our knowledge, this study is the first report of genes involved in capsule biosynthesis normally found at the KL located elsewhere in A. baumannii genomes.

  6. Association analysis of the IL-1 gene cluster polymorphisms with aggressive and chronic periodontitis in the Algerian population.

    Science.gov (United States)

    Boukortt, Kawther Nourelhouda; Saidi-Ouahrani, Nadjia; Boukerzaza, Boubaker; Ouhaibi-Djellouli, Hadjira; Hachmaoui, Khalida; Benaissa, Fatima Zohra; Taleb, Leila; Drabla-Ouahrani, Hayet; Deba, Tahria; Ouledhamou, Sid Ahmed; Mehtar, Nadhera; Boudjema, Abdellah

    2015-10-01

    There is strong evidence that genetic as well as environmental factors affect the development of periodontitis. Various studies suggest that genetic polymorphisms of the interleukin-1 (IL-1) genes are associated with an increased risk of developing the pathogenesis. The aim of the present study was to investigate the possible relationship between two polymorphisms of IL-1 gene cluster IL-1B (C+3954T) (rs1143634) and IL-1A (C-889T) (rs1800587) SNPs and the aggressive and chronic periodontitis risk in a case control study in Algerian population. 279 subjects were recruited and received a periodontal examination: 128 healthy controls and 151 cases. From cases, 91 patients were having a chronic disease whereas 60 subjects with aggressive form. All these subjects were genotyped for IL-1A (C-889T) and IL-1B (C+3954T) polymorphisms using TaqMan real time PCR technology. Frequencies of IL-1 alleles, genotypes and the haplotypes were also examined. Significant differences were found in the carriage rate of both minor alleles of the IL-1A (C-889T) and IL-1B (C+3954T) polymorphisms of aggressive periodontitis cases compared with healthy controls (OR [95%CI]=1.61 [1.03-2.49], p=0.03), (OR [95%CI]=1.69 [1.09-2.63], p=0.01), respectively. The result did not reach significance with the chronic form. The studied polymorphisms of the IL-1 genes appear to be associated with susceptibility to aggressive periodontitis (AgP) in the Algerian population. Copyright © 2015 Elsevier Ltd. All rights reserved.

  7. Centrosome clustering and cyclin D1 gene amplification in double minutes are common events in chromosomal unstable bladder tumors

    International Nuclear Information System (INIS)

    Rey, Javier del; Prat, Esther; Ponsa, Immaculada; Lloreta, Josep; Gelabert, Antoni; Algaba, Ferran; Camps, Jordi; Miró, Rosa

    2010-01-01

    Aneuploidy, centrosome abnormalities and gene amplification are hallmarks of chromosome instability (CIN) in cancer. Yet there are no studies of the in vivo behavior of these phenomena within the same bladder tumor. Twenty-one paraffin-embedded bladder tumors were analyzed by conventional comparative genome hybridization and fluorescence in situ hybridization (FISH) with a cyclin D1 gene (CCND1)/centromere 11 dual-color probe. Immunofluorescent staining of α, β and γ tubulin was also performed. Based on the CIN index, defined as the percentage of cells not displaying the modal number for chromosome 11, tumors were classified as CIN-negative and CIN-positive. Fourteen out of 21 tumors were considered CIN-positive. All T1G3 tumors were included in the CIN-positive group whereas the majority of Ta samples were classified as CIN-negative tumors. Centrosome clustering was observed in six out of 12 CIN-positive tumors analyzed. CCND1 amplification in homogeneously staining regions was present in six out of 14 CIN-positive tumors; three of them also showed amplification of this gene in double minutes. Complex in vivo behavior of CCND1 amplicon in bladder tumor cells has been demonstrated by accurate FISH analysis on paraffin-embedded tumors. Positive correlation between high heterogeneity, centrosome abnormalities and CCND1 amplification was found in T1G3 bladder carcinomas. This is the first study to provide insights into the coexistence of CCND1 amplification in homogeneously staining regions and double minutes in primary bladder tumors. It is noteworthy that those patients whose tumors showed double minutes had a significantly shorter overall survival rate (p < 0.001)

  8. The border sequence of the balhimycin biosynthesis gene cluster from Amycolatopsis balhimycina contains bbr, encoding a StrR-like pathway-specific regulator

    NARCIS (Netherlands)

    Shawky, Riham M.; Puk, Oliver; Wietzorrek, Andreas; Pelzer, Stefan; Takano, Eriko; Wohlleben, Wolfgang; Stegmann, Efthimia

    2007-01-01

    Balhimycin, produced by the actinomycete Amycolatopsis balhimycina DSM5908, is a glycopeptide antibiotic highly similar to vancomycin, the antibiotic of 'last resort' used for the treatment of resistant Gram-positive pathogenic bacteria. Partial sequence of the balhimycin biosynthesis gene cluster

  9. Genome Sequences and Photosynthesis Gene Cluster Composition of a Freshwater Aerobic Anoxygenic Phototroph, Sandarakinorhabdus asp. Strain AAP62, Isolated from the Shahu Lake in Ningxia, China

    Czech Academy of Sciences Publication Activity Database

    Zeng, Yonghui; Feng, F.; Liu, Y.; Koblížek, Michal

    2013-01-01

    Roč. 1, č. 1 (2013) ISSN 2169-8287 R&D Projects: GA ČR GAP501/10/0221; GA MŠk(CZ) ED2.1.00/03.0110 Institutional support: RVO:61388971 Keywords : Gene cluster * Sandarakinorhabdus Subject RIV: EE - Microbiology, Virology http://genomea. asm .org/content/1/1/e00034-13.full

  10. Polymorphic restriction sites in the horse beta-globin gene cluster.

    Science.gov (United States)

    Rando, A; Di Gregorio, P; Masina, P

    1986-01-01

    Horse DNA samples digested with PstI and probed with the rabbit beta 1 globin gene show three phenotypes determined by one fragment of variable length (about 5.1 or 3.3 kb). Family data demonstrate that these fragments segregate as Mendelian alleles. The frequencies of the two alleles are 0.66 for the 3.3-kb fragment and 0.34 for the 5.1-kb one. Another polymorphism has been detected with BamHI. Again three phenotypes determined by two alleles (fragments of 7.5 and 3.8 kb) have been observed. Allelic frequencies of the 7.5- and 3.8-kb fragments are 0.24 and 0.76 respectively. The two polymorphic sites are non-randomly associated.

  11. Hyperdiverse Gene Cluster in Snail Host Conveys Resistance to Human Schistosome Parasites

    Science.gov (United States)

    Tennessen, Jacob A.; Théron, André; Marine, Melanie; Yeh, Jan-Ying; Rognon, Anne; Blouin, Michael S.

    2015-01-01

    Schistosomiasis, a neglected global pandemic, may be curtailed by blocking transmission of the parasite via its intermediate hosts, aquatic snails. Elucidating the genetic basis of snail-schistosome interaction is a key to this strategy. Here we map a natural parasite-resistance polymorphism from a Caribbean population of the snail Biomphalaria glabrata. In independent experimental evolution lines, RAD genotyping shows that the same genomic region responds to selection for resistance to the parasite Schistosoma mansoni. A dominant allele in this region conveys an 8-fold decrease in the odds of infection. Fine-mapping and RNA-Seq characterization reveal a 25%) haplotypes across the GRC, a significantly non-neutral pattern, suggests that balancing selection maintains diversity at the GRC. Thus, the GRC resembles immune gene complexes seen in other taxa and is likely involved in parasite recognition. The GRC is a potential target for controlling transmission of schistosomiasis, including via genetic manipulation of snails. PMID:25775214

  12. Clusters of conserved beta cell marker genes for assessment of beta cell phenotype

    DEFF Research Database (Denmark)

    Martens, Geert A; Jiang, Lei; Hellemans, Karine H

    2011-01-01

    The aim of this study was to establish a gene expression blueprint of pancreatic beta cells conserved from rodents to humans and to evaluate its applicability to assess shifts in the beta cell differentiated state. Genome-wide mRNA expression profiles of isolated beta cells were compared to those...... of a large panel of other tissue and cell types, and transcripts with beta cell-abundant and -selective expression were identified. Iteration of this analysis in mouse, rat and human tissues generated a panel of conserved beta cell biomarkers. This panel was then used to compare isolated versus laser capture...... microdissected beta cells, monitor adaptations of the beta cell phenotype to fasting, and retrieve possible conserved transcriptional regulators....

  13. Genetic relationships among native americans based on beta-globin gene cluster haplotype frequencies

    Directory of Open Access Journals (Sweden)

    Rita de Cassia Mousinho-Ribeiro

    2003-01-01

    Full Text Available The distribution of b-globin gene haplotypes was studied in 209 Amerindians from eight tribes of the Brazilian Amazon: Asurini from Xingú, Awá-Guajá, Parakanã, Urubú-Kaapór, Zoé, Kayapó (Xikrin from the Bacajá village, Katuena, and Tiriyó. Nine different haplotypes were found, two of which (n. 11 and 13 had not been previously identified in Brazilian indigenous populations. Haplotype 2 (+ - - - - was the most common in all groups studied, with frequencies varying from 70% to 100%, followed by haplotype 6 (- + + - +, with frequencies between 7% and 18%. The frequency distribution of the b-globin gene haplotypes in the eighteen Brazilian Amerindian populations studied to date is characterized by a reduced number of haplotypes (average of 3.5 and low levels of heterozygosity and intrapopulational differentiation, with a single clearly predominant haplotype in most tribes (haplotype 2. The Parakanã, Urubú-Kaapór, Tiriyó and Xavante tribes constitute exceptions, presenting at least four haplotypes with relatively high frequencies. The closest genetic relationships were observed between the Brazilian and the Colombian Amerindians (Wayuu, Kamsa and Inga, and, to a lesser extent, with the Huichol of Mexico. North-American Amerindians are more differentiated and clearly separated from all other tribes, except the Xavante, from Brazil, and the Mapuche, from Argentina. A restricted pool of ancestral haplotypes may explain the low diversity observed among most present-day Brazilian and Colombian Amerindian groups, while interethnic admixture could be the most important factor to explain the high number of haplotypes and high levels of diversity observed in some South-American and most North-American tribes.

  14. NFκB-mediated activation of the cellular FUT3, 5 and 6 gene cluster by herpes simplex virus type 1.

    Science.gov (United States)

    Nordén, Rickard; Samuelsson, Ebba; Nyström, Kristina

    2017-11-01

    Herpes simplex virus type 1 has the ability to induce expression of a human gene cluster located on chromosome 19 upon infection. This gene cluster contains three fucosyltransferases (encoded by FUT3, FUT5 and FUT6) with the ability to add a fucose to an N-acetylglucosamine residue. Little is known regarding the transcriptional activation of these three genes in human cells. Intriguingly, herpes simplex virus type 1 activates all three genes simultaneously during infection, a situation not observed in uninfected tissue, pointing towards a virus specific mechanism for transcriptional activation. The aim of this study was to define the underlying mechanism for the herpes simplex virus type 1 activation of FUT3, FUT5 and FUT6 transcription. The transcriptional activation of the FUT-gene cluster on chromosome 19 in fibroblasts was specific, not involving adjacent genes. Moreover, inhibition of NFκB signaling through panepoxydone treatment significantly decreased the induction of FUT3, FUT5 and FUT6 transcriptional activation, as did siRNA targeting of p65, in herpes simplex virus type 1 infected fibroblasts. NFκB and p65 signaling appears to play an important role in the regulation of FUT3, FUT5 and FUT6 transcriptional activation by herpes simplex virus type 1 although additional, unidentified, viral factors might account for part of the mechanism as direct interferon mediated stimulation of NFκB was not sufficient to induce the fucosyltransferase encoding gene cluster in uninfected cells. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  15. C4BPAL1, a member of the human regulator of complement activation (RCA) gene cluster that resulted from the duplication of the gene coding for the [alpha]-chain of C4b-binding protein

    Energy Technology Data Exchange (ETDEWEB)

    Sanchez-Corral, P.; Pardo-Manuel de Villena, F.; Rey-Campos, J.; Rodriguez de Cordoba, S. (Unidad de Immunologia, Madrid (Spain))

    1993-07-01

    The regulator of complement activation (RCA) gene cluster evolved by multiple gene duplications to produce a family of genes coding for proteins that collectively control the activation of the complement system. The authors report here the characterization of C4BPAL1, a member of the human RCA gene cluster that arose from the duplication of the C4BPA gene after the separation of rodent and primate lineages. C4BPAL1 maps 20 kb downstream of the C4BPA gene and is in the same 5[prime] to 3[prime] orientation found for all RCA genes characterized thus far. It includes nine exon-like regions homologous to exons 2-8, 11, and 12 of the C4BPA gene. Analysis of the C4BPAL1 sequence suggests that it is currently a pseudogene in humans. However, comparisons between C4BPAL1 and the human and murine C4BPA genes show sequence conservation, which strongly suggests that, for a long period of time, C4BPAL1 has been a functional gene coding for a protein with structural requirements similar to those of the [alpha]-chain of C4b-binding protein. 50 refs., 5 figs., 1 tab.

  16. Cloning and characterization of the biosynthetic gene cluster of 16-membered macrolide antibiotic FD-891: involvement of a dual functional cytochrome P450 monooxygenase catalyzing epoxidation and hydroxylation.

    Science.gov (United States)

    Kudo, Fumitaka; Motegi, Atsushi; Mizoue, Kazutoshi; Eguchi, Tadashi

    2010-07-26

    FD-891 is a 16-membered cytotoxic antibiotic macrolide that is especially active against human leukemia such as HL-60 and Jurkat cells. We identified the FD-891 biosynthetic (gfs) gene cluster from the producer Streptomyces graminofaciens A-8890 by using typical modular type I polyketide synthase (PKS) genes as probes. The gfs gene cluster contained five typical modular type I PKS genes (gfsA, B, C, D, and E), a cytochrome P450 gene (gfsF), a methyltransferase gene (gfsG), and a regulator gene (gfsR). The gene organization of PKSs agreed well with the basic polyketide skeleton of FD-891 including the oxidation states and alpha-alkyl substituent determined by the substrate specificities of the acyltransferase (AT) domains. To clarify the involvement of the gfs genes in the FD-891 biosynthesis, the P450 gfsF gene was inactivated; this resulted in the loss of FD-891 production. Instead, the gfsF gene-disrupted mutant accumulated a novel FD-891 analogue 25-O-methyl-FD-892, which lacked the epoxide and the hydroxyl group of FD-891. Furthermore, the recombinant GfsF enzyme coexpressed with putidaredoxin and putidaredoxin reductase converted 25-O-methyl-FD-892 into FD-891. In the course of the GfsF reaction, 10-deoxy-FD-891 was isolated as an enzymatic reaction intermediate, which was also converted into FD-891 by GfsF. Therefore, it was clearly found that the cytochrome P450 GfsF catalyzes epoxidation and hydroxylation in a stepwise manner in the FD-891 biosynthesis. These results clearly confirmed that the identified gfs genes are responsible for the biosynthesis of FD-891 in S. graminofaciens.

  17. Lactobacillus plantarum gene clusters encoding putative cell-surface protein complexes for carbohydrate utilization are conserved in specific gram-positive bacteria

    Directory of Open Access Journals (Sweden)

    Muscariello Lidia

    2006-05-01

    Full Text Available Abstract Background Genomes of gram-positive bacteria encode many putative cell-surface proteins, of which the majority has no known function. From the rapidly increasing number of available genome sequences it has become apparent that many cell-surface proteins are conserved, and frequently encoded in gene clusters or operons, suggesting common functions, and interactions of multiple components. Results A novel gene cluster encoding exclusively cell-surface proteins was identified, which is conserved in a subgroup of gram-positive bacteria. Each gene cluster generally has one copy of four new gene families called cscA, cscB, cscC and cscD. Clusters encoding these cell-surface proteins were found only in complete genomes of Lactobacillus plantarum, Lactobacillus sakei, Enterococcus faecalis, Listeria innocua, Listeria monocytogenes, Lactococcus lactis ssp lactis and Bacillus cereus and in incomplete genomes of L. lactis ssp cremoris, Lactobacillus casei, Enterococcus faecium, Pediococcus pentosaceus, Lactobacillius brevis, Oenococcus oeni, Leuconostoc mesenteroides, and Bacillus thuringiensis. These genes are neither present in the genomes of streptococci, staphylococci and clostridia, nor in the Lactobacillus acidophilus group, suggesting a niche-specific distribution, possibly relating to association with plants. All encoded proteins have a signal peptide for secretion by the Sec-dependent pathway, while some have cell-surface anchors, novel WxL domains, and putative domains for sugar binding and degradation. Transcriptome analysis in L. plantarum shows that the cscA-D genes are co-expressed, supporting their operon organization. Many gene clusters are significantly up-regulated in a glucose-grown, ccpA-mutant derivative of L. plantarum, suggesting catabolite control. This is supported by the presence of predicted CRE-sites upstream or inside the up-regulated cscA-D gene clusters. Conclusion We propose that the CscA, CscB, CscC and Csc

  18. Genome based analysis of type-I polyketide synthase and nonribosomal peptide synthetase gene clusters in seven strains of five representative Nocardia species.

    Science.gov (United States)

    Komaki, Hisayuki; Ichikawa, Natsuko; Hosoyama, Akira; Takahashi-Nakaguchi, Azusa; Matsuzawa, Tetsuhiro; Suzuki, Ken-ichiro; Fujita, Nobuyuki; Gonoi, Tohru

    2014-04-30

    Actinobacteria of the genus Nocardia usually live in soil or water and play saprophytic roles, but they also opportunistically infect the respiratory system, skin, and other organs of humans and animals. Primarily because of the clinical importance of the strains, some Nocardia genomes have been sequenced, and genome sequences have accumulated. Genome sizes of Nocardia strains are similar to those of Streptomyces strains, the producers of most antibiotics. In the present work, we compared secondary metabolite biosynthesis gene clusters of type-I polyketide synthase (PKS-I) and nonribosomal peptide synthetase (NRPS) among genomes of representative Nocardia species/strains based on domain organization and amino acid sequence homology. Draft genome sequences of Nocardia asteroides NBRC 15531(T), Nocardia otitidiscaviarum IFM 11049, Nocardia brasiliensis NBRC 14402(T), and N. brasiliensis IFM 10847 were read and compared with published complete genome sequences of Nocardia farcinica IFM 10152, Nocardia cyriacigeorgica GUH-2, and N. brasiliensis HUJEG-1. Genome sizes are as follows: N. farcinica, 6.0 Mb; N. cyriacigeorgica, 6.2 Mb; N. asteroides, 7.0 Mb; N. otitidiscaviarum, 7.8 Mb; and N. brasiliensis, 8.9 - 9.4 Mb. Predicted numbers of PKS-I, NRPS, and PKS-I/NRPS hybrid clusters ranged between 4-11, 7-13, and 1-6, respectively, depending on strains, and tended to increase with increasing genome size. Domain and module structures of representative or unique clusters are discussed in the text. We conclude the following: 1) genomes of Nocardia strains carry as many PKS-I and NRPS gene clusters as those of Streptomyces strains, 2) the number of PKS-I and NRPS gene clusters in Nocardia strains varies substantially depending on species, and N. brasiliensis strains carry the largest numbers of clusters among the species studied, 3) the seven Nocardia strains studied in the present work have seven common PKS-I and/or NRPS clusters, some of whose products are yet to be studied

  19. FADS gene cluster polymorphisms: important modulators of fatty acid levels and their impact on atopic diseases.

    Science.gov (United States)

    Lattka, Eva; Illig, Thomas; Heinrich, Joachim; Koletzko, Berthold

    2009-01-01

    Long-chain polyunsaturated fatty acids (LC-PUFAs) play an important role in several physiological processes and their concentration in phospholipids has been associated with several complex diseases, such as atopic disease. The level and composition of LC-PUFAs in the human body is highly dependent on their intake in the diet or on the intake of fatty acid precursors, which are endogenously elongated and desaturated to physiologically active LC-PUFAs. The most important enzymes in this reaction cascade are the Delta(5) and Delta(6) desaturase. Several studies in the last few years have revealed that single nucleotide polymorphisms (SNPs) in the 2 desaturase encoding genes (FADS1 and FADS2) are highly associated with the concentration of omega-6 and omega-3 fatty acids, showing that beside nutrition, genetic factors also play an important role in the regulation of LC-PUFAs. This review focuses on current knowledge of the impact of genetic polymorphisms on LC-PUFA metabolism and on their potential role in the development of atopic diseases. Copyright (c) 2009 S. Karger AG, Basel.

  20. CLEAN: CLustering Enrichment ANalysis

    Science.gov (United States)

    Freudenberg, Johannes M; Joshi, Vineet K; Hu, Zhen; Medvedovic, Mario

    2009-01-01

    Background Integration of biological knowledge encoded in various lists of functionally related genes has become one of the most important aspects of analyzing genome-wide functional genomics data. In the context of cluster analysis, functional coherence of clusters established through such analyses have been used to identify biologically meaningful clusters, compare clustering algorithms and identify biological pathways associated with the biological process under investigation. Results We developed a computational framework for analytically and visually integrating knowledge-based functional categories with the cluster analysis of genomics data. The framework is based on the simple, conceptually appealing, and biologically interpretable gene-specific functional coherence score (CLEAN score). The score is derived by correlating the clustering structure as a whole with functional categories of interest. We directly demonstrate that integrating biological knowledge in this way improves the reproducibility of conclusions derived from cluster analysis. The CLEAN score differentiates between the levels of functional coherence for genes within the same cluster based on their membership in enriched functional categories. We show that this aspect results in higher reproducibility across independent datasets and produces more informative genes for distinguishing different sample types than the scores based on the traditional cluster-wide analysis. We also demonstrate the utility of the CLEAN framework in comparing clusterings produced by different algorithms. CLEAN was implemented as an add-on R package and can be downloaded at . The package integrates routines for calculating gene specific functional coherence scores and the open source interactive Java-based viewer Functional TreeView (FTreeView). Conclusion Our results indicate that using the gene-specific functional coherence score improves the reproducibility of the conclusions made about clusters of co

  1. De Novo Assembly and Genome Analyses of the Marine-Derived Scopulariopsis brevicaulis Strain LF580 Unravels Life-Style Traits and Anticancerous Scopularide Biosynthetic Gene Cluster.

    Science.gov (United States)

    Kumar, Abhishek; Henrissat, Bernard; Arvas, Mikko; Syed, Muhammad Fahad; Thieme, Nils; Benz, J Philipp; Sørensen, Jens Laurids; Record, Eric; Pöggeler, Stefanie; Kempken, Frank

    2015-01-01

    The marine-derived Scopulariopsis brevicaulis strain LF580 produces scopularides A and B, which have anticancerous properties. We carried out genome sequencing using three next-generation DNA sequencing methods. De novo hybrid assembly yielded 621 scaffolds with a total size of 32.2 Mb and 16298 putative gene models. We identified a large non-ribosomal peptide synthetase gene (nrps1) and supporting pks2 gene in the same biosynthetic gene